Sitecore
EPAM Systems
Lviv, Lviv Oblast, Ukraine,
Posted on Jan 16, 2026
Responsibilities
- Provide L3 on-call support as needed
- Define and implement SLI/SLO monitoring standards
- Conduct detailed root cause analyses for incidents and devise preventive measures
- Design and develop infrastructure and product monitoring systems
- Lead postmortem processes and incident response drills
- Analyze and enhance product performance and scalability
- Automate recurring operational tasks to boost efficiency
- Implement CI/CD pipelines using infrastructure-as-code principles
- Manage cloud infrastructure and configurations using tools like Terraform and Ansible
- Collaborate closely with cross-functional teams to align on operational goals and business needs
Requirements
- 3+ years of experience in Site Reliability Engineering, DevOps, or a related role
- Expertise in scripting languages such as Python, Go, Bash, or PowerShell
- Proficiency in cloud infrastructure technologies including GCP, Azure, AWS, or Terraform
- Strong skills in monitoring and observability tools such as DataDog, Prometheus, or Grafana
- In-depth knowledge of CI/CD platforms like Jenkins, Gitlab-CI, or Azure DevOps
- Solid understanding of configuration management tools such as Ansible
- Competency in containerization technologies, including Docker and Kubernetes
- Exceptional problem-solving abilities, troubleshooting skills, and attention to detail
- Ability to reconstruct incident conditions using robust root cause analysis approaches
Nice to have
- Familiarity with end-to-end observability stacks like ELK, Dynatrace, or Zabbix
- Background in Groovy SDK and Jenkinsfile scripting
- Experience designing scalable and fault-tolerant cloud-native architectures
- Understanding of network performance optimization in cloud environments
We offer/Benefits
With us you can:
- Work on a flexible schedule remotely or from any of our comfortable offices or coworking spaces in Ukraine
- Receive the necessary equipment to perform your work tasks
- Change projects and technology stacks within EPAM
- Gain experience in various business domains (Insurance, E-commerce, Healthcare, Finance, Travelling, Media, Artificial Intelligence, and more)
- Relocation opportunities may be available for eligible candidates, depending on the role and openings at other EPAM locations
- Participate in volunteer, charity programs and communities (both technical and interest-based)
We focus on your professional growth:
- You can plan your individual career path together with your manager
- Receive regular feedback from colleagues
- Improve your English for free with certified teachers (Speaking Clubs, client interview preparation courses, etc.)
- Get the opportunity to undergo free training and certification in AWS, GCP, or Azure Clouds
- Use the internal E-learn training program (18,200+ specialized training and mentoring programs)
- Access corporate accounts on LinkedIn Learning, Get Abstract and other partner resources
- Study at EPAM Solution Architecture School with the instructors who are practicing architects
- Develop as a leader, join Delivery Management, Resource Management, Leadership Essentials school and more
- Participate in internal communities (500+ meetups, technical discussions, brainstorming sessions, online events and conferences annually)
What we offer:
- Vacation and sick leave (including a sick leave without a medical certificate)
- A wide range of Voluntary Medical Insurance programs providing both medical treatment and various preventive options (including sports activities)
- Medical insurance for family members at corporate rates
- Company support during significant life events (childbirth or adoption, marriage, etc.)
- Support for psychological comfort: discounts on services from mental health specialists or coaches, thematic training
- E-kids program - a free programming language training program for EPAMers' children