DevOps & 16 others
EPAM Systems
Software Engineering
Mexico · Remote
Posted on Nov 24, 2025
Responsibilities
- Build and operate a client's cloud infrastructure platform
- Keep the Lights On by ensuring system stability and availability
- Define, collect, and report on service quality metrics to drive continuous improvement
- Work across teams to optimize reliability, performance, latency, and efficiency
- Implement and maintain CI/CD pipelines using tools like Codefresh and JetBrains TeamCity
- Develop and manage infrastructure as code using Terraform and related IaC tooling
- Support and troubleshoot issues related to AWS cloud services and Kubernetes solutions such as EKS
- Automate deployment, configuration, and repair activities using scripting languages including Bash and Go
- Manage containerization technologies such as Docker and artifact repositories like JFrog Artifactory
- Monitor system health and performance with appropriate monitoring tools
- Collaborate with clients and internal teams to ensure fast turnaround of feature requests and bug fixes
- Participate in on-call rotation to provide timely support for owned services
- Maintain and improve network protocol configurations within AWS environments
- Document system architectures and operational procedures
- Ensure compliance with security and operational best practices
Requirements
- 3+ years of experience with Amazon Web Services (AWS) cloud platform
- Proficient in Terraform and infrastructure as code (IaC) tooling
- Experience with CI/CD pipelines and related tools such as Codefresh and JetBrains TeamCity
- In-depth knowledge of Linux operating systems and shell scripting including Bash
- Solid understanding of network protocols and networking in AWS environments
- Programming experience in Go language and familiarity with Python
- Experience managing containerization technologies such as Docker
- Knowledge of Kubernetes solutions, particularly Amazon EKS
- Experience with artifact management tools such as JFrog Artifactory
- Ability to troubleshoot complex system and network issues
- Strong analytical and problem-solving skills
- Experience working in a DevOps or Dev+Ops environment practicing 'you build it, you run it'
- Willingness to participate in on-call rotation for service support
- Excellent communication and collaboration skills
- English proficiency B2 (Upper-Intermediate) or higher
We offer/Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn