Amazon Web Services & 5 others
EPAM Systems
Mexico · Amp. Gabriel Hernández, Ciudad de México, CDMX, Mexico · Remote
Posted on Nov 19, 2025
Responsibilities
- Lead the creation and engineering of scalable, secure, and resilient enterprise software systems and cloud platforms
- Establish and implement best practices for software development, DevSecOps, and infrastructure automation
- Manage the deployment, monitoring, and maintenance of systems using Site Reliability Engineering (SRE) principles
- Design and oversee cloud infrastructure (AWS, Azure, GCP), ensuring optimal performance, cost-efficiency, and compliance with standards
- Develop and refine CI/CD pipelines and Infrastructure as Code (IaC) methodologies to ensure efficient workflows
- Drive the adoption and implementation of containerized and serverless architectures, including tools like Docker, AKS, and Azure Functions
- Lead security initiatives by ensuring adherence to compliance standards and conducting risk assessments
- Translate complex business needs into scalable technical solutions and strategic roadmaps
- Mentor and support engineering teams, fostering a culture of continuous improvement and innovation
- Research and integrate generative AI technologies to enhance automation and software capabilities
Requirements
- Bachelor’s degree in Computer Science, a related field, or equivalent professional experience
- At least 5 years of relevant hands-on experience in similar roles
- At least one year of experience in leading and managing teams
- Advanced skills in scripting languages like Bash and Python for automation and infrastructure-related tasks
- Strong expertise in programming frameworks such as .NET and Java, with a focus on modern application development
- Extensive experience with Infrastructure as Code (IaC) tools like Terraform, including creating and managing modules
- Proficiency with CI/CD tools such as Azure DevOps, Jenkins, GitLab CI, or GitHub Actions
- In-depth knowledge of cloud platforms (AWS, Azure, GCP), including service design, deployment, and optimization
- Familiarity with Site Reliability Engineering (SRE) practices and tools for monitoring, incident response, and performance tuning
- Fluent English language skills, both written and spoken, at a B2+ level or higher
We offer/Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn