Generative AI Operations & 5 others
EPAM Systems
Software Engineering, Operations, Data Science
Ukraine
Posted on Jan 16, 2026
Responsibilities
- Design, implement, and maintain automated CI/CD pipelines for the development, training, and deployment of Large Language Models (LLMs) and AI agents
- Build and manage agentic AI systems, ensuring efficient agent-to-agent collaboration and orchestration of complex workflows
- Integrate AI agents with external tools and APIs using modern standards such as the Model Context Protocol (MCP)
- Leverage AI-powered development tools to streamline software delivery, infrastructure management, and troubleshooting processes
- Define and manage cloud infrastructure for GenAI workloads using Infrastructure as Code (IaC) tools such as Terraform, AWS CDK, or CloudFormation
- Implement monitoring and observability solutions for models, agents, and system health using tools like Prometheus, Grafana, or Datadog
- Optimize scalability, performance, and cost-efficiency of GenAI services in production environments
- Enforce AI security, safety, and governance practices, ensuring compliance with organizational and industry standards
Requirements
- Minimum 3 years of experience in DevOps, Site Reliability Engineering (SRE)
- Minimum 1 year of experience in MLOps roles with a strong focus on cloud infrastructure
- Proven experience with AWS, Google Cloud, or Azure
- Proficiency in Python or Bash, and experience with containerization/orchestration tools such as Docker and Kubernetes
- Strong background in building and maintaining CI/CD pipelines using Jenkins, GitLab CI, or similar tools
- Experience with cloud-native GenAI platforms (e.g., AWS Bedrock, Azure AI Foundry, Google Vertex AI)
- Familiarity with LLM architectures and the challenges of deploying large-scale models
- Experience designing or managing multi-agent systems and orchestrated AI workflows
- Hands-on experience implementing infrastructure using IaC frameworks
- B2+ level of English proficiency
Nice to have
- Master’s or PhD in Computer Science, AI, or related field
- Relevant cloud or DevOps certifications (e.g., AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer)
- Strong problem-solving mindset and ability to thrive in a fast-paced, innovative environment
We offer/Benefits
With us you can:
- Work on a flexible schedule remotely or from any of our comfortable offices or coworking spaces in Ukraine
- Receive the necessary equipment to perform your work tasks
- Change projects and technology stacks within EPAM
- Gain experience in various business domains (Insurance, E-commerce, Healthcare, Finance, Travelling, Media, Artificial Intelligence, and more)
- Relocation opportunities may be available for eligible candidates, depending on the role and openings at other EPAM locations
- Participate in volunteer, charity programs and communities (both technical and interest-based)
We focus on your professional growth:
- You can plan your individual career path together with your manager
- Receive regular feedback from colleagues
- Improve your English for free with certified teachers (Speaking Clubs, client interview preparation courses, etc.)
- Get the opportunity to undergo free training and certification in AWS, GCP, or Azure Clouds
- Use the internal E-learn training program (18,200+ specialized training and mentoring programs)
- Access corporate accounts on LinkedIn Learning, Get Abstract and other partner resources
- Study at EPAM Solution Architecture School with the instructors who are practicing architects
- Develop as a leader, join Delivery Management, Resource Management, Leadership Essentials school and more
- Participate in internal communities (500+ meetups, technical discussions, brainstorming sessions, online events and conferences annually)
What we offer:
- Vacation and sick leave (including a sick leave without a medical certificate)
- A wide range of Voluntary Medical Insurance programs providing both medical treatment and various preventive options (including sports activities)
- Medical insurance for family members at corporate rates
- Company support during significant life events (childbirth or adoption, marriage, etc.)
- Support for psychological comfort: discounts on services from mental health specialists or coaches, thematic training
- E-kids program - a free programming language training program for EPAMers' children