Senior/Lead AI DevOps/SRE
EPAM Systems
This job is no longer accepting applications
See open jobs at EPAM Systems.See open jobs similar to "Senior/Lead AI DevOps/SRE" FinTech Australia.Senior/Lead AI DevOps/SRE Description
We are currently seeking an experienced Senior/Lead AI DevOps/SRE to join our team. In this pivotal role, you will collaborate closely with data scientists and software developers to ensure seamless integration and optimize the operational efficiency of our AI deployments. Your expertise will be pivotal in deploying, maintaining, and scaling our cutting-edge AI solutions, encompassing LLMs and RAG systems.
As a key team member, you will spearhead both traditional DevOps responsibilities and innovative approaches to MLOps. Your proactive involvement will be essential in driving the success of our AI initiatives and maximizing their impact across the organization.
Responsibilities
- Implement and maintain CI/CD pipelines for AI and machine learning projects, ensuring robust deployment strategies and continuous integration
- Monitor and ensure the reliability, availability, and performance of AI applications, particularly those involving LLMs and RAG
- Collaborate with AI research teams to operationalize machine learning models and systems efficiently
- Develop and enforce best practices for version control, configuration management, and testing of AI-driven software solutions
- Utilize MLOps tools such as Kubeflow, MLflow, or TensorFlow Extended (TFX) to streamline the machine learning lifecycle from experimentation to production
- Implement monitoring solutions that track both system metrics and model performance to facilitate proactive issue resolution
- Participate in on-call rotations to support the operational health of critical systems, employing SRE principles to meet service-level objectives (SLOs) and reduce downtime
Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field
- Proven experience as a DevOps Engineer or SRE, with a strong background in software development and automation
- Expertise in deployment and management of LLMs, including technologies like RAG
- Proficient in CI/CD tools (Jenkins, GitLab CI, CircleCI) and infrastructure as code (Terraform, Ansible)
- Solid knowledge of container orchestration technologies (Kubernetes, Docker)
- Familiarity with MLOps tools and practices to support machine learning lifecycle management
Nice to have
- Experience with cloud services (AWS, GCP, Azure), particularly in AI/ML deployments
- Background in monitoring tools like Prometheus, Grafana, and ELK stack
- Understanding of Python, particularly in data science and machine learning contexts
- Certification in Kubernetes, AWS/GCP/Azure, or similar technologies
We offer
- We gather like-minded people:
- Engineering community of industry professionals
- Friendly team and enjoyable working environment
- Flexible schedule and opportunity to work remotely within Poland
- Chance to work abroad for up to 60 days annually
- Relocation within our 50+ offices
- We provide growth opportunities:
- Outstanding career roadmap
- Leadership development, career advising, soft skills, and well-being programs
- Certification (GCP, Azure, AWS)
- Unlimited access to LinkedIn Learning, Get Abstract, O’Reilly, Cloud Guru
- Language classes in English and Polish for foreigners
- We cover it all:
- Stable income (Employment Contract or B2B)
- Participation in the Employee Stock Purchase Plan
- Benefits package (health insurance, multisport, shopping vouchers)
- Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
- Referral bonuses
- Corporate, social and well-being events
- Please, note:
- The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview
- We will reach out to selected candidates exclusively
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
This job is no longer accepting applications
See open jobs at EPAM Systems.See open jobs similar to "Senior/Lead AI DevOps/SRE" FinTech Australia.