Cloud Site Reliability Engineer (AWS or GCP)
EPAM Systems
This job is no longer accepting applications
See open jobs at EPAM Systems.See open jobs similar to "Cloud Site Reliability Engineer (AWS or GCP)" FinTech Australia.Software Engineering
Remote
Posted 6+ months ago
Cloud Site Reliability Engineer (AWS or GCP) Description
DESCRIPTION
Are you a skilled Cloud Site Reliability Engineer with experience in AWS or GCP?
Do you have a passion for maintaining CI/CD frameworks, integrating observatory stacks, and supporting Cloud applications?
If so, we have an exciting opportunity for you!
We're currently seeking a Cloud Site Reliability Engineer to join our vibrant team.
This role offers the chance to help the product team in maximizing the reliability of software solutions and ensure that the energy needs of the planet are met. If you're ready to take your career to the next level, we'd love to hear from you!
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Responsibilities
- Maintain and improve and optimize a LightOps monitoring system
- Incident management: troubleshooting, resolve, write documentation, perform post-mortem analysis on all relevant happenings for Cloud applications
- Share knowledge and actively collaborate with other teams in the organization for playbook development
- Be the liaison between the engineering and operations teams, in order to ensure proper communication and collaboration
Requirements
- Knowledge of OS Administration, PowerShell, Bash
- Experience with Automation using Scripting and Programming Languages
- Mastery of Observability and monitoring; Grafana/Prometheus and Open source (e.g. Loki)
- Familiarity with containerization (Docker), Kubernetes
- Hands-on experience in infrastructure performance and capacity planning (design, build or implementation)
- Competence in technologies such as NAT, DNS and DHCP
- Know-how of tools such as Git and PagerDuty
- Solid programming skills with Python and GO are a strong Plus
- Database skills (MongoDB, Oracle, Postgres)
- Disaster Recovery experience (backup georedundancy, recovery scripting)
- Networking e.g. VPN, Load Balancer, NSG or familiar with medium-> high Networking concepts
We Offer
- Career plan and real growth opportunities
- Unlimited access to LinkedIn learning solutions
- International Mobility Plan within 25 countries
- Constant training, mentoring, online corporate courses, eLearning and more
- English classes with a certified teacher
- Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)
- Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
- Flexible work schedule and dress code
- Collaborate in a multicultural environment and share best practices from around the globe
- Hired directly by EPAM & 100% under payroll
- Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
- Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)
- 13 % employee savings fund, capped to the law limit
- Grocery coupons
- 30 days December bonus
- Employee Stock Purchase Plan
- 12 vacations days plus 4 floating days
- Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)
- Relocation bonus: transportation, 2 weeks of accommodation for you and your family and more
- Monthly non-taxable amount for the electricity and internet bills
Conditions
- By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy
This job is no longer accepting applications
See open jobs at EPAM Systems.See open jobs similar to "Cloud Site Reliability Engineer (AWS or GCP)" FinTech Australia.