Site Reliability Engineer
EPAM Systems
This job is no longer accepting applications
See open jobs at EPAM Systems.See open jobs similar to "Site Reliability Engineer" FinTech Australia.Software Engineering
Remote
Posted on Jul 30, 2024
Site Reliability Engineer Description
DESCRIPTION
Join EPAM as a remote Site Reliability Engineer.
A position of a Senior SRE who is going to cover the LatAm timezone and work in a team of 3 SREs with a hands-on Lead SRE in the same timezone and collaborate with another team of 3 SREs (hands-on Lead + 2 senior SREs) from the European timezone, ensuring follow-the-sun, 24/7 on-call support for the entirety of the customer platform that includes a few Java backend services.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Responsibilities
- Provide follow-the-sun, 12/7 on-call support for the entirety of the Java backend services currently owned by the customer backend - including owning API Gateway observability
- Prepare and deploy patches to the issues found both in the Java code and related service cloud infrastructure
- Assist in establishing top-of-the-line metrics and dashboards which enable this group and customer backend team to quickly identify/establish overall platform health
- Assist in establishing/improving runbooks for all EOS Backend services
- Assist in monitoring SLOs of all involved backend services submitting code changes which improve SLO as errors occur
Requirements
- 3 – 8 years of experience as DevOps/SRE
- Coding Exposure highly desired
- Experience with Amazon DynamoDB, Amazon ElastiCache, Amazon Web Services
- Soft Skills:
- Fast learner who can handle information dumps quickly, learn from them, and apply them dynamically during on-call efforts
- Able to troubleshoot complex systems efficiently using logs & telemetry - identifying and resolving root causes
- Able to communicate operational issues clearly and concisely in writing as part of live incident response
- Motivated to track and improve SLO across a number of systems through repeatable processes
We Offer
- Career plan and real growth opportunities
- Unlimited access to LinkedIn learning solutions
- International Mobility Plan within 25 countries
- Constant training, mentoring, online corporate courses, eLearning and more
- English classes with a certified teacher
- Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)
- Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
- Flexible work schedule and dress code
- Collaborate in a multicultural environment and share best practices from around the globe
- Hired directly by EPAM & 100% under payroll
- Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
- Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)
- 13 % employee savings fund, capped to the law limit
- Grocery coupons
- 30 days December bonus
- Employee Stock Purchase Plan
- 12 vacations days plus 4 floating days
- Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)
- Relocation bonus: transportation, 2 weeks of accommodation for you and your family and more
- Monthly non-taxable amount for the electricity and internet bills
Conditions
- By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy
This job is no longer accepting applications
See open jobs at EPAM Systems.See open jobs similar to "Site Reliability Engineer" FinTech Australia.