FinTech Australia
FinTech Australia
About
About Us
What is Fintech
Contact Us
Policy
Policy
Policy Working Groups
Events
Events Calendar
The Finnies
Intersekt Festival
Members
Corporate Partners
Fintech Careers
Jobs Board
eLearning
Resources
Ecosystem Map
Regulatory Map
Investor Map
EY Fintech Census
Services Directory
News
News
Podcast
Member Portal
FinTech Australia
FinTech Australia
About
About Us
What is Fintech
Contact Us
Policy
Policy
Policy Working Groups
Events
Events Calendar
The Finnies
Intersekt Festival
Members
Corporate Partners
Fintech Careers
Jobs Board
eLearning
Resources
Ecosystem Map
Regulatory Map
Investor Map
EY Fintech Census
Services Directory
News
News
Podcast
Member Portal
Folder: About
Folder: Policy
Folder: Events
Members
Corporate Partners
Folder: Fintech Careers
Folder: Resources
Folder: News
Member Portal
Back
About Us
What is Fintech
Contact Us
Back
Policy
Policy Working Groups
Back
Events Calendar
The Finnies
Intersekt Festival
Back
Jobs Board
eLearning
Back
Ecosystem Map
Regulatory Map
Investor Map
EY Fintech Census
Services Directory
Back
News
Podcast
hero

Companies you'll love to work for

0
companies
0
Jobs
For Employers
Add your job
listings
Contact Us
For Employers
Find Candidates
Directly
Talent Pool
For Candidates
Help Recruiters
Find You
Talent Network
Search 
jobs
Explore 
companies
Join talent network
Talent
My job alerts

Senior Site Reliability Engineer

EPAM Systems

EPAM Systems

This job is no longer accepting applications

See open jobs at EPAM Systems.See open jobs similar to "Senior Site Reliability Engineer" FinTech Australia.
Software Engineering
Remote
Posted on Apr 11, 2025
Apply Apply

Senior Site Reliability Engineer Description

We are seeking a skilled Senior Site Reliability Engineer to join our team and contribute to the development and maintenance of highly reliable and scalable systems. This role will involve optimizing infrastructure, automating processes, and ensuring system performance across cloud platforms and distributed systems. You will collaborate with cross-functional teams, lead technical initiatives, and provide mentorship to team members, fostering a culture of continuous improvement and operational excellence.

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.


#LI-DNI

Responsibilities

  • Optimize Linux-based operating systems to ensure high performance for production services and distributed systems
  • Implement advanced telemetry solutions using tools like Grafana, Prometheus, and Splunk to enhance monitoring and organizational capabilities
  • Troubleshoot complex issues in Kubernetes, establishing best practices and standards for the team
  • Create and maintain automation scripts using Bash and Python to improve operational workflows
  • Develop and manage container orchestration systems such as Kubernetes or EKS, sharing expertise with the team
  • Design and maintain high-performance cloud infrastructure with AWS to ensure availability and reliability
  • Lead automation initiatives to reduce manual processes and enhance team efficiency
  • Provide strong leadership and foster a collaborative team environment through effective communication and ownership
  • Encourage continuous learning and professional growth within the team, cultivating a culture of improvement and curiosity
  • Offer technical mentorship and guidance to team members, ensuring clarity and efficiency in communication
  • Strategically manage disaster recovery and capacity planning to maintain system scalability and resilience
  • Automate deployment processes using tools like Terraform or CloudFormation to increase productivity and reliability
  • Integrate open-source technologies such as Cassandra, Kafka, Postgres, Solr, and Redis to strengthen SRE practices

Requirements

  • Bachelor's degree in Computer Science or a related field involving coding (e.g., physics or mathematics), or equivalent practical experience
  • At least three years of hands-on experience as a Site Reliability Engineer
  • Proficiency in Bash for scripting and automation tasks
  • Experience using Grafana for monitoring and visualization
  • Strong understanding of Linux systems and their optimization for production environments
  • Familiarity with Microsoft Internet Information Services (IIS) for managing web server infrastructure
  • Knowledge of Prometheus for monitoring and alerting in distributed systems
  • Proficiency in Python for developing automation and improving operational workflows
  • English language proficiency at a B2 level or higher, with excellent written and verbal communication skills

Nice to have

  • Experience working with Amazon Web Services (AWS) and designing scalable cloud solutions
  • Familiarity with cloud platforms and their integration into system architecture
  • Expertise in Kubernetes for container orchestration and management
  • Experience with Splunk for advanced telemetry and log management
  • Knowledge of Terraform and Terraform Cloud for infrastructure as code and deployment automation
  • Strong troubleshooting skills for identifying and resolving complex system issues

We offer

  • Connectivity Bonus (15,000 ARS are paid with a salary receipt at the end of each month as a non-wages concept)
  • Medicina Prepaga (It covers the collaborator and direct family group)
  • Paternity Leave (Two additional days are added to what is established by law, total of 4 days)
  • Discounts card
  • English Training (English lessons, twice per week)
  • Training Program (Access to multiple customized training plans according to the needs of each role within the company)
  • Marriage bonus (The company doubles the allowance established by law that ANSES offers)
  • Referral Program (Referral bonus is paid when the referral of a collaborator joins the Company)
  • External Agreements and Discounts
  • Vacations: 14 calendar days a year

By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy.

Apply Apply

This job is no longer accepting applications

See open jobs at EPAM Systems.See open jobs similar to "Senior Site Reliability Engineer" FinTech Australia.
See more open positions at EPAM Systems
Privacy policyCookie policy
FINTECH AUSTRALIA

FinTech Australia exists to help our country become one of the world’s top markets for fintech innovation and investment.

IMPORTANT LINKS
  • Privacy Policy
  • Member Login
  • Join Fintech Australia
  • Contact Us
© 2023 FinTech Australia