PHP & 6 others
EPAM Systems
Argentina · Amp. Gabriel Hernández, Ciudad de México, CDMX, Mexico · Remote
Posted on Dec 23, 2025
Responsibilities
- Partner with engineering teams to assess system architecture and pinpoint key performance areas
- Deliver data-driven advice for enhancing platform performance
- Improve load testing approaches, tools, and standards within the team
- Record and share test plans, outcomes, and technical findings clearly
- Resolve bottlenecks to boost platform scalability and reliability
- Create and enforce limits across platform entry points
- Facilitate continuous integration and deployment processes to uphold platform health
- Track platform metrics using Prometheus and Elastic Stack to guarantee operational quality
- Utilize OpenTelemetry for tracing and diagnosing performance issues
- Collaborate with cross-functional teams to align governance strategies with business objectives
- Lead efforts to proactively manage resource distribution and fair use across the platform
- Guide team members to encourage growth and deepen technical skills
- Keep up-to-date with new technologies relevant to platform governance and ecommerce
Requirements
- Extensive software engineering experience with at least 3 years in complex, scalable systems
- Proficient in PHP and experienced with Go or Scala, capable of working across all three languages
- Knowledgeable in OpenTelemetry, Prometheus, and Elastic Stack (ELK) for monitoring and performance evaluation
- Familiarity with cloud platforms like Google Cloud Platform, AWS, or Azure and microservices architectures
- Experience with CI/CD pipelines
- Strong problem-solving and debugging skills focused on system performance
- Background in ecommerce, including store operations, catalog management, and checkout processes
- Capability to explain complex technical topics to both technical and non-technical stakeholders
- Proactive mindset with a strong drive to detect issues and implement fixes
- Eagerness to learn and adapt to new technologies and challenges
- Upper-Intermediate English skills (B2) for effective team communication
Nice to have
- Understanding of distributed systems and event-driven architectures
- Experience with scalability and performance testing, including load generation and monitoring
- Skill in automating testing, data gathering, and reporting
- Practical knowledge of load testing tools such as K6, JMeter, or Blazemeter
- Prior experience with Scala programming language
We offer/Benefits
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn