Data Architect
EPAM Systems
IT
Prague, Czechia
Posted on Sep 13, 2024
Data Architect Description
We are looking for Data Architect any level for data-driven projects. Together we design and drive lots of solutions which generate value from data, taking advantage of scalable platforms, cutting-edge technologies, and machine learning algorithms.Set of used technologies is very wide, so any technology background of the Data Architect is acceptable. We provide a solid architecture framework, educational programs, and strong SA community to support you in a deep dive to data domain.Some architectural areas we are focusing on:• Data Processing Architecture• Streaming Architecture• Data Platform Operations• Metadata Management Architecture• Cloud Data Services Architecture• ML and MLOps Architecture• Data Warehouse Architecture• Data Management• Business Intelligence Solutions• Data Integration Architecture• Data Security ArchitectureSome examples from tool set/technology stack we are using:• Clouds: AWS, Azure, GCP• Distributed data processing & ETL Frameworks: Apache Spark (and related cloud specific technologies such as AWS EMR, GCP DataProc, Azure HD Insight), GCP DataFlow, AWS Glue, Databricks• Distributed Environments: Kubernetes, Docker, AWS ECS, Google Kubernetes Engine, Azure Kubernetes Services• Analytical Data Warehousing: Snowflake, AWS Redshift, Azure Synapse, GCP BigQuery• Relational Databases: PostgreSQL, AWS SQL DB, GCP Cloud SQL• Lightweight/Serverless Compute: AWS Lambda Functions, GCP Cloud Functions, Azure Functions• No-SQL/Specialized Databases: Cassandra, MongoDB, Azure Cosmos DB, GCP BigTable, Redis (including cloud analogs)• Data Catalogs & Metadata Management: Collibra, Alation, Informatica,Azure Purview, Google Dataplex• Integration & flow management: AWS Step Functions, Airflow/ GCP Cloud Composer, Azure Data Factory, Kafka Connect• Data Streaming: Kafka, AWS Kinesis, GCP Pub/Sub, Azure Event Hub• Object storages: S3, ADLS, GCS, HDFS, Minio• Search platforms: Solr, ElasticSearch• ML: MLflow, Kubeflow, AWS Sagemaker, Azure ML, GCP AI Platform• Data Visualization: Power BI, Tableau, QlikView, Spotfire, Jupyter• Platform Operations: IaaC (Terraform, AWS CloudFormation, Azure DevOps etc.), IaM (Azure AD, AWS Cognito, etc.), monitoring (Prometheus, Splunk, Azure Monitor, etc.), CI/CD (Jenkins, GCP Cloud Build, etc.), Cloud cost managment, secuity & networking tools• Programming Languages: Java, Scala, Python
#LI-DNI#Not found
#LI-DNI#Not found
Responsibilities
- Design and evolve large-scale data-driven solutions
- Drive direct communications with business stakeholders
- Elaborate on all technical aspects for the development team, provide justification for any architectural decision
- Lead implementation of the solutions from establishing project requirements and goals to solution "go-live"
- Create and present solution architecture documentation with deep technical details to customer and implementation teams
- Participate in the full cycle of pre-sale activities
- Lead solution architecture evaluation and assessment activities
- Continuously research emerging technologies, participate in company level knowledge sharing initiatives, PoCs and training programs
Requirements
- Experience in requirements engineering, solution architecture, systems development, deployment and maintenance
- Knowledge of architecture, design patterns and technological landscape in at least 3 technology domains (Data Platforms, IoT, ML, Backend, Mobile, etc.)
- Profound knowledge of the technology’s internals for at-least 1 technology domain
- Solid understanding of the core concepts in data and analytics platfrom architectures, data warehousing, business intelligence, data management, integration, security and operations areas
- Wide experience in design, implementation, deployment, troubleshooting and replatforming of distributed systems both on premises and in the Cloud
- Structured and systematic knowledge of the entire of architecture design process (requirements, quality attributes, technology selection, estimation, proposal verification, documentation, etc.)
- Experience in all phases of the software development life cycle using different development methodologies and best practices
- Highly organized and detail-oriented
- Good communication skills
- Fluent English
We offer
- Opportunity to work in a fast-paced, agile, software engineering culture
- Comfortable modern office in Prague 7, with support of hybrid or fully remote mode
- Benefit program (5 weeks of vacation, paid sick days, paid days off for special occasions, meal vouchers, flexi pass, Prague city public transport annual coupon, multisport cards, optional contribution to pension fund, health insurance for family member)
- EPAM Employee Stock Purchase Plan (ESPP) (subject to certain eligibility requirements)
- English language courses
- Czech language courses upon request
- Referral bonuses for recommended candidates
- Mobile Phone Tariff’s program for managerial-level candidates
- Great learning and development opportunities, including in-house professional training, career advisory and coaching, sponsored professional certifications, well-being programs, LinkedIn Learning Solutions and much more
Certain benefits and perks may be subject to eligibility requirements and may be available only after you have passed your probationary period.