Azure Data Engineer

hirewand

Gurgaon 5 Years Exp Posted 78d ago

Key Responsibilities

Design, develop, and maintain scalable ETL/ELT pipelines using Python, PySpark, and Spark SQL

Build and optimize workloads on Azure Databricks (notebooks, job clusters, workflows)

Implement Delta Lake best practices (schema evolution, partitioning, performance tuning, merge strategies)

Develop robust ingestion frameworks integrating with Azure Data Lake Storage Gen2 and Azure SQL
Implement data validation, transformation logic, and quality control frameworks
Ensure performance optimization and cost-efficient execution
Support production deployments, monitor pipelines, and resolve incidents
Maintain version control using Git and follow CI/CD best practices
Collaborate with architects, analysts, and business stakeholders

Mandatory Skills

Strong hands-on experience in PySpark and Spark SQL

Proven experience in Spark performance tuning & optimization

Solid expertise in Azure Databricks

Strong knowledge of Delta Lake architecture

Good to Have

Exposure to data governance, data security, and compliance frameworks

Experience integrating data from SAP ERP systems
Knowledge of Azure Data Factory (ADF)
- Understanding of DevOps and CI/CD pipelines