Azure Data Engineer

hirewand

pune 5 Years Exp Posted 65d ago

Key Responsibilities

Design, develop, and maintain scalable ETL/ELT pipelines using Python, PySpark, and Spark SQL

Build and optimize workloads on Azure Databricks (notebooks, job clusters, workflows)

Implement Delta Lake best practices (schema evolution, partitioning, performance tuning, merge strategies)

Develop robust ingestion frameworks integrating with Azure Data Lake Storage Gen2 and Azure SQL
Implement data validation, transformation logic, and quality control frameworks
Ensure performance optimization and cost-efficient execution
Support production deployments, monitor pipelines, and resolve incidents
Maintain version control using Git and follow CI/CD best practices
Collaborate with architects, analysts, and business stakeholders

Mandatory Skills

Strong hands-on experience in PySpark and Spark SQL

Proven experience in Spark performance tuning & optimization

Solid expertise in Azure Databricks

Strong knowledge of Delta Lake architecture

Experience working with Azure Data Lake Storage Gen2
Git-based version control experience
- Production support and troubleshooting experience