Azure Data Engineer
hirewand
Job Description
Key Responsibilities
Design, develop, and maintain scalable ETL/ELT pipelines using Python, PySpark, and Spark SQL
Build and optimize workloads on Azure Databricks (notebooks, job clusters, workflows)
Implement Delta Lake best practices (schema evolution, partitioning, performance tuning, merge strategies)
- Develop robust ingestion frameworks integrating with Azure Data Lake Storage Gen2 and Azure SQL
- Implement data validation, transformation logic, and quality control frameworks
- Ensure performance optimization and cost-efficient execution
- Support production deployments, monitor pipelines, and resolve incidents
- Maintain version control using Git and follow CI/CD best practices
- Collaborate with architects, analysts, and business stakeholders
Mandatory Skills
Strong hands-on experience in PySpark and Spark SQL
Proven experience in Spark performance tuning & optimization
Solid expertise in Azure Databricks
Strong knowledge of Delta Lake architecture
- Experience working with Azure Data Lake Storage Gen2
- Git-based version control experience
- Production support and troubleshooting experience