Azure Data Engineer

hirewand

Gurgaon 5 Years Exp Posted 30d ago

Job Description

Key Responsibilities

Design, develop, and maintain scalable ETL/ELT pipelines using Python, PySpark, and Spark SQL

Build and optimize workloads on Azure Databricks (notebooks, job clusters, workflows)

Implement Delta Lake best practices (schema evolution, partitioning, performance tuning, merge strategies)

  • Develop robust ingestion frameworks integrating with Azure Data Lake Storage Gen2 and Azure SQL
  • Implement data validation, transformation logic, and quality control frameworks
  • Ensure performance optimization and cost-efficient execution
  • Support production deployments, monitor pipelines, and resolve incidents
  • Maintain version control using Git and follow CI/CD best practices
  • Collaborate with architects, analysts, and business stakeholders

Mandatory Skills

Strong hands-on experience in PySpark and Spark SQL

Proven experience in Spark performance tuning & optimization

Solid expertise in Azure Databricks

Strong knowledge of Delta Lake architecture

  • Experience working with Azure Data Lake Storage Gen2
  • Git-based version control experience
  • Production support and troubleshooting experience

Good to Have

Exposure to data governance, data security, and compliance frameworks

  • Experience integrating data from SAP ERP systems
  • Knowledge of Azure Data Factory (ADF)
    • Understanding of DevOps and CI/CD pipelines

Similar Openings for You