Senior Data Engineer, Specialist

msd

Hyderabad NM Years Exp Posted 5h ago

Job Description

What will you do in this role

  • Design, develop, and maintain ETL/ELT pipelines using SQL, Python, and Spark
  • Build and manage data workflows using Apache Airflow for orchestration and scheduling
  • Develop scalable and optimized solutions using AWS services (S3, Glue, Redshift, EMR, Lambda, etc.)
  • Implement and manage data processing pipelines in Databricks (Delta Lake, notebooks, workflows, Unit Catalog)
  • Ensure data quality, reliability, and performance across pipelines
  • Collaborate with analytics, product, and business teams to deliver data solutions
  • Monitor, troubleshoot, and optimize production pipelines

What should you have

  • Strong proficiency in SQL and Python
  • Hands-on experience with Apache Spark (PySpark preferred)
  • Experience working with Apache Airflow for workflow orchestration
  • Solid experience with AWS cloud platform. Redshift performance optimization skills
  • Hands-on experience in Databricks
  • Understanding of data warehousing, data modeling, and ETL design

🔹 Good to Have

  • Experience with CI/CD pipelines and GitHub Actions
  • Knowledge of Pharmaceutical / Life Sciences domain
  • Familiarity with data governance and quality frameworks
  • Exposure to Docker, Kubernetes, or similar technologies

Primary Skills.

  • SQL, Pyton,
  • PySpark
  • Aws cloud Platform

Similar Openings for You