Senior Data Engineer, Specialist
msd
Job Description
What will you do in this role
- Design, develop, and maintain ETL/ELT pipelines using SQL, Python, and Spark
- Build and manage data workflows using Apache Airflow for orchestration and scheduling
- Develop scalable and optimized solutions using AWS services (S3, Glue, Redshift, EMR, Lambda, etc.)
- Implement and manage data processing pipelines in Databricks (Delta Lake, notebooks, workflows, Unit Catalog)
- Ensure data quality, reliability, and performance across pipelines
- Collaborate with analytics, product, and business teams to deliver data solutions
- Monitor, troubleshoot, and optimize production pipelines
What should you have
- Strong proficiency in SQL and Python
- Hands-on experience with Apache Spark (PySpark preferred)
- Experience working with Apache Airflow for workflow orchestration
- Solid experience with AWS cloud platform. Redshift performance optimization skills
- Hands-on experience in Databricks
- Understanding of data warehousing, data modeling, and ETL design
🔹 Good to Have
- Experience with CI/CD pipelines and GitHub Actions
- Knowledge of Pharmaceutical / Life Sciences domain
- Familiarity with data governance and quality frameworks
- Exposure to Docker, Kubernetes, or similar technologies
Primary Skills.
- SQL, Pyton,
- PySpark
- Aws cloud Platform