Lead Data Engineer
darwinbox
Job Description
Leading and mentoring a team of Data Engineers
Data Pipeline Development:
Design, implement, and optimize ETL/ELT pipelines for structured and unstructured data.
Integrate data from multiple internal and external sources into centralized systems.
Build scalable batch and real-time data processing workflows.
Data Infrastructure & Architecture:
Develop and maintain data lake, data warehouse, or data mesh architectures.
Ensure high availability, performance, and scalability of data systems.
Implement data modeling best practices to support analytics and ML use cases.
Software Engineering:
Write clean, efficient, and maintainable code in Python, Java, or Scala.
Implement unit tests, integration tests, and CI/CD pipelines for data workflows.
Apply software engineering principles to build reusable data services and APIs.