Principal Data Software Engineer
greenhouse
Job Description
Platform Architecture & Design
- Design and implement scalable, secure, and efficient data architectures (e.g., data lakehouse, medallion architecture) to support enterprise AI and analytics initiatives.
- Define technical specifications for data ingestion, transformation, storage, and access layers.
- Evaluate and recommend tools, frameworks, and cloud infrastructure to optimize performance, scalability, and cost-efficiency.
- Ensure architectural alignment with organizational AI, analytics, and clinical application goals.
Data Engineering & Operations
- Develop and maintain high-performance data pipelines for both batch and streaming data processing, leveraging tools such as Apache Spark, Databricks or Snowflake.
- Implement deployment and orchestration frameworks (e.g., Airflow, dbt, Spark, Databricks, Snowflake)
- Adhere to engineering best practices for CI/CD, observability, incident response, and SLA adherence
- Optimize data processing systems for performance, reliability, and reusability.
- Mentor engineers to foster technical growth and ensure alignment with best practices in data engineering.
Governance, Quality & Compliance
- Collaborate with data governance and compliance teams to implement data security, encryption, and access control policies in line with HIPAA, GDPR, and other regulations.
- Ensure data quality through validation, lineage tracking, and metadata management practices.
- Support the development and maintenance of data catalogs for discoverability and stewardship
Collaboration & Technical Leadership
- Partner with data engineers, product teams, and clinical informatics specialists to deliver integrated data solutions.
- Provide technical guidance and mentorship to junior engineers, fostering a culture of technical excellence and collaboration.
- Communicate complex technical designs and solutions to both technical and non-technical stakeholders effectively.
- Contribute to agile development practices, participating in code reviews, design discussions, and continuous improvement initiatives.
Innovation & Technical Expertise
- Stay informed about emerging trends in data engineering, cloud-native architectures, and healthcare data standards.
- Prototype and evaluate modern data frameworks and tools (e.g., Delta Lake, Apache Iceberg) to enhance platform capabilities.
- Communicate complex technical concepts to both technical and non-technical stakeholders.
- Advocate for best practices in data platform design to ensure scalability, explainability, and compliance in AI-driven systems.