Data Engineer
pal
Job Description
- Design and deploy automated ETL/ELT pipelines to ingest data from various sources (APIs, databases, logs) into cloud data warehouses.
- Maintain cloud storage solutions (Data Lakes and Data Warehouses).
- Monitor and tune query performance and cloud resource consumption to manage costs.
Technical Requirements:
- Cloud Platforms: AWS (Redshift, S3, Glue), Azure (Data Factory, Synapse), or GCP (BigQuery, Dataflow).
- Languages: Advanced SQL (essential) and Python.
- Data Processing: Apache Spark, Flink, or Kafka for real-time streaming.
- Orchestration: Apache Airflow, dbt, or Prefect