Data Engineer
oraclecloud
Job Description
- Core Programming: Strong proficiency in Python, including experience with libraries like Pandas, NumPy, and logging frameworks.
- Big Data: 3+ years of hands-on experience with Apache Spark (PySpark) for distributed data processing.
- GCP Ecosystem: Practical experience with Google Cloud services, specifically:
- BigQuery (Optimization, Partitioning, Clustering).
- Cloud DataProc or Dataflow.
- Cloud Storage (GCS) and Cloud Functions.
- Cloud Composer (Apache Airflow) for orchestration.
- Data Warehousing: Solid understanding of relational databases and SQL (PostgreSQL, MySQL) as well as NoSQL environments.
- DevOps & Tools: Experience with Git, Docker, and CI/CD pipelines. Familiarity with Terraform or other IaC tools is a significant plus.