GCP Data Engineer

hsbc

pune NM Years Exp Posted 1h ago

Job Description

  • Design and deliver batch and streaming ETL/ELT pipelines to ingest data from multiple sources into GCP.
  • Gather requirements with stakeholders and translate business needs into scalable data solutions.
  • Build Apache Beam pipelines on GCP Dataflow for extraction, transformation, and analytical preparation.
  • Integrate data from databases, APIs, and flat files while enforcing data quality, consistency, and governance.
  • Manage and optimise GCP storage/warehouse layers (BigQuery, Cloud Storage) for analytics and reporting.
  • Orchestrate and automate workflows using Airflow/Cloud Composer; reduce manual operations through automation.
  • Implement observability (monitoring/alerting), troubleshoot bottlenecks, and ensure SLA/SLO adherence.
  • Lead SRE practices: reliability strategy, incident response, post-mortems, documentation, security/compliance, and continuous improvement.

To be successful in this role, you should meet the following requirements:

  • Bachelor’s degree in Computer Science/IT (or equivalent experience).
  • Proven ETL/ELT experience: data modelling, data warehousing concepts, and strong SQL.
  • Strong GCP experience (certification preferred), especially BigQuery/Cloud Storage/Dataflow.
  • Big data batch + streaming expertise (e.g., Kafka, Spark/Flink, Hadoop/Hive/HBase, dbt).
  • Proficiency in Java and/or Python for data engineering and data manipulation.
  • Proven SRE/DevOps experience: architecture, reliability, automation, and operational excellence.
  • Incident management capability: on-call/response, RCA, and post-mortem execution.
    • Strong communication and problem-solving; experience with AI/LLM (preferably financial services).

Similar Openings for You