GCP Data Engineer
hsbc
Job Description
- Design and deliver batch and streaming ETL/ELT pipelines to ingest data from multiple sources into GCP.
- Gather requirements with stakeholders and translate business needs into scalable data solutions.
- Build Apache Beam pipelines on GCP Dataflow for extraction, transformation, and analytical preparation.
- Integrate data from databases, APIs, and flat files while enforcing data quality, consistency, and governance.
- Manage and optimise GCP storage/warehouse layers (BigQuery, Cloud Storage) for analytics and reporting.
- Orchestrate and automate workflows using Airflow/Cloud Composer; reduce manual operations through automation.
- Implement observability (monitoring/alerting), troubleshoot bottlenecks, and ensure SLA/SLO adherence.
- Lead SRE practices: reliability strategy, incident response, post-mortems, documentation, security/compliance, and continuous improvement.
To be successful in this role, you should meet the following requirements:
- Bachelor’s degree in Computer Science/IT (or equivalent experience).
- Proven ETL/ELT experience: data modelling, data warehousing concepts, and strong SQL.
- Strong GCP experience (certification preferred), especially BigQuery/Cloud Storage/Dataflow.
- Big data batch + streaming expertise (e.g., Kafka, Spark/Flink, Hadoop/Hive/HBase, dbt).
- Proficiency in Java and/or Python for data engineering and data manipulation.
- Proven SRE/DevOps experience: architecture, reliability, automation, and operational excellence.
- Incident management capability: on-call/response, RCA, and post-mortem execution.
- Strong communication and problem-solving; experience with AI/LLM (preferably financial services).