GCP Platform Engineer
fint
Job Description
Platform Engineering & Production Ownership
· Engineer, configure, and operate production-grade GCP data platform services, including:
- Dataproc
- Dataflow
- Cloud Composer (Airflow)
- Pub/Sub
- Google Cloud Storage (GCS)
· Own platform configurations to ensure high availability, performance, scalability, and security compliance.
· Design and maintain Terraform-based Infrastructure-as-Code (IaC) modules for standardized and repeatable platform provisioning.
· Implement and enforce security and governance controls, including:
- IAM least-privilege models
- CMEK
- VPC Service Controls
- Organization policies
- Workload Identity
L3 Troubleshooting & Deep Technical Support
· Act as the L3 escalation point for platform-related issues from data and application engineering teams.
· Troubleshoot complex production issues, including:
- Dataproc / Spark / YARN job failures and performance bottlenecks
- Dataflow pipeline backlogs, worker tuning, and throughput issues
- Composer / Airflow scheduler failures, DAG dependency issues
- Pub/Sub throughput, retention, and delivery behaviour issues
· Analyse and clearly explain how GCP service configurations impact runtime behaviour, monitoring metrics, performance, and cost.
· Perform root-cause analysis (RCA) and implement long-term engineering fixes rather than short-term workarounds.
Monitoring, Performance & Reliability Engineering
· Design and maintain monitoring, alerting, and observability using Cloud Monitoring and logging.
· Interpret service-level metrics and logs to diagnose:
- Performance degradation
- Scaling and capacity bottlenecks
- Reliability and availability risks
· Tune platform configurations for optimal performance, reliability, and cost efficiency.
· Ensure platforms meet production uptime, SLA, and compliance requirements.
Automation, CI/CD & Engineering Tooling
· Build automation using Terraform, Python, and scripting to minimize manual intervention.
· Integrate CI/CD pipelines for:
- Platform configuration changes
- Cloud Composer DAG deployments
- Dataflow template promotions
· Develop reusable frameworks for:
- Dependency packaging
- Deployment workflows
- Environment provisioning
- Operational consistency
Platform Standards, Documentation & Enablement
· Define and enforce platform standards and best practices.
· Create and maintain:
- Platform runbooks
- Troubleshooting guides
- Onboarding documentation
· Enable development teams through self-service tooling, templates, and clear usage guidelines.
What This Role Is NOT
To avoid confusion, this role is explicitly not the following:
- Not an L1 / L2 Operations or Support role – no ticket triage, alert acknowledgment, or routine support tasks.
- Not a shift-based or NOC role – no 24×7 rotations or follow-the-sun support.
- Not a DevOps-only role – CI/CD is an enabler, not the primary responsibility.
- Not a pure Data Engineer role – focus is on platform engineering, not business data transformations.
- Not a Cloud Administrator role – requires deep understanding of service behaviour, not just provisioning.
- Not a break-fix role&