GCP Platform Engineer

fint

Chennai, India 7 Years Exp Posted 1d ago

Job Description

Platform Engineering & Production Ownership

·      Engineer, configure, and operate production-grade GCP data platform services, including:

  1. Dataproc
  2. Dataflow
  3. Cloud Composer (Airflow)
  4. Pub/Sub
  5. Google Cloud Storage (GCS)

·      Own platform configurations to ensure high availability, performance, scalability, and security compliance.

·      Design and maintain Terraform-based Infrastructure-as-Code (IaC) modules for standardized and repeatable platform provisioning.

·      Implement and enforce security and governance controls, including:

  1. IAM least-privilege models
  2. CMEK
  3. VPC Service Controls
  4. Organization policies
  5. Workload Identity

L3 Troubleshooting & Deep Technical Support

·      Act as the L3 escalation point for platform-related issues from data and application engineering teams.

·      Troubleshoot complex production issues, including:

  1. Dataproc / Spark / YARN job failures and performance bottlenecks
  2. Dataflow pipeline backlogs, worker tuning, and throughput issues
  3. Composer / Airflow scheduler failures, DAG dependency issues
  4. Pub/Sub throughput, retention, and delivery behaviour issues

·      Analyse and clearly explain how GCP service configurations impact runtime behaviour, monitoring metrics, performance, and cost.

·      Perform root-cause analysis (RCA) and implement long-term engineering fixes rather than short-term workarounds.

Monitoring, Performance & Reliability Engineering

·      Design and maintain monitoring, alerting, and observability using Cloud Monitoring and logging.

·      Interpret service-level metrics and logs to diagnose:

  1. Performance degradation
  2. Scaling and capacity bottlenecks
  3. Reliability and availability risks

·      Tune platform configurations for optimal performance, reliability, and cost efficiency.

·      Ensure platforms meet production uptime, SLA, and compliance requirements.

Automation, CI/CD & Engineering Tooling

·      Build automation using Terraform, Python, and scripting to minimize manual intervention.

·      Integrate CI/CD pipelines for:

  1. Platform configuration changes
  2. Cloud Composer DAG deployments
  3. Dataflow template promotions

·      Develop reusable frameworks for:

  1. Dependency packaging
  2. Deployment workflows
  3. Environment provisioning
  4. Operational consistency

Platform Standards, Documentation & Enablement

·      Define and enforce platform standards and best practices.

·      Create and maintain:

  1. Platform runbooks
  2. Troubleshooting guides
  3. Onboarding documentation

·      Enable development teams through self-service tooling, templates, and clear usage guidelines.


What This Role Is NOT

To avoid confusion, this role is explicitly not the following:

  • Not an L1 / L2 Operations or Support role – no ticket triage, alert acknowledgment, or routine support tasks.
  • Not a shift-based or NOC role – no 24×7 rotations or follow-the-sun support.
  • Not a DevOps-only role – CI/CD is an enabler, not the primary responsibility.
  • Not a pure Data Engineer role – focus is on platform engineering, not business data transformations.
  • Not a Cloud Administrator role – requires deep understanding of service behaviour, not just provisioning.
    • Not a break-fix role&

Similar Openings for You