Site Reliability Engineer

equifax

Pune 1 Years Exp Posted 473d ago

Job Description

What you will do:

  • Kubernetes: Deploy and manage Kubernetes clusters, optimizing for performance and reliability.

  • Cloud Infrastructure: Build and maintain scalable infrastructure on GCP (or other cloud providers), leveraging automation tools like Terraform.

  • Observability: Implement monitoring and logging solutions to proactively detect and resolve issues.

  • Incident Response: Participate in on-call rotations as part of the first responder team, troubleshooting and resolving production incidents with a focus on minimizing downtime.

  • Collaboration: Work closely with product development teams to ensure smooth deployments.

What experience you need:

  • 1+ years of experience working with Containerized environments like Docker and Kubernetes.

  • 1+ years of experience working with public cloud environments (GCP preferred)

  • Programming experience in one or more languages such as Python, Bash, Java, Go, Groovy or similar languages.

  • Proficiency with continuous integration and continuous delivery (CI/CD) using tools like Jenkins, Git.

  • 1+ years of experience monitoring infrastructure and application performance.

  • Knowledge of network infrastructure and security basics (DNS, subnets, firewalls, load balancers).

  • Sound knowledge of application design principles

What can set you apart:

  • Hands-on experience with GCP/GKE.

  • Certifications in Kubernetes (CKA, CKAD) or cloud certification.

Similar Openings for You