Site Reliability Engineer

equifax

Pune 4 Years Exp Posted 551d ago

Job Description

Key Responsibilities:

Kubernetes: Deploy and manage Kubernetes clusters, optimizing for performance and reliability.
Cloud Infrastructure: Build and maintain scalable infrastructure on GCP (or other cloud providers), leveraging automation tools like Terraform.
Observability: Implement monitoring and logging solutions to proactively detect and resolve issues.
Incident Response: Participate in on-call rotations, troubleshooting and resolving production incidents with a focus on minimizing downtime.
Collaboration: Work closely with product development teams to promote reliability best practices and ensure smooth deployments.

Qualifications:

4+ years of experience working with Docker and Kubernetes.
3+ years of experience working with public cloud environments ( GCP preferred)
Programming experience in one or more languages such as Python, Bash, Java, Go, Groovy or similar languages.
Proficiency with continuous integration and continuous delivery (CI/CD) using tools like Jenkins, Git.
2+ years of experience monitoring infrastructure and application performance.
Solid understanding of application design principles and trade-offs.
Knowledge of network infrastructure and security basics (DNS, subnets, firewalls, load balancers).

Bonus Points:

Experience with GCP/GKE .
Certifications in Kubernetes (CKA, CKAD) or cloud certification.

Keywords: Site Reliability Engineer, SRE, Kubernetes, Google Cloud Platform, GCP, Cloud Infrastructure, DevOps, CI/CD, Monitoring, Linux, Automation, Terraform

Site Reliability Engineer

Job Description

Similar Openings for You

Senior Software Test Engineer

Software Engineer III

Technical Lead - Java Backend

Java Fullstack Developer