Senior SRE Engineer
harman
Job Description
Description & Requirements
Experience: 6-8 years
Responsibilities:
- Setup Monitoring and observability for the system
- Take lead on complex incidents and provide deep technical expertise to resolve issues quickly.
- Perform RCA indepth for incident management and suggest permanent fix
- Design and implement automation for high-availability systems and fault-tolerant architectures
- Implement scalable and reliable infrastructure solutions
- Participate in design reviews focusing on reliability and scalability
- Propose efficient with DevOps Skills of Continuous Integration and Continuous Delivery/Change Delivery Setup (CI/CD)
- Collaborate with security, compliance, and other teams to ensure adherence to best practices
- Drive the documentation of processes and best practices for reliability
- Define, track, and report on SLOs and SLIs for critical service
Skills:
- Strong CI/CD skills (Jenkins, Ansible)
- Good experience with Monitoring tools like Datadog
- Strong experience with Python/Shell
- Hands on experience with ITSM tools like ServiceNow, JIRA
- Good experience in any Cloud (AWS/Azure)
- Good experience with Infrastructure Solutions like Terraform