Site Reliability Engineer
cisco
Job Description
Responsibilities:
- Setup and maintain monitoring, alerting and logging system to track health and performance of identity service platform
- Using data and telemetry to improve feature work and propose feature improvements
- Drive cost optimization efforts
- Respond to incidents and outages, troubleshoot issues and implement solutions to restore service
Minimum Qualifications:
- 3+ years of experience as Cloud Engineer, DevOps Engineer, SRE, Software Engineer or Systems Engineer
- At least 1 year programming with any of the following: Go, Python, Java, Bash, Linux Shell or similar languages
- At least 1 year experience working with container technologies such as Docker, Kubernetes etc.
- Experience with observability platforms like Splunk, Grafana, Prometheus
Preferred Qualifications:
- Cloud computing and working with cloud providers (AWS, GCP, Azure)
- Experience with Source control and continuous integration tools like Git, Jenkins
- Ability and the passion to learn new things quickly