Site Reliability Engineer
smartrecruiters
Job Description
- Monitor and improve the availability, performance and security of production services
- Apply prevention steps in order to improve production services reliability
- Mitigate issues on production systems and build solutions through automation to prevent them from reoccurring
- Enhance and feed the monitoring system to improve service reliability and to provide other teams at CyberArk with the dashboards to help deliver an excellent service to our customers
- Automate common, repeatable tasks using Ansible and scripting languages
- Triage and manage escalation of cases
- Performance deliberate and structured Troubleshooting
- Share the on-call rotation and act as an escalation contact for incidents
- Influence design / architecture of services to proactively prevent system failures