Staff Site Reliability Engineer
servicenow
Job Description
Job Description
What you get to do in this role:
- Provide relief and sustainable resolution to issues within our infrastructure.
- Use your experience in software development, systems engineering, and networking to proactively prevent repeatable issues.
- Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design.
- Drive a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions.
- Drive monitoring and automation initiatives
Qualifications
To be successful in this role you have:
- Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving. This may include using AI-powered tools, automating workflows, analyzing AI-driven insights, or exploring AI's potential impact on the function or industry.
- Deep knowledge of Linux systems
- 10+ years Coding in various languages; we normally prefer Python, JavaScript, and Ruby
- 5+ Years’ experience with DevOps automation, CI/CD pipeline and agile methodologies
- 5+ Years’ experience with Cloud technologies, preferably Azure
- Expertise in Observability and Monitoring of applications, services, and networks at scale
- MySQL database administration, troubleshooting, and performance tuning
- Networking skills, IP addressing and routing.
- Team-first attitude and an uncompromising attention to detail.
- Good collaboration and communication skills
- Ability to work in shifts that cover one weekend day.
- Experience developing on the ServiceNow Platform is a bonus