Senior Site Reliability Engineer
okta
Job Description
What you’ll be doing
- Design, build, maintain and deploy tools that allow Okta’s engineers to execute infrastructure production changes and deploy code.
- Manage multiple environments spanning a globally distributed infrastructure.
- Improve environment visibility and management in a repeatable and automatable way.
- Collaborate with all engineering and operations teams to improve overall product health and reliability.
- Respond to production incidents and determine how we can prevent them in the future.
- Triage and troubleshoot complex production issues to ensure reliability and performance.
- Design and build scalable and extensible platforms/services/tools in Java, Python, Go with a focus on automation and reliability.
- Work cross functionally with Operations and Product teams to identify bottlenecks and manual processes. Build solutions that provide scale and reliability to address these issues.
- Leverage industry best practices in infrastructure, automation, orchestration to explore greenfield opportunities that will form the basis of future infrastructure improvements.
- Identify areas for automation that are self-serviceable to reduce manual onboarding. Develop tools and processes to address these areas.
- Work on improving the security posture of team owned services and infrastructure. This would involve base image maintenance, updating hosts with newer library versions from vendors as well as services with vulnerability free libraries if and when they are identified.
What we are looking for
- 5+ years of Experience with Java, Go, Python or similar backend languages
- 5+ years of experience building, maintaining and debugging services, internal tools and frameworks
- 3+ years experience automating and deploying large scale production services in AWS, GCP or similar. Also, holding experience with deployment piplelines such as Jenkins or Spinnaker (strongly preferred);
- 3+ years of hands on experience working with Kubernetes, with a good understanding of Kuberentes fundamentals