Senior Site Reliability Engineer
okta
Job Description
What you’ll be doing?
- Designing, building, running, and monitoring Okta Workflows’ global production infrastructure
- Be an evangelist for security best practices and also lead initiatives/projects to strengthen our security posture for critical infrastructure
- Responding to production incidents and determining how we can prevent them in the future
- Triaging and troubleshooting complex production issues to ensure reliability and performance
- Identifying and automating manual processes
- Continuously evolving our monitoring tools and platform
- Promoting and applying best practices for building scalable and reliable services across engineering
- Developing and maintaining technical documentation, runbooks, and procedures
- Supporting a highly available and large scale Kubernetes and AWS environment as part of an on-call rotation
- Be a technical SME for a team that designs and builds Okta's production infrastructure, focusing on security at scale in the cloud.
What you’ll bring to the role?
- Are always willing to go the extra mile: see a problem, fix the problem.
- Are passionate about encouraging the development of engineering peers and leading by example.
- Have experience with Kubernetes deployments in either AWS and/or GCP Cloud environments.
- Have an understanding and familiarity with configuration management tools like Chef, Terraform, or Ansible.
- Have expert-level abilities in operational tooling languages such as Go and shell, and use of source control.
- Have knowledge of various types of data stores, particularly PostgreSQL, Redis, and OpenSearch.
- Experience with industry-standard security tools like Nessus and OSQuery.
- Have knowledge of CI/CD principles, Linux fundamentals, OS hardening, networking concepts, and IP protocols.
Experience in the following
- 5+ years of experience architecting and running complex AWS or other cloud networking infrastructure resources
- 5+ years of experience with Ansible, Chef, and/or Terraform
- Strong leadership skills
- Strong Linux understanding and experience.
- Strong security background and knowledge.
- BS In computer science (or equivalent experience).
#LI-Hybrid