Site Reliability Engineer
mokahr
Job Description
- Hands-on with developers to deploy the applications to provide support
- Building new features to improve the platform in terms of stability & updates
- Manage our Kubernetes clusters on-prem and in the cloud to support our growing workloads
- Participating in the architecture design process and troubleshooting of live applications with the product teams
- Participating in a 24x7 on-call rotation
- Influence architectural decisions with focus on security, scalability and high-performance
- Setup and maintain monitoring, metrics & reporting systems for fine-grained observability and actionable alerting
- Authoring technical documentation for workflows/processes/best practices