Site Reliability Engineer - SRE
cisco
Job Description
Your Impact
- Develop full-fledged software tooling to deliver programmable infrastructure (infrastructure as code)
- Develop tooling to drive end-to-end micro-services monitoring and management
- Implement Kubernetes compliance and best practices in terms of security, audits, network policies, reporting
- Develop a self-service Console to provide infrastructure visibility
- Manage the availability, scalability, and performance of the platform's infrastructure
- Create tools and infrastructure leveraged by the rest of the engineering teams
- Convert other engineering team's application development bottlenecks as an opportunity to automate & scale the tooling of the platform's infrastructure
- Create and maintain continuous integration and continuous deployment(CI/CD) environments for scaling SaaS applications to multi-region & multi-cloud patterns
Minimum Qualifications
- BS/MS in Computer Science or related area
- 7 or more years of relevant work experience
- Hands-on experience working with Kubernetes infrastructure in AWS
- Excellent understanding of container networking and microservices architecture
- Experience with VM hosting in AWS