Senior Cloud Site Reliability Engineer
zs
Job Description
- 3+ years’ experience working as a Site Reliability Engineer or an equivalent position
- 2+ years’ experience with AWS cloud technologies and at least one AWS certifications is required (Solution Architect / DevOps Engineer)
- 1+ years’ experience functioning as a senior member in an infrastructure/software team
- Hands-on experience with AWS services like EC2, RDS, EMR, CloudFront, ELB, API Gateway, CodeBuild, AWS Config, Systems Manager, Service Catalog, Lambda, etc.
- Full-stack IT experience with *nix, Windows, network/firewall concepts, source control (BitBucket) and build/dependency management and continuous integration systems (TeamCity, Jenkins)
- Expertise in at least one scripting language, Python preferred
- Must have firm understanding of application reliability, performance tuning and scalability
- Exposure to big data technologies (Spark, Hadoop, Scala, etc.) stack is preferred
- Solid knowledge of infrastructure and cloud-native services along with network technologies
- Solid understanding of RDBMS and Cloud Database engines like Postgres SQL, MySQL etc.
- Firm understanding of Clusters, Load balancers and CDN
- Experience in fault-tolerant system design
- Familiarity with Splunk data analysis, Datadog or similar tools is a plus
- A Bachelor’s degree (Master’s preferred) in a related technical field
- Excellent analytical, troubleshooting and communication skills
- Possess strong verbal, written and team presentation communication skills. ZS is a global firm; fluency in English is required
- This role requires healthy doses of initiative and the ability to remain flexible and responsive in a very dynamic environment
- Ability to quickly learn new platforms, languages, tools, and techniques as needed to meet project requirements