Site Reliability Engineer
Siemens
Job Description
What You’ll Be Doing
- Drive quality accountability within the organization with well-defined processes, metrics, and goals for process quality.
- Owning effective post-mortems and ensuring actions are followed-up
- Manage availability, latency, scalability, and efficiency of application development
- Instilling engineering reliability into the life cycle with a focus on fault-tolerant approaches
- Drive capacity planning, performance analysis, instrumentation, and other systems requirements
- Define and report "progress" on strategic initiates and project-level tasks
- Frequent and effective communication to stakeholders
- Implement metrics-driven processes to ensure service quality targets are met
- Innovate with a cloud-forward mindset to improve durability, and reliability, and drive cost-saving initiatives.
- Effectively plan projects in stages and versions reflecting future needs and improvements
- Focuses on "DevOps as a Service" initiatives where the service both addresses process inefficiencies and provides a complete solution that can be easily adopted
- Push for the adoption of new tools by making the case for increased velocity
- Educate and promote knowledge inside the DevOps team, and out
- SME and core focus on new developments in the Cloud Services and technologies
- Clearly communicate progress, challenges, timelines, requests, potential tasks, and areas for improvement
What you need
- Bachelor’s or Masters’s Degree in computer science or related discipline; or equivalent work experience
- 4+ years of demonstrated experience
- 3+ years’ Cloud administration
- 2+ years’ CD/CI
- 2+ years’ Coding, scripting
- Experience with most (but not necessarily all):
- AWS
- Jenkins, TeamCity, or similar
- Ansible, Terraform, CloudFormation, CDK, SAM
- PowerShell, Bash, C#, Python, GO
- SQL Server, Oracle, PostgreSQL
- Bitbucket, GitHub, SVN
- DataDog, ELK, CloudWatch, SentryOne
- Cloud administration, automation, deployment practices, databases, SQL, programming and scripting, design patterns, SOLID principles.
- Demonstrated experience working in large, complex systems environments.
- Experience in a mature CI/CD SDLC environment, implemented exception handling, logging, monitoring, performance measurement, operational metrics knowledge.
- Strong communication, partnership, teamwork and influencing skills required that can close the gaps in time and distance.
- Excellent time management, communication, decision-making, presentation, and organizational skills.
- Demonstrated knowledge and depth in cloud services.
- A DevOps mindset.