Senior Site Reliability Engineer
bakerhughes
Job Description
As a Senior Site Reliability Engineer, you will be responsible for:
- Demonstrating best practices pertaining to Cloud DevOps development along with a willingness to continually learn Cloud native technologies.
- Following security guidelines to develop secure and compliant Cloud services by working with Risk and Security teams.
- Monitoring configuration management, platform layout, and hosting infrastructure.
- Automating deployment of applications and infrastructure
- Be able to work independently and in a team environment managing a range of customers and technical situations.
- Providing technical application support for enterprise-level systems
- Running our infrastructure with Chef, Ansible, Terraform, Github CI/CD, and Kubernetes
- Participating in Capacity planning, system performance monitoring, resource utilization trending and incident and change management.
- Co-ordinating with Cloud infrastructure partners for Server, Network, Database, service-related incidents, and projects
- Deploying application upgrades/patches in production and test environments
- Troubleshooting application alerts, Azure and AWS Policy from monitoring tools and code inspection and performing RCAs
- Writing tutorials, how-to videos, and other technical articles for the customer community and knowledgebase articles and keep them up to date
- Working on critical, complex customer problems that may span multiple services
- Participating in 24x7 on-call rotation and working with global teams
- Collaborating with cross functional stakeholders
- Providing mentorship and guidance to team members
- Ensuring security best practices are integrated into the development lifecycle, including compliance with data protection regulations.
- Collaborating with stakeholders to understand requirements, set priorities, and communicate progress and challenges.