Sr. DevOps Engineer
bmc
Job Description
• Drive rollback strategies, release validation, and version management for complex, continuously deployed SaaS systems.
• Design and implement robust CI/CD pipelines with strong automated test coverage at unit, integration, system, and end-to-end levels.
• Establish automation-first practices for environment provisioning, test execution, deployment, monitoring, and recovery.
• Lead debugging and troubleshooting efforts for distributed systems in production, partnering closely with development teams.
• Build and evolve cloud infrastructure using infrastructure-as-code, ensuring scalability, security, and cost efficiency.
• Champion SRE principles: SLOs, SLIs, error budgets, resilience testing, and operational metrics.
• Act as the point of accountability for production health, escalating risks early and driving corrective action.
To ensure you’re set up for success, you will bring the following skillset & experience:
• 8+ years of experience building and operating SaaS products in production environments.
• Strong hands-on background in SRE, DevOps, and Cloud Engineering (AWS, Azure, or GCP).
• Demonstrated experience handling production incidents, rollbacks, postmortems, and long-term reliability improvements.
• Deep experience with CI/CD systems and automated testing frameworks, with a strong bias toward test automation over manual processes.
• Proven ability to design and maintain automation and integration tests for distributed cloud-native systems.
• Experience managing multiple active versions of SaaS products in production.
• Strong debugging skills across services, infrastructure, and pipelines.
• Practical experience with containers, Kubernetes, and infrastructure-as-code tools.
• Ability to operate effectively in fast-moving environments with real production consequences.
• Direct, accountable leadership style with comfort owning operational outcomes.
• Bachelor’s or master’s degree in computer science, Engineering, or a related field.