DevOps Engineer – Product Development
ibm
Job Description
- Design, implement, and maintain scalable and secure CI/CD pipelines that support development, testing, staging, and production environments.
- Work closely with software engineering teams to enable seamless integration, continuous delivery, and faster release cycles.
- Automate build, deployment, and configuration management using industry-standard tools and frameworks (e.g., Jenkins, GitHub Actions, ArgoCD, Helm, Terraform).
- Ensure environment consistency and reliability through infrastructure-as-code (IaC) practices and containerized deployments.
- Monitor and optimize system performance, availability, and scalability across distributed environments.
- Develop and implement tools and frameworks for automated deployment, testing, observability, and incident response.
- Collaborate with developers to embed defect prevention, security, and compliance into the delivery pipeline.
- Enable A/B testing, canary releases, and blue-green deployments for safe and controlled rollouts.
- Partner with QA and development teams to ensure test automation integration within the CI/CD pipeline.
- Contribute to incident management and post-mortem analysis, driving continuous improvement in resilience and operational efficiency.
- Maintain monitoring, logging, and alerting systems to ensure proactive detection and resolution of issues.
Required education
Bachelor's Degree
Preferred education
Master's Degree
Required technical and professional expertise
Required Skills and Experience
- Strong hands-on experience with CI/CD tools such as Jenkins, GitHub Actions, GitLab CI, or ArgoCD.
- Expertise in containerization and orchestration using Docker and Kubernetes.
- Experience implementing infrastructure as code using Terraform, Ansible, or Helm.
- Proficiency in Linux system administration, scripting (Bash, Python, or Go), and environment management.
- Familiarity with build automation tools such as Maven, Gradle, or Ant.
- Experience with cloud platforms (AWS, Azure, or IBM Cloud) and understanding of cloud-native architectures.
- Hands-on experience in monitoring and observability stacks (Prometheus, Grafana, ELK/EFK, OpenTelemetry).
- Strong understanding of networking fundamentals, load balancing, and security best practices.
- Experience with source control and versioning tools (Git, GitHub, Bitbucket).
- Proven ability to troubleshoot performance and deployment issues in complex distributed systems.
Preferred technical and professional experience
Preferred Skills
- Experience with service mesh frameworks (Istio, Linkerd) and modern deployment strategies (e.g., canary, blue-green).
- Exposure to SRE principles — SLIs, SLOs, error budgets, and reliability metrics.
- Knowledge of chaos engineering or resilience testing tools.
- Familiarity with cost optimization and resource scaling in cloud environments.
- Prior experience in product-oriented DevOps environments, supporting large-scale SaaS or platform engineering teams.