DevOps Engineer
keka
Job Description
Key Responsibilities:
- Build and optimize CI/CD pipelines using GitHub Actions, GitLab CI, Jenkins, or similar.
- Develop and maintain Infrastructure as Code with Terraform (and CloudFormation where needed), including modules, state management, and reusable design.
- Deploy and manage containerized workloads with Docker, Kubernetes, ECS, and EKS.
- Monitor platform health, investigate incidents, and drive improvements in reliability and performance.
- Implement secure cloud configurations: IAM, encryption, secrets management, logging, and network segmentation.
- Support observability using Prometheus, Grafana, CloudWatch, FluentBit, or OpenTelemetry.
- Partner with developers to improve release quality, deployment speed, and operational readiness.
- Continuously raise the bar on automation, documentation, and DevOps practices across the org.
Required Skills
- 5+ years in DevOps, SRE, or cloud infrastructure roles.
- Strong hands-on experience with AWS and cloud-native architecture.
- Solid expertise in Terraform (modules, state, reusable patterns).
- Proven experience designing and running CI/CD pipelines and release automation.
- Working knowledge of Docker, Kubernetes, and container deployment workflows.
- Scripting in Python, Bash, or similar.
- Solid grasp of monitoring, logging, alerting, and incident response.
- Strong security fundamentals: least-privilege access, secrets management, audit logging.
Preferred Experience
- Working in customer cloud accounts or multi-tenant environments.
- Compliance-aligned infrastructure: SOC 2, HIPAA, or HITRUST.
- Hands-on with ECS, EKS, ECR, Security Hub, GuardDuty, and modern observability stacks.
- Certifications such as AWS, CKAD, or other cloud/DevOps credentials.
- Experience in high-growth product or services environments where speed and ownership matter.
What Success Looks Like
- CI/CD pipelines are faster, more reliable, and reduce manual effort across teams.
- Infrastructure is automated, repeatable, and easy to scale across customer environments.
- Production systems show stronger uptime, observability, and faster incident response.
- Engineering teams ship confidently with mature release and operational support.
- Cloud environments meet security and compliance bars without slowing delivery.