Associate Director - DevOps
flutterinternational
Job Description
Responsibilities:
- Lead and manage the DevOps team, providing mentorship, guidance, and support to ensure team success.
- Architect, implement, and maintain highly available, scalable, and secure infrastructure on cloud platforms such as AWS, GCP, or Azure.
- Drive continuous improvement of the CI/CD pipelines, ensuring seamless integration and deployment processes across the organization.
- Collaborate closely with engineering, product, and operations teams to define and implement DevOps strategies that align with business goals.
- Ensure proper monitoring, logging, and alerting systems are in place to proactively detect and resolve issues in production environments.
- Develop and enforce best practices for infrastructure as code (IaC) using tools like Terraform, Ansible, or CloudFormation.
- Lead the design, deployment, and management of Kubernetes clusters, ensuring efficient container orchestration and scalability.
- Ensure robust disaster recovery and business continuity plans are in place and tested regularly.
- Manage cloud costs and optimize infrastructure utilization for maximum efficiency.
- Stay up-to-date with the latest industry trends and emerging technologies in the DevOps space, and advocate for their adoption where appropriate.
- Foster a culture of collaboration, innovation, and continuous learning within the DevOps team and across the organization.
Requirements:
- BE/B. Tech. in Computer Science, Information Technology, or a related field.
- 10+ years of experience in DevOps, with at least 3+ years in a leadership role.
- Strong experience with cloud platforms such as AWS, GCP, or Azure, including architecture and cost management.
- Expertise in CI/CD tools such as Jenkins, GitLab CI, or CircleCI.
- Extensive experience with containerization technologies, particularly Docker and Kubernetes, including orchestration and cluster management.
- Proficiency in scripting languages like Python, Bash, or Ruby.
- Experience with infrastructure as code (IaC) tools such as Terraform, Ansible, or CloudFormation.
- Strong knowledge of monitoring and logging tools such as Prometheus, Grafana, ELK Stack, or Datadog.
- Excellent problem-solving skills, with the ability to troubleshoot complex issues in distributed systems.
- Strong communication and interpersonal skills, with the ability to work effectively in a collaborative environment.
- Experience in a fast-paced, agile development environment, preferably in a gaming or technology-driven company.