DevOps/ML Engineer AVP
db
Job Description
What we’ll offer you
As part of our flexible scheme, here are just some of the benefits that you’ll enjoy
- Best in class leave policy
- Gender neutral parental leaves
- 100% reimbursement under childcare assistance benefit (gender neutral)
- Sponsorship for Industry relevant certifications and education
- Employee Assistance Program for you and your family members
- Comprehensive Hospitalization Insurance for you and your dependents
- Accident and Term life Insurance
- Complementary Health screening for 35 yrs. and above
Your key responsibilities
- Design, implement, and maintain our team’s infrastructure and workflows on Google Cloud Platform, including GCP services such as Google Kubernetes Engine (GKE), Cloud Storage, Vertex AI, Anthos, Monitoring etc.
- Design, implement, and maintain our containerization and orchestration strategy using Docker and Kubernetes. Collaborate with development teams to ensure seamless integration of containerized applications into our production environment.
- Collaborate with software developers to integrate machine learning models and algorithms into our products, using PyTorch, TensorFlow or other machine learning frameworks.
- Develop and maintain CI/CD pipelines for our products, using tools such as GitHub and GitHub actions.
- Create and maintain Infrastructure as Code templates using Terraform.
- Ensure the reliability, scalability, and security of our infrastructure and products, using monitoring and logging tools such as Anthos Service Mesh (ASM), Google Cloud's operations (GCO) etc.
- Work closely with other teams, such as software development, data science, and product management, to identify and prioritize infrastructure and machine learning requirements.
- Stay up to date with the latest developments in Google Cloud Platform and machine learning and apply this knowledge to improve our products and processes.
Your skills and experience:
- Bachelor’s degree in computer science, Engineering, or a related field.
- At least 3 years of experience in a DevOps or SRE role, with a focus on Google Cloud Platform.
- Strong experience with infrastructure as code tools such as Terraform or Cloud Formation.
- Experience with containerization technologies such as Docker and container orchestration tools such as Kubernetes.
- Knowledge of machine learning frameworks such as TensorFlow or PyTorch.
- Experience with CI/CD pipelines and automated testing.
- Strong understanding of security and compliance best practices, including GCP security and compliance features.
- Excellent communication and collaboration skills, with the ability to work closely with cross-functional teams.
Preferred Qualifications:
- Master’s degree in computer science, Engineering, or a related field.
- Knowledge of cloud-native application development, including serverless computing and event-driven architecture.
- Experience with cloud cost optimization and resource management.
- Familiarity with agile software development methodologies and version control systems such as Git.