Manager, AI/ML Ops Engineering
greenhouse
Job Description
You Will:
- Focus on team members and coaching them to play to their strengths, grow and deliver peak performance
- Lead and mentor a team of AI/ML Ops engineers and operations specialists.
- Define and govern solutions for AI/ML operations, ensuring scalability, cost-efficiency, and reliability
- Develop and maintain standardized AI/MLOps workflows, including CI/CD/CT (Continuous Integration/Continuous Delivery/Continuous Training) pipelines
- Delegate and harness the aggregate strength of your team.
- Focuses on individual and team needs to foster a positive culture consistent with Smartsheet values.
- Actively helps individuals and the overall team to set priorities and focus on delivery of commitments
- Review team's designs and provide feedback on deployment safety, resilience, scale, performance, and security
- Lead and facilitate cross-team interactions, communication, and dependencies
- Ensure all changes are fully tested before being deployed
- Ensure deployment plans are well-considered and include appropriate scalability and load tests
- Work with stakeholders to align AI projects with business strategies.
- Ensure all AI/ML Ops solutions adhere to regulatory compliance, security, and ethical guidelines
- Drive Engineering and Operational excellence initiatives
- Perform other duties as assigned
You Have:
- Enterprise SaaS software solutions with high availability and scalability
- Experience building teams through recruiting and retention
- Experience in Leading and Mentoring a team of ML engineers and operations specialists.
- Experience in building and maintaining AI/ML Ops platform systems ensuring scalability, reliability, efficiency and security
- AI/MLOps workflows on Databricks , MLFlow, Mosaic AI Agent Framework, Unity Catalog, Vector Search, Knowledge Graph
- Knowledge of AI/ML frameworks like LangChain, LangGraph for AI/ML Ops pipeline integration
- Cloud Platforms: Hands-on experience with at least one major cloud provider (AWS, Azure, or GCP). Experience in AWS hosted data platform is preferable
- Programming languages like Python and SQL
- Modern software engineering practices like Kubernetes, CI/CD, IAC tools (Preferably Terraform), Observability, monitoring and alerting
- Solution Cost Optimisations and design to cost
- Legally eligible to work in India on an ongoing basis