JD LLM DevOps Engineer
abb
Job Description
Your role and responsibilities
In this role, you will have the opportunity develop and enhance complete and sizable software mod-ules in the assigned software engineering function in one or more of platform and application man-agement. Each day, you will execute assigned design and development activities focused on building solutions in an efficient and cost-effective manner and in accordance with quality standards. You will also showcase your expertise by providing curate project schedule estimates and ensures their success-ful completion within the deadline. The work model for the role is: #LI- Onsite This role is contributing to Process Automation business for Process Automation Digital division based in Bangalore, India You will be mainly accountable for: • Design, implement, and manage CI/CD pipelines for deploying large language models (LLMs). • Automate the deployment and scaling of AI models on cloud platforms. • Monitor and maintain the health and performance of AI infrastructure. • Collaborate with data scientists and machine learning engineers to streamline model integration and deployment. • Implement security best practices for AI model deployment and data handling. • Optimize cloud resource usage to ensure cost-effective operations. • Troubleshoot and resolve issues related to AI model deployment and infrastructure. • Stay updated with the latest DevOps tools and practices.
Qualifications for the role
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- 6+ years of proven experience in DevOps engineering, with a focus on AI and machine learning models.
- Strong proficiency in scripting languages such as Python, Bash, or PowerShell.
- Experience with cloud platforms (e.g., Azure, AWS, Google Cloud).
- Solid understanding of CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI).
- Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills.
- Ability to work in a fast-paced and dynamic environment.
- Primary Skills: Python with Machine Learning, Optimization techniques, NLP, Deep Learning techniques
- Secondary Skills: SQL, MongoDB, Flask development
- Additional: Azure deployment, Kubernetes, ML with Spark Compute
- Preferred Qualifications:
- Experience with large language models (LLMs) and natural language processing (NLP).
- Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, Ansible).
- Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana).
- Experience with version control systems (e.g., Git).