AI Platform & DevOps Engineer
emergys
Job Description
Core Duties: Manage the development and deployment of AI-related applications, ensuring a robust pipeline.
- Design, implement, and manage container orchestration systems using Docker and Kubernetes (AKS).
- Deploy and manage AI applications on Azure, ensuring scalability and reliability.
- Knowledge on Azure and opensource AI models and usage.
- Knowledge on Security tools like Nexus or Blackduck to maintain a DevSecOps mindset.
- Setup and maintain continuous integration and continuous deployment (CI/CD) pipelines using tools such as Gitlab CI or Jenkins.
- Setup observability tools like langfuse or ELK or DataDog.
- Develop and maintain scripts to automate various deployment and system tasks.
- Manage toolchain installations, including version control systems, artifact repositories, and build tools.
- Collaborate with development and operations teams to ensure a seamless and effective deployment pipeline.
- Monitor and maintain systems to ensure high availability and performance under restrictive network infrastructure.
- Stay informed about new technologies and incorporate them into existing processes when appropriate.
- Report progress, issues, and solutions clearly and concisely to management and stakeholders.
Key Requirements
- Experience: Minimum of 5-10 years of relevant experience in DevOps.
- Containerization: Expertise in Docker, including Docker Swarm setup and use.
- Orchestration Technologies: Proficient in Azure Kubernetes Service (AKS) or other similar orchestration technologies.
- Deployment: Experience with deploying applications on Azure.
- CI/CD Tools: Working knowledge and experience with Gitlab CI and Jenkins for continuous deployment.
- Operating Systems: Proficient in working with Linux operating systems.
- Tool Chain: Experience with installation and configuration of toolchains including Git, Nexus, Maven, Jenkins.
- Scripting: Skilled in writing scripts to automate tasks and workflows (e.g., Bash, Python).
- Build Tools: Experienced with build tools such as Maven, Gradle, or NPM.
- Methodologies: Familiar with Agile-Scrum methodologies.
- Communication: Fluent in English with the ability to clearly communicate complex technical information.
- Learning and Adaptability: Demonstrates a willingness to investigate and learn new technologies quickly.
- Work Under Pressure: Ability to perform effectively under pressure, especially in a restrictive network infrastructure.
Personal Attributes
- Self-Motivated: A proactive and independent worker with a strong drive to succeed.
- Quick Learner: An individual who adapts to new technologies and challenges swiftly.
- Team Player: Works effectively within a team environment, collaborating and contributing towards common goals.