Lead Software Engineer - Devops, Python, Azure
blueyonder
Job Description
What you’ll do:
-
Design, develop, and maintain backend services and cloud-native applications.
-
Build and automate CI/CD pipelines for both application code and AI Foundry models/agents.
-
Automate cloud infrastructure provisioning using Terraform.
-
Manage Azure Foundry deployments, including model endpoints, agent workflows, service configurations, and lifecycle tracking.
-
Deploy containerized workloads on Kubernetes (AKS) and manage Helm-based releases.
-
Implement monitoring, alerting, and observability for applications, Foundry agents, and pipelines.
-
Perform debugging, root-cause analysis, and environment support for production systems.
-
Integrate automation tests and validation steps into CI/CD pipelines.
-
Collaborate with product, AI, and engineering teams to build scalable and reliable solutions.
-
Maintain strong coding standards, security best practices, and version control discipline.
-
Participate in Agile ceremonies and contribute to continuous improvement.
-
Identify performance, cost, and security optimization opportunities across cloud and AI workloads.
What we are looking for:
-
Education: Bachelor’s degree in computer science, Engineering, or related discipline (or equivalent hands-on experience)
-
Years of Experience: 5–7 years of combined DevOps + Development experience
Experience in:
-
Azure Cloud and Azure AI Foundry (model deployment, agent operations, flow orchestration)
-
Microservices development (Java/Python/Node.js)
-
CI/CD automation (GitHub Actions)
-
Kubernetes, Docker, container-based deployments
-
Infrastructure as Code using Terraform
-
REST API design and integration
-
SQL and distributed data platforms (Snowflake preferred)
Expertise in:
-
Azure Foundry workspace setup, model region support, model/agent lifecycle
-
Git branching strategies, deployment governance
-
Observability frameworks: Prometheus, Grafana/Elastic
-
Secure development practices and secret management (Azure Key Vault)
-
System debugging, performance tuning, and release automation
-