Senior Software Engineer, RAG and Agentic AI

nvidia

Bengaluru, India 5 Years Exp Posted 53d ago

Job Description

What you’ll be doing:

  • Plan, build and refine a GPU-accelerated, scalable, configurable Retrieval Augmented Generation (RAG) workflow and optimize it for accuracy, relevance, grounding and performance.

  • Design and implement AI agents to enhance RAG pipeline which are capable of reasoning, planning, multi-step execution, and collaboration across tools and services

  • Run fast, high-quality POCs on emerging agent and RAG architectures; harden successful patterns into generalized, reusable implementations and integrate them as part of production software.

  • Build and deploy a disaggregated, end-to-end RAG pipeline using on-prem microservices architecture, orchestrating complex, multi-service deployments from local Docker environments to enterprise-scale Kubernetes clusters.

  • Drive the continuous improvement of the pipelines by rigorously evaluating system accuracy, characterizing performance metrics across components, analyzing the data and recommending actionable strategic enhancements."

  • Collaborate with various teams on new product features and the improvement of existing product. Provide guidance and support to NVIDIA internal teams and external partners on domain-adaptation, customization and integration of the RAG pipeline.

  • Champion engineering excellence by leading rigorous code, architecture, and test plan reviews, authoring robust user documentation, and driving collaborative problem-solving and triage initiatives.

  • Drive software excellence by designing with clean architectural patterns and automating the path to production through advanced CI/CD, testing, and telemetry workflows.

 

What we need to see:

  • 5+ years of professional software engineering experience, with deep expertise in Python, and AI applications.

  • Bachelor's degree or Master’s degree (or equivalent experience) in Computer Science, Electrical Engineering, Data Science, Artificial Intelligence or other related fields

  • Hands-on experience building and deploying LLM-powered AI applications or RAG or Agentic AI workflows.

  • Strong understanding of LLM design patterns, including tool calling, prompt engineering, structured outputs, reasoning.

  • Experience with agent frameworks or orchestration systems such as LangGraph, LangChain, OpenAI Agents SDK, or similar.

  • Have working experience with microservices, Docker, Helm, Kubernetes.

  • Experience with end-to-end software lifecycle, release packaging, and CI/CD pipelines.

  • Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic environment involving teams across the globe.

 

Ways to stand out from the crowd:

  • Experience designing multi-agent systems and sophisticated workflow orchestration engines.

  • Familiarity with evaluation frameworks, MLOps pipelines, and AI observability tooling.

  • Background in deploying AI models on data center, cloud, and embedded systems.

    • Strong python programming skills and experience of working with AI coding agents.

Similar Openings for You