Staff Data Scientist R&D

avathon

Bengaluru, India 8 Years Exp Posted 36d ago

Job Description

What You’ll Do

Build and fine-tune SLMs/LLMs for production use cases such as semantic search, forecasting, contract intelligence, and conversational insights. Optimize models for performance, cost, and latency using PEFT, quantization, and efficient inference techniques

Embed supply chain knowledge into models by working closely with domain experts and product teams

Enable agentic workflows, allowing AI Assistants to execute planning, optimization, and decision-support tasks

Integrate models into products using custom Lambdas, and a Graph-based microservices architecture

Apply vector search and semantic reasoning to help customers navigate complex supply chain relationships

Work with real enterprise data—ERP, logistics, inventory, supplier, and contract datasets

Deploy and operate models using strong MLOps practices on platforms like AWS, GCP or AZURE

Measure and improve quality, reducing hallucinations and improving reliability through continuous evaluation
Partner with engineering, product, and operations teams to ship impactful AI features end-to-end

What We’re Looking For

Experience working with SLMs (Mistral, Phi, Nemotron, Llama variants), applying parameter-efficient fine-tuning (LoRA/QLoRA) to optimize task-specific natural language query answering with high accuracy and minimized hallucinations.

Strong understanding of supply chain data and workflows (logistics, inventory, procurement, ERP systems)

Familiarity with Knowledge Graphs and graph-based data modeling, including designing and querying Neo4j graphs for relationship-driven insights.

Solid hands-on skills in Python, PyTorch/TensorFlow, GraphQL, and vector databases

8+ years of experience in forecasting, optimization, supply chain analytics, or AI-driven products

Experience designing Retrieval-Augmented Generation pipelines using document chunking, metadata filtering, hybrid (dense + sparse) retrieval, and SLM/LLM grounding for enterprise knowledge search.

Hands-on experience building and tuning FAISS-based vector indexes (IVF, HNSW, PQ) for large-scale semantic similarity search with low-latency production deployment

Experience taking models from prototype to production

Bonus: reinforcement learning, MARL, graph neural networks, or probabilistic modeling

Clear communication skills and a strong sense of product ownership

Similar Openings for You

Data Engineer

ibm • Hyderabad

Lead Data Engineer

thermofisher • Bangalore

Data Engineering

ripplehire • Trivandrum

Senior Data Engineer

ups • Chennai, India