Staff Data Scientist R&D
avathon
Job Description
What You’ll Do
- Build and fine-tune SLMs/LLMs for production use cases such as semantic search, forecasting, contract intelligence, and conversational insights. Optimize models for performance, cost, and latency using PEFT, quantization, and efficient inference techniques
- Embed supply chain knowledge into models by working closely with domain experts and product teams
- Enable agentic workflows, allowing AI Assistants to execute planning, optimization, and decision-support tasks
- Integrate models into products using custom Lambdas, and a Graph-based microservices architecture
- Apply vector search and semantic reasoning to help customers navigate complex supply chain relationships
- Work with real enterprise data—ERP, logistics, inventory, supplier, and contract datasets
- Deploy and operate models using strong MLOps practices on platforms like AWS, GCP or AZURE
- Measure and improve quality, reducing hallucinations and improving reliability through continuous evaluation
- Partner with engineering, product, and operations teams to ship impactful AI features end-to-end
What We’re Looking For
- Experience working with SLMs (Mistral, Phi, Nemotron, Llama variants), applying parameter-efficient fine-tuning (LoRA/QLoRA) to optimize task-specific natural language query answering with high accuracy and minimized hallucinations.
- Strong understanding of supply chain data and workflows (logistics, inventory, procurement, ERP systems)
- Familiarity with Knowledge Graphs and graph-based data modeling, including designing and querying Neo4j graphs for relationship-driven insights.
- Solid hands-on skills in Python, PyTorch/TensorFlow, GraphQL, and vector databases
- 8+ years of experience in forecasting, optimization, supply chain analytics, or AI-driven products
- Experience designing Retrieval-Augmented Generation pipelines using document chunking, metadata filtering, hybrid (dense + sparse) retrieval, and SLM/LLM grounding for enterprise knowledge search.
- Hands-on experience building and tuning FAISS-based vector indexes (IVF, HNSW, PQ) for large-scale semantic similarity search with low-latency production deployment
- Experience taking models from prototype to production
- Bonus: reinforcement learning, MARL, graph neural networks, or probabilistic modeling
- Clear communication skills and a strong sense of product ownership