Synapse XTL - Data Scientist/AI Engineer
hirist
Job Description
Key Responsibilities :
- LLM Development & Integration : Build, fine-tune, and deploy LLM-based applications using OpenAI (GPT-4o), Anthropic (Claude), and Google Gemini APIs.
- Agentic AI Systems : Design and implement multi-agent workflows using LangChain and LangGraph for complex analytical procedures, risk analysis, and financial document processing.
- Document Intelligence : Develop and maintain OCR pipelines, PDF extraction, and entity extraction systems for financial documents (balance sheets, income statements, audit workpapers).
- API Development : Build and maintain RESTful APIs using FastAPI and Flask for AI microservices.
- Frontend Integration : Collaborate with frontend teams and build demo interfaces using Gradio and Streamlit.
- Cloud Infrastructure : Deploy and manage AI applications on AWS (Elastic Beanstalk, S3, Lambda, Textract).
- MLOps & Monitoring : Implement observability using LangFuse, New Relic, and Sentry for LLM tracing and performance monitoring.
- Code Quality : Write clean, tested, and production-ready Python code following best practices.
Required Technical Skills :
Core Python & AI/ML Frameworks :
- Strong proficiency in Python 3.10+.
- Hands-on experience with LangChain, LangGraph, and LangChain Hub.
- Working knowledge of OpenAI API, Anthropic Claude API, and Google Generative AI.
- Experience with Pydantic for data validation and schema modeling.
Document Processing & OCR :
- Familiarity with OCR libraries : Tesseract, EasyOCR, AWS Textract.
- PDF processing with PyMuPDF, PyPDF2, pdf2image, pdfplumber.
- Document parsing with Unstructured, MarkItDown, python-docx.
Web Frameworks & APIs :
- FastAPI and Flask for building production APIs.
- Uvicorn and Gunicorn for ASGI/WSGI servers.
- RESTful API design and implementation.