Java Fullstack + AI _D-2662

allianz

pune 5 Years Exp Posted 22d ago

Job Description

KEY RESPONSIBILITIES

  • Design, develop, and enhance backend services and APIs using Java and Spring Boot.
  • Build high-throughput and resilient event-driven components using Kafka.
  • Work with MongoDB or other NoSQL databases to design efficient data models and optimize queries.
  • Integrate Large Language Models (LLMs) such as OpenAI, Claude, or Gemini into backend services via REST APIs and SDKs.
  • Build and maintain AI-powered microservices including RAG (Retrieval-Augmented Generation) pipelines, semantic search, and document intelligence features.
  • Develop and expose AI agent workflows using frameworks such as LangChain4j, Spring AI, or similar Java-native AI toolkits.
  • Implement prompt engineering strategies, context management, and output validation layers for LLM interactions.
  • Design vector database integrations (Pinecone, Weaviate, pgvector) for embedding storage and similarity search.
  • Collaborate closely with architects, BAs, and cross-functional teams to deliver robust technical solutions.
  • Participate in code reviews, refactoring efforts, and optimisation of system performance.
  • Ensure best practices for coding standards, security, maintainability, and AI model governance.
  • Troubleshoot production issues and provide root cause analysis and long-term fixes.
  • Support CI/CD pipeline integration, model deployment automation, and MLOps tooling.
  • Contribute to documentation, technical specifications, and architectural diagrams.

REQUIRED SKILLS & QUALIFICATIONS

  • 5+ years of hands-on development experience in Java-based applications.
  • Strong expertise in Java (8/11/17), Spring Framework, Spring Boot, and RESTful services.
  • Experience with Kafka for messaging, streaming, or event-driven architecture.
  • Practical knowledge of MongoDB or other NoSQL databases (e.g., Cassandra, DynamoDB, Couchbase).
  • Solid understanding of microservices architecture and distributed systems.
  • Hands-on experience consuming LLM APIs (OpenAI GPT-4o, Anthropic Claude, Google Gemini) in production Java applications.
  • Familiarity with Spring AI or LangChain4j for building LLM-backed services in Java ecosystems.
  • Experience with prompt engineering — crafting, versioning, and testing prompts for accuracy, safety, and cost efficiency.
  • Knowledge of embedding models and vector stores for semantic search and RAG pipelines.
  • Strong debugging, analytical, and problem-solving skills.
  • Experience with Git, Maven/Gradle, Jenkins, Docker, or Kubernetes is a plus.
  • Excellent communication and collaboration skills.

PREFERRED SKILLS

  • Experience working in the Insurance domain (Policy, Claims, Underwriting, Billing, etc.).
  • Hands-on experience building or fine-tuning ML models using Python-based frameworks (Hugging Face, scikit-learn, PyTorch) integrated with Java services.
  • Exposure to AI agent orchestration tools — LangGraph, AutoGen, CrewAI, or OpenAI Assistants API.
  • Experience deploying models on cloud AI services: AWS Bedrock, Azure OpenAI Service, or Google Vertex AI.
  • Familiarity with MLOps platforms (MLflow, SageMaker, Kubeflow) for experiment tracking and model lifecycle management.
  • Knowledge of AI safety, responsible AI practices, and PII/data privacy considerations when working with LLMs.
  • Exposure to cloud platforms such as AWS, Azure, or GCP.
  • Knowledge of caching frameworks (Redis, Hazelcast) for LLM response caching and rate-limit management.
  • Familiarity with containerization and orchestration platforms.
    • Understanding of DevOps principles and observability tools (Grafana, Prometheus, ELK) — including LLM observability (token usage, latency, hallucination monitoring).

Similar Openings for You