Machine Learning Engineer
morningstar
Job Description
- Design and build ML-driven data pipelines that ingest and process structured and unstructured data from multiple sources.
- Develop models for information extraction, entity recognition (NER), entity resolution, classification, and data normalization.
- Apply NLP, transformer models, and LLMs to extract and enrich data from documents such as reports, filings, and news articles.
- Build systems that improve data coverage, accuracy, freshness, and consistency across datasets.
- Integrate ML models into scalable production systems with strong reliability, latency, and throughput guarantees.
- Collaborate with data collection and curation teams to incorporate human-in-the-loop feedback and improve model performance.
- Design evaluation frameworks and metrics for data quality, extraction accuracy, and enrichment effectiveness.
- Optimize pipelines for large-scale processing using distributed systems and streaming technologies.
- Contribute to architecture decisions for data infrastructure, ensuring scalability and maintainability.
- Stay current with advancements in NLP, GenAI, and information extraction, and translate research into production-ready systems.
- Ensure best practices in monitoring, observability, data governance, and responsible AI usage.
- Mentor junior engineers and contribute to a culture of technical excellence through reviews and knowledge sharing.