Portrait of Shyam Subramanian

Boston, MA

Shyam Subramanian

Data Scientist | ML Engineer | Researcher

I enjoy solving challenging problems end-to-end at the intersection of language, vision, and human-centric systems.

About Me

I am a Data Scientist with strong AI/ML background and Software engineering expertise focused on building end-to-end AI solutions from modeling to production. I build intelligent language and visual understanding systems that work alongside humans to solve real-world problems.

At Fidelity Investments, my recent work has focused on applying Agentic AI to reduce manual work for associates, enabling faster and more accurate customer request resolution. My active areas of interest include Multi-Modal Document Understanding and Task-Oriented Dialogue Systems. This drives my research & experimentation in Agentic Document Graph Navigation and Script-Based Dialog Policy Planning.

Outside of work and research, I enjoy playing piano, chess, volleyball, and visiting exciting places. If you want to jam, battle over a chess game, or share similar interests, let's connect!

Education

Master's in Business Administration

2023 - 2024

University of the Cumberlands (UC)

ucumberlands.edu

Master's in Data Science

2018 - 2020

Worcester Polytechnic Institute (WPI)

rajalakshmi.org

Bachelor's in Computer Science & Engineering

2012 - 2016

Rajalakshmi Engineering College (Anna University)

rajalakshmi.org

Experience

Principal Data Scientist @ Fidelity Investments

Jan 2024 - Present

Agentic AI for Service Request Automation

Multi-Agent

Tool Use

LLaMA/GPT/Claude

vLLM

OpenSearch

LangGraph

LangSmith

  • Developed and Deployed: A multi-agent AI system to automate contact center service request classification, description generation, and routing with human-in-the-loop validation.
  • Impact: 10% ↑ routing accuracy, 6% ↓ rework rate, ~40% ↓ manual request creation time scaled across multiple business groups.

Conversational Intelligence for Contact Center Assistants

Conversational RAG

Dialog Management

LLaMA/Gemma/GPT

vLLM

DeepSpeed

Milvus

LangGraph

  • Researched and prototyped: A multi-turn Conversational RAG pipeline on customer calls with script-based dialog management, powered by a domain-adapted LLM continuously pre-trained and fine-tuned on a large curated customer call dataset.
  • Impact: Combined latest research on LLM pre-training and fine-tuning, call embeddings, and multi-turn dialog management demonstrating an average 4.6/5 response rating and 87% response acceptance rate.

Web Platform for AI Model Management

ReactJS

NodeJS

PostgreSQL

OpenSearch

Strapi CMS

Kubernetes

Jenkins

  • Developed and Deployed: An AI Model Management application that registers AI models, and integrates usecases, model risk, governance, and deployment workflows.
  • Impact: 1000+ models, 100+ usecases, 15+ business groups, 700+ users enabling model owners, risk managers, and leadership to track cost, performance, risk, and prioritize usecases in a single system.

Senior Data Scientist @ Fidelity Investments

May 2021 - Dec 2023

Multi-Modal Document Understanding for 401-K Client Onboarding

Layout Analysis

Visual Understanding

Knowledge Graphs

Flan-T5/GPT

PEFT/LoRA

DocOwl/GPT4V

  • Developed and Deployed: A document understanding system to extract complex multi-hop, multi-modal information from 401-K client onboarding documents that are long, hierarchically structured with varied layouts.
  • Impact: 80+ extracted items, ~85% accuracy in extraction, estimated time savings equivalent of ~6 Full Time Employees.

Principal Data Scientist @ Fidelity Investments

Jan 2024 - Present

Agentic AI for Service Request Automation

Multi-Agent

Tool Use

LLaMA/GPT/Claude

vLLM

OpenSearch

LangGraph

LangSmith

  • Developed and Deployed: A multi-agent AI system to automate contact center service request classification, description generation, and routing with human-in-the-loop validation.
  • Impact: 10% ↑ routing accuracy, 6% ↓ rework rate, ~40% ↓ manual request creation time scaled across multiple business groups.

Conversational Intelligence for Contact Center Assistants

Conversational RAG

Dialog Management

LLaMA/Gemma/GPT

vLLM

DeepSpeed

Milvus

LangGraph

  • Researched and prototyped: A multi-turn Conversational RAG pipeline on customer calls with script-based dialog management, powered by a domain-adapted LLM continuously pre-trained and fine-tuned on a large curated customer call dataset.
  • Impact: Combined latest research on LLM pre-training and fine-tuning, call embeddings, and multi-turn dialog management demonstrating an average 4.6/5 response rating and 87% response acceptance rate.

Web Platform for AI Model Management

ReactJS

NodeJS

PostgreSQL

OpenSearch

Strapi CMS

Kubernetes

Jenkins

  • Developed and Deployed: An AI Model Management application that registers AI models, and integrates usecases, model risk, governance, and deployment workflows.
  • Impact: 1000+ models, 100+ usecases, 15+ business groups, 700+ users enabling model owners, risk managers, and leadership to track cost, performance, risk, and prioritize usecases in a single system.

Senior Data Scientist @ Fidelity Investments

May 2021 - Dec 2023

Multi-Modal Document Understanding for 401-K Client Onboarding

Layout Analysis

Visual Understanding

Knowledge Graphs

Flan-T5/GPT

PEFT/LoRA

DocOwl/GPT4V

  • Developed and Deployed: A document understanding system to extract complex multi-hop, multi-modal information from 401-K client onboarding documents that are long, hierarchically structured with varied layouts.
  • Impact: 80+ extracted items, ~85% accuracy in extraction, estimated time savings equivalent of ~6 Full Time Employees.

Patents & Publications

Mohamed Mahdi Alouane, Shyam Subramanian, Hui Su

US Patent 2025

Filed 2022

Fidelity Investments

Keerthan Ramnath, Punitha Chandrasekar, Hui Su, Shyam Subramanian et al.

US Patent 2025

Filed 2022

Fidelity Investments

Shyam Subramanian, Kyumin Lee

EMNLP 2020

Cited By 45

Git

WPI

Selected Work

Image preview for selected work: Agentic AI for Service Request Automation

Agentic AI for Service Request Automation

Contact center representatives manually create thousands of service requests under real-world time constraints. Built a multi-agent AI system that classifies, generates, and routes requests with significantly higher accuracy and speed. Designed to scale across multiple business groups with measurable impact on quality and handling time.

AI Agents

Workflow Automation

Multi-Business Impact

Explore this work

Image preview for selected work: Conversational Intelligence for Contact Center Assistants

Conversational Intelligence for Contact Center Assistants

Large language models lack the company-specific knowledge, operational expertise, and conversational context that experienced contact center representatives build over years. Built a multi-turn Conversational RAG pipeline with structured dialog management paired with a domain-adapted LLM through continuous pre-training and fine-tuning. Evaluated through an expert pilot study with contact center representatives, demonstrating strong retrieval quality, response acceptability, and human ratings.

Conversational Agents

Pre-training & Fine-tuning

Research

Explore this work

Image preview for selected work: Document Extraction for 401-K Client Onboarding

Document Extraction for 401-K Client Onboarding

Manually extracting information from complex, varied 401-K business documents is a months-long bottleneck that limits how many clients the business can onboard at any given time. Built an end-to-end multi-modal document extraction pipeline combining custom layout detection, hierarchical document parsing, fine-tuned retrieval models, and generative LLMs to automatically extract a large number of fields from scanned and digital documents. Deployed in production, significantly reducing manual effort and enabling the business to scale client onboarding.

Multi-Modal Doc Extraction

Visual Understanding

End-to-End Production

Explore this work

Shyam Subramanian

© All rights reserved. 2026.

CONTENT

Work

Blog

Built with Reflex