Building Scalable ML Pipelines with Kubernetes
A deep dive into orchestrating machine learning workloads at scale using Kubernetes and custom operators.
5+ years architecting LLM-powered systems, RAG pipelines, and multi-agent orchestration across finance, retail, and healthcare.
From research to production—building scalable AI that delivers.
Production-grade AI systems and research experiments.
Insights on AI engineering, ML systems, and production deployment.
A deep dive into orchestrating machine learning workloads at scale using Kubernetes and custom operators.
Practical insights from deploying fine-tuned language models in production environments with real user traffic.
Comparing Pinecone, Weaviate, and Qdrant for real-world RAG applications. Performance benchmarks and trade-offs.
How to transform experimental ML models into robust production systems that handle millions of requests.
Techniques for achieving sub-50ms inference latency: model quantization, batching, and infrastructure optimization.
Emerging patterns in AI engineering: multimodal models, edge AI, and the rise of AI-native architectures.
A journey through innovation, research, and engineering excellence.
Jitterbit
Building APIM Bot (API lifecycle automation via NL) and iPaaS Bot (multi-agent integration automation). Architecting RAG pipelines, token-aware summarization, and provider-agnostic LLM adapters. Achieving 60% cost reduction, 99.5% uptime on Kubernetes microservices.
American Express
Led 5-member team building enterprise GenAI Governance Framework (50+ tests, 60% faster approvals). Delivered NLP solutions for risk monitoring, underwriting (85% accuracy), and complaints analysis. Pioneered neural embedding architectures outperforming legacy models by 40 bps.
Manthan Software Systems
Fine-tuned BERT and GPT-2 for personalized query auto-completion on 1.4M retail queries. Applied deep learning and statistical models on sparse datasets.
National Taiwan University
Big-data genomic analytics on axolotl genome at the Epigenetics Lab. Identified regenerative epigenomic factors through computational analysis.
IIT Kharagpur
CGPA: 7.64/10. Focused on computational biology, bioinformatics, and data science. Built foundation in ML and statistical modeling.