Skip to the content.
AI & GENAI PROJECTS

AI & GenAI Projects

Production-grade RAG platforms, AI evaluation systems, and agentic workflows for enterprise environments.

This section highlights my work across multi-tenant retrieval systems, evaluation infrastructure, and enterprise AI orchestration — with a focus on reliability, scalability, and measurable engineering impact.

LLM Fine-Tuning Pipeline

LoRA/PEFT for Llama2 & Llama3

Built an end-to-end LoRA fine-tuning pipeline for Llama2 (7B) and Llama3 using HuggingFace Transformers and PEFT. Delivered 20% domain-specific accuracy improvement with substantially reduced compute cost versus full fine-tuning. Includes configurable training datasets, domain evaluation scripts, and GGUF export for local inference.

20% Accuracy Improvement
Llama2/3 LoRA PEFT HuggingFace GGUF Cloud GPU

Intelligent Document Query Bot

Microsoft Teams RAG Bot

Built a document retrieval bot for Microsoft Teams using OpenAI and LangChain for context-aware question answering over enterprise documents. Implemented FastAPI backend with ChromaDB vector storage and deployed via Azure Bot Services.

OpenAI LangChain ChromaDB FastAPI Azure Bot Services MS Teams

Commercial LLM Fine-Tuning

OpenAI Davinci & GPT-3.5 Turbo

Fine-tuned OpenAI models (Davinci-002, GPT-3.5 Turbo) for domain-specific conversational support. Built multi-turn conversation data preparation pipelines, evaluation harnesses using scikit-learn metrics, and deployed interactive Streamlit chat applications for inference.

GPT-3.5 Turbo Davinci-002 OpenAI Fine-Tuning API Streamlit scikit-learn

Architecture Diagrams

System-level design thinking behind the featured projects

Mila Chat — Multi-Tenant RAG Architecture

Multi-tenant enterprise retrieval architecture with tenant-scoped query execution, vector search, relational metadata, and async ingestion workflows.

Query Path

User / Client FastAPI Chat Service Tenant Isolation Layer

Parallel Execution

RAG Pipeline | Text2SQL Path

Data Layer

Qdrant PostgreSQL S3 Azure OpenAI

Ingestion Path

Documents Azure Doc Intelligence AWS SQS Step Functions Qdrant + S3

Unified AI Evaluation Platform — Evaluation Flow

Centralized LLM evaluation workflow with YAML-configured product adapters, judge-based metrics, deterministic metrics, CI/CD integration, and evaluation dashboards.

Trigger

CI/CD Pipeline YAML Config Product Adapter

Evaluation Engine

Evaluation Runner

Metrics

Azure OpenAI Judge | Deterministic Metrics

Storage & Output

PostgreSQL S3 Artifacts Streamlit Dashboard Reports / Comparison

System Design Focus

Multi-Tenant AI Architecture Retrieval-Augmented Generation Vector Search & Filtering Async Ingestion Pipelines LLM Cost Optimization Evaluation Infrastructure