Skip to the content.

Data Science Projects

Intelligent Document Query Bot for Microsoft Teams

Problem Statement

In the modern workplace, accessing information from documents efficiently can be a challenging task. To address this issue, our project focuses on developing an intelligent bot for Microsoft Teams that leverages advanced language models and document retrieval systems to provide users with accurate and context-aware answers from uploaded PDF documents.

Tech Stack

Responsibilities

Fine-Tuning OpenAI’s Davinci Model for Conversational Support

Problem Statement

In today’s fast-paced business environment, providing efficient and accurate support is crucial. This project aims to fine-tune OpenAI’s Davinci language model for conversational support queries. By leveraging a dataset of support queries and responses, the model is trained to understand user inquiries and generate contextually relevant and helpful responses.

Tech Stack

Responsibilities

Fine-Tuning GPT-3.5 Turbo for Customer Support Conversations

Problem Statement

Efficient customer support relies on clear and helpful interactions. This project focuses on fine-tuning OpenAI’s GPT-3.5 Turbo model for customer support conversations. By utilizing a dataset containing system prompts, user instructions, and assistant responses, the model is trained to understand and generate contextually relevant replies for effective customer support.

Tech Stack

Responsibilities

DataTalker - A Conversation Interface for Datasets

Overview

Explore DataTalker, a Streamlit web application that enables natural language interactions with your dataset. This project showcases the fusion of language models and data processing to facilitate intuitive conversations with your data.

Description

DataTalker leverages LangChain and OpenAI to create an interactive chat interface. By uploading your dataset and utilizing OpenAI’s API, you can converse with your data, extracting insights and information in a conversational manner.

Advantages

Implementation

Utilizing Streamlit, LangChain, and OpenAI, DataTalker provides an intuitive interface for uploading datasets and querying them using natural language. The conversation history is retained for seamless interactions.

Conclusion

DataTalker exemplifies the power of combining natural language processing with data exploration. It showcases the potential for intuitive interactions with datasets, potentially revolutionizing the way we analyze and understand data.

Experience DataTalker

Live Website Link


Image Captioning with Transformers: A Fusion of Vision and Language

Overview

Explore my transformative project uniting the power of transformers with image captioning. This endeavor showcases my adeptness at leveraging cutting-edge techniques in machine learning and computer vision to tackle real-world challenges.

Description

Witness the magic of transformers as they infuse life into image captions, surpassing traditional methods. The VisionEncoderDecoderModel chosen for this project seamlessly encodes image content and decodes it into meaningful captions, outshining models like VGG16.

Advantages of Transformers

Transformers redefine image captioning, leveraging their self-attention mechanism to capture intricate relationships and context, resulting in contextually rich captions.

Implementation

Driven by a user-friendly Streamlit interface, the project effortlessly preprocesses images using the VisionEncoderDecoderModel, ViTImageProcessor, and AutoTokenizer. Captivating captions are generated and displayed alongside the original image.

Conclusion

This project exemplifies my expertise in merging transformers and image captioning to pioneer solutions at the crossroads of machine learning and computer vision.

Setup and Usage

  1. Prerequisites: Python 3.6+, Git.
  2. Installation: Clone the repository, download specified folders, and install required packages.
  3. Run the Project: Immerse yourself in the world of image captioning with a seamless user experience.

Experience the Fusion of Vision and Language! Live Website Link: Image Captioning with Transformers


BoomBikes Shared Bikes


House Prices Prediction (Ridge and Lasso Regression)


Telecom Churn (Linear Regression)


Melanoma Detection (CNN)


Drone Detection Model


Gesture Recognition


Automatic Ticket Classification (NLP)


Bank Note Authentication Using Random Forest Algorithm


Chances of Surviving The Titanic

back