Skip to content

Develop a chatbot that can effectively adapt to context and topic shifts in a conversation, leveraging the Stanford Question Answering Dataset to provide informed and relevant responses, and thereby increasing user satisfaction and engagement.

License

Notifications You must be signed in to change notification settings

kraviteja95usd/smartchat-conversational-chatbot

Repository files navigation

Contents

  1. Repository Name
  2. Title of the Project
  3. Short Description of the Project
  4. Objectives of the Project
  5. Name of the Dataset
  6. Description of the Dataset
  7. Goal of the Project using this Dataset
  8. Size of the dataset
  9. Algorithms which are used as part of our investigation
  10. Project Requirements
  11. Usage of the Project
  12. Which chatbot architecture should the users use
  13. Authors

Repository Name

smartchat-conversational-chatbot

Title of the Project

SmartChat: A Context-Aware Conversational Agent

Short Description of the Project

Develop a chatbot that can effectively adapt to context and topic shifts in a conversation, leveraging the Stanford Question Answering Dataset to provide informed and relevant responses, and thereby increasing user satisfaction and engagement.

Objectives of the Project

Create a user-friendly web or app interface that enables users to have natural and coherent conversations with the chatbot, with high satisfaction rating.

Name of the Dataset

The dataset used in this project is Stanford Question Answering Dataset.

Data Source: Kaggle

Type of the Dataset: Text

Description of the Dataset

The Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset consisting of questions posed by crowdworkers on a set of Wikipedia articles. The answer to every question is a segment of text, or span, from the corresponding reading passage. There are 100,000+ question-answer pairs on 500+ articles. More information can be found at: https://rajpurkar.github.io/SQuAD-explorer/

Goal of the Project using this Dataset

  • The goal of the project is to develop a chatbot that can carry out multi-turn conversations, adapt to context, and handle a variety of topics.

Size of the Dataset:

  • The dataset has 2 JSON files. One is for training and the other is for testing
    • dev-v1.1.json – 4.9 MB
    • train-v1.1.json – 30.3 MB

Algorithms which are used as part of our investigation

  • 2 different architectures are used:
    • GPT2-Medium architecture using LoRA and PEFT
    • BERT (bert-base-uncased)

Project Requirements

  • python3
  • datasets
  • torch
  • peft
  • transformers
  • evaluate
  • safetensors
  • numpy
  • pandas
  • matplotlib
  • scikit-learn
  • seaborn
  • nltk
  • rouge-score
  • rouge
  • gradio
  • tqdm

Usage of the Project

Which chatbot architecture should the users use

Authors

Author Contact Details
Ravi Teja Kothuru rkothuru@sandiego.edu
Soumi Ray soumiray@sandiego.edu
Anwesha Sarangi asarangi@sandiego.edu

About

Develop a chatbot that can effectively adapt to context and topic shifts in a conversation, leveraging the Stanford Question Answering Dataset to provide informed and relevant responses, and thereby increasing user satisfaction and engagement.

Topics

Resources

License

Stars

Watchers

Forks