Logithon-AI-Hack

Self Learning AI PDF to Data Converter

This project tackles PDF data extraction using a Large Language Model for layout-agnostic and context-aware results.

About the project

This project makes a smart system for getting answers to text questions. People type their questions and the system uses the LLAMA-13B model to give answers.

Tech Stack

WEB Technologies

Next.js
Typescript
FastAPI

Machine Learning Technologies

Python
Huggingface LLMs
Pytorch
Reinforcement Learning

Data Analysis:

Numpy
Pandas
Matplotlib

Databases:

ChromaDB (Vector database)
Firebase

Theory and Approach

This project began with a powerful language model known as Llama 2 13b. To make it even more effective, it was fine-tuned on a specific dataset. Additionally, RLHF was implemented using GPT2 as a reward model, further enhancing its capabilities for data conversion tasks.

Results and demo

Future work

Implementation of vision transformers.

Speech to text conversion

Contributors

PARAM THAKKAR

ABHI MEHTA

ANUSHKA YADAV

AKSHITA BHASIN

Acknowledgements

logithon 2024

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Multilingual RAG		Multilingual RAG
__pycache__		__pycache__
.env		.env
README.md		README.md
__init__.py		__init__.py
app.py		app.py
config.py		config.py
main.py		main.py
model-rlhf-dpo2.ipynb		model-rlhf-dpo2.ipynb
output.txt		output.txt
pdf_helper.py		pdf_helper.py
pull_model.py		pull_model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Logithon-AI-Hack

Self Learning AI PDF to Data Converter

Table of contents

About the project

Tech Stack

WEB Technologies

Machine Learning Technologies

Data Analysis:

Databases:

Theory and Approach

This project began with a powerful language model known as Llama 2 13b. To make it even more effective, it was fine-tuned on a specific dataset. Additionally, RLHF was implemented using GPT2 as a reward model, further enhancing its capabilities for data conversion tasks.

Results and demo

Future work

Implementation of vision transformers.

Speech to text conversion

Contributors

PARAM THAKKAR

ABHI MEHTA

ANUSHKA YADAV

AKSHITA BHASIN

Acknowledgements

About

Releases

Packages

Contributors 3

Languages

ParamThakkar123/Logithon-AI-Hack

Folders and files

Latest commit

History

Repository files navigation

Logithon-AI-Hack

Self Learning AI PDF to Data Converter

Table of contents

About the project

Tech Stack

WEB Technologies

Machine Learning Technologies

Data Analysis:

Databases:

Theory and Approach

This project began with a powerful language model known as Llama 2 13b. To make it even more effective, it was fine-tuned on a specific dataset. Additionally, RLHF was implemented using GPT2 as a reward model, further enhancing its capabilities for data conversion tasks.

Results and demo

Future work

Implementation of vision transformers.

Speech to text conversion

Contributors

PARAM THAKKAR

ABHI MEHTA

ANUSHKA YADAV

AKSHITA BHASIN

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages