Sign Language Converter

This project uses a YOLOv8 model to detect and convert sign language gestures from images and videos into text. The annotated images highlight the detected gestures with bounding boxes and labels.

Here's an example of an annotated image with detected sign language gesture:

Installation

Clone the repository:

git clone https://github.com/justsomerandomdude264/Sign_Language_Converter.git
cd Sign_Language_Converter

Make the Docker image:
```
docker-compose build
```

Usage

Run the Backend Server:
```
docker-compose up
```
Upload Media:
- Use the web interface to upload images or videos containing sign language gestures or use the images in the test_images folder.
- The results will be displayed with annotations and converted text.

How It Works

Frontend:
- The frontend is built with React and provides a user-friendly interface to upload images and videos.
- It displays the selected file, loads during processing, and presents the annotated media along with the detected text.
Backend:
- The backend is built with Python and Django, utilizing the YOLOv8 model for object detection.
- When a media file is uploaded, the backend processes the file, runs the YOLOv8 model to detect sign language gestures, and returns the results to the frontend.
Processing Images:
- The image is read using OpenCV and passed to the YOLOv8 model for detection.
- Detected gestures are annotated with bounding boxes and labels.
- The processed image and annotations are returned to the frontend for display.
Processing Videos:
- The video is processed frame-by-frame using OpenCV.
- Every frame is analyzed to detect gestures using the finetuned YOLOv8 model .
- Detected gestures are annotated similarly to images, and unique detected texts are aggregated.
- The annotations and converted text are sent back to the frontend.

How I Made It

Model Training:
- The YOLOv8 model was trained using a dataset of sign language gestures.
- The training process involved annotating images with bounding boxes for each gesture and fine-tuning the model to improve accuracy.
Backend Development:
- The backend server was developed using Django.
- Routes were created to handle file uploads and process images and videos using the trained YOLOv8 model.
- The model's output is formatted and sent back to the frontend.
Frontend Development:
- The frontend was created using React.
- Components were designed to handle file selection, upload, and display of results.
- Integration with the backend was done using Axios for POST API requests.

Project Structure

Sign_Language_Converter/: Contains the server-side code (django backend) and YOLOv8 model processing.
frontend/: Contains the React-based web interface for uploading media and displaying results.
test_images/: Includes three images to test the program out.

Dataset

In this project I used a dataset made by David Lee named American Sign Language Letters Dataset.
Link- https://public.roboflow.com/object-detection/american-sign-language-letters

Contributing

Contributions are welcome! Please fork the repository and submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Sign_Language_Converter		Sign_Language_Converter
frontend		frontend
test_images		test_images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
example.png		example.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sign Language Converter

Installation

Usage

How It Works

How I Made It

Project Structure

Dataset

Contributing

About

Releases

Packages

Languages

License

justsomerandomdude264/Sign_Language_Converter

Folders and files

Latest commit

History

Repository files navigation

Sign Language Converter

Installation

Usage

How It Works

How I Made It

Project Structure

Dataset

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages