The Image OCR Tool is designed to extract text from uploaded images using Optical Character Recognition (OCR) technology. This web-based application supports multiple languages and provides a user-friendly interface for uploading images, monitoring processing progress, and exporting extracted text in CSV format.
- Multiple Language Support: Extract text from images in various languages including English, Traditional Chinese, Simplified Chinese, Japanese, and Korean.
- Drag and Drop Interface: Conveniently upload images using a drag-and-drop interface or by selecting files from the local system.
- Real-Time Progress Updates: Monitor the progress of text extraction with a dynamic progress bar.
- CSV Export: Download the extracted text for each image in CSV format.
To run the Image OCR Tool locally, simply follow these steps:
- Download the repository files or clone the repository to your local machine.
- Open the
index.html
file in a web browser.
- Launch the tool by opening the
index.html
file in any modern web browser. - Choose the languages from which text should be extracted using the checkboxes provided.
- Upload images either by dragging and dropping into the specified area or by using the file selector.
- Click 'Process Images' to start the OCR process.
- Once processing is complete, you can download the results as a CSV file.
- HTML/CSS/JavaScript for frontend development.
- Bootstrap for responsive design.
- jQuery for DOM manipulation and AJAX.
- Tesseract.js for performing OCR.
- DataTables to display results in a table format that supports downloading as CSV.
Contributions are welcome. Please create a fork of the repository and submit a pull request with your proposed changes.