OCR Translator (Linux OS)

Keywords: OCR, Tesseract-OCR, Google Translate, Shell Script, Linux

1. Introduction: OCR Translator

Immigrants often struggle to understand letters in a foreign language received by mail. OCR Translator aims to overcome language barriers, by using Tesseract-OCR and Google Translate.

2. Workflow

notice: the preferred way is using a flatbed scanner, camera-based functionality will be added in future releases.

3. Config

Install Tesseract OCR; at time of writing, tesseract 4.0.0-beta.1 was used as OCR engine.
Install dependencies (using conda virtualenv)

    # navigate to ./anaconda 
    conda env create --file environment.yml
    
    # activate OCR_Translator_env
    source activate OCR_Translator_env

Notes:

currently supported data types: PDF, png
one page only (multiple pdf pages won't work)

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
anaconda		anaconda
docs		docs
resources		resources
src		src
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Translator (Linux OS)

1. Introduction: OCR Translator

2. Workflow

3. Config

License

About

Releases

Packages

Languages

License

kapitsa2811/OCRTranslator

Folders and files

Latest commit

History

Repository files navigation

OCR Translator (Linux OS)

1. Introduction: OCR Translator

2. Workflow

3. Config

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages