...
To install Tesseract, the steps are very easy :
To install it on a Debian/Ubuntu system, run the following command:
Code Block |
---|
sudo apt-get install tesseract-ocr |
Congrats ! Tesseract is now installed and ready to be used. However by default, Tesseract is only able to deal with english language, if you want to deal with other languages you will need to install the corresponding package (if available). The pattern of a language paclage is: tesseract-ocr-[country_code]
The country code is based on three letters. For example the Tesseract package for the french language is: tesseract-ocr-fra
So to install the french package you will need to run the following command:
Code Block |
---|
sudo apt-get install tesseract-ocr-fra |
You can find the list of available language packages for Tesseract on the web or here: https://packages.ubuntu.com/search?keywords=tesseract-ocr
You can also install all the available languages in a single command:
sudo apt-get install tesseract-ocr-all |