Pure Javascript OCR for more than 100 Languages 📖🎉🖥
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Guillermo 5ee01d4265 readme stuff 8 years ago
dist build browser 8 years ago
src moving things to shared 8 years ago
.babelrc add babelrc, switch to createTesseractWorker 8 years ago
.gitignore ginignore, package.json 10 years ago
README.md readme stuff 8 years ago
devServer.js rewrite 8 years ago
index.html add index 8 years ago
package.json rewrite 8 years ago
tesseract_lang_list.md move langs to own file 8 years ago
tesseract_parameters.md remove giant table 8 years ago
webpack.config.dev.js moving things to shared 8 years ago
webpack.config.prod.js add webpack config 8 years ago

README.md

UNDER CONTRUCTION

Due for Release on Monday, Oct 3, 2016 Tuesday, Oct 4, 2016

Tesseract.js

Tesseract.js is a javascript library that gets words in almost any language out of images. Under the hood, Tesseract.js wraps tesseract.js-core an emscripten port of the Tesseract OCR Engine.

fancy demo gif

Tesseract.js works with script tags, webpack/browserify, and node. Once you've installed it, using it is as simple as

Tesseract.recognize(my_image)
         .progress(function  (p) { console.log('progress', p)    })
         .then(function (result) { console.log('result', result) })

Check out the docs for a full treatment of the API.

Installation

Tesseract.js works with a <script> tag via local copy or cdn, with webpack and browserify via npm, and on node via npm. Check out the docs for a full treatment of the API.