Comments (1)
[Archive] v2 to v4 Guide
The following comment contains the old guide for upgrading from v2 to v4. Users are encouraged to update to the latest version (v5), but this is still provided for informational purposes.
Changes Impacting Most Users
createWorker
is now async- In most code this means
worker = Tesseract.createWorker()
should be replaced withworker = await Tesseract.createWorker()
- In most code this means
worker.load
is no longer needed (createWorker
now returns worker pre-loaded)- Simply delete
worker.load
from existing code
- Simply delete
Changes Impacting Fewer Users
- Electron users
- Use the browser version of Tesseract.js
- In v2, many users used the Node.js version
- Use the browser version of Tesseract.js
- Users of
getPDF
function - Users who set
cacheMethod: 'none'
orcacheMethod: 'refresh'
as workaround for caching bug- This workaround can be removed, the underlying bug has been fixed (see this comment)
- Users who set the optional
corePath
argument- You will need to point
corePath
to a compatible version of Tesseract.js-core (the latest version of Tesseract.js should be used with the latest version of Tesseract.js-core) - For significantly faster performance, set
corePath
to a directory that includes bothtesseract-core.wasm.js
andtesseract-core-simd.wasm.js
- See this comment for explanation
- You will need to point
- Node.js <14 users
- Node.js v14 is now the earliest version supported
from tesseract.js.
Related Issues (20)
- JSDelivr CDN not accessible in China HOT 8
- Large images cause excessive memory usage
- Worker stuck on "loading language traineddata" HOT 4
- Updated types to infer output formats
- Inference of Chinese handwritten characters is bad HOT 3
- Add line size metrics (ascender, descender, size) to `line` objects in `blocks` output HOT 1
- Font attributes incorrect even when font is properly identified (`is_italic`, `is_serif`, etc.) HOT 1
- Focusing area HOT 1
- Multiple issues: Discussion
- Disable non-text output formats by default
- Tesseract - Running in Browser Console HOT 1
- Execution `worker.recognize` repeatedly causes "Out of Memory" error in JSFiddle HOT 5
- Error: Network error while fetching HOT 1
- how to use installed tessercat lib on windows for tesseract.js? HOT 1
- createWorker throws exception with option.langPath set in electron HOT 7
- Auto fill forms by scanning ID cards
- Suppressing "Corrupt JPEG data: 1 extraneous bytes before marker 0xd9" output HOT 1
- Tesseract.js Bug on IBM i Server HOT 9
- Legacy model does not work for indic and arabic scripts due to Legacy data being removed
- Combine worker and scheduler interfaces
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tesseract.js.