Releases: tesseract-ocr/tesseract
Releases · tesseract-ocr/tesseract
5.2.0
This is a new minor version of Tesseract 5.
- Improvements and fixes for continuous integration,
autoconf and cmake builds. - Set /Os for some 32 bit MS compilers (fixes #3769).
- Improve comments and other documentation.
- Add initial support for Intel AVX512F.
- Fix for very large PDF files on 32 bit hosts (fixes #3805).
- Fix NEON detection on FreeBSD.
- Fix regression with UZN files (fixes #3837).
- Fix calling delete[] for memory allocated by malloc in C API.
- Add an API function to init tesseract with traineddata from memory
(fixes #3691). - Replace direct access to Leptonica internal data structures by
function calls and support latest releases of Leptonica. - Replace std::regex by std::string functions (fixes issue #3830).
- Use compiled-in TESSDATA_PREFIX also on Windows (fixes #3767).
- Add new parameter 'invert_threshold', change the default threshold
from 0.5 to 0.7 and mark parameter 'tessedit_do_invert' as deprecated.
See also list of all changes.
5.1.0
This is a new minor version of Tesseract 5.
- Handle image and line regions in output formats ALTO, hOCR and text.
- New parameter curl_timeout for curl_easy_setop.
- Build fixes and improvements.
- Catch nullptr in PageIterator::Orientation to improve robustness.
- Remove unused code.
See also list of all changes.
5.0.1
This is a bug fix release of Tesseract 5.0.
- Add SPDX-License-Identifier to public include files.
- Support redirections when running OCR on a URL.
- Lots of fixes and improvements for cmake builds.
Distributions should use the autoconf build. - Fix broken msys2 build with gcc 11.
- Fix parameter certainty_scale (was duplicated).
- Fix some compiler warnings and clean code.
- Correctly detect amd64 and i386 on FreeBSD.
- Add libarchive and libcurl in continuous integration actions.
- Update submodule googletest to release v1.11.0.
See also list of all changes.
5.0.0
This is the final stable release of Tesseract 5.0.0.
- Limit BCER to interval [0,1]
- Improved build process
- Cleaned code
See also list of all changes.
5.0.0-rc3
This is the third release candidate of Tesseract 5.0.0.
- Improve training messages
- Add RowAttributes getter to PageIterator
See also list of all changes.
4.1.3
5.0.0-rc2
This is the second release candidate of Tesseract 5.0.0.
- Fix regression for OCR with more than one model file
- Bug fixes
- Optimizations
See also list of all changes.
4.1.2
5.0.0-rc1
This is the first release candidate of Tesseract 5.0.0.
- Enable fast float32 LSTM by default
- Switch to NFC normalisation everywhere
- Remove banner message
- Disable music staff detection and removal
- Add new command line option --loglevel
- Bug fixes
See also list of all changes.
5.0.0-beta-20210916
This is a new pre-release of Tesseract 5.0.0.
- Bug fixes
- Extend URI support for Tesseract with libcurl
- Rename processed TIFF output file and add page number if needed
See also list of all changes.