ge.tess4j.tess4j.5.13.0.source-code.versionchanges.txt Maven / Gradle / Ivy
Go to download
Show more of this group Show more artifacts with this name
Show all versions of tess4j Show documentation
Show all versions of tess4j Show documentation
# Tess4J
## Description:
A Java JNA wrapper for Tesseract OCR API.
Tess4J is released and distributed under the Apache License, v2.0.
## Features:
The library provides optical character recognition (OCR) support for:
TIFF, JPEG, GIF, PNG, and BMP image formats
Multi-page TIFF images
PDF document format
The newest version!
Tess4J Change Summary
Version 0.1 - initial release (14 Aug 2010):
- Java JNA-based wrapper for Tesseract OCR DLL 2.04
- Support uncompressed, binary TIFF images
Version 0.2 (16 Aug 2010):
- Add support for more image formats (PNG, BMP, GIF, PDF, JPEG)
- Add support for compressed, grayscale and colored images
Version 0.3 (22 Aug 2010):
- Include API support for BufferedImage
- Clean up codes. Remove unsupported API and files
- Document the API
Version 0.3.1 (26 Aug 2010):
- Send only pixel data, not whole image data, to Tesseract engine, to fix a bug that has erroneously put some words at beginning of line towards end of line
Version 0.4 (1 Nov 2010):
- Add JNA Direct Mapping calls, which can provide performance near that of custom JNI
Version 1.0 (30 October 2012):
- Upgrade to Tesseract 3.02 (r798), which is not backward compatible with Tesseract 2.04.
- Implement a new JNA wrapper for the new Tesseract OCR API
- Add more unit test cases
- Update documentation
Version 1.1 (3 March 2013)
- Update Tesseract DLL to r828
- Additional API methods, image helper methods, and unit test cases
- Improve handling of Unicode character encoding
- Fix memory leaks
- Add support for determining skew angle and image rotation
Version 1.2 (22 September 2013)
- Update Tesseract DLL to r866
- More efficient OCR of multiple images
- Various minor improvements
- Update JNA to v4.0
Version 1.3 (31 May 2014)
- Update JNA to v4.1.0
- Update Ghost4J to v0.5.1
- Refactoring
- Bundle Tesseract and Leptonica 64-bit DLLs
Version 1.4 (18 January 2015)
- Refactor to reduce code duplication
- Embed Windows native resources in JAR
- Autoload Windows native libraries
Version 1.4.1 (24 January 2015)
- Enable use of jna.library.path system property for user-customizable path
Version 1.5 (13 March 2015)
- Add UNLV zone file support
- Refactor
Version 2.0 (29 March 2015)
- Upgrade to Tesseract 3.03 (r1050), which is compatible with Tesseract 3.03RC on Linux
- Refactor Tesseract class for extensibility and thread-safety
- Update English language data for Tesseract 3.02
Version 3.0 (25 December 2015)
- Upgrade to Tesseract 3.04 (953523b)
- Include Lept4J library
- Incorporate slf4j and logback libraries for logging
- Make GhostScript calls thread safe
Version 3.1 (21 March 2016)
- Update Tesseract to 3.04.01 (4ef68a0)
- Use Lept4J-1.1.2 (Leptonica 1.72)
- Update JNA to 4.2.2
- Update Ghost4J to 1.0.1
- Delete ResultRenderer after use to release PDF file handler
Version 3.2 (15 May 2016)
- Revert JNA to 4.1.0 due to "Invalid calling convention 63" errors invoking GhostScript via Ghost4J on Linux
- Update Lept4J to 1.2.2 (Leptonica 1.73)
- Recompile Tesseract 3.04.01 DLL against Leptonica 1.73
- Update GhostScript Windows binary to 9.19
Version 3.2.1 (29 May 2016)
- Properly release Box and Boxa resources
- Update Lept4J to 1.2.3
Version 3.2.2 (16 February 2017)
- Update GhostScript to 9.20
- Fix possible NPE with PDF-related codes
- Update dependencies
- Additional image utility methods
Version 3.3.0 (16 February 2017)
- Upgrade to Tesseract 3.05 (2ca5d0a)
- Update Lept4J to 1.3.0 (Leptonica 1.74.1)
Version 3.3.1 (23 March 2017)
- Update Lept4J to 1.3.1
- Update other dependencies
Version 3.4.0 (1 June 2017)
- Upgrade to Tesseract 3.05.01 (2158661)
- Update Lept4J to 1.4.0
- Add support for jboss-vfs protocol
Version 3.4.1 (22 September 2017)
- Not extract/copy native resource if it exists and has same file size
- Update Tesseract 3.05.01 (e2e79c4); link against Leptonica 1.74.4
- Update Lept4J to 1.6.1
Version 3.4.2 (14 November 2017)
- Update Lept4J to 1.6.2
- Update GhostScript to 9.22
- Improve handling of PDF files in multi-threaded environment
- Lift limits on number of pages in PDF
- Use TESSDATA_PREFIX environment variable by default, if defined
Version 3.4.3 (14 January 2018)
- Not extract/copy resource if it exists and has same file size
Version 3.4.4 (22 February 2018)
- Exclude logback.xml from JAR
- Add image rotate and deskew methods
- Update Lept4J to 1.6.3
Version 3.4.5 (21 March 2018)
- Remove GS DLL due to license incompatibility
- Use PDFBox
Version 3.4.6 (25 March 2018)
- Update PDFBox dependencies
Version 3.4.7 (16 April 2018)
- Update dependencies for Java 9 fixes
Version 3.4.8 (2 May 2018)
- Fix a path issue when extracting resources from JAR to temp directory on Windows server
Version 4.0.0 (28 April 2018)
- Upgrade to Tesseract 4.0.0-beta.1 (45bb942)
- Update Lept4J to 1.9.3 (Leptonica 1.75.3)
Version 4.0.1 (2 May 2018)
- Fix a path issue when extracting resources from JAR to temp directory on Windows server
Version 4.0.2 (3 May 2018)
- Replace JNA string constant Platform.RESOURCE_PREFIX
- Update jai-imageio url
- Update Lept4J to 1.9.4
Version 4.1.0 (20 July 2018)
- Upgrade to Tesseract 4.0.0-beta.3 (b502bbf)
- Update Lept4J to 1.10.0
- Improve handling of PDF
- Refactor
Version 4.1.1 (28 July 2018)
- Properly dispose of resources and temporary image files
- Clean up code and test output resources
- Fix NPE in Java 10
Version 4.2.0 (11 August 2018)
- Upgrade to Tesseract 4.0.0-beta.4 (fd49206)
Version 4.2.1 (11 August 2018)
- Recompile using JDK8 to avoid NoSuchMethodError: Method flip() does not exist in class java.nio.ByteBuffer
- Use explicit cast for compatibility with covariant return type on JDK 9's ByteBuffer methods, e.g., flip()
Version 4.2.2 (3 September 2018)
- Fix Invalid memory access exception due of incorrect bit depth value
Version 4.2.3 (17 October 2018)
- Update pdfbox dependencies
Version 4.3.0 (29 October 2018)
- Upgrade to Tesseract 4.0.0 (5131699)
Version 4.3.1 (26 December 2018)
- Fix Windows build
- Improve RenderedImage to ByteBuffer conversion
Version 4.4.0 (13 July 2019)
- Upgrade to Tesseract 4.1.0 (5280bbc)
- Upgrade to Leptonica 1.78.0 (lept4j-1.12.2)
- Update dependencies
Version 4.4.1 (7 October 2019)
- Use tessdata_fast data
- Use Native.loadLibrary method for backward compatibility with older JNA versions
Version 4.5.0 (27 December 2019)
- Upgrade to Tesseract 4.1.1 (7510304)
Version 4.5.1 (3 January 2020)
- Update Leptonica 1.79.0 (lept4j-1.13.0)
- Fix Permission denied issue with Ghostscript 9.50
Version 5.0.0 (30 November 2021)
- Upgrade to Tesseract 5.0.0 (924b746)
- Update Leptonica 1.82.0 (lept4j-1.16.1)
- Update dependencies
Version 5.1.0 (11 January 2022)
- Update Tesseract 5.0.1 (853aa66)
- Update dependencies
Version 5.1.1 (26 January 2022)
- Remove ghost4j dependency due to log4j vulnerabilities
Version 5.2.0 (4 March 2022)
- Upgrade to Tesseract 5.1.0 (d737372)
- Update dependencies
Version 5.2.1 (26 April 2022)
- Only extract resources appropriate to the platform
- Update dependencies
Version 5.3.0 (7 July 2022)
- Upgrade to Tesseract 5.2.0 (5ad5325)
- Update dependencies
Version 5.4.0 (24 July 2022)
- Fix createDocumentsWithResults for multi-page image (Issue #233)
- Fix TessResultRendererBeginDocument exception with null input (Issue #234)
- New API method
Version 5.5.0 (26 December 2022)
- Upgrade to Tesseract 5.3.0
- Update dependencies and Maven plugins
Version 5.6.0 (29 January 2023)
- Update Leptonica 1.83.1 (lept4j-1.18.0)
Version 5.7.0 (3 April 2023)
- Upgrade to Tesseract 5.3.1
Version 5.8.0 (29 July 2023)
- Upgrade to Tesseract 5.3.2
Version 5.9.0 (3 December 2023)
- Upgrade to Tesseract 5.3.3
- Update PDFBox and other dependencies
- Add API support for multiple regions of interest (ROI) per image or page
- Add a utility method to merge hocr file into pdf file
Version 5.10.0 (4 January 2024)
- Update Leptonica 1.84.1 (lept4j-1.19.0)
Version 5.11.0 (7 March 2024)
- Upgrade to Tesseract 5.3.4
- Update lept4j-1.19.1
Version 5.12.0 (24 June 2024)
- Upgrade to Tesseract 5.4.1
Version 5.13.0 (20 August 2024)
- Add a convenient method to get OSD data
- Use default methods in interface to reduce code duplication