Download JAR files tagged by tesseract with all dependencies
pdf2txt-ghostact_2.12 from group org.clulab (version 1.1.5)
The pdf2txt-ghostact subproject converts PDF to text by first converting the PDF to an image with Ghostscript and then converting the image to text with Tesseract.
Artifact pdf2txt-ghostact_2.12
Group org.clulab
Version 1.1.5
Last update 30. October 2023
Organization Computational Language Understanding (CLU) Lab
URL https://github.com/clulab/pdf2txt
License Apache License, Version 2.0
Dependencies amount 2
Dependencies scala-library, pdf2txt-common_2.12,
There are maybe transitive dependencies!
Group org.clulab
Version 1.1.5
Last update 30. October 2023
Organization Computational Language Understanding (CLU) Lab
URL https://github.com/clulab/pdf2txt
License Apache License, Version 2.0
Dependencies amount 2
Dependencies scala-library, pdf2txt-common_2.12,
There are maybe transitive dependencies!
pdf2txt-ghostact_2.11 from group org.clulab (version 1.1.5)
The pdf2txt-ghostact subproject converts PDF to text by first converting the PDF to an image with Ghostscript and then converting the image to text with Tesseract.
Artifact pdf2txt-ghostact_2.11
Group org.clulab
Version 1.1.5
Last update 30. October 2023
Organization Computational Language Understanding (CLU) Lab
URL https://github.com/clulab/pdf2txt
License Apache License, Version 2.0
Dependencies amount 2
Dependencies scala-library, pdf2txt-common_2.11,
There are maybe transitive dependencies!
Group org.clulab
Version 1.1.5
Last update 30. October 2023
Organization Computational Language Understanding (CLU) Lab
URL https://github.com/clulab/pdf2txt
License Apache License, Version 2.0
Dependencies amount 2
Dependencies scala-library, pdf2txt-common_2.11,
There are maybe transitive dependencies!
easyocr from group cn.easyproject (version 3.0.4-RELEASE)
Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).
62 downloads
Artifact easyocr
Group cn.easyproject
Version 3.0.4-RELEASE
Last update 28. October 2015
Organization not specified
URL http://easyproject.cn/easyocr
License The Apache Software License, Version 2.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!
Group cn.easyproject
Version 3.0.4-RELEASE
Last update 28. October 2015
Organization not specified
URL http://easyproject.cn/easyocr
License The Apache Software License, Version 2.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!
tess4j from group net.sourceforge.tess4j (version 5.13.0)
# Tess4J
## Description:
A Java JNA wrapper for Tesseract OCR API.
Tess4J is released and distributed under the Apache License, v2.0.
## Features:
The library provides optical character recognition (OCR) support for:
TIFF, JPEG, GIF, PNG, and BMP image formats
Multi-page TIFF images
PDF document format
4847 downloads
Artifact tess4j
Group net.sourceforge.tess4j
Version 5.13.0
Last update 20. August 2024
Organization Tess4J
URL http://tess4j.sourceforge.net
License Apache License 2.0
Dependencies amount 9
Dependencies jna, jai-imageio-core, pdfbox, pdfbox-tools, jbig2-imageio, commons-io, lept4j, jboss-vfs, slf4j-api,
There are maybe transitive dependencies!
Group net.sourceforge.tess4j
Version 5.13.0
Last update 20. August 2024
Organization Tess4J
URL http://tess4j.sourceforge.net
License Apache License 2.0
Dependencies amount 9
Dependencies jna, jai-imageio-core, pdfbox, pdfbox-tools, jbig2-imageio, commons-io, lept4j, jboss-vfs, slf4j-api,
There are maybe transitive dependencies!
aocr from group de.niklasfi.aocr (version 2.4)
Swiftly add ocr layers to scanned pdf files.
Unfortunately existing open source ocr solutions (tesseract) pale in comparison with the ones commercially
available. The azure read api provides particularly good results. It is also easy to set up, but while it can
annotate text in images, there is no easy way to upload and ocr a full pdf document.
That is, until now. aocr provides an easy way to ocr full pdf documents.
Artifact aocr
Group de.niklasfi.aocr
Version 2.4
Last update 19. December 2023
Organization not specified
URL https://github.com/niklasfi/aocr
License MIT License
Dependencies amount 10
Dependencies mapstruct, lombok, pdfbox, pdfbox-tools, jbig2-imageio, commons-cli, httpclient5, jackson-databind, jackson-datatype-jsr310, commons-text,
There are maybe transitive dependencies!
Group de.niklasfi.aocr
Version 2.4
Last update 19. December 2023
Organization not specified
URL https://github.com/niklasfi/aocr
License MIT License
Dependencies amount 10
Dependencies mapstruct, lombok, pdfbox, pdfbox-tools, jbig2-imageio, commons-cli, httpclient5, jackson-databind, jackson-datatype-jsr310, commons-text,
There are maybe transitive dependencies!
Page 2 from 2 (items total 15)