All Downloads are FREE. Search and download functionalities are using the official Maven repository.

Download JAR files tagged by tesseract with all dependencies

Search JAR files by class name

pdf2txt-ghostact_2.12 from group org.clulab (version 1.1.5)

The pdf2txt-ghostact subproject converts PDF to text by first converting the PDF to an image with Ghostscript and then converting the image to text with Tesseract.

Group: org.clulab Artifact: pdf2txt-ghostact_2.12
Show documentation Show source 
 

0 downloads
Artifact pdf2txt-ghostact_2.12
Group org.clulab
Version 1.1.5
Last update 30. October 2023
Organization Computational Language Understanding (CLU) Lab
URL https://github.com/clulab/pdf2txt
License Apache License, Version 2.0
Dependencies amount 2
Dependencies scala-library, pdf2txt-common_2.12,
There are maybe transitive dependencies!

pdf2txt-ghostact_2.11 from group org.clulab (version 1.1.5)

The pdf2txt-ghostact subproject converts PDF to text by first converting the PDF to an image with Ghostscript and then converting the image to text with Tesseract.

Group: org.clulab Artifact: pdf2txt-ghostact_2.11
Show documentation Show source 
 

0 downloads
Artifact pdf2txt-ghostact_2.11
Group org.clulab
Version 1.1.5
Last update 30. October 2023
Organization Computational Language Understanding (CLU) Lab
URL https://github.com/clulab/pdf2txt
License Apache License, Version 2.0
Dependencies amount 2
Dependencies scala-library, pdf2txt-common_2.11,
There are maybe transitive dependencies!

easyocr from group cn.easyproject (version 3.0.4-RELEASE)

Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).

Group: cn.easyproject Artifact: easyocr
Show all versions Show documentation Show source 
 

62 downloads
Artifact easyocr
Group cn.easyproject
Version 3.0.4-RELEASE
Last update 28. October 2015
Organization not specified
URL http://easyproject.cn/easyocr
License The Apache Software License, Version 2.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

tess4j from group net.sourceforge.tess4j (version 5.13.0)

# Tess4J ## Description: A Java JNA wrapper for Tesseract OCR API. Tess4J is released and distributed under the Apache License, v2.0. ## Features: The library provides optical character recognition (OCR) support for: TIFF, JPEG, GIF, PNG, and BMP image formats Multi-page TIFF images PDF document format

Group: net.sourceforge.tess4j Artifact: tess4j
Show all versions Show documentation Show source 
 

4847 downloads
Artifact tess4j
Group net.sourceforge.tess4j
Version 5.13.0
Last update 20. August 2024
Organization Tess4J
URL http://tess4j.sourceforge.net
License Apache License 2.0
Dependencies amount 9
Dependencies jna, jai-imageio-core, pdfbox, pdfbox-tools, jbig2-imageio, commons-io, lept4j, jboss-vfs, slf4j-api,
There are maybe transitive dependencies!

aocr from group de.niklasfi.aocr (version 2.4)

Swiftly add ocr layers to scanned pdf files. Unfortunately existing open source ocr solutions (tesseract) pale in comparison with the ones commercially available. The azure read api provides particularly good results. It is also easy to set up, but while it can annotate text in images, there is no easy way to upload and ocr a full pdf document. That is, until now. aocr provides an easy way to ocr full pdf documents.

Group: de.niklasfi.aocr Artifact: aocr
Show all versions Show documentation Show source 
 

0 downloads
Artifact aocr
Group de.niklasfi.aocr
Version 2.4
Last update 19. December 2023
Organization not specified
URL https://github.com/niklasfi/aocr
License MIT License
Dependencies amount 10
Dependencies mapstruct, lombok, pdfbox, pdfbox-tools, jbig2-imageio, commons-cli, httpclient5, jackson-databind, jackson-datatype-jsr310, commons-text,
There are maybe transitive dependencies!



Page 2 from 2 (items total 15)


© 2015 - 2024 Weber Informatics LLC | Privacy Policy