All Downloads are FREE. Search and download functionalities are using the official Maven repository.

Download JAR files tagged by extract with all dependencies

Search JAR files by class name

parent-pom from group org.apache.uima (version 17)

UIMA (Unstructured Information Management Architecture). UIMA promotes community development and reuse of annotators that extract metadata from unstructured information (text, audio, video, etc.); it provides for externalized declaration of type systems, component configuration, aggregation, and more, supports scalablity, and provides tooling. This is the common UIMA parent pom. It contains the project-wide defaults.

Group: org.apache.uima Artifact: parent-pom
Show all versions 
There is no JAR file uploaded. A download is not possible! Please choose another version.
0 downloads
Artifact parent-pom
Group org.apache.uima
Version 17
Last update 05. October 2023
Organization not specified
URL https://uima.apache.org
License not specified
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

norconex-importer from group com.norconex.collectors (version 3.0.1)

Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a computer file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before importing/using it in your own service or application.

Group: com.norconex.collectors Artifact: norconex-importer
Show all versions Show documentation Show source 
 

0 downloads
Artifact norconex-importer
Group com.norconex.collectors
Version 3.0.1
Last update 09. July 2023
Organization not specified
URL https://opensource.norconex.com/importer
License not specified
Dependencies amount 12
Dependencies tika-core, tika-parsers, tika-translate, commons-cli, jj2000, opencsv, luaj-jse, webp-imageio, norconex-commons-lang, log4j-slf4j-impl, log4j-core, jcl-over-slf4j,
There are maybe transitive dependencies!

jwat-tools from group org.jwat (version 0.7.1)

JWAT-Tools uses the available JWAT libraries to make high level tasks available either from command-line or programmatically. Common tasks include: Test, Compress, Decompress, CDX, Arc2Warc. More specialised tasks include: Changed, ContainerMD, Delete, Extract, Interval, PathIndex, Unpack, Headers2CDX.

Group: org.jwat Artifact: jwat-tools
Show all versions Show documentation Show source 
 

0 downloads
Artifact jwat-tools
Group org.jwat
Version 0.7.1
Last update 12. May 2023
Organization not specified
URL http://jwat.org/
License Apache License, Version 2.0
Dependencies amount 10
Dependencies jwat-common, jwat-gzip, jwat-arc, jwat-warc, jwat-archive, common-cli, common-json, common-datastructures, jna, openwayback-core,
There are maybe transitive dependencies!

spring-batch-context from group org.digibooster.spring.batch (version 1.1.0-RELEASE)

The aim of this library is to allow developers who use spring-batch to propagate information from the main thread that runs the batch to the executions context of the batch items : ItemReader, ItemProcessor and ItemWriter. For example we some times need to extract the current user from Security Context, so instead of writing the code that passes the current user information as a job parameter we let this library to handle it. This library can be extended to support any information developer want to add.

Group: org.digibooster.spring.batch Artifact: spring-batch-context
Show all versions 
There is no JAR file uploaded. A download is not possible! Please choose another version.
0 downloads
Artifact spring-batch-context
Group org.digibooster.spring.batch
Version 1.1.0-RELEASE
Last update 17. February 2023
Organization not specified
URL https://github.com/zahidMed/spring-batch-context
License GNU General Public License v3.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

skrapeit-core from group it.skrape (version 1.0.0-alpha8)

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. First and foremost it aims to be a testing lib, but it can also be used to scrape websites in a convenient fashion.

Group: it.skrape Artifact: skrapeit-core
Show all versions Show documentation Show source 
 

0 downloads
Artifact skrapeit-core
Group it.skrape
Version 1.0.0-alpha8
Last update 14. October 2020
Organization not specified
URL https://docs.skrape.it
License MIT License
Dependencies amount 1
Dependencies kotlin-stdlib-jdk8,
There are maybe transitive dependencies!

skrape-it from group it.skrape (version 1.0.0-alpha7)

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. First and foremost it aims to be a testing lib, but it can also be used to scrape websites in a convenient fashion.

Group: it.skrape Artifact: skrape-it
Show documentation Show source 
 

0 downloads
Artifact skrape-it
Group it.skrape
Version 1.0.0-alpha7
Last update 04. October 2020
Organization not specified
URL https://docs.skrape.it
License MIT License
Dependencies amount 1
Dependencies kotlin-stdlib-jdk8,
There are maybe transitive dependencies!

cloudsdk-client from group com.ocrsdk.abbyy (version 1.0.0)

ABBYY Cloud OCR SDK allows to recognize printed text (OCR) and hand-printed text (ICR) on images, convert image-only documents to searchable PDF or editable Microsoft Office formats, extract data from receipts, business cards and IDs through the simple HTTP API. This Java library provides classes and methods to access ABBYY Cloud OCR SDK API endpoints.

Group: com.ocrsdk.abbyy Artifact: cloudsdk-client
Show all versions Show documentation Show source 
 

0 downloads
Artifact cloudsdk-client
Group com.ocrsdk.abbyy
Version 1.0.0
Last update 24. January 2020
Organization not specified
URL https://github.com/abbyy/cloudsdk-client-java
License The Apache License, Version 2.0
Dependencies amount 1
Dependencies jackson-databind,
There are maybe transitive dependencies!

abnffuzzer from group com.github.nradov (version 0.2.2)

Fuzz tester for Augmented Backus-Naur Form (ABNF) rules as defined in IETF RFC 5234. This library can directly parse a variety of input sources and extract the ABNF rules, then generate random output matching those rules. That output can be fed in a test framework such as JUnit to search for defects in an application which implements the rules. The most common use case would be testing implementations of other IETF RFCs.

Group: com.github.nradov Artifact: abnffuzzer
Show all versions Show documentation Show source 
 

0 downloads
Artifact abnffuzzer
Group com.github.nradov
Version 0.2.2
Last update 07. December 2019
Organization not specified
URL https://github.com/nradov/${project.artifactId}
License GNU Lesser General Public License, version 2.1
Dependencies amount 3
Dependencies commons-cli, antlr4, antlr4-runtime,
There are maybe transitive dependencies!

rxtx-rebundled from group de.root1 (version 2.1-7r2-6)

rxtx-rebundled is a slightly modified version of the original rxtx 2.1-7r2. The only difference is: 1) It's a working mavenized library 2) There's just one JAR required. Nothing else. The native libs are bundled with the jar-file. 3) You don't have to care about "java.library.path". This version will on startup extract the correct native libs to system temp folder and loads them automatically. When JVM terminates, temp files will be removed.

Group: de.root1 Artifact: rxtx-rebundled
Show all versions Show documentation Show source 
 

2 downloads
Artifact rxtx-rebundled
Group de.root1
Version 2.1-7r2-6
Last update 09. May 2015
Organization not specified
URL http://dev.root1.de/projects/rxtx-rebundled/
License GNU LESSER GENERAL PUBLIC LICENSE Version 3, 29 June 2007
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

pdf-extractor from group de.cit-ec.scie (version 2.0.1)

This is an optimized version of Apache PDFBox. It allows to extract the rough structure of a document (pages, blocks of text and paragraphs as well as formatting information) and was made with the intent to optimize text extraction results for scientific papers. The output can easily be transformed to plaintext (toString) or to an XML format (toXML).

Group: de.cit-ec.scie Artifact: pdf-extractor
Show all versions Show documentation Show source 
 

11 downloads
Artifact pdf-extractor
Group de.cit-ec.scie
Version 2.0.1
Last update 10. December 2014
Organization not specified
URL http://openresearch.cit-ec.de/projects/scie/
License The GNU Affero General Public License, Version 3
Dependencies amount 1
Dependencies pdfbox,
There are maybe transitive dependencies!



Page 29 from 30 (items total 296)


© 2015 - 2024 Weber Informatics LLC | Privacy Policy