All Downloads are FREE. Search and download functionalities are using the official Maven repository.

Download JAR files tagged by detection with all dependencies

Search JAR files by class name

toolbox-utils from group de.uni_leipzig.asv.toolbox (version 1.0)

ASV Toolbox is a modular collection of tools for the exploration of written language data. They work either on word lists or text and solve several linguistic classification and clustering tasks. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction. On a more abstract level, the algorithms deal with various kinds of word similarity, using pattern based and statistical approaches. The collection can be used to work on large real world data sets as well as for studying the underlying algorithms. The ASV Toolbox can work on plain text files and connect to a MySQL database. While it is especially designed to work with corpora of the Leipzig Corpora Collection, it can easily be adapted to other sources.

Group: de.uni_leipzig.asv.toolbox Artifact: toolbox-utils
Show documentation Show source 
 

0 downloads
Artifact toolbox-utils
Group de.uni_leipzig.asv.toolbox
Version 1.0
Last update 13. August 2013
Organization not specified
URL http://wortschatz.uni-leipzig.de/~cbiemann/software/toolbox/
License MIT License
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

boilerpipe from group de.l3s.boilerpipe (version 1.1.0)

The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page. The library already provides specific strategies for common tasks (for example: news article extraction) and may also be easily extended for individual problem settings. Extracting content is very fast (milliseconds), just needs the input document (no global or site-level information required) and is usually quite accurate. Boilerpipe is a Java library written by Christian Kohlschütter. It is released under the Apache License 2.0. The algorithms used by the library are based on (and extending) some concepts of the paper "Boilerplate Detection using Shallow Text Features" by Christian Kohlschütter et al., presented at WSDM 2010 -- The Third ACM International Conference on Web Search and Data Mining New York City, NY USA.

Group: de.l3s.boilerpipe Artifact: boilerpipe
Show all versions Show documentation Show source 
 

10 downloads
Artifact boilerpipe
Group de.l3s.boilerpipe
Version 1.1.0
Last update 03. November 2010
Organization not specified
URL http://code.google.com/p/boilerpipe/
License Apache License 2.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

groupdocs-metadata from group com.groupdocs (version 18.5)

GroupDocs.Metadata for Java is a class library to process metadata associated with various Document, Image, CAD, Audio, Video and Archive formats. Key Features: - Covers most popular metadata standards: XMP, EXIF, IPTC, Image Resource Blocks, ID3, document properties - Covers most popular document formats: Microsoft Word, Microsoft Excel, Microsoft PowerPoint, PDF, Microsoft OneNote, Microsoft Visio, Open Document Format - Covers most popular image formats: BMP, GIF, DjVu, JPEG, PNG, TIFF, PSD, WebP, WMF, EMF, DICOM - Covers most popular email formats: Outlook Message, Email Message - Covers most popular audio formats: Mp3, WAV - Covers most popular video formats: Avi, Mov - Create, modify and remove metadata associated with supported document and image formats with a few lines of code - Manage EXIF metadata in Jpeg and TIFF formats - Manage XMP metadata in image and PDF formats - Manage Image Resource blocks in image formats - Manage audio metadata: ID3 tag (ID3v1, ID3v2), Lyrics3 tag, APE - Utilities to inspect and clean hidden metadata in document formats - Utilities to Search and Compare all metadata - Utilities to Export metadata to Excel/Csv - Metadata cleaner utility - MIME type detection - Read track changes. Accept or reject track changes - Read EXIF maker-notes: Sony, Nikon, Canon, Panasonic For more details on the library, please visit GroupDocs website at: https://products.groupdocs.com/metadata/Java Note: The library comes up with some limitations in the evaluation mode. In order to test full features of GroupDocs.Metadata for Java library, please request a free 30-day temporary license.

Group: com.groupdocs Artifact: groupdocs-metadata
Show all versions Show documentation Show source 
 

6 downloads
Artifact groupdocs-metadata
Group com.groupdocs
Version 18.5
Last update 10. May 2018
Organization not specified
URL https://products.groupdocs.com/metadata/Java
License GroupDocs License, Version 1.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!



Page 46 from 46 (items total 453)


© 2015 - 2024 Weber Informatics LLC | Privacy Policy