All Downloads are FREE. Search and download functionalities are using the official Maven repository.

Download JAR files tagged by extracting with all dependencies

Search JAR files by class name

boilerpipe from group de.l3s.boilerpipe (version 1.1.0)

The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page. The library already provides specific strategies for common tasks (for example: news article extraction) and may also be easily extended for individual problem settings. Extracting content is very fast (milliseconds), just needs the input document (no global or site-level information required) and is usually quite accurate. Boilerpipe is a Java library written by Christian Kohlschütter. It is released under the Apache License 2.0. The algorithms used by the library are based on (and extending) some concepts of the paper "Boilerplate Detection using Shallow Text Features" by Christian Kohlschütter et al., presented at WSDM 2010 -- The Third ACM International Conference on Web Search and Data Mining New York City, NY USA.

Group: de.l3s.boilerpipe Artifact: boilerpipe
Show all versions Show documentation Show source 
 

10 downloads
Artifact boilerpipe
Group de.l3s.boilerpipe
Version 1.1.0
Last update 03. November 2010
Organization not specified
URL http://code.google.com/p/boilerpipe/
License Apache License 2.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

openimaj from group org.openimaj (version 1.3.10)

OpenIMAJ (Open Intelligent Multimedia in Java) is a collection of libraries and tools for multimedia analysis written in the Java programming language. OpenIMAJ intends to be the first truly complete multimedia analysis library and contains modules for analysing images, videos, text, audio and even webpages. The OpenIMAJ image and video analysis and feature extraction modules contain methods for processing visual content and extracting state-of-the-art features, including SIFT. The OpenIMAJ clustering and nearest-neighbour libraries contain efficient, multi-threaded implementations of clustering algorithms including Hierarchical K-Means and Approximate K-Means. The clustering library makes it possible to easily create visual-bag-of-words representations for images and video with very large vocabularies. The text-analysis modules contain implementations of a statistical language classifier and low-level processing pipeline. A number of modules deal with content creation, including interactive slideshows and animations. The hardware integration modules allow cross-platform integration with devices including webcams, the Microsoft Kinect, and even devices such as GPS's. OpenIMAJ also incorporates a number of tools to enable extremely-large-scale multimedia analysis using a distributed computing approach based on Apache Hadoop.

Group: org.openimaj Artifact: openimaj
Show all versions 
There is no JAR file uploaded. A download is not possible! Please choose another version.
0 downloads
Artifact openimaj
Group org.openimaj
Version 1.3.10
Last update 09. February 2020
Organization The University of Southampton
URL http://www.openimaj.org
License New BSD
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!



Page 32 from 32 (items total 312)


© 2015 - 2024 Weber Informatics LLC | Privacy Policy