All Downloads are FREE. Search and download functionalities are using the official Maven repository.

Download JAR files tagged by crawl with all dependencies

Search JAR files by class name

crawl-tool from group com.github.houbb (version 1.0.0)

Crawl tool for java(爬虫基本工具类).

Group: com.github.houbb Artifact: crawl-tool
Show documentation Show source 
 

0 downloads
Artifact crawl-tool
Group com.github.houbb
Version 1.0.0
Last update 20. September 2023
Organization not specified
URL Not specified
License The Apache Software License, Version 2.0
Dependencies amount 8
Dependencies log-integration, heaven, html2md, md2html, junit, jsoup, fastjson, commons-lang3,
There are maybe transitive dependencies!

jlibs-xml-crawler from group in.jlibs (version 3.0.1)

Crawl XML(wsdl, xsd, xsl) documents

Group: in.jlibs Artifact: jlibs-xml-crawler
Show all versions Show documentation Show source 
 

4 downloads
Artifact jlibs-xml-crawler
Group in.jlibs
Version 3.0.1
Last update 05. June 2021
Organization not specified
URL Not specified
License not specified
Dependencies amount 1
Dependencies jlibs-xml,
There are maybe transitive dependencies!

metrics-crawler from group org.bytemechanics (version 1.0.2)

Little library to crawl metrics

Group: org.bytemechanics Artifact: metrics-crawler
Show all versions Show documentation Show source 
 

0 downloads
Artifact metrics-crawler
Group org.bytemechanics
Version 1.0.2
Last update 24. May 2020
Organization Byte Mechanics
URL https://metrics-crawler.bytemechanics.org
License Apache License 2.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

poseidon from group com.github.houbb (version 0.0.1)

The web crawl base framework based on jsoup.

Group: com.github.houbb Artifact: poseidon
Show documentation Show source 
 

0 downloads
Artifact poseidon
Group com.github.houbb
Version 0.0.1
Last update 03. June 2019
Organization not specified
URL Not specified
License The Apache Software License, Version 2.0
Dependencies amount 9
Dependencies heaven, csv, junit, jsoup, okhttp, fastjson, lombok, log4j-api, log4j-core,
There are maybe transitive dependencies!

logtrix from group org.netpreserve (version 0.1.0)

Parses and summarises Heritrix crawl logs

Group: org.netpreserve Artifact: logtrix
Show documentation Show source 
 

0 downloads
Artifact logtrix
Group org.netpreserve
Version 0.1.0
Last update 15. May 2019
Organization not specified
URL https://github.com/iipc/logtrix
License Apache License, Version 2.0
Dependencies amount 4
Dependencies slf4j-api, jackson-databind, jackson-datatype-jsr310, guava,
There are maybe transitive dependencies!

folder-source-reader from group org.ow2.weblab.service (version 1.0)

Use this component to crawl a folder with RDF weblab resources.

Group: org.ow2.weblab.service Artifact: folder-source-reader
Show documentation Show source 
 

0 downloads
Artifact folder-source-reader
Group org.ow2.weblab.service
Version 1.0
Last update 09. July 2010
Organization not specified
URL Not specified
License not specified
Dependencies amount 1
Dependencies slf4j-log4j12,
There are maybe transitive dependencies!

registry-crawler-service from group gov.nasa.pds (version 1.1.0)

A web service to crawl a file system for PDS4 labels.

Group: gov.nasa.pds Artifact: registry-crawler-service
Show all versions Show documentation Show source 
 

0 downloads
Artifact registry-crawler-service
Group gov.nasa.pds
Version 1.1.0
Last update 05. October 2023
Organization not specified
URL https://nasa-pds.github.io/${project-name}
License not specified
Dependencies amount 10
Dependencies commons-cli, log4j-api, log4j-core, log4j-slf4j-impl, gson, amqp-client, activemq-client, jetty-server, jetty-servlet, registry-common,
There are maybe transitive dependencies!

testcasegenerator from group com.crawljax.plugins (version 5.2.3)

Generates test cases from the crawl session.

Group: com.crawljax.plugins Artifact: testcasegenerator
Show all versions Show documentation Show source 
 

0 downloads
Artifact testcasegenerator
Group com.crawljax.plugins
Version 5.2.3
Last update 01. June 2023
Organization not specified
URL http://crawljax.com
License not specified
Dependencies amount 13
Dependencies velocity-engine-core, jgrapht-core, commons-configuration, commons-lang3, gson, xmlunit-core, diffutils, crawloverview-plugin, rtree, rxjava, guava-mini, commons-io, testng,
There are maybe transitive dependencies!

crawler from group com.soulgalore (version 1.5.11)

Simple java (1.6) crawler to crawl web pages on one and same domain.

Group: com.soulgalore Artifact: crawler
Show all versions Show documentation Show source 
 

0 downloads
Artifact crawler
Group com.soulgalore
Version 1.5.11
Last update 08. February 2014
Organization not specified
URL https://github.com/soulgalore/crawler
License Apache License 2.0
Dependencies amount 4
Dependencies guice, jsoup, httpclient, commons-cli,
There are maybe transitive dependencies!

heritrix-modules from group org.archive.heritrix (version 3.4.0-20240909)

This project contains some of the configurable modules used within the Heritrix application to crawl the web. The modules in this project can be used in applications other than Heritrix, however.

Group: org.archive.heritrix Artifact: heritrix-modules
Show all versions Show documentation Show source 
 

0 downloads
Artifact heritrix-modules
Group org.archive.heritrix
Version 3.4.0-20240909
Last update 09. September 2024
Organization not specified
URL Not specified
License not specified
Dependencies amount 9
Dependencies heritrix-commons, bsh, groovy-jsr223, groovy-templates, jetty-server, jetty-security, crawler-commons, jsch, pdfbox,
There are maybe transitive dependencies!



Page 1 from 2 (items total 18)


© 2015 - 2024 Weber Informatics LLC | Privacy Policy