All Downloads are FREE. Search and download functionalities are using the official Maven repository.

Download JAR files tagged by crawleractions with all dependencies


SiteCrawler from group io.github.jasperroel (version 1.0.0)

This project provides a simple WebCrawler with retry-capabilities, functionality to distinguish between http/https sites. It biggest feature is that it allows for plugins (or CrawlerActions), which allows you to hook your scripts into the crawling process. It also allow for setting "blocked" URLs. Those URLs or patterns will not be crawled.

Group: io.github.jasperroel Artifact: SiteCrawler
Show documentation Show source 
Download SiteCrawler.jar (1.0.0)
 

0 downloads
Artifact SiteCrawler
Group io.github.jasperroel
Version 1.0.0
Last update 30. July 2018
Organization Salesforce.com
URL https://github.com/forcedotcom/SiteCrawler
License The BSD 2-Clause License
Dependencies amount 3
Dependencies jcl-over-slf4j, htmlunit, commons-lang,
There are maybe transitive dependencies!



Page 1 from 1 (items total 1)


© 2015 - 2024 Weber Informatics LLC | Privacy Policy