All Downloads are FREE. Search and download functionalities are using the official Maven repository.

Download JAR files tagged by crawlers with all dependencies

Search JAR files by class name

storm-crawler from group com.digitalpebble (version 0.7)

A collection of resources for building low-latency, scalable web crawlers on Apache Storm.

Group: com.digitalpebble Artifact: storm-crawler
Show all versions 
There is no JAR file uploaded. A download is not possible! Please choose another version.
0 downloads
Artifact storm-crawler
Group com.digitalpebble
Version 0.7
Last update 03. November 2015
Organization DigitalPebble Ltd
URL https://github.com/DigitalPebble/storm-crawler
License The Apache License, Version 2.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

su-sdk from group com.searchunify (version 1.0.2)

The SearchUnify SDK enables developers to easily work with the SearchUnify platform and build scalable solutions with search, analytics, crawlers and more.

Group: com.searchunify Artifact: su-sdk
Show all versions Show documentation Show source 
 

0 downloads
Artifact su-sdk
Group com.searchunify
Version 1.0.2
Last update 24. January 2023
Organization not specified
URL https://github.com/searchunify/su-sdk-java
License MIT License
Dependencies amount 7
Dependencies jackson-core, jackson-annotations, jackson-databind, jackson-module-afterburner, okhttp, log4j-core, log4j-api,
There are maybe transitive dependencies!

crawler-detect from group org.nekosoft.utils (version 1.0.0)

A Java port of crawlerdetect.io, a PHP class for detecting bots/crawlers/spiders via the user agent and http_from header

Group: org.nekosoft.utils Artifact: crawler-detect
Show all versions Show documentation Show source 
 

0 downloads
Artifact crawler-detect
Group org.nekosoft.utils
Version 1.0.0
Last update 01. September 2022
Organization not specified
URL https://github.com/nekosoftllc/crawler-detect/wiki
License The Apache License, Version 2.0
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

feku from group io.github.siddharthgoel88 (version 1.0.0)

Java utility to get User Agents from an exhaustive list of browsers, crawlers and many other softwares.

Group: io.github.siddharthgoel88 Artifact: feku
Show documentation Show source 
 

6 downloads
Artifact feku
Group io.github.siddharthgoel88
Version 1.0.0
Last update 30. December 2015
Organization not specified
URL https://github.com/siddharthgoel88/feku
License MIT License
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

bot-detector from group ch.javacamp.bot-detector (version 1.0.1)

A small, fast and dependency less Java library to detect and verify bots and crawlers on the basis of user-agents and IP addresses. The authenticity of a bot can be verified with reverse dns lookups.

Group: ch.javacamp.bot-detector Artifact: bot-detector
Show all versions Show documentation Show source 
 

0 downloads
Artifact bot-detector
Group ch.javacamp.bot-detector
Version 1.0.1
Last update 28. March 2023
Organization Javacamp.ch
URL https://github.com/UeliKurmann/bot-detector
License mit
Dependencies amount 0
Dependencies No dependencies
There are maybe transitive dependencies!

opensearchserver from group com.jaeksoft (version 1.5.14)

OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, ...) and the REST/RESTFul API you will be able to integrate quickly and easily advanced full-text search capabilities in your application. OpenSearchServer runs on Windows and Linux/Unix/BSD.

Group: com.jaeksoft Artifact: opensearchserver
Show all versions Show documentation Show source 
 

1 downloads
Artifact opensearchserver
Group com.jaeksoft
Version 1.5.14
Last update 09. August 2016
Organization not specified
URL http://www.open-search-server.com
License General Public License, Version 3.0
Dependencies amount 89
Dependencies jsp-api, cxf-rt-frontend-jaxws, cxf-rt-frontend-jaxrs, cxf-rt-transports-http-hc, lucene-core, lucene-analyzers, lucene-memory, lucene-queries, lucene-spellchecker, httpclient, nekohtml, tagsoup, fastutil, gson, icu4j, google-api-data-youtube-v2, httpmime, args4j, quartz-commonj, jsoup, json, xml-apis, jedis, apache-mime4j-core, commons-email, htmlcleaner, imgscalr-lib, fluent-hc, httpcore, httpcore-nio, httpasyncclient, pdfbox-ant, icepdf-core, zkbind, zul, zkplus, zhtml, hadoop-client, commons-codec, poi, poi-excelant, poi-ooxml, poi-ooxml-schemas, poi-scratchpad, simple-odf, batik-awt-util, batik-dom, batik-svg-dom, batik-util, batik-xml, sanselan, json-simple, commons-net, jaudiotagger, slf4j-log4j12, zk, woodstox-core-asl, jaxb-api, cxf-rt-rs-extension-providers, jackson-core, jackson-jaxrs-xml-provider, jackson-jaxrs-json-provider, json-path, dropbox-core-sdk, selenium-java, selenium-remote-driver, htmlunit-driver, phantomjsdriver, commons-compress, rome, lucene-stempel, langdetect, bcprov-jdk15, bcmail-jdk15, antlr4-runtime, opencsv, groovy-all, mongo-java-driver, jmimemagic, hunspell-bridj, RoaringBitmap, mysql-connector-java, jcifs-krb5-jdk7, postgresql, hsqldb, derby, iijdbc, jtds, sqlite-jdbc,
There are maybe transitive dependencies!



Page 2 from 2 (items total 16)


© 2015 - 2024 Weber Informatics LLC | Privacy Policy