All Downloads are FREE. Search and download functionalities are using the official Maven repository.

org.archive.modules.fetcher.FetchFTP_en.utf8 Maven / Gradle / Ivy

Go to download

This project contains some of the configurable modules used within the Heritrix application to crawl the web. The modules in this project can be used in applications other than Heritrix, however.

There is a newer version: 3.5.0
Show newest version
description:
FTP Fetcher.


extract-from-dirs-description:
Set to true to extract further URIs from FTP directories. Default is 
true. 


extract-parent-description:
Set to true to extract the parent URI from all FTP URIs. Default is true. 


fetch-bandwidth-description:
The maximum KB/sec to use when fetching data from a server. The default 
of 0 means no maximum. 


max-length-bytes-description:
Maximum length in bytes to fetch. Fetch is truncated at this length. A 
value of 0 means no limit. 


password-description:
The password to send to FTP servers. By convention, anonymous users send 
their email address in this field. 


timeout-seconds-description:
If the fetch is not completed in this number of seconds, give up (and 
retry later). 


username-description:
The username to send to FTP servers. By convention, the default value of 
"anonymous" is used for publicly available FTP sites. 






© 2015 - 2024 Weber Informatics LLC | Privacy Policy