mulk


NAME

mulk - Non-interactive multi-connection network downloader with image filtering and Metalink support.

CONTENTS

Name
Synopsis
Description
General Options
Download Options
Recursive Download Options
Input Options
Metalink Options
Reporting Options
Saving Options
Image Options
Bugs
See Also
Author

SYNOPSIS

mulk [OPTION]... [URL]...

DESCRIPTION

Multi-connection command line tool for downloading Internet sites with image filtering and Metalink support. Similar to wget and cURL, but it manages up to 50 simultaneous and parallel links. Main features are: HTML code parsing, recursive fetching, Metalink retrieving, segmented download and image filtering by width and height. It is based on libcurl, liburiparser, libtidy, libmetalink and libcrypto.

GENERAL OPTIONS

    --version Show Mulk version number.
-h, --help Show the help message and exit.
-q, --quiet Quiet (no output).
-v, --verbose Verbosity enabled.

DOWNLOAD OPTIONS

-U, --user-agent Identify as different agent instead of Mulk/VERSION.
-p, --max-sim-conns-per-host Maximum simultaneous connections per each host (default = 2, max value = 5).
-m, --max-sim-conns Maximum simultaneous connections (default = 50, max value = 50).
    --user Username to use for the connection.
    --password Password to use for the connection.
    --proxy <host[:port]> use HTTP proxy on given host and port.
    --cookie Cookie string for HTTP session. The string should be in the format "parameter1=value1; parameter2=value2".
    --load-cookies Load cookies from file for HTTP session. File should be in Netscape/Mozilla cookie file format.
    --save-cookies Save cookies to file after HTTP session. All cookies previously read from file, loaded from command line and sent by the remote server are written to the file.
    --exec-filter Execute an external program to modify or drop URLs to download.
Two parameters will be passed to the program, the first is the URL to analyse, second one is the level of depth where the URL was found parsing HTML. The standard output generated by the program is considered the new URL to enqueue for downloading. If no output is generated, the URL will be dropped.
URLs contained in Metalink files are ignored by this option.
For example, this script drops all URLs that contain a question mark:

#!/bin/sh
mark=‘echo "$1" | grep "?"‘ if [ "$mark" = "" ]; then echo "$1" fi

RECURSIVE DOWNLOAD OPTIONS

-d, --depth Maximum recursion depth (0 for infinite, default = 1).
    --no-html-dependencies Don’t get all images, links, etc. needed to display HTML page.
-k, --save-relative-links Make links relative in downloaded HTML pages.
-H, --span-hosts Go to foreign hosts.
-D, --domains Comma-separated list of accepted domains.
    --exclude-domains Comma-separated list of rejected domains.
    --follow-ftp Follow FTP links from HTML documents.

INPUT OPTIONS

-t, --option-file Text file with list of options. The file accepts only long options without ’--’.
The lines starting with ’#’ are skipped.
-f, --url-file Text file with list of URLs to download. The lines starting with ’#’ are skipped.

METALINK OPTIONS

-l, --metalink-file Metalink XML file to download.
    --metalink-list-file Text file with list of Metalink files to download.
The lines starting with ’#’ are skipped.
    --metalink-location Comma-separated list of accepted countries.
    --metalink-print-locations Print full list of valid countries.
    --metalink-continent Comma-separated list of accepted continents.
Valid values are: AF, AS, EU, NA, SA, OC, AN.
    --metalink-print-continents Print full list of valid continents.
    --metalink-os Operating system of the file to download.
    --metalink-language Language version of the file to download.
    --metalink-resume-file Partially downloaded Metalink file to resume. It needs a reference Metalink file.
    --follow-metalink Follow Metalink files from HTML documents.

REPORTING OPTIONS

-r, --report-file Generate a report in a text file.
-c, --report-csv-file Generate a report in a CSV file.
    --report-every-lines Write reports to files every n lines/downloads (0 for writing all reports at the end, default = 500).

SAVING OPTIONS

-x, --disable-site-save Don’t save whole site tree to disk (enabled by default).
-g, --save-gif-image Save GIF images to mime output directory.
-n, --save-png-image Save PNG images to mime output directory.
-j, --save-jpeg-image Save JPEG images to mime output directory.
    --save-mime-type Save URLs with specific mime type to output directory.
    --mime-output-dir Output directory to save files by mime types (default = ’data’).
    --file-output-dir Output directory to save downloaded files (default = ’.’).
    --temp-dir Temporary downloading directory (default = ’.tmp-mulk’).

IMAGE OPTIONS (active only with -g, -n or -j)

    --min-image-width Minimum image width.
    --max-image-width Maximum image width.
    --min-image-height Minimum image height.
    --max-image-height Maximum image height.

BUGS

Please report any bugs using the bug reporting form on the Mulk web site: http://sourceforge.net/projects/mulk

SEE ALSO

Additional information and the latest version is available at the web site: http://sourceforge.net/projects/mulk
A MediaWiki manual with other informations and some examples can be found here: http://sourceforge.net/apps/mediawiki/mulk

AUTHOR

Emanuele Bovisio <pocoyo at users.sourceforge dot net> wrote this manpage. Mulk was written by Emanuele Bovisio.


Generated by man2tidyhtml 1.1 through manServer 1.07 from mulk.1 using man macros.

Get mulk at SourceForge.net. Fast, secure and Free Open Source software downloads