sparkwarc (version 0.1.1)

cc_warc: Provides WARC paths for commoncrawl.org

Description

Provides WARC paths for commoncrawl.org. To be used with spark_read_warc.

Usage

cc_warc(start, end = start)

Arguments

start

The first path to retrieve.

end

The last path to retrieve.

Examples

Run this code
# NOT RUN {
cc_warc(1)
cc_warc(2, 3)

# }

Run the code above in your browser using DataLab