{"name":"gutenberg_corpus","title":"Project Gutenberg Corpora","pagetitle":"Project Gutenberg Corpora — gutenberg_corpus","aliases":"gutenberg_corpus","author":[],"keywords":[],"description":"

Get a corpus of texts from Project Gutenberg.

","usage":"gutenberg_corpus(ids, filter = NULL, mirror = NULL, verbose = TRUE, ...)","arguments":[{"name":"ids","description":"

an integer vector of requested Gutenberg text IDs.

"},{"name":"filter","description":"

a text filter to set on the corpus.

"},{"name":"mirror","description":"

a character string URL for the Gutenberg mirror to use,\n or NULL to determine automatically.

"},{"name":"verbose","description":"

a logical scalar indicating whether to print progress\n updates to the console.

"},{"name":"...","description":"

additional arguments passed to as_corpus.

"}],"has_args":true,"examples":"# NOT RUN {\n# get the texts of George Eliot's novels\n# }\n# NOT RUN {\neliot <- gutenberg_corpus(c(145, 550, 6688))\n# }\n","sections":[],"details":"

gutenberg_corpus downloads a set of texts from Project Gutenberg,\ncreating a corpus with the texts as rows. You specify the texts for inclusion\nusing their Project Gutenberg IDs, passed to the function in the\nids argument.

You can search for Project Gutenberg texts and get their IDs using the\ngutenberg_works function from the gutenbergr package.

","value":"

A corpus (data frame) with three columns: \"title\", \"author\",\nand \"text\".

","seealso":"

corpus_frame.

","package":{"package":"corpus","version":"0.10.1"}}

Description

Usage

Arguments

Value

Details

See Also

Examples