
Last chance! 50% off unlimited learning
Sale ends in
Create a ZIP file source.
ZipSource(zipfile,
pattern = NULL,
recursive = FALSE,
ignore.case = FALSE,
mode = "text")
An object inheriting from ZipSource
, SimpleSource
, and
Source
.
A character string with the full path name of a ZIP file.
an optional regular expression. Only file names in the ZIP file which match the regular expression will be returned.
logical. Should the listing recurse into directories?
logical. Should pattern-matching be case-insensitive?
a character string specifying if and how files should be read in. Available modes are:
""
No read. In this case getElem
and
pGetElem
only deliver URIs.
"binary"
Files are read in binary raw mode (via
readBin
).
"text"
Files are read as text (via
readLines
).
A ZIP file source extracts a compressed ZIP file via
unzip
and interprets each file as a document.
Source
for basic information on the source infrastructure
employed by package tm.
zipfile <- tempfile()
files <- Sys.glob(file.path(system.file("texts", "txt", package = "tm"), "*"))
zip(zipfile, files)
zipfile <- paste0(zipfile, ".zip")
Corpus(ZipSource(zipfile, recursive = TRUE))[[1]]
file.remove(zipfile)
Run the code above in your browser using DataLab