Learn R Programming

JSTORr (version 1.0.20161214)

JSTOR_unpack_multiple_archives: Unpacks multiple zip files of JSTOR journal articles and bibliographic data to a Document Term Matrix of 1-grams

Description

Import journal articles and bibliographic data from multiple downloaded zipfiles, and reshape ready for simple text mining. For use with JSTOR's Data for Research datasets (http://dfr.jstor.org/).

Usage

JSTOR_unpack_multiple_archives(mydir = getwd())

Arguments

mydir
path to directory containing multiple zip files dowloaded from dfr.jstor.org (default is the current working directory)

Value

Returns a list of two items. First is "wordcounts", a Document Term Matrix of 1-grams, and second is 'bibliodata', a data frame of bibliographic information for all articles.

Examples

Run this code
## multiple_archives <- JSTOR_unpack_multiple_archives(mydir = "~/my_data")

Run the code above in your browser using DataLab