Learn R Programming

JSTORr (version 1.0.20161214)

JSTOR_unpack1grams: Unpack JSTOR journal articles and bibliographic data to a Document Term Matrix of 1-grams

Description

Import journal articles and bibliographic data from the downloaded zipfile and reshape ready for simple text mining. For use with JSTOR's Data for Research datasets (http://dfr.jstor.org/).

Usage

JSTOR_unpack1grams(path = getwd())

Arguments

path
full path to directory containing 'wordcounts' folder and the citations.tsv file. These are obtained after unzipping the file downloaded from DfR (you should unzip the file before running this function). Default is the working directory.

Value

Returns a list of two items. First is "wordcounts", a Document Term Matrix of 1-grams, and second is 'bibliodata', a data frame of bibliographic information for all articles.

Examples

Run this code
## unpack1grams <- JSTOR_unpack1grams(path = "C:/Users/marwick/Downloads/JSTOR") 

Run the code above in your browser using DataLab