JSTOR_findassocs

object returned by the function JSTOR_unpack1grams.

unpack1grams

the object returned by the function JSTOR_dtmofnouns. A Document Term Matrix containing the documents.

nouns

The word to calculate the correlations with

word

the number years to aggregate documents by. For example, n = 5 (the default value) will create groups of all documents published in non-overlapping five year ranges. Note that high n values combined with high plimit and corlimit values will severly filter the output. For exploratory data analysis it's recommended to start with low n values and work up.

The lower threshold value of the Pearson correlation statistic (default is 0.4).

corlimit

The lower threshold value of the Pearson correlation statistic (default is 0.05).

plimit

An integer for the number of top ranking words to plot. For example, topn = 20 (the default value) will plot the top 20 words for each range of years.

topn

An integer to control the maximum size of the text in the plot

biggest

logical.  If TRUE attempts to run the function on multiple 
cores.  Note that this may actually be slower if you have one core, limited memory or if 
the data set is small due to communication of data between the cores.

parallel


Generates a plot of the top n words in all the documents that positively correlate with a given word, in ranges of years. For use with JSTOR's Data for Research datasets (http://dfr.jstor.org/). For best results, repeat the function after adding common words to the stopword list. To learn more about editing the stopword list, see the help for the JSTOR_dtmofnouns function.


Simple exploratory text mining and document clustering of journal
articles from JSTOR's Data for Research service. Go to
\url{http://dfr.jstor.org/}, make a request for data (specify CSV as outout
format and Word Counts as data type), then once you get a zip file, unzip
it and start with one of the unpack functions and then you're ready to go
with any of the other functions. For more details on installation and
usage, see \url{https://github.com/benmarwick/JSTORr/}

JSTOR_findassocs: Plot the words with the strongest correlation with a given word, by time intervals

Description

Usage

Arguments

Value

Examples