RDocumentation
Moon
Learn R
Search all packages and functions
⚠️
There's a newer version (0.1.5) of this package.
Take me there.
textreg (version 0.1.3)
n-Gram Text Regression, aka Concise Comparative Summarization
Description
Function for sparse regression on raw text, regressing a labeling vector onto a feature space consisting of all possible phrases.
Copy Link
Copy
Link to current version
Version
Version
0.1.5
0.1.4
0.1.3
0.1.2
0.1.1
Down Chevron
Install
install.packages('textreg')
Monthly Downloads
630
Version
0.1.3
License
GPL (>= 2)
Maintainer
Luke Miratrix
Last Published
November 11th, 2015
Functions in textreg (0.1.3)
Search functions
make.count.table
Count number of times documents have a given phrase.
phrase.matrix
Make matrix of where phrases appear in corpus.
phrase.count
Count phrase appearance.
predict.textreg.result
Predict labeling with the selected phrases.
save.corpus.to.files
Save corpus to text (and RData) file.
plot.textreg.result
Plot the sequence of features as they are introduced with the textreg gradient descent program.
find.CV.C
K-fold cross-validation to determine optimal tuning parameter
print.fragment.sample
Pretty print results of phrase sampling object.
make_search_phrases
Convert phrases to appropriate search string.
dirtyBathtub
Sample of raw-text OSHA accident summaries.
cluster.phrases
Cluster phrases based on similarity of appearance.
path.matrix.chart
Plot optimization path of textreg.
clean.text
Clean text and get it ready for textreg.
stem.corpus
Step corpus with annotation.
is.fragment.sample
Is object a fragment.sample object?
reformat.textreg.model
Clean up output from textreg.
make.path.matrix
Generate matrix describing gradient descent path of textreg.
tm_gregexpr
Call gregexpr on the content of a tm Corpus.
cpp_build.corpus
Driver function for the C++ function.
bathtub
Sample of cleaned OSHA accident summaries.
convert.tm.to.character
Convert tm corpus to vector of strings.
list.table.chart
Graphic showing multiple word lists side-by-side.
make.phrase.matrix
Make a table of where phrases appear in a corpus
sample.fragments
Sample fragments of text to contextualize a phrase.
make.appearance.matrix
Make phrase appearance matrix from textreg result.
calc.loss
Calculate total loss of model (Squared hinge loss).
make.list.table
Collate multiple regression runs.
testCorpora
Some small, fake test corpora.
find.threshold.C
Conduct permutation test on labeling to get null distribution of regularization parameter.
build.corpus
Build a corpus that can be used in the textreg call.
is.textreg.corpus
Is object a textreg.corpus object?
make.CV.chart
Plot K-fold cross validation curves
textreg
Sparse regression of labeling vector onto all phrases in a corpus.
make.similarity.matrix
Calculate similarity matrix for set of phrases.
is.textreg.result
Is object a textreg.result object?
print.textreg.result
Pretty print results of textreg regression.
print.textreg.corpus
Pretty print textreg corpus object
cpp_textreg
Driver function for the C++ function.
grab.fragments
Grab all fragments in a corpus with given phrase.
textreg-package
Sparse regression package for text that allows for multiple word phrases.
make.phrase.correlation.chart
Generate visualization of phrase overlap.