Learn R Programming

⚠️There's a newer version (0.1.5) of this package.Take me there.

textreg (version 0.1.3)

n-Gram Text Regression, aka Concise Comparative Summarization

Description

Function for sparse regression on raw text, regressing a labeling vector onto a feature space consisting of all possible phrases.

Copy Link

Version

Install

install.packages('textreg')

Monthly Downloads

479

Version

0.1.3

License

GPL (>= 2)

Maintainer

Luke Miratrix

Last Published

November 11th, 2015

Functions in textreg (0.1.3)

make.count.table

Count number of times documents have a given phrase.
phrase.matrix

Make matrix of where phrases appear in corpus.
phrase.count

Count phrase appearance.
predict.textreg.result

Predict labeling with the selected phrases.
save.corpus.to.files

Save corpus to text (and RData) file.
plot.textreg.result

Plot the sequence of features as they are introduced with the textreg gradient descent program.
find.CV.C

K-fold cross-validation to determine optimal tuning parameter
print.fragment.sample

Pretty print results of phrase sampling object.
make_search_phrases

Convert phrases to appropriate search string.
dirtyBathtub

Sample of raw-text OSHA accident summaries.
cluster.phrases

Cluster phrases based on similarity of appearance.
path.matrix.chart

Plot optimization path of textreg.
clean.text

Clean text and get it ready for textreg.
stem.corpus

Step corpus with annotation.
is.fragment.sample

Is object a fragment.sample object?
reformat.textreg.model

Clean up output from textreg.
make.path.matrix

Generate matrix describing gradient descent path of textreg.
tm_gregexpr

Call gregexpr on the content of a tm Corpus.
cpp_build.corpus

Driver function for the C++ function.
bathtub

Sample of cleaned OSHA accident summaries.
convert.tm.to.character

Convert tm corpus to vector of strings.
list.table.chart

Graphic showing multiple word lists side-by-side.
make.phrase.matrix

Make a table of where phrases appear in a corpus
sample.fragments

Sample fragments of text to contextualize a phrase.
make.appearance.matrix

Make phrase appearance matrix from textreg result.
calc.loss

Calculate total loss of model (Squared hinge loss).
make.list.table

Collate multiple regression runs.
testCorpora

Some small, fake test corpora.
find.threshold.C

Conduct permutation test on labeling to get null distribution of regularization parameter.
build.corpus

Build a corpus that can be used in the textreg call.
is.textreg.corpus

Is object a textreg.corpus object?
make.CV.chart

Plot K-fold cross validation curves
textreg

Sparse regression of labeling vector onto all phrases in a corpus.
make.similarity.matrix

Calculate similarity matrix for set of phrases.
is.textreg.result

Is object a textreg.result object?
print.textreg.result

Pretty print results of textreg regression.
print.textreg.corpus

Pretty print textreg corpus object
cpp_textreg

Driver function for the C++ function.
grab.fragments

Grab all fragments in a corpus with given phrase.
textreg-package

Sparse regression package for text that allows for multiple word phrases.
make.phrase.correlation.chart

Generate visualization of phrase overlap.