RDocumentation
Moon
Learn R
Search all packages and functions
⚠️
There's a newer version (4.0.1) of this package.
Take me there.
quanteda (version 0.9.6-9)
Quantitative Analysis of Textual Data
Description
A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.
Copy Link
Copy
Link to current version
Version
Version
4.0.1
4.0.0
3.3.1
3.3.0
3.2.4
3.2.3
3.2.2
3.2.1
3.2.0
3.1.0
3.0.0
2.1.2
2.1.1
2.1.0
2.0.1
2.0.0
1.5.2
1.5.1
1.5.0
1.4.3
1.4.1
1.4.0
1.3.14
1.3.13
1.3.4
1.3.0
1.2.0
1.1.1
1.0.0
0.99.22
0.99.12
0.99.9
0.99
0.9.9-65
0.9.9-50
0.9.9-24
0.9.9-22
0.9.9-17
0.9.9-3
0.9.8.5
0.9.8.3
0.9.8
0.9.7-17
0.9.6-9
0.9.6-1
0.9.4
0.9.2-0
0.9.0-1
0.8.6-0
0.8.4-2
0.8.2-1
0.8.2-0
0.8.0-4
0.8.0-3
0.7.2-1
Down Chevron
Install
install.packages('quanteda')
Monthly Downloads
18,454
Version
0.9.6-9
License
GPL-3
Issues
49
Pull Requests
2
Stars
824
Forks
187
Repository
http://github.com/kbenoit/quanteda
Maintainer
Kenneth Benoit
Last Published
May 20th, 2016
Functions in quanteda (0.9.6-9)
Search functions
convert
convert a dfm to a non-quanteda format
docnames
get or set document names
compress
compress a dfm by combining similarly named dimensions
features
extract the feature labels from a dfm
lexdiv
calculate lexical diversity
ndoc
get the number of documents or features
ntoken
count the number of tokens or types
print.tokenizedTexts
print a tokenizedTexts objects
sample
Randomly sample documents or features
changeunits
change the document units of a corpus
tf
compute (weighted) term frequency from a dfm
encodedTextFiles
a .zip file of texts containing a variety of differently encoded texts
docfreq
#' @rdname weight #' @return
weight, x
with no
type
argument queries the weighting applied to the dfm, returning setMethod("weight", signature(c("dfm", "MISSING")), function(x) { if (isS4(x)) x@weighting else attr(x, "weighting") })
head.dfm
Return the first or last part of a dfm
removeFeatures
remove features from an object
metacorpus
get or set corpus metadata
subset.corpus
extract a subset of a corpus
cbind.dfm
Combine dfm objects by Rows or Columns
corpus
constructor for corpus objects
dfm
create a document-feature matrix
show,dictionary-method
print a dictionary object
corpusSource-class
corpus source classes
encodedTexts
encoded texts for testing
textmodel_NB
Naive Bayes classifier for texts
texts
get corpus texts
ukimmigTexts
Immigration-related sections of 2010 UK party manifestos
sort.dfm
sort a dfm by one or more margins
similarity
compute similarities between documents and/or features
topfeatures
list the most frequent features
settings
Get or set the corpus settings
weight
weight the feature frequencies in a dfm
tokenize
tokenize a set of texts
LBGexample
dfm with example data from Table 1 of Laver Benoit and Garry (2003)
docvars
get or set for document-level variables
ngrams
Create ngrams and skipgrams
dfm-class
Virtual class "dfm" for a document-feature matrix
tfidf
compute tf-idf weights from a dfm
as.data.frame,dfm-method
coerce a dfm to a data.frame
metadoc
get or set document-level meta-data
kwic
List key words in context from a text or a corpus of texts.
collocations
Detect collocations from text
plot.kwic
plot a dispersion plot of key word(s)
nsentence
count the number of sentences
summary.corpus
summarize a corpus or a vector of texts
readability
calculate readability
trim
Trim a dfm using threshold-based or random feature selection
phrasetotoken
convert phrases into single tokens
textmodel_wordfish
wordfish text model
selectFeatures
select features from an object
textmodel_ca
correspondence analysis of a document-feature matrix
textfile
read a text corpus source from a file
toLower
Convert texts to lower case
textmodel_wordscores
Wordscores text model
segment
segment texts into component elements
predict.textmodel_NB_fitted
prediction method for Naive Bayes classifier objects
print.dfm
print a dfm object
textmodel_fitted-class
the fitted textmodel classes
syllables
count syllables in a text
wordstem
stem words
dictionary
create a dictionary
encoding
detect the encoding of texts
textmodel
fit a text model
ie2010Corpus
Irish budget speeches from 2010
inaugCorpus
A corpus of US presidential inaugural addresses from 1789-2013
stopwords
access built-in stopwords
plot.dfm
plot features as a wordcloud
wordlists
word lists used in some readability indexes
applyDictionary
apply a dictionary or thesarus to an object
mobydickText
Project Gutenberg text of Herman Melville's
Moby Dick
exampleString
A paragraph of text for testing various text-based functions
quanteda-package
An R package for the quantitative analysis of textual data
scrabble
compute the Scrabble letter values of text