Rdocumentation
powered by
Learn R Programming
⚠️
There's a newer version (4.2.0) of this package.
Take me there.
quanteda (version 0.9.2-0)
Quantitative Analysis of Textual Data
Description
A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.
Copy Link
Link to current version
Version
Version
4.2.0
4.1.0
4.0.2
4.0.1
4.0.0
3.3.1
3.3.0
3.2.4
3.2.3
3.2.2
3.2.1
3.2.0
3.1.0
3.0.0
2.1.2
2.1.1
2.1.0
2.0.1
2.0.0
1.5.2
1.5.1
1.5.0
1.4.3
1.4.1
1.4.0
1.3.14
1.3.13
1.3.4
1.3.0
1.2.0
1.1.1
1.0.0
0.99.22
0.99.12
0.99.9
0.99
0.9.9-65
0.9.9-50
0.9.9-24
0.9.9-22
0.9.9-17
0.9.9-3
0.9.8.5
0.9.8.3
0.9.8
0.9.7-17
0.9.6-9
0.9.6-1
0.9.4
0.9.2-0
0.9.0-1
0.8.6-0
0.8.4-2
0.8.2-1
0.8.2-0
0.8.0-4
0.8.0-3
0.7.2-1
Install
install.packages('quanteda')
Monthly Downloads
22,704
Version
0.9.2-0
License
GPL-3
Issues
59
Pull Requests
2
Stars
854
Forks
187
Repository
http://github.com/kbenoit/quanteda
Maintainer
Kenneth Benoit
Last Published
January 11th, 2016
Functions in quanteda (0.9.2-0)
Search all functions
applyDictionary
apply a dictionary or thesarus to an object
dictionary
create a dictionary
convert
convert a dfm to a non-quanteda format
features
extract the feature labels from a dfm
ndoc
get the number of documents or features
encodedTexts
encoded texts for testing
tfidf
compute tf-idf weights from a dfm
dfm
create a document-feature matrix
exampleString
A paragraph of text for testing various text-based functions
metacorpus
get or set corpus metadata
ntoken
count the number of tokens or types
docnames
get or set document names
kwic
List key words in context from a text or a corpus of texts.
encoding
detect the encoding of texts
textmodel
fit a text model
phrasetotoken
convert phrases into single tokens
head.dfm
Return the first or last part of a dfm
plot.dfm
plot features as a wordcloud
cbind.dfm
combine dfm object by columns
settings
Get or set the corpus settings
inaugCorpus
A corpus of US presidential inaugural addresses from 1789-2013
nsentence
count the number of sentences
textfile
read a text corpus source from a file
dfm-class
Virtual class "dfm" for a document-feature matrix
docfreq
compute the (weighted) document frequency of a feature
ngrams
Create ngrams and skipgrams
corpusSource-class
corpus source classes
metadoc
get or set document-level meta-data
similarity
compute similarities between documents and/or features
textmodel_fitted-class
the fitted textmodel classes
changeunits
change the document units of a corpus
quanteda-package
An R package for the quantitative analysis of textual data.
collocations
Detect collocations from text
textmodel_wordscores
Wordscores text model
corpus
constructor for corpus objects
trim
Trim a dfm using threshold-based or random feature selection
textmodel_wordfish
wordfish text model
removeFeatures
remove features from an object
ie2010Corpus
Irish budget speeches from 2010
LBGexample
dfm containing example data from Table 1 of Laver Benoit and Garry (2003)
sample.corpus
Randomly sample documents or features
ukimmigTexts
Immigration-related sections of 2010 UK party manifestos
weight
weight the feature frequencies in a dfm by various methods
plot.kwic
plot a dispersion plot of key word(s)
readability
calculate readability
wordstem
stem words
scrabble
compute the Scrabble letter values of text
syllables
count syllables in a text
print.tokenizedTexts
print a tokenizedTexts objects
subset.corpus
extract a subset of a corpus
docvars
get or set for document-level variables
tokenize
tokenize a set of texts
lexdiv
calculate lexical diversity
sort.dfm
sort a dfm by one or more margins
summary.corpus
summarize a corpus or a vector of texts
show,dictionary-method
print a dictionary object
segment
segment texts into component elements
textmodel_ca
correspondence analysis of a document-feature matrix
tf
compute (weighted) term frequency from a dfm
toLower
Convert texts to lower case
stopwords
access built-in stopwords
texts
get corpus texts
selectFeatures
select features from an object
print.dfm
print a dfm object
wordlists
word lists used in some readability indexes
topfeatures
list the most frequent features