Rdocumentation
powered by
Learn R Programming
⚠️
There's a newer version (4.2.0) of this package.
Take me there.
quanteda (version 0.7.2-1)
Quantitative Analysis of Textual Data
Description
A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.
Copy Link
Link to current version
Version
Version
4.2.0
4.1.0
4.0.2
4.0.1
4.0.0
3.3.1
3.3.0
3.2.4
3.2.3
3.2.2
3.2.1
3.2.0
3.1.0
3.0.0
2.1.2
2.1.1
2.1.0
2.0.1
2.0.0
1.5.2
1.5.1
1.5.0
1.4.3
1.4.1
1.4.0
1.3.14
1.3.13
1.3.4
1.3.0
1.2.0
1.1.1
1.0.0
0.99.22
0.99.12
0.99.9
0.99
0.9.9-65
0.9.9-50
0.9.9-24
0.9.9-22
0.9.9-17
0.9.9-3
0.9.8.5
0.9.8.3
0.9.8
0.9.7-17
0.9.6-9
0.9.6-1
0.9.4
0.9.2-0
0.9.0-1
0.8.6-0
0.8.4-2
0.8.2-1
0.8.2-0
0.8.0-4
0.8.0-3
0.7.2-1
Install
install.packages('quanteda')
Monthly Downloads
24,111
Version
0.7.2-1
License
GPL-3
Issues
59
Pull Requests
2
Stars
854
Forks
187
Repository
http://github.com/kbenoit/quanteda
Maintainer
Kenneth Benoit
Last Published
June 2nd, 2015
Functions in quanteda (0.7.2-1)
Search all functions
lexdiv
calculate lexical diversity
docfreq
get the document frequency of a feature
ngrams
Create ngrams
clean
simple cleaning of text before processing
kwic
List key words in context from a text or a corpus of texts.
removeFeatures
remove features from an object
ntoken
count the number of tokens
textmodel_wordscores
Wordscores text model
dfm-class
Virtual class "dfm" for a document-feature matrix
inaugCorpus
A corpus of US presidential inaugural addresses from 1789-2013
settings
Get or set the corpus settings
summary.corpus
summarize a corpus or a vector of texts
topfeatures
list the most frequent features
print.dfm
print a dfm object
collocations
Detect collocations from text
textmodel_ca
correspondence analysis of a document-feature matrix
tokenize
tokenize a set of texts
docvars
get or set for document-level variables
docnames
get or set document names
ie2010Corpus
Irish budget speeches from 2010
bigrams
Create bigrams
textmodel_fitted-class
the fitted textmodel classes
dfm
create a document-feature matrix
encoding
get the encoding of documents in a corpus
stopwords
access built-in stopwords
syllables
count syllables in a text
subset.corpus
extract a subset of a corpus
exampleString
A paragraph of text for testing various text-based functions
textmodel_wordfish
wordfish text model
corpusSource-class
corpus source classes
metacorpus
get or set corpus metadata
ndoc
get the number of documents or features
phrasetotoken
convert phrases into single tokens
plot.dfm
plot features as a wordcloud
metadoc
get or set document-level meta-data
quanteda-package
An R package for the quantitative analysis of textual data.
texts
get or set corpus texts
segment
segment texts into component elements
textfile
read a text corpus source from a file
tokenizeOnly
tokenizeOnly
LBGexample
dfm containing example data from Table 1 of Laver Benoit and Garry (2003)
corpus
constructor for corpus objects
convert
convert a dfm to a non-quanteda format
dictionary-class
create a dictionary
language
get or set the language of corpus documents
features
extract the feature labels from a dfm
sort.dfm
sort a dfm by one or more margins
weight
Weight the feature frequencies in a dfm by various methods
textmodel
fit a text model
ukimmigTexts
Immigration-related sections of 2010 UK party manifestos
trim
Trim a dfm using threshold-based or random feature selection
changeunits
change the document units of a corpus
similarity
compute similarities between documents and/or features
wordstem
stem words