Learn R Programming

⚠️There's a newer version (4.2.0) of this package.Take me there.

quanteda (version 0.8.0-4)

Quantitative Analysis of Textual Data

Description

A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.

Copy Link

Version

Install

install.packages('quanteda')

Monthly Downloads

20,607

Version

0.8.0-4

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Kenneth Benoit

Last Published

July 13th, 2015

Functions in quanteda (0.8.0-4)

nsentence

count the number of sentences
clean

simple cleaning of text before processing
convert

convert a dfm to a non-quanteda format
corpus

constructor for corpus objects
inaugCorpus

A corpus of US presidential inaugural addresses from 1789-2013
weight

Weight the feature frequencies in a dfm by various methods
settings

Get or set the corpus settings
texts

get corpus texts
changeunits

change the document units of a corpus
docnames

get or set document names
features

extract the feature labels from a dfm
encodedTexts

encoded texts for testing
dfm-class

Virtual class "dfm" for a document-feature matrix
topfeatures

list the most frequent features
dfm

create a document-feature matrix
readability

calculate readability
lexdiv

calculate lexical diversity
metadoc

get or set document-level meta-data
summary.corpus

summarize a corpus or a vector of texts
textfile

read a text corpus source from a file
LBGexample

dfm containing example data from Table 1 of Laver Benoit and Garry (2003)
textmodel_ca

correspondence analysis of a document-feature matrix
exampleString

A paragraph of text for testing various text-based functions
ngrams

Create ngrams
ie2010Corpus

Irish budget speeches from 2010
phrasetotoken

convert phrases into single tokens
collocations

Detect collocations from text
tokenize

tokenize a set of texts
docvars

get or set for document-level variables
docfreq

get the document frequency of a feature
ntoken

count the number of tokens or types
quanteda-package

An R package for the quantitative analysis of textual data.
removeFeatures

remove features from an object
sort.dfm

sort a dfm by one or more margins
print.dfm

print a dfm object
wordstem

stem words
similarity

compute similarities between documents and/or features
stopwords

access built-in stopwords
encoding

detect the encoding of texts
ndoc

get the number of documents or features
toLower

Convert texts to lower case
textmodel

fit a text model
wordlists

word lists used in some readability indexes
ukimmigTexts

Immigration-related sections of 2010 UK party manifestos
textmodel_wordfish

wordfish text model
bigrams

Create bigrams
syllables

count syllables in a text
corpusSource-class

corpus source classes
textmodel_wordscores

Wordscores text model
textmodel_fitted-class

the fitted textmodel classes
dictionary

create a dictionary
kwic

List key words in context from a text or a corpus of texts.
metacorpus

get or set corpus metadata
plot.dfm

plot features as a wordcloud
subset.corpus

extract a subset of a corpus
segment

segment texts into component elements
trim

Trim a dfm using threshold-based or random feature selection