⚠️There's a newer version (4.3.1) of this package.Take me there.

quanteda (version 0.7.2-1)

Quantitative Analysis of Textual Data

Description

A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.

Copy Link

Version

Install

install.packages('quanteda')

Monthly Downloads

17,344

Version

0.7.2-1

License

GPL-3

Issues

Pull Requests

Maintainer

Kenneth Benoit

Last Published

June 2nd, 2015

Functions in quanteda (0.7.2-1)

lexdiv

calculate lexical diversity

docfreq

get the document frequency of a feature

ngrams

Create ngrams

clean

simple cleaning of text before processing

kwic

List key words in context from a text or a corpus of texts.

removeFeatures

remove features from an object

ntoken

count the number of tokens

textmodel_wordscores

Wordscores text model

dfm-class

Virtual class "dfm" for a document-feature matrix

inaugCorpus

A corpus of US presidential inaugural addresses from 1789-2013

settings

Get or set the corpus settings

summary.corpus

summarize a corpus or a vector of texts

topfeatures

list the most frequent features

print.dfm

print a dfm object

collocations

Detect collocations from text

textmodel_ca

correspondence analysis of a document-feature matrix

tokenize

tokenize a set of texts

docvars

get or set for document-level variables

docnames

get or set document names

ie2010Corpus

Irish budget speeches from 2010

bigrams

Create bigrams

textmodel_fitted-class

the fitted textmodel classes

dfm

create a document-feature matrix

encoding

get the encoding of documents in a corpus

stopwords

access built-in stopwords

syllables

count syllables in a text

subset.corpus

extract a subset of a corpus

exampleString

A paragraph of text for testing various text-based functions

textmodel_wordfish

wordfish text model

corpusSource-class

corpus source classes

metacorpus

get or set corpus metadata

ndoc

get the number of documents or features

phrasetotoken

convert phrases into single tokens

plot.dfm

plot features as a wordcloud

metadoc

get or set document-level meta-data

quanteda-package

An R package for the quantitative analysis of textual data.

texts

get or set corpus texts

segment

segment texts into component elements

textfile

read a text corpus source from a file

tokenizeOnly

LBGexample

dfm containing example data from Table 1 of Laver Benoit and Garry (2003)

corpus

constructor for corpus objects

convert

convert a dfm to a non-quanteda format

dictionary-class

create a dictionary

language

get or set the language of corpus documents

features

extract the feature labels from a dfm

sort.dfm

sort a dfm by one or more margins

weight

Weight the feature frequencies in a dfm by various methods

Trim a dfm using threshold-based or random feature selection

changeunits

change the document units of a corpus

wordstem

stem words

Quantitative Analysis of Textual Data

Description

Copy Link

Version

Install

Monthly Downloads

Version

License

Issues

Pull Requests

Stars

Forks

Repository

Maintainer

Last Published

Functions in quanteda (0.7.2-1)