⚠️There's a newer version (4.3.1) of this package.Take me there.

quanteda (version 0.8.0-3)

Quantitative Analysis of Textual Data

Description

A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.

Copy Link

Version

Install

install.packages('quanteda')

Monthly Downloads

17,344

Version

0.8.0-3

License

GPL-3

Issues

Pull Requests

Maintainer

Kenneth Benoit

Last Published

July 11th, 2015

Functions in quanteda (0.8.0-3)

ngrams

Create ngrams

removeFeatures

remove features from an object

textfile

read a text corpus source from a file

LBGexample

dfm containing example data from Table 1 of Laver Benoit and Garry (2003)

docfreq

get the document frequency of a feature

encoding

detect the encoding of texts

features

extract the feature labels from a dfm

kwic

List key words in context from a text or a corpus of texts.

collocations

Detect collocations from text

get or set corpus metadata

word lists used in some readability indexes

metadoc

get or set document-level meta-data

textmodel_ca

correspondence analysis of a document-feature matrix

changeunits

change the document units of a corpus

corpusSource-class

corpus source classes

segment

segment texts into component elements

ukimmigTexts

weight

Weight the feature frequencies in a dfm by various methods

toLower

Convert texts to lower case

subset.corpus

extract a subset of a corpus

docvars

get or set for document-level variables

syllables

count syllables in a text

docnames

get or set document names

textmodel_wordscores

Wordscores text model

textmodel_fitted-class

the fitted textmodel classes

quanteda-package

An R package for the quantitative analysis of textual data.

settings

Get or set the corpus settings

plot.dfm

plot features as a wordcloud

dfm

create a document-feature matrix

topfeatures

list the most frequent features

dfm-class

Virtual class "dfm" for a document-feature matrix

lexdiv

calculate lexical diversity

sort.dfm

sort a dfm by one or more margins

ie2010Corpus

Irish budget speeches from 2010

corpus

constructor for corpus objects

convert

convert a dfm to a non-quanteda format

stopwords

access built-in stopwords

textmodel_wordfish

wordfish text model

nsentence

count the number of sentences

textmodel

fit a text model

exampleString

A paragraph of text for testing various text-based functions

trim

Trim a dfm using threshold-based or random feature selection

clean

simple cleaning of text before processing

ndoc

get the number of documents or features

texts

get corpus texts

inaugCorpus

A corpus of US presidential inaugural addresses from 1789-2013

tokenize

tokenize a set of texts

ntoken

count the number of tokens or types

encodedTexts

encoded texts for testing

phrasetotoken

convert phrases into single tokens

readability

calculate readability

summary.corpus

summarize a corpus or a vector of texts

Quantitative Analysis of Textual Data

Description

Copy Link

Version

Install

Monthly Downloads

Version

License

Issues

Pull Requests

Stars

Forks

Repository

Maintainer

Last Published

Functions in quanteda (0.8.0-3)