⚠️There's a newer version (4.3.1) of this package.Take me there.

quanteda (version 0.9.0-1)

Quantitative Analysis of Textual Data

Description

A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.

Copy Link

Version

Install

install.packages('quanteda')

Monthly Downloads

17,344

Version

0.9.0-1

License

GPL-3

Issues

Pull Requests

Maintainer

Kenneth Benoit

Last Published

November 27th, 2015

Functions in quanteda (0.9.0-1)

encodedTexts

encoded texts for testing

exampleString

A paragraph of text for testing various text-based functions

LBGexample

dfm containing example data from Table 1 of Laver Benoit and Garry (2003)

kwic

List key words in context from a text or a corpus of texts.

ndoc

get the number of documents or features

syllables

count syllables in a text

collocations

Detect collocations from text

cbind.dfm

combine dfm object by columns

corpusSource-class

corpus source classes

textmodel_wordscores

Wordscores text model

changeunits

change the document units of a corpus

docvars

get or set for document-level variables

applyDictionary

apply a dictionary or thesarus to an object

dfm

create a document-feature matrix

convert

convert a dfm to a non-quanteda format

features

extract the feature labels from a dfm

corpus

constructor for corpus objects

print.tokenizedTexts

print a tokenizedTexts objects

head.dfm

Return the first or last part of a dfm

dictionary

create a dictionary

lexdiv

calculate lexical diversity

docnames

get or set document names

stopwords

access built-in stopwords

metacorpus

get or set corpus metadata

ie2010Corpus

Irish budget speeches from 2010

readability

calculate readability

selectFeatures

select features from an object

dfm-class

Virtual class "dfm" for a document-feature matrix

textfile

read a text corpus source from a file

textmodel_fitted-class

the fitted textmodel classes

An R package for the quantitative analysis of textual data.

tokenize

tokenize a set of texts

encoding

detect the encoding of texts

ngrams

Create ngrams and skipgrams

trim

Trim a dfm using threshold-based or random feature selection

wordlists

word lists used in some readability indexes

phrasetotoken

convert phrases into single tokens

print.dfm

print a dfm object

sample.corpus

Randomly sample documents or features

plot.kwic

plot a dispersion plot of key word(s)

texts

get corpus texts

toLower

Convert texts to lower case

docfreq

get the document frequency of a feature

show,dictionary-method

print a dictionary object

nsentence

count the number of sentences

ntoken

count the number of tokens or types

topfeatures

list the most frequent features

textmodel_ca

correspondence analysis of a document-feature matrix

settings

Get or set the corpus settings

subset.corpus

extract a subset of a corpus

scrabble

compute the Scrabble letter values of text

sort.dfm

sort a dfm by one or more margins

segment

segment texts into component elements

removeFeatures

remove features from an object

wordstem

stem words

weight

Weight the feature frequencies in a dfm by various methods

inaugCorpus

A corpus of US presidential inaugural addresses from 1789-2013

metadoc

get or set document-level meta-data

plot.dfm

plot features as a wordcloud

summary.corpus

summarize a corpus or a vector of texts

Quantitative Analysis of Textual Data

Description

Copy Link

Version

Install

Monthly Downloads

Version

License

Issues

Pull Requests

Stars

Forks

Repository

Maintainer

Last Published

Functions in quanteda (0.9.0-1)