⚠️There's a newer version (4.2.0) of this package.Take me there.

quanteda (version 0.9.6-1)

Quantitative Analysis of Textual Data

Description

A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.

Copy Link

Version

Install

install.packages('quanteda')

Monthly Downloads

14,820

Version

0.9.6-1

License

GPL-3

Issues

Pull Requests

Maintainer

Kenneth Benoit

Last Published

May 9th, 2016

Functions in quanteda (0.9.6-1)

metadoc

get or set document-level meta-data

encoding

detect the encoding of texts

ngrams

Create ngrams and skipgrams

encodedTextFiles

a .zip file of texts containing a variety of differently encoded texts

textmodel

fit a text model

settings

Get or set the corpus settings

textfile

read a text corpus source from a file

selectFeatures

select features from an object

weight

weight the feature frequencies in a dfm

wordlists

word lists used in some readability indexes

collocations

Detect collocations from text

applyDictionary

apply a dictionary or thesarus to an object

ie2010Corpus

Irish budget speeches from 2010

tfidf

compute tf-idf weights from a dfm

dfm

create a document-feature matrix

trim

Trim a dfm using threshold-based or random feature selection

topfeatures

list the most frequent features

dictionary

create a dictionary

head.dfm

Return the first or last part of a dfm

tokenize

tokenize a set of texts

LBGexample

dfm with example data from Table 1 of Laver Benoit and Garry (2003)

predict.textmodel_NB_fitted

prediction method for Naive Bayes classifier objects

docnames

get or set document names

quanteda-package

An R package for the quantitative analysis of textual data

sort.dfm

sort a dfm by one or more margins

syllables

count syllables in a text

textmodel_fitted-class

the fitted textmodel classes

subset.corpus

extract a subset of a corpus

textmodel_wordscores

Wordscores text model

textmodel_ca

correspondence analysis of a document-feature matrix

encodedTexts

encoded texts for testing

toLower

Convert texts to lower case

wordstem

stem words

dfm-class

Virtual class "dfm" for a document-feature matrix

inaugCorpus

A corpus of US presidential inaugural addresses from 1789-2013

ndoc

get the number of documents or features

plot.dfm

plot features as a wordcloud

convert

convert a dfm to a non-quanteda format

features

extract the feature labels from a dfm

ntoken

count the number of tokens or types

show,dictionary-method

print a dictionary object

plot.kwic

plot a dispersion plot of key word(s)

corpus

constructor for corpus objects

textmodel_wordfish

wordfish text model

kwic

List key words in context from a text or a corpus of texts.

nsentence

count the number of sentences

lexdiv

calculate lexical diversity

phrasetotoken

convert phrases into single tokens

print.dfm

print a dfm object

textmodel_NB

Naive Bayes classifier for texts

readability

calculate readability

exampleString

A paragraph of text for testing various text-based functions

ukimmigTexts

compress

compress a dfm by combining similarly named dimensions

cbind.dfm

Combine dfm objects by Rows or Columns

mobydickText

Project Gutenberg text of Herman Melville's Moby Dick

print.tokenizedTexts

print a tokenizedTexts objects

scrabble

compute the Scrabble letter values of text

sample

Randomly sample documents or features

segment

segment texts into component elements

docvars

get or set for document-level variables

texts

get corpus texts

changeunits

change the document units of a corpus

metacorpus

get or set corpus metadata

as.data.frame,dfm-method

coerce a dfm to a data.frame

docfreq

#' @rdname weight #' @return weight, x with no type argument queries the weighting applied to the dfm, returning setMethod("weight", signature(c("dfm", "MISSING")), function(x) { if (isS4(x)) x@weighting else attr(x, "weighting") })

removeFeatures

remove features from an object

summary.corpus

summarize a corpus or a vector of texts

stopwords

access built-in stopwords

corpusSource-class

corpus source classes

compute (weighted) term frequency from a dfm

Quantitative Analysis of Textual Data

Description

Copy Link

Version

Install

Monthly Downloads

Version

License

Issues

Pull Requests

Stars

Forks

Repository

Maintainer

Last Published

Functions in quanteda (0.9.6-1)