⚠️There's a newer version (4.3.1) of this package.Take me there.

quanteda (version 0.9.6-9)

Quantitative Analysis of Textual Data

Description

A fast, flexible toolset for for the management, processing, and quantitative analysis of textual data in R.

Copy Link

Version

Install

install.packages('quanteda')

Monthly Downloads

17,344

Version

0.9.6-9

License

GPL-3

Issues

Pull Requests

Maintainer

Kenneth Benoit

Last Published

May 20th, 2016

Functions in quanteda (0.9.6-9)

convert

convert a dfm to a non-quanteda format

docnames

get or set document names

compress

compress a dfm by combining similarly named dimensions

features

extract the feature labels from a dfm

lexdiv

calculate lexical diversity

ndoc

get the number of documents or features

ntoken

count the number of tokens or types

print.tokenizedTexts

print a tokenizedTexts objects

sample

Randomly sample documents or features

changeunits

change the document units of a corpus

compute (weighted) term frequency from a dfm

encodedTextFiles

a .zip file of texts containing a variety of differently encoded texts

docfreq

#' @rdname weight #' @return weight, x with no type argument queries the weighting applied to the dfm, returning setMethod("weight", signature(c("dfm", "MISSING")), function(x) { if (isS4(x)) x@weighting else attr(x, "weighting") })

head.dfm

Return the first or last part of a dfm

removeFeatures

remove features from an object

metacorpus

get or set corpus metadata

subset.corpus

extract a subset of a corpus

cbind.dfm

Combine dfm objects by Rows or Columns

corpus

constructor for corpus objects

dfm

create a document-feature matrix

show,dictionary-method

print a dictionary object

corpusSource-class

corpus source classes

encodedTexts

encoded texts for testing

textmodel_NB

Naive Bayes classifier for texts

sort a dfm by one or more margins

topfeatures

list the most frequent features

settings

Get or set the corpus settings

weight

weight the feature frequencies in a dfm

tokenize

tokenize a set of texts

LBGexample

dfm with example data from Table 1 of Laver Benoit and Garry (2003)

docvars

get or set for document-level variables

ngrams

Create ngrams and skipgrams

dfm-class

Virtual class "dfm" for a document-feature matrix

tfidf

compute tf-idf weights from a dfm

as.data.frame,dfm-method

coerce a dfm to a data.frame

metadoc

get or set document-level meta-data

kwic

List key words in context from a text or a corpus of texts.

collocations

Detect collocations from text

plot.kwic

plot a dispersion plot of key word(s)

nsentence

count the number of sentences

summary.corpus

summarize a corpus or a vector of texts

readability

calculate readability

trim

Trim a dfm using threshold-based or random feature selection

phrasetotoken

convert phrases into single tokens

textmodel_wordfish

wordfish text model

selectFeatures

select features from an object

textmodel_ca

correspondence analysis of a document-feature matrix

textfile

read a text corpus source from a file

toLower

Convert texts to lower case

textmodel_wordscores

Wordscores text model

segment

segment texts into component elements

predict.textmodel_NB_fitted

prediction method for Naive Bayes classifier objects

print.dfm

print a dfm object

textmodel_fitted-class

the fitted textmodel classes

syllables

count syllables in a text

detect the encoding of texts

textmodel

fit a text model

ie2010Corpus

Irish budget speeches from 2010

inaugCorpus

A corpus of US presidential inaugural addresses from 1789-2013

stopwords

access built-in stopwords

plot.dfm

plot features as a wordcloud

wordlists

word lists used in some readability indexes

applyDictionary

apply a dictionary or thesarus to an object

mobydickText

Project Gutenberg text of Herman Melville's Moby Dick

exampleString

A paragraph of text for testing various text-based functions

quanteda-package

An R package for the quantitative analysis of textual data

scrabble

compute the Scrabble letter values of text

Quantitative Analysis of Textual Data

Description

Copy Link

Version

Install

Monthly Downloads

Version

License

Issues

Pull Requests

Stars

Forks

Repository

Maintainer

Last Published

Functions in quanteda (0.9.6-9)