Convenience wrappers for dfm convert
Immigration-related sections of 2010 UK party manifestos
data_corpus_dailnoconf1991
Confidence debate from 1991 Irish Parliament
Coerce a compressed corpus to a standard corpus
Trim a dfm using frequency threshold-based feature selection
Weight the feature frequencies in a dfm
Compute the (weighted) document frequency of a feature
Convert a dfm to a data.frame
redefinition of network::as.network()
Coercion and checking functions for dfm objects
Get or set document names
Coercion and checking functions for dictionary objects
Coerce various objects to statistics_textmodel
Assign the summary.textmodel class to a list
Extract model coefficients from a fitted textmodel_ca object
Compute lexical diversity from a dfm or tokens
Coercion, checking, and combining functions for tokens objects
Count the number of documents or features
Compute the Moving-Average Type-Token Ratio (MATTR)
Recast the document units of a corpus
Utility function to generate a nested list
Randomly sample documents from a corpus
Compute the Mean Segmental Type-Token Ratio (MSTTR)
Declare a compound character to be a sequence of separate pattern matches
predict.textmodel_affinity
Prediction for a fitted affinity textmodel
Internal data sets
US presidential inaugural address texts
Import a Lexicoder dictionary
A paragraph of text for testing various text-based functions
data_corpus_irishbudget2010
Irish budget speeches from 2010
Import a LIWC-formatted dictionary
Apply a dictionary to a dfm
as.coefficients_textmodel
Coerce various objects to coefficients_textmodel
This is a helper function used in summary.textmodel_*
.
Match the feature set of a dfm to given feature names
coerce a compressed corpus to a standard corpus
Weight a dfm by tf-idf
View methods for quanteda
Convert an fcm to an igraph object
Internal function to fit the likelihood scaling mixture model.
Coerce a dist_selection object to a matrix
Get or set the corpus settings
Function to assign multiple slots to a S4 object
Coerce a simil object into a matrix
Convert the case of the features of a dfm and combine
Create a feature co-occurrence matrix
Recombine a dfm or fcm by combining identical dimension elements
Sort an fcm in alphabetical order of the features
textmodel_affinity-internal
Internal methods for textmodel_affinity
Convert quanteda dictionary objects to the YAML format
Class affinity maximum likelihood text scaling model
Combine documents in a dfm by a grouping variable
Function extending base::attributes()
Coerce a dist object into a list
Generate a grouping vector from docvars
Grouping variable(s) for various functions
Get or set corpus metadata
Coerce a dictionary object into a list
Construct a corpus object
Wordfish text model
Coerce a dist into a dist
Coercion functions for fcm objects
Wordscores text model
Coerce a dfm to a matrix or data.frame
Base method extensions for corpus objects
Coerce a dist_selection object into a list
Create a dictionary
Return the first or last part of a corpus
Get or set document-level meta-data
Plot a fitted scaling model
dfm from data in Table 1 of Laver, Benoit, and Garry (2003)
Convert the case of character objects
Return the first or last part of a dfm
Remove sentences based on their token lengths or a pattern match
Check if font is available on the system
Remove sentences based on their token lengths or a pattern match
Bootstrap a dfm
Compute keyness (internal functions)
Locate keywords-in-context
Count the number of tokens or types
Create a document-feature matrix
Lexicoder Sentiment Dictionary (2015)
Plot features as a wordcloud
Pattern for feature, token and keyword matching
Select or remove tokens from a tokens object
Convert a dfm to an lsa "textmatrix"
dfm_split_hyphenated_features
Split a dfm's hyphenated features into constituent parts
Combine dfm objects by Rows or Columns
Function to serialized list-of-character tokens
Extract a subset of a dfm
Objects exported from other packages
Utility function to create a object with new set of attributes
Get word types from a tokens object
Utility function to remove empty keys
Segment texts on a pattern match
Deprecated name for nscrabble
Extract a subset of a corpus
Datasets with deprecated or defunct names
Simpler and faster version of expand.grid() in base package
Replace features in dfm
Randomly sample documents or features from a dfm
Select features from a dfm or fcm
influence.predict.textmodel_affinity
Compute feature influence from a predicted textmodel_affinity object
Virtual class "fcm" for a feature co-occurrence matrix
The fcm class of object is a special type of fcm object with
additional slots, described below. Sort a dfm by frequency of one or more margins
Select types without performing slow regex search
Virtual class "dfm" for a document-feature matrix
Check if patterns contains glob wildcard
Internal function for select_types()
to escape regular expressions
Get or set document-level variables
Internal functions for dfm objects
Internal function to merge values of duplicated keys
Check if a glob pattern is indexed by index_types
Get the feature labels from a dfm
Count the number of sentences
Raise warning of unused dots
Internal function for select_types()
to check if a string is a regular expression
Flatten a hierarchical dictionary into a list of character vectors
Return an error message
Count the Scrabble letter values of text
format a sparsity value for printing
Defunct form of nfeat
Prediction from a fitted textmodel_nb object
friendly_class_undefined_message
Print friendly object class not defined message
Count syllables in a text
predict.textmodel_wordfish
Prediction from a textmodel_wordfish method
Internal function to convert a list to a dictionary
An R package for the quantitative analysis of textual data
Get or set package options for quanteda
Converts a Matrix to a dfm
Print a dfm object
replace_dictionary_values
Internal function to replace dictionary values
Converts a Matrix to a fcm
lowercase_dictionary_values
Internal function to lowercase dictionary values
Sample a vector by a group
Extensions for and from spacy_parse objects
summary.textmodel_wordfish
summary method for textmodel_wordfish
Naive Bayes classifier for texts
Calculate lexical diversity
Compute the sparsity of a document-feature matrix
Tokenize a set of texts
Split tokens by a separator pattern
Latent Semantic Analysis
[Experimental] Compute document/feature proximity
Segment tokens object by chunks of a given size
Extract a subset of a tokens
Summary statistics on a character vector
Print a phrase object
print.statistics_textmodel
Implements print methods for textmodel_statistics
Import a Wordstat dictionary
Print a dist_selection object
print method for summary.textmodel
Internal function for textplot_wordcloud
Pattern matching using valuetype
print method for a wordfish model
Import a Yoshikoder dictionary file.
Identify and score multi-word expressions
Compute entropy of documents or features
deprecated name for dfm_weight
Deprecated form of dfm_tfidf
Convert regex and glob patterns to type IDs or fixed patterns
recompile a serialized tokens object
Convert various input as pattern to a vector used in tokens_select,
tokens_compound and kwic.
predict.textmodel_wordscores
Predict textmodel_wordscores
Internal function for special handling of multi-word dictionary values
print.coefficients_textmodel
Print methods for textmodel features estimates
This is a helper function used in print.summary.textmodel
.
summary.character method to override the network::summary.character()
Plot word keyness
Set values to a dfm's S4 slots
Plot a network of feature co-occurrences
Internal function for select_types
to search the index using
fastmatch.
Replace tokens in a tokens object
Set values to a fcm's S4 slots
Wordshoal text model (redirect)
Influence plot for text scaling models
Internal function for textplot_wordcloud
Convert token sequences into compound tokens
Internal functions to set dimnames
Summarize a corpus
Plot the dispersion of key word(s)
summary method for textmodel_nb objects
Get or assign corpus texts
Correspondence analysis of a document-feature matrix
Stem the terms in an object
Recombine documents tokens by groups
Identify the most frequent features in a dfm
Similarity and distance computation between documents or features
textmodel_lsa-postestimation
Post-estimations methods for textmodel_lsa
Similarity and distance computation between documents or features
Randomly sample documents from a tokens object
Tabulate feature frequencies
Segment tokens object by patterns
Calculate keyness statistics
Calculate readability
Select rows of textstat objects by glob, regex or fixed patterns
Apply a dictionary to a tokens object
Create ngrams and skipgrams from tokens
Convert the case of tokens
[Experimental] Change direction of words in tokens
Convert a dfm to a non-quanteda format