coerce a compressed corpus to a standard corpus
as.coefficients_textmodel
Coerce various objects to coefficients_textmodel
This is a helper function used in summary.textmodel_*
.
Convert an fcm to an igraph object
Convert a dfm to a data.frame
Coerce a compressed corpus to a standard corpus
Coerce a dfm to a matrix or data.frame
Coerce a dist_selection object into a list
Coercion, checking, and combining functions for tokens objects
Coercion and checking functions for dfm objects
Coercion and checking functions for dictionary objects
Coerce a dist object into a list
Convert quanteda dictionary objects to the YAML format
Function extending base::attributes()
Combine dfm objects by Rows or Columns
Bootstrap a dfm
Convert the case of character objects
Construct a corpus object
Assign the summary.textmodel class to a list
Compute the Moving-Average Type-Token Ratio (MATTR)
Coerce various objects to statistics_textmodel
Compute lexical diversity from a dfm or tokens
Recast the document units of a corpus
Extract a subset of a corpus
Internal function to fit the likelihood scaling mixture model.
View methods for quanteda
as.matrix,textstat_simil_sparse-method
as.matrix method for textstat_simil_sparse
data_corpus_dailnoconf1991
Confidence debate from 1991 Irish Parliament
redefinition of network::as.network()
Compute the Mean Segmental Type-Token Ratio (MSTTR)
Convert a dfm to a non-quanteda format
A paragraph of text for testing various text-based functions
Base method extensions for corpus objects
US presidential inaugural address texts
Remove sentences based on their token lengths or a pattern match
Immigration-related sections of 2010 UK party manifestos
Internal functions for dfm objects
Create a document-feature matrix
Weight the feature frequencies in a dfm
Check if font is available on the system
convert same-value pairs to NA in a textstat_proxy object
Extract model coefficients from a fitted textmodel_ca object
Convenience wrappers for dfm convert
Lexicoder Sentiment Dictionary (2015)
Virtual class "dfm" for a document-feature matrix
dfm from data in Table 1 of Laver, Benoit, and Garry (2003)
data_corpus_irishbudget2010
Irish budget speeches from 2010
Remove sentences based on their token lengths or a pattern match
Create a feature co-occurrence matrix
Sort an fcm in alphabetical order of the features
Return the first or last part of a dfm
Utility function to create a object with new set of attributes
Weight a dfm by tf-idf
Segment texts on a pattern match
Extract a subset of a dfm
Randomly sample documents from a corpus
Combine documents in a dfm by a grouping variable
Apply a dictionary to a dfm
friendly_class_undefined_message
Print friendly object class not defined message
Get or set document-level variables
Internal function for select_types()
to escape regular expressions
Internal data sets
Datasets with deprecated or defunct names
Convert a dfm to an lsa "textmatrix"
Generate a grouping vector from docvars
Converts a Matrix to a dfm
Return the first or last part of a textstat_proxy object
Match the feature set of a dfm to given feature names
Recombine a dfm or fcm by combining identical dimension elements
Sort a dfm by frequency of one or more margins
dfm_split_hyphenated_features
Split a dfm's hyphenated features into constituent parts
Compute keyness (internal functions)
Converts a Matrix to a fcm
Get or set corpus metadata
Replace features in dfm
Locate keywords-in-context
Coerce a dictionary object into a list
Create a dictionary
Select features from a dfm or fcm
Randomly sample documents or features from a dfm
Compute the frequencies of features
Convert the case of the features of a dfm and combine
Trim a dfm using frequency threshold-based feature selection
Compute the (weighted) document frequency of a feature
Get or set document names
Prediction from a fitted textmodel_nb object
Get or set document-level meta-data
Grouping variable(s) for various functions
Return the first or last part of a corpus
Count the number of sentences
Count the number of tokens or types
Count syllables in a text
Pattern for feature, token and keyword matching
Print a dfm object
Get the feature labels from a dfm
Simpler and faster version of expand.grid() in base package
predict.textmodel_wordfish
Prediction from a textmodel_wordfish method
Flatten a hierarchical dictionary into a list of character vectors
format a sparsity value for printing
Virtual class "fcm" for a feature co-occurrence matrix
The fcm class of object is a special type of fcm object with
additional slots, described below. Print a phrase object
Internal function for select_types()
to check if a string is a regular expression
print.statistics_textmodel
Implements print methods for textmodel_statistics
Check if a glob pattern is indexed by index_types
Internal function to convert a list to a dictionary
Select types without performing slow regex search
Internal function for select_types
to search the index using
fastmatch.
lowercase_dictionary_values
Internal function to lowercase dictionary values
influence.predict.textmodel_affinity
Compute feature influence from a predicted textmodel_affinity object
Check if patterns contains glob wildcard
summary method for textmodel_nb objects
Set values to a fcm's S4 slots
Extensions for and from spacy_parse objects
Function to assign multiple slots to a S4 object
Wordfish text model
Naive Bayes classifier for texts
Get or set the corpus settings
Print a dist_selection object
Defunct form of nfeat
summary.textmodel_wordfish
summary method for textmodel_wordfish
Internal function to merge values of duplicated keys
Tabulate feature frequencies
Similarity and distance computation between documents or features
Compute entropy of documents or features
Declare a compound character to be a sequence of separate pattern matches
predict.textmodel_affinity
Prediction for a fitted affinity textmodel
Return an error message
Similarity and distance computation between documents or features
Class affinity maximum likelihood text scaling model
Randomly sample documents from a tokens object
Segment tokens object by patterns
Convert the case of tokens
[Experimental] Change direction of words in tokens
Correspondence analysis of a document-feature matrix
Get or assign corpus texts
Count the number of documents or features
Count the Scrabble letter values of text
Utility function to generate a nested list
predict.textmodel_wordscores
Predict textmodel_wordscores
Convert regex and glob patterns to type IDs or fixed patterns
Internal functions to import dictionary files
Identify and score multi-word expressions
Convert various input as pattern to a vector used in tokens_select,
tokens_compound and kwic.
print.coefficients_textmodel
Print methods for textmodel features estimates
This is a helper function used in print.summary.textmodel
.
Select or remove tokens from a tokens object
Function to serialized list-of-character tokens
Select rows of textstat objects by glob, regex or fixed patterns
Split tokens by a separator pattern
Calculate readability
Pattern matching using valuetype
Extract a subset of a tokens
print method for summary.textmodel
Internal functions to set dimnames
print method for a wordfish model
Objects exported from other packages
Set values to a dfm's S4 slots
An R package for the quantitative analysis of textual data
Sample a vector by a group
Deprecated name for nscrabble
Internal function for textplot_wordcloud
Get or set package options for quanteda
replace_dictionary_values
Internal function to replace dictionary values
Utility function to remove empty keys
Summary statistics on a character vector
textmodel_affinity-internal
Internal methods for textmodel_affinity
Compute the sparsity of a document-feature matrix
Influence plot for text scaling models
Internal function for special handling of multi-word dictionary values
Plot word keyness
summary.character method to override the network::summary.character()
Summarize a corpus
Wordscores text model
Calculate lexical diversity
Calculate keyness statistics
Plot features as a wordcloud
deprecated name for dfm_weight
Plot the dispersion of key word(s)
textmodel_lsa-postestimation
Post-estimations methods for textmodel_lsa
Wordshoal text model (redirect)
Deprecated form of dfm_tfidf
Latent Semantic Analysis
Convert token sequences into compound tokens
Tokenize a set of texts
Segment tokens object by chunks of a given size
Apply a dictionary to a tokens object
Create ngrams and skipgrams from tokens
Raise warning of unused dots
Unlist a list of integer vectors safely
Unlist a list of character vectors safely
Recombine documents tokens by groups
Get word types from a tokens object
Plot a network of feature co-occurrences
Plot a fitted scaling model
[Experimental] Compute document/feature proximity
textstat_simil/dist classes
recompile a serialized tokens object
Identify the most frequent features in a dfm
Replace tokens in a tokens object
Stem the terms in an object
Internal function for textplot_wordcloud
Coerce a dist into a dist
Coercion functions for fcm objects
Coerce a dist_selection object to a matrix
Coerce a simil object into a matrix