Extract a subset of a corpus
Coercion and checking functions for fcm objects
Internal data sets
Function to assign multiple slots to a S4 object
Convert the case of character objects
Combine dfm objects by Rows or Columns
Coercion, checking, and combining functions for tokens objects
Bootstrap a dfm
Recast the document units of a corpus
Compute the Mean Segmental Type-Token Ratio (MSTTR)
Function extending base::attributes()
Segment texts on a pattern match
Convert quanteda objects to non-quanteda formats
Compute the Moving-Average Type-Token Ratio (MATTR)
Construct a corpus object
Remove sentences based on their token lengths or a pattern match
Base method extensions for corpus objects
Formerly included data objects
Compute lexical diversity from a dfm or tokens
Replace features in dfm
Randomly sample documents from a corpus
Check if font is available on the system
Convert quanteda dictionary objects to the YAML format
dfm from data in Table 1 of Laver, Benoit, and Garry (2003)
Create a document-feature matrix
A paragraph of text for testing various text-based functions
convert same-value pairs to NA in a textstat_proxy object
Convert a dfm to an lsa "textmatrix"
dictionary class objects and functions
Immigration-related sections of 2010 UK party manifestos
Convenience wrappers for dfm convert
Return the first or last part of a corpus
Randomly sample documents or features from a dfm
Convert the case of the features of a dfm and combine
Weight a dfm by tf-idf
Get or set document names
Get or set document-level meta-data
Virtual class "dfm" for a document-feature matrix
Return the first or last part of a dfm
Remove sentences based on their token lengths or a pattern match
Apply a dictionary to a dfm
Simpler and faster version of expand.grid() in base package
Internal function for select_types()
to escape regular expressions
Combine documents in a dfm by a grouping variable
Extract a subset of a dfm
Get or set document-level variables
Flatten a hierarchical dictionary into a list of character vectors
US presidential inaugural address texts
Match the feature set of a dfm to given feature names
Convert regex and glob patterns to type IDs or fixed patterns
Internal functions for dfm objects
dfm_split_hyphenated_features
Split a dfm's hyphenated features into constituent parts
Sort a dfm by frequency of one or more margins
Create a feature co-occurrence matrix
Create a dictionary
Select features from a dfm or fcm
Sort an fcm in alphabetical order of the features
Internal function to extract docvars
Virtual class "fcm" for a feature co-occurrence matrix
Compute the (weighted) document frequency of a feature
Lexicoder Sentiment Dictionary (2015)
format a sparsity value for printing
Segment tokens object by patterns
Return the first or last part of a textstat_proxy object
friendly_class_undefined_message
Print friendly object class not defined message
Compute the frequencies of features
Count the Scrabble letter values of text
Count the number of sentences
Generate a grouping vector from docvars
Grouping variable(s) for various functions
Internal function for select_types()
to check if a string is a regular expression
Internal function for select_types
to search the index using
fastmatch.
Check if a glob pattern is indexed by index_types
Converts a Matrix to a dfm
Recombine a dfm or fcm by combining identical dimension elements
Get or set object metadata
Replace tokens in a tokens object
Trim a dfm using frequency threshold-based feature selection
lowercase_dictionary_values
Internal function to lowercase dictionary values
Count the number of documents or features
Converts a Matrix to a fcm
Get the package version that created an object
Get the feature labels from a dfm
Internal function to get, set or initialize system metadata
Check if patterns contains glob wildcard
Compute keyness (internal functions)
Internal function to merge values of duplicated keys
Weight the feature frequencies in a dfm
Object compilers
Plot a network of feature co-occurrences
Internal functions to create a list for the meta attribute
Pattern for feature, token and keyword matching
Shortcut functions to access or assign metadata
Internal functions to import dictionary files
Count syllables in a text
Print methods for quanteda core objects
Extensions for and from spacy_parse objects
Objects exported from other packages
Utility function to generate a nested list
Select types without performing slow regex search
Declare a compound character to be a sequence of separate pattern matches
Special handling for names of quanteda objects
Compute the sparsity of a document-feature matrix
Sample a vector by a group
Return an error message
Print a phrase object
Set values to a fcm's S4 slots
Count the number of tokens or types
Get or assign corpus texts
Plot the dispersion of key word(s)
Convert various input as pattern to a vector used in tokens_select,
tokens_compound and kwic.
Summary statistics on a character vector
Select rows of textstat objects by glob, regex or fixed patterns
Similarity and distance computation between documents or features
Models for scaling and classification of textual data
Internal functions to set dimnames
replace_dictionary_values
Internal function to replace dictionary values
Extract a subset of a tokens
Get word types from a tokens object
Utility function to remove empty keys
Plot word keyness
Calculate lexical diversity
Functions to add or retrieve corpus summary metadata
Function to serialize list-of-character tokens
textstat_simil/dist classes
Plot features as a wordcloud
Set values to a dfm's S4 slots
Split tokens by a separator pattern
Recombine documents tokens by groups
Summarize a corpus
Calculate keyness statistics
Internal function for special handling of multi-word dictionary values
Locate keywords-in-context
Create ngrams and skipgrams from tokens
Tabulate feature frequencies
Unlist a list of character vectors safely
Internal function for textplot_wordcloud
quanteda tokenizers
Compute entropies of documents or features
Apply a dictionary to a tokens object
Construct a tokens object
An R package for the quantitative analysis of textual data
Convert token sequences into compound tokens
Segment tokens object by chunks of a given size
Get or set package options for quanteda
recompile a serialized tokens object
Convert the case of tokens
Select or remove tokens from a tokens object
Internal function to convert a list to a dictionary
Pattern matching using valuetype
Stem the terms in an object
Internal function for textplot_wordcloud
[Experimental] Change direction of words in tokens
Identify the most frequent features in a dfm
Pipe operator
Identify and score multi-word expressions
summary.character method to override the network::summary.character()
[Experimental] Compute document/feature proximity
Calculate readability
Randomly sample documents from a tokens object
Unlist a list of integer vectors safely
Raise warning of unused dots
Coercion and checking functions for dictionary objects
View methods for quanteda
Coercion and checking functions for dfm objects
Coerce a dfm to a matrix or data.frame
Convert an fcm to an igraph object
redefinition of network::as.network()
Convert a dfm to a data.frame
coerce a compressed corpus to a standard corpus
as.matrix,textstat_simil_sparse-method
as.matrix method for textstat_simil_sparse