textProjection

Word or text variable to be plotted.

words

Word embeddings from textEmbed for the words to be plotted
(i.e., the aggregated word embeddings for the "words" parameter).

word_embeddings

Word embeddings from textEmbed for individual words
(i.e., decontextualized embeddings).

single_word_embeddings

Numeric variable that the words should be plotted according to on the x-axes.

Numeric variable that the words should be plotted according to on the y-axes (y=NULL).

Number of PCA dimensions applied to the word embeddings in the beginning of the function.
A number below 1 takes out % of variance; An integer specify number of components to extract.
(default is NULL as this setting has not yet been evaluated).

Method to aggregate the word embeddings
(default = "mean"; see also "min", "max", and "[CLS]").

aggregation

Method to split the axes
(default = "quartile" involving selecting lower and upper quartile; see also "mean"). However, if the variable is
only containing two different values (i.e., being dichotomous) mean split is used.

split

Compute the power of the frequency of the words and multiply
the word embeddings with this in the computation of aggregated word embeddings for
group low (1) and group high (2). This increases the weight of more frequent words.

word_weight_power

Option to select words that have occurred a specified number of
times (default = 0); when creating the Supervised Dimension Projection line
(i.e., single words receive Supervised Dimension Projection and p-value).

min_freq_words_test

Number of permutations in the creation of the null distribution.

Npermutations

A setting to split Npermutations to avoid reaching computer memory limits;
the higher the faster, but too high may lead to abortion.

n_per_split

seed

Compute Supervised Dimension Projection and related variables for plotting words.

Transforms text variables to word embeddings; where the word embeddings are used to statistically test the mean difference between set of texts, compute semantic similarity scores between texts, predict numerical variables, and visual statistically significant words according to various dimensions etc. For more information see <https://www.r-text.org>.

textProjection: Compute Supervised Dimension Projection and related variables for plotting words.

Description

Usage

Arguments

Value

Examples