- word_embeddings_layers
Layers outputted from textEmbedRawLayers.
- layers
The numbers of the layers to be aggregated
(e.g., c(11:12) to aggregate the eleventh and twelfth).
Note that layer 0 is the input embedding to the transformer, and should normally not be used.
Selecting 'all' thus removes layer 0.
- aggregation_from_layers_to_tokens
Method to carry out the aggregation among the layers for each word/token,
including "min", "max" and "mean" which takes the minimum, maximum or mean across each column;
or "concatenate", which links together each layer of the word embedding to one long row. Default is "concatenate"
- aggregation_from_tokens_to_texts
Method to carry out the aggregation among the word embeddings
for the words/tokens, including "min", "max" and "mean" which takes the minimum, maximum or mean across each column;
or "concatenate", which links together each layer of the word embedding to one long row.
- return_tokens
If TRUE, provide the tokens used in the specified transformer model.
- tokens_select
Option to only select embeddings linked to specific tokens
such as "[CLS]" and "[SEP]" (default NULL).
- tokens_deselect
Option to deselect embeddings linked to specific tokens
such as "[CLS]" and "[SEP]" (default NULL).