Generates sparse BM25 embeddings for keyword search
vocabVocabulary
languageLanguage setting ("en" or "ml")
new()Create a new SparseEmbedder
SparseEmbedder$new(language = "en")languageLanguage behavior ("en" = ASCII-focused, "ml" = Unicode-aware)
fit()Fit the embedder on a corpus
SparseEmbedder$fit(texts)textsCharacter vector of texts
textsCharacter vector of texts
Sparse matrix of BM25 scores
query_terms()Get term scores for a query
SparseEmbedder$query_terms(query)queryQuery text
Named vector of term scores
clone()The objects of this class are cloneable with this method.
SparseEmbedder$clone(deep = FALSE)deepWhether to make a deep clone.