A lightweight NLP toolkit for R organized as a four-stage pipeline: fetch (URLs from search/Wikipedia), read (content from URLs), process (split, tokenize, index), and search (regex, BM25, vector similarity, dictionary). Uses verb_noun naming for discoverability. Minimal dependencies; embeddings are built elsewhere and passed in for semantic search.
Maintainer: Jason Timm JaTimm@salud.unm.edu (2026)
Useful links: