tokenize: Recompute the tokens for a document or corpus
Description
Given a TextReuseTextDocument or a
TextReuseCorpus, this function recomputes the tokens and hashes
with the functions specified. Optionally, it can also recompute the minhash signatures.