tokenizer_encode: Encode Text to Token IDs
Description
Encode Text to Token IDs
Usage
tokenizer_encode(text, vocab, merge_ranks)
Value
Integer vector of token IDs
Arguments
- text
Character string to encode
- vocab
Vocabulary mapping (token -> id)
- merge_ranks
Merge ranking for BPE