Learn R Programming

whisper (version 0.1.0)

tokenizer_encode: Encode Text to Token IDs

Description

Encode Text to Token IDs

Usage

tokenizer_encode(text, vocab, merge_ranks)

Value

Integer vector of token IDs

Arguments

text

Character string to encode

vocab

Vocabulary mapping (token -> id)

merge_ranks

Merge ranking for BPE