powered by
Converts text to tokens
get_tokens(text, model)
a vector of tokens for the given text as integer
a character string to encode to tokens, can be a vector
a model to use for tokenization, either a model name, e.g., gpt-4o or a tokenizer, e.g., o200k_base. See also available tokenizers.
gpt-4o
o200k_base
model_to_tokenizer(), decode_tokens()
model_to_tokenizer()
decode_tokens()
get_tokens("Hello World", "gpt-4o") get_tokens("Hello World", "o200k_base")
Run the code above in your browser using DataLab