powered by
GPT-2/Whisper uses a specific byte-to-unicode mapping.
byte_to_token(byte)
Character token
Integer byte value (0-255)