load_vocab

<p>path to vocabulary file. File is assumed to be a text file,
with one token per line, with the line number corresponding to the index of
that token in the vocabulary.</p>

vocab_file

Apply 'Wordpiece' (<arXiv:1609.08144>) tokenization to input text,
given an appropriate vocabulary. The 'BERT' (<arXiv:1810.04805>) tokenization
conventions are used by default.

Jonathan Bratt

wordpiece

R Implementation of Wordpiece Tokenization

Jon Harmon

Bedford Freeman & Worth Pub Grp LLC DBA Macmillan Learning 

load_vocab function

Load a vocabulary file — load_vocab

Load a vocabulary file

load_vocab: Load a vocabulary file

Description

Usage

Arguments

Value

Examples