
Last chance! 50% off unlimited learning
Sale ends in
(R implementation of BasicTokenizer._run_split_on_punc from BERT: tokenization.py.)
split_on_punc(text)
A character scalar, encoded as utf-8.
The input text as a character vector, split on punctuation characters.