powered by
(R implementation of _is_control from BERT: tokenization.py.)
is_control(char)
A character scalar, comprising a single unicode character.
TRUE if char is a control character.
"\t", "\n", and "\r" are technically control characters but we treat them as whitespace since they are generally considered as such.