Learn R Programming

RBERT (version 0.1.11)

is_punctuation: Check whether `char` is a punctuation character.

Description

(R implementation of _is_punctuation from BERT: tokenization.py.)

Usage

is_punctuation(char)

Arguments

char

A character scalar, comprising a single unicode character.

Value

TRUE if char is a punctuation character.

Details

We treat all non-letter/number ASCII as punctuation. Characters such as "^", "$", and "`" are not in the Unicode Punctuation class but we treat them as punctuation anyway, for consistency.