powered by
(I'm not sure that this object-based approach is best for R implementation, but for now just trying to reproduce python functionality.)
BasicTokenizer(do_lower_case = TRUE)
Logical; the value to give to the "do_lower_case" argument in the BasicTokenizer object.
an object of class BasicTokenizer
Has methods: `tokenize.BasicTokenizer()` `run_strip_accents.BasicTokenizer()` (internal use) `run_split_on_punc.BasicTokenizer()` (internal use) `tokenize_chinese_chars.BasicTokenizer()` (internal use) `is_chinese_char.BasicTokenizer()` (internal use) `clean_text.BasicTokenizer()` (internal use)
# NOT RUN { b_tokenizer <- BasicTokenizer(TRUE) # }
Run the code above in your browser using DataLab