
Last chance! 50% off unlimited learning
Sale ends in
MC_tokenizer(x)
scan_tokenizer(x)
character
vector consisting of tokens obtained by
tokenization of x
.[object Object],[object Object]
getTokenizers
data("crude")
MC_tokenizer(crude[[1]])
scan_tokenizer(crude[[1]])
strsplit_space_tokenizer <- function(x) unlist(strsplit(x, "[[:space:]]+"))
strsplit_space_tokenizer(crude[[1]])
Run the code above in your browser using DataLab