Internal C++ implementation of n-gram recoding for multiword detection.
Users should call txt_recode_ngram_fast instead.
Character vector where matched n-grams are combined
Character vector of tokens
Character vector of multiword expressions to match
Integer vector indicating the length of each compound
String separator to use when joining tokens