Generate a List with BreakIterator Settings
A convenience function to tune the ICU
in some text boundary analysis functions, see
stri_opts_brkiter(type, locale, skip_word_none, skip_word_number, skip_word_letter, skip_word_kana, skip_word_ideo, skip_line_soft, skip_line_hard, skip_sentence_term, skip_sentence_sep, ...)
single string; either the break iterator type, one of
word; or a custom set of ICU break iteration rules. see stringi-search-boundaries
""for default locale
logical; perform no action for "words" that do not fit into any other categories
logical; perform no action for words that appear to be numbers
logical; perform no action for words that contain letters, excluding hiragana, katakana, or ideographic characters
logical; perform no action for words containing kana characters
logical; perform no action for words containing ideographic characters
logical; perform no action for soft line breaks, i.e. positions at which a line break is acceptable but not required
logical; perform no action for hard, or mandatory line breaks
logical; perform no action for sentences ending with a sentence terminator ("
!"), possibly followed by a hard separator (
logical; perform no action for sentences that do not contain an ending sentence terminator, but are ended by a hard separator or end of input
any other arguments to this function are purposely ignored
skip_* family of settings may be used to prevent performing
any special actions on particular types of text boundaries, e.g.
in case of the
Note that custom break iterator rules (advanced users only) should be specified as a single string. For a detailed description of the syntax of RBBI rules, please refer to the ICU User Guide on Boundary Analysis.
Returns a named list object.
skip_* values act as they have been set to
ubrk.h File Reference -- ICU4C API Documentation,
Boundary Analysis -- ICU User Guide, http://userguide.icu-project.org/boundaryanalysis