sentences
splits text at the sentence boundaries defined by
Unicode Standard Annex #29, Section 5.
These boundaries handle Unicode correctly and they give reasonable
behavior across a variety of languages. Unfortunately, the UAX 29
sentence-breaking rules do not handle abbreviations correctly. So, for
example, the text "I saw Mr. Jones today."
will get split into
two sentences.
Future versions of the sentences
function may change to
accommodate special rules for abbreviations like "Mr.", "Dr.", etc.