text_to_sentences

A string of text (required), 
typically a character vector.

Sentence delimiters (as regex) 
used to split <code>x</code> into substrings. 
By default, <code>split_delim = "\.|\?|!"</code>.

split_delim

Boolean: Enforce splitting at 
<code>split_delim</code>? 
If <code>force_delim = FALSE</code> (as per default), 
the function assumes a standard sentence-splitting pattern: 
<code>split_delim</code> is followed by a single space and a capital letter. 
If <code>force_delim = TRUE</code>, splits at <code>split_delim</code> are 
enforced (regardless of spacing or capitalization).

force_delim

<code>text_to_sentences</code> splits at given punctuation marks 
(as a regular expression, default: <code>split_delim = "\.|\?|!"</code>) 
and removes empty leading and trailing spaces before returning 
a vector of the remaining character sequences (as the sentences).

All datasets and functions required for the examples and exercises of the book "Data Science for Psychologists" (by Hansjoerg Neth, Konstanz University, 2020), available at <https://bookdown.org/hneth/ds4psy/>. The book and course introduce principles and methods of data science to students of psychology and other biological or social sciences. The 'ds4psy' package primarily provides datasets, but also functions for data generation and manipulation (e.g., of text and time data) and graphics that are used in the book and its exercises. All functions included in 'ds4psy' are designed to be instructive and entertaining, rather than elegant or efficient.

Hansjoerg Neth

ds4psy

Data Science for Psychologists

text_to_sentences function

text_to_sentences splits a string of text <code>x</code> 
(consisting of one or more character strings) 
into a vector of its constituting sentences. — text_to_sentences

text_to_sentences: text_to_sentences splits a string of text `x` (consisting of one or more character strings) into a vector of its constituting sentences.

Description

Usage

Arguments

Details

See Also

Examples