tm (version 0.5-10)

removePunctuation: Remove Punctuation Marks from a Text Document

Description

Remove punctuation marks from a text document.

Usage

## S3 method for class 'PlainTextDocument':
removePunctuation(x, preserve_intra_word_dashes = FALSE)

Arguments

x
A text document.
preserve_intra_word_dashes
A logical specifying whether intra-word dashes should be kept.

Value

  • The text document x with any punctuation marks in it removed (besides intra-word dashes if preserve_intra_word_dashes is set).

See Also

getTransformations to list available transformation (mapping) functions.

regex shows the class [:punct:] of punctuation characters.

Examples

Run this code
data("crude")
crude[[14]]
removePunctuation(crude[[14]])
removePunctuation(crude[[14]], preserve_intra_word_dashes = TRUE)

Run the code above in your browser using DataCamp Workspace