removePunctuation {tm}R Documentation

Remove Punctuation Marks from a Text Document

Description

Remove punctuation marks from a text document.

Usage

## S3 method for class 'PlainTextDocument'
removePunctuation(x, preserve_intra_word_dashes = FALSE)

Arguments

x

A text document.

preserve_intra_word_dashes

A logical specifying whether intra-word dashes should be kept.

Value

The text document x with any punctuation marks in it removed (besides intra-word dashes if preserve_intra_word_dashes is set).

See Also

getTransformations to list available transformation (mapping) functions.

regex shows the class [:punct:] of punctuation characters.

Examples

data("crude")
crude[[14]]
removePunctuation(crude[[14]])
removePunctuation(crude[[14]], preserve_intra_word_dashes = TRUE)

[Package tm version 0.5-10 Index]