textreadr (version 0.9.0)

read_doc: Read in .doc Content

Description

Read in the content from a .doc file using antiword via the antiword package.

Usage

read_doc(file, skip = 0, remove.empty = TRUE, trim = TRUE,
  format = FALSE, ...)

Arguments

file

The path to the .doc file.

skip

The number of lines to skip.

remove.empty

logical. If TRUE empty elements in the vector are removed.

trim

logical. If TRUE the leading/training white space is removed.

format

logical. If TRUE the output will keep doc formatting (e.g., bold, italics, underlined). This corresponds to the -f flag in antiword.

ignored.

Value

Returns a character vector.

Examples

Run this code
# NOT RUN {
x <- system.file("docs/Yasmine_Interview_Transcript.doc",
    package = "textreadr")
read_doc(x)
# }

Run the code above in your browser using DataLab