textreadr (version 0.5.1)

read_doc: Read in .doc Content

Description

Read in the content from a .doc file using http://www.winfield.demon.nl via the antiword package.

Usage

read_doc(file, skip = 0, remove.empty = TRUE, trim = TRUE,
  format = FALSE, ...)

Arguments

file
The path to the .doc file.
skip
The number of lines to skip.
remove.empty
logical. If TRUE empty elements in the vector are removed.
trim
logical. If TRUE the leading/training white space is reoved.
format
logical. If TRUE the output will keep doc formatting (e.g., bold, italics, underlined). This corresponds to the -f flag in antiword.
ignored.

Value

Returns a character vector.

Examples

Run this code
## Not run: ------------------------------------
# x <- system.file("docs/Yasmine_Interview_Transcript.doc",
#     package = "textreadr")
# read_doc(x)
## ---------------------------------------------

Run the code above in your browser using DataLab