Learn R Programming

⚠️There's a newer version (0.6.5) of this package.Take me there.

docxtractr (version 0.1.0.9000)

Extract Data Tables from Microsoft Word Documents

Description

Microsoft Word docx files provide an XML structure that is fairly straightforward to navigate, especially when it applies to Word tables. The docxtractr package provides tools to determine table count/structure and extract/clean tables from Microsoft Word docx documents.

Copy Link

Version

Install

install.packages('docxtractr')

Monthly Downloads

1,326

Version

0.1.0.9000

License

MIT + file LICENSE

Maintainer

Bob Rudis

Last Published

August 29th, 2015

Functions in docxtractr (0.1.0.9000)

read_docx

Read in a Word document for table extraction
assign_colnames

Make a specific row the column names for the specified data.frame
docx_extract_tbl

Extract a table from a Word document
docxtractr

docxtractr is an R pacakge for extracting tables out of Word documents (docx)
print.docx

Display information about the document
docx_describe_tbls

Returns a description of all the tables in the Word document
docx_tbl_count

Get number of tables in a Word document
docx_extract_all

Extract all tables from a Word document