Learn R Programming

tabulapdf (version 1.0.5-3)

get_page_dims: Page length and dimensions

Description

Get Page Length and Dimensions

Usage

get_page_dims(file, doc, pages = NULL, password = NULL, copy = FALSE)

get_n_pages(file, doc, password = NULL, copy = FALSE)

Value

For get_n_pages, an integer. For get_page_dims, a list of two-element numeric vectors specifying the width and height of each page, respectively.

Arguments

file

A character string specifying the path or URL to a PDF file.

doc

Optionally,, in lieu of file, an rJava reference to a PDDocument Java object.

pages

An optional integer vector specifying pages to extract from.

password

Optionally, a character string containing a user password to access a secured PDF.

copy

Specifies whether the original local file(s) should be copied to tempdir() before processing. FALSE by default. The argument is ignored if file is URL.

Author

Thomas J. Leeper <thosjleeper@gmail.com>

Details

get_n_pages returns the page length of a PDF document. get_page_dims extracts the dimensions of specified pages in a PDF document. This can be useful for figuring out how to specify the area argument in extract_tables

References

Tabula

See Also

extract_tables, extract_text, make_thumbnails

Examples

Run this code
# simple demo file
f <- system.file("examples", "mtcars.pdf", package = "tabulapdf")

get_n_pages(file = f)
get_page_dims(f)

Run the code above in your browser using DataLab