Learn R Programming

Tivy (version 0.1.1)

extract_pdf_data: Extract data from PDF announcements

Description

Processes PDF files containing official fishing announcements and extracts relevant information such as dates, coordinates, and nautical miles. Handles both local files and URLs.

Usage

extract_pdf_data(
  pdf_sources = NULL,
  temp_dir = NULL,
  verbose = TRUE,
  max_retries = 3
)

Value

Data frame with extracted announcement information including coordinates, dates, and nautical mile distances.

Arguments

pdf_sources

Character vector of PDF file paths or URLs.

temp_dir

Temporary directory for downloaded files. If NULL, uses tempdir().

verbose

Show processing messages.

max_retries

Maximum download retries for URLs.

Examples

Run this code
if (FALSE) {
pdf_files <- c("announcement1.pdf", "announcement2.pdf")
results <- extract_pdf_data(pdf_sources = pdf_files)

pdf_urls <- c(
  "https://example.com/announcement1.pdf",
  "https://example.com/announcement2.pdf"
)
results <- extract_pdf_data(pdf_sources = pdf_urls)
}

Run the code above in your browser using DataLab