Learn R Programming

textpress (version 1.0.0)

.get_site: Get Site Content and Extract HTML Elements

Description

This function attempts to retrieve the HTML content of a URL, extract specific HTML elements (e.g., paragraphs, headings), and extract publication date information using the extract_date function.

Usage

.get_site(x)

Value

A data frame with columns for the URL, HTML element types, text content, extracted date, and date source.

Arguments

x

A URL to extract content and publication date from.