url
the original vector of urls provided
title
the page title, if found
lang
inferred page language
url2
the fetched url, this may be different to the original, for example if redirected
links
a list of tibbles of hyperlinks found in <a>
tags
rss
a list of embedded RSS feeds found on the page
tables
a list of tables found on the page in descending order of size, coerced to
tibble
wherever possible.
images
list of tibbles containing image links found on the page
social
list of tibbles containing twitter, linkedin and github user info found on page
code_lang
numeric indicating inferred code language. A negative values near -1
indicates high likelihood that the language is python, positive values near 1 indicate R.
If not code tags are detected, or the language could not be inferred, value is NA
.
size
the size of the downloaded page in bytes
server
the page server
accessed
datetime when the page was accessed
published
page publication or last updated date, if detected
generator
the page generator, if found
status
HTTP status code
source
character string of xml documents. These can each be coerced to xml_document
for further processing using rvest
using xml2:read_html()
.