
Last chance! 50% off unlimited learning
Sale ends in
getHTMLLinks(doc, externalOnly = TRUE, xpQuery = "//a/@href",
baseURL = docName(doc), relative = FALSE)
getHTMLExternalFiles(doc, xpQuery = c("//img/@src", "//link/@href",
"//script/@href", "//embed/@src"),
baseURL = docName(doc), relative = FALSE,
asNodes = FALSE, recursive = FALSE)
#foo
.getHTMLLinks
returns a character vector of the links. getHTMLExternalFiles
returns a character vector.
getXIncludes
getHTMLLinks("http://www.omegahat.org")
getHTMLLinks("http://www.omegahat.org/RSXML")
unique(getHTMLExternalFiles("http://www.omegahat.org"))
Run the code above in your browser using DataLab