Learn R Programming

ralger (version 2.3.0)

scrap: Simple website scraping

Description

This function is used to scrape one element from a website.

Usage

scrap(link, node, clean = FALSE, askRobot = FALSE)

Value

a character vector

Arguments

link

the link of the web page to scrape

node

the HTML or CSS element to consider, the SelectorGadget tool is highly recommended

clean

logical. Should the function clean the extracted vector or not ? Default is FALSE.

askRobot

logical. Should the function ask the robots.txt if we're allowed or not to scrape the web page ? Default is FALSE.

Examples

Run this code
# \donttest{
# Extracting imdb top 250 movie titles
  link <- "https://www.imdb.com/chart/top/"
  node <- "h3.ipc-title__text"
  scrap(link, node)
# }

Run the code above in your browser using DataLab