titles_scrap

the link of the web page to scrape

link

filter the titles according to a character string provided.

contain

logical. Should the contain argument be case sensitive ? defaults to FALSE

case_sensitive

logical. Should the function ask the robots.txt if we're allowed or not to scrape the web page ? Default is FALSE

askRobot

This function is used to scrape titles (h1, h2 &amp; h3 html tags) from a website. Useful for scraping daily electronic newspapers' titles.

The goal of 'ralger' is to facilitate web scraping in R.

Mohamed Fodil Ihaddaden

ralger

Easy Web Scraping

Mohamed El Fodil Ihaddaden

Ezekiel Ogundepo

Romain François

titles_scrap function

<dl><dt>link</dt>
<dd>the link of the web page to scrape</dd>
<dt>contain</dt>
<dd>filter the titles according to a character string provided.</dd>
<dt>case_sensitive</dt>
<dd>logical. Should the contain argument be case sensitive ? defaults to FALSE</dd>
<dt>askRobot</dt>
<dd>logical. Should the function ask the robots.txt if we're allowed or not to scrape the web page ? Default is FALSE</dd></dl>

Arguments

Website title scraping — titles_scrap

<dl>

<dt>link</dt>
<dd>the link of the web page to scrape</dd>


<dt>contain</dt>
<dd>filter the titles according to a character string provided.</dd>


<dt>case_sensitive</dt>
<dd>logical. Should the contain argument be case sensitive ? defaults to FALSE</dd>


<dt>askRobot</dt>
<dd>logical. Should the function ask the robots.txt if we're allowed or not to scrape the web page ? Default is FALSE</dd>

</dl>

titles_scrap: Website title scraping

Description

Usage

Value

Arguments

Examples