paragraphs_scrap

the link of the web page to scrape

link

filter the paragraphs according to the character string provided.

contain

logical. Should the contain argument be case sensitive ? defaults to FALSE

case_sensitive

if TRUE the paragraphs will be collapsed into one element and the contain argument ignored.

collapse

logical. Should the function ask the robots.txt if we're allowed or not to scrap the web page ? Default is FALSE.

askRobot

This function is used to scrape text paragraphs from a website.

The goal of 'ralger' is to facilitate web scraping in R.

Mohamed Fodil Ihaddaden

ralger

Easy Web Scraping

Mohamed El Fodil Ihaddaden

Ezekiel Ogundepo

Romain François

paragraphs_scrap function

<dl><dt>link</dt>
<dd>the link of the web page to scrape</dd>
<dt>contain</dt>
<dd>filter the paragraphs according to the character string provided.</dd>
<dt>case_sensitive</dt>
<dd>logical. Should the contain argument be case sensitive ? defaults to FALSE</dd>
<dt>collapse</dt>
<dd>if TRUE the paragraphs will be collapsed into one element and the contain argument ignored.</dd>
<dt>askRobot</dt>
<dd>logical. Should the function ask the robots.txt if we're allowed or not to scrap the web page ? Default is FALSE.</dd></dl>

Arguments

Website text paragraph scraping — paragraphs_scrap

<dl>

<dt>link</dt>
<dd>the link of the web page to scrape</dd>


<dt>contain</dt>
<dd>filter the paragraphs according to the character string provided.</dd>


<dt>case_sensitive</dt>
<dd>logical. Should the contain argument be case sensitive ? defaults to FALSE</dd>


<dt>collapse</dt>
<dd>if TRUE the paragraphs will be collapsed into one element and the contain argument ignored.</dd>


<dt>askRobot</dt>
<dd>logical. Should the function ask the robots.txt if we're allowed or not to scrap the web page ? Default is FALSE.</dd>

</dl>

paragraphs_scrap: Website text paragraph scraping

Description

Usage

Value

Arguments

Examples