Learn R Programming

⚠️There's a newer version (0.6.0) of this package.Take me there.

splashr (version 0.4.0)

Tools to Work with the 'Splash' 'JavaScript' Rendering and Scraping Service

Description

'Splash' is a 'JavaScript' rendering service. It is a lightweight web browser with an 'HTTP' API, implemented in 'Python' using 'Twisted' and 'QT' and provides some of the core functionality of the 'RSelenium' or 'seleniumPipes' R packages in a lightweight footprint. Some of 'Splash' features include the ability to process multiple web pages in parallel; retrieving 'HTML' results and/or take screen shots; disabling images or use 'Adblock Plus' rules to make rendering faster; executing custom 'JavaScript' in page context; getting detailed rendering info in 'HAR' format.

Copy Link

Version

Install

install.packages('splashr')

Monthly Downloads

7

Version

0.4.0

License

AGPL

Issues

Pull Requests

Stars

Forks

Maintainer

Bob Rudis

Last Published

August 29th, 2017

Functions in splashr (0.4.0)

get_har_entry

Retrieve an entry by index from a HAR object
as_har

Turn a generic Splash HAR response into a HAR object
get_request_url

Retrieve request URL
get_content_size

Retrieve size of content | body | headers
as_httr_req

Create an httr verb request function from an HAR request
get_request_type

Retrieve or test request type
render_har

Return information about Splash interaction with a website in HAR format.
get_response_body

Retrieve the body content of a HAR entry
render_jpeg

Return a image (in JPEG format) of the javascript-rendered page.
render_html

Return the HTML of the javascript-rendered page.
render_json

Return a json-encoded dictionary with information about javascript-rendered webpage.
get_content_type

Retrieve or test content type of a HAR request object
splash_active

Test if a Splash server is up
render_png

Return a image (in PNG format) of the javascript-rendered page.
splash_add_lua

Add raw lua code into DSL call chain
splash_har_reset

Drops all internally stored HAR records.
splash_html

Return a HTML snapshot of a current page.
splash_history

Get information about requests/responses for the pages loaded
splash_images

Enable/disable images
splash_version

Get Splash version information
splash

Configure parameters for connecting to a Splash server
splash_private_mode

Enable or disable execution of JavaSript code embedded in the page.
as_response

Return a HAR entry response as an httr::response object
splash_wait

Wait for a period time
splash_go

Go to an URL.
har_entries

Retrieve just the HAR entries from a splashr request
splash_har

Return information about Splash interaction with a website in HAR format.
splash_release

Trigger mouse release event in web page.
har_entry_count

Retrieves number of HAR entries in a response
execute_lua

Execute a custom rendering script and return a result.
splash_click

Trigger mouse click event in web page.
splash_png

Return a screenshot of a current page in PNG format.
splash_press

Trigger mouse press event in web page.
install_splash

Retrieve the Docker image for Splash
splashr-exports

splashr exported operators
splash_debug

Retrieve debug-level info for a Splash server
splash_response_body

Enable or disable response content tracking.
splash_send_keys

Send keyboard events to page context.
splashr

Tools to Work with the 'Splash' JavaScript Rendering Service
splash_send_text

Send text as input to page context, literally, character by character.
render_file

Return the HTML or image (png) of the javascript-rendered page in a local file
splash_user_agent

Overwrite the User-Agent header for all further requests.
splash_enable_javascript

Enable or disable execution of JavaSript code embedded in the page.
splash_focus

Focus on a document element provided by a CSS selector
splash_perf_stats

Get Splash performance-related statistics
splash_plugins

Enable or disable browser plugins (e.g. Flash).
start_splash

Start a Splash server Docker container
stop_splash

Stop a running a Splash server Docker container
killall_splash

Prune all dead and running Splash Docker containers
json_fromb64

Convert a Base64 encoded string into an R object