Learn R Programming

⚠️There's a newer version (0.6.0) of this package.Take me there.

splashr (version 0.4.0)

Tools to Work with the 'Splash' 'JavaScript' Rendering and Scraping Service

Description

'Splash' is a 'JavaScript' rendering service. It is a lightweight web browser with an 'HTTP' API, implemented in 'Python' using 'Twisted' and 'QT' and provides some of the core functionality of the 'RSelenium' or 'seleniumPipes' R packages in a lightweight footprint. Some of 'Splash' features include the ability to process multiple web pages in parallel; retrieving 'HTML' results and/or take screen shots; disabling images or use 'Adblock Plus' rules to make rendering faster; executing custom 'JavaScript' in page context; getting detailed rendering info in 'HAR' format.

Copy Link

Version

Install

install.packages('splashr')

Monthly Downloads

19

Version

0.4.0

License

AGPL

Issues

Pull Requests

Stars

Forks

Maintainer

Bob Rudis

Last Published

August 29th, 2017

Functions in splashr (0.4.0)

get_har_entry

Retrieve an entry by index from a HAR object
as_har

Turn a generic Splash HAR response into a HAR object
get_request_url

Retrieve request URL
get_content_size

Retrieve size of content | body | headers
as_httr_req

Create an httr verb request function from an HAR request
get_request_type

Retrieve or test request type
render_har

Return information about Splash interaction with a website in HAR format.
get_response_body

Retrieve the body content of a HAR entry
render_jpeg

Return a image (in JPEG format) of the javascript-rendered page.
render_html

Return the HTML of the javascript-rendered page.
render_json

Return a json-encoded dictionary with information about javascript-rendered webpage.
get_content_type

Retrieve or test content type of a HAR request object
splash_active

Test if a Splash server is up
render_png

Return a image (in PNG format) of the javascript-rendered page.
splash_add_lua

Add raw lua code into DSL call chain
splash_har_reset

Drops all internally stored HAR records.
splash_html

Return a HTML snapshot of a current page.
splash_history

Get information about requests/responses for the pages loaded
splash_images

Enable/disable images
splash_version

Get Splash version information
splash

Configure parameters for connecting to a Splash server
splash_private_mode

Enable or disable execution of JavaSript code embedded in the page.
as_response

Return a HAR entry response as an httr::response object
splash_wait

Wait for a period time
splash_go

Go to an URL.
har_entries

Retrieve just the HAR entries from a splashr request
splash_har

Return information about Splash interaction with a website in HAR format.
splash_release

Trigger mouse release event in web page.
har_entry_count

Retrieves number of HAR entries in a response
execute_lua

Execute a custom rendering script and return a result.
splash_click

Trigger mouse click event in web page.
splash_png

Return a screenshot of a current page in PNG format.
splash_press

Trigger mouse press event in web page.
install_splash

Retrieve the Docker image for Splash
splashr-exports

splashr exported operators
splash_debug

Retrieve debug-level info for a Splash server
splash_response_body

Enable or disable response content tracking.
splash_send_keys

Send keyboard events to page context.
splashr

Tools to Work with the 'Splash' JavaScript Rendering Service
splash_send_text

Send text as input to page context, literally, character by character.
render_file

Return the HTML or image (png) of the javascript-rendered page in a local file
splash_user_agent

Overwrite the User-Agent header for all further requests.
splash_enable_javascript

Enable or disable execution of JavaSript code embedded in the page.
splash_focus

Focus on a document element provided by a CSS selector
splash_perf_stats

Get Splash performance-related statistics
splash_plugins

Enable or disable browser plugins (e.g. Flash).
start_splash

Start a Splash server Docker container
stop_splash

Stop a running a Splash server Docker container
killall_splash

Prune all dead and running Splash Docker containers
json_fromb64

Convert a Base64 encoded string into an R object