Learn R Programming

splashr (version 0.6.0)

Tools to Work with the 'Splash' 'JavaScript' Rendering and Scraping Service

Description

'Splash' is a 'JavaScript' rendering service. It is a lightweight web browser with an 'HTTP' API, implemented in 'Python' using 'Twisted' and 'QT' and provides some of the core functionality of the 'RSelenium' or 'seleniumPipes' R packages in a lightweight footprint. Some of 'Splash' features include the ability to process multiple web pages in parallel; retrieving 'HTML' results and/or take screen shots; disabling images or use 'Adblock Plus' rules to make rendering faster; executing custom 'JavaScript' in page context; getting detailed rendering info in 'HAR' format.

Copy Link

Version

Install

install.packages('splashr')

Monthly Downloads

15

Version

0.6.0

License

MIT + file LICENSE

Maintainer

Bob Rudis

Last Published

February 26th, 2019

Functions in splashr (0.6.0)

as_data_frame.harentry

Turns a "HAR"-like object into a data frame(tibble)
install_splash

Retrieve the Docker image for Splash
json_fromb64

Convert a Base64 encoded string into an R object
render_har

Return information about Splash interaction with a website in HAR format.
render_html

Return the HTML of the javascript-rendered page.
splash_perf_stats

Get Splash performance-related statistics
splash_plugins

Enable or disable browser plugins (e.g. Flash).
splash_version

Get Splash version information
splash_wait

Wait for a period time
get_har_entry

Retrieve an entry by index from a HAR object
get_content_type

Retrieve or test content type of a HAR request object
har_entries

Retrieve just the HAR entries from a splashr request
har_entry_count

Retrieves number of HAR entries in a response
splash_active

Test if a Splash server is up
splash_add_lua

Add raw lua code into DSL call chain
splash_enable_javascript

Enable or disable execution of JavaSript code embedded in the page.
get_header_val

Retrieve the value of a specific response header
get_headers

Retrieve response headers as a data frame
killall_splash

Prune all dead and running Splash Docker containers
splash_focus

Focus on a document element provided by a CSS selector
get_request_type

Retrieve or test request type
get_request_url

Retrieve request URL
splash_click

Trigger mouse click event in web page.
print.splashr

Print for splashr
render_jpeg

Return a image (in JPEG format) of the javascript-rendered page.
splash_debug

Retrieve debug-level info for a Splash server
as_response

Return a HAR entry response as an httr::response object
as_httr_req

Create an httr verb request function from an HAR request
splash_har_reset

Drops all internally stored HAR records.
render_json

Return a json-encoded dictionary with information about javascript-rendered webpage.
splash_go

Go to an URL.
splash_har

Return information about Splash interaction with a website in HAR format.
splash_response_body

Enable or disable response content tracking.
splash_history

Get information about requests/responses for the pages loaded
splash_png

Return a screenshot of a current page in PNG format.
splash_html

Return a HTML snapshot of a current page.
splash_press

Trigger mouse press event in web page.
splash_send_keys

Send keyboard events to page context.
start_splash

Start a Splash server Docker container
stop_splash

Stop a running a Splash server Docker container
splash_images

Enable/disable images
splash_private_mode

Enable or disable execution of JavaSript code embedded in the page.
execute_lua

Execute a custom rendering script and return a result.
splash_release

Trigger mouse release event in web page.
get_content_size

Retrieve size of content | body | headers
splashr-exports

splashr exported operators
splashr

Tools to Work with the 'Splash' JavaScript Rendering Service
get_response_body

Retrieve the body content of a HAR entry
render_png

Return an image (in PNG format) of the javascript-rendered page.
get_response_url

Retrieve response URL
splash

Configure parameters for connecting to a Splash server
splash_send_text

Send text as input to page context, literally, character by character.
splash_user_agent

Overwrite the User-Agent header for all further requests.
as_har

Turn a generic Splash HAR response into a HAR object