Learn R Programming

⚠️There's a newer version (1.7.3) of this package.Take me there.

urltools

A package for elegantly handling and parsing URLs from within R.

Author: Oliver Keyes, Jay Jacobs License: MIT Status: Stable

Description

URLs in R are often treated as nothing more than part of data retrieval - they're used for making connections and reading data. With web analytics and research, however, URLs can be the data, and R's default handlers are not best suited to handle vectorised operations over large datasets. urltools is intended to solve this.

It contains drop-in replacements for R's URLdecode and URLencode functions, along with new functionality such as a URL parser and parameter value extractor. In all cases, the functions are designed to be content-safe (not breaking on unexpected values) and fully vectorised, resulting in a dramatic speed improvement over existing implementations - crucial for large datasets. For more information, see the urltools vignette.

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.

Installation

The latest CRAN version can be obtained via:

install.packages("urltools")

To get the development version:

devtools::install_github("ironholds/urltools")

Dependencies

Copy Link

Version

Install

install.packages('urltools')

Monthly Downloads

40,094

Version

1.7.1

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Oliver Keyes

Last Published

August 3rd, 2018

Functions in urltools (1.7.1)

scheme

Get or set a URL's scheme
port

Get or set a URL's port
puny_encode

Encode or Decode Internationalised Domains
url_parse

split URLs into their component parts
urltools

Tools for handling URLs
path

Get or set a URL's path
url_compose

Recompose Parsed URLs
suffix_dataset

Dataset of public suffixes
param_set

Set the value associated with a parameter in a URL's query.
suffix_refresh

Retrieve a public suffix dataset
tld_dataset

Dataset of top-level domains (TLDs)
fragment

Get or set a URL's fragment
tld_extract

Extract TLDs
parameters

Get or set a URL's parameters
strip_credentials

Get or remove user authentication credentials
domain

Get or set a URL's domain
host_extract

Extract hosts
param_get

get the values of a URL's parameters
tld_refresh

Retrieve a TLD dataset
param_remove

Remove key-value pairs from query strings
url_decode

Encode or decode a URI
suffix_extract

extract the suffix from domain names