urltools v1.7.1

0

Monthly downloads

0th

Percentile

Vectorised Tools for URL Handling and Parsing

A toolkit for all URL-handling needs, including encoding and decoding, parsing, parameter extraction and modification. All functions are designed to be both fast and entirely vectorised. It is intended to be useful for people dealing with web-related datasets, such as server-side logs, although may be useful for other situations involving large sets of URLs.

Readme

urltools

A package for elegantly handling and parsing URLs from within R.

Author: Oliver Keyes, Jay Jacobs
License: MIT
Status: Stable

Travis-CI Build Status downloads

Description

URLs in R are often treated as nothing more than part of data retrieval - they're used for making connections and reading data. With web analytics and research, however, URLs can be the data, and R's default handlers are not best suited to handle vectorised operations over large datasets. urltools is intended to solve this.

It contains drop-in replacements for R's URLdecode and URLencode functions, along with new functionality such as a URL parser and parameter value extractor. In all cases, the functions are designed to be content-safe (not breaking on unexpected values) and fully vectorised, resulting in a dramatic speed improvement over existing implementations - crucial for large datasets. For more information, see the urltools vignette.

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.

Installation

The latest CRAN version can be obtained via:

install.packages("urltools")

To get the development version:

devtools::install_github("ironholds/urltools")

Dependencies

Functions in urltools

Name Description
scheme Get or set a URL's scheme
port Get or set a URL's port
puny_encode Encode or Decode Internationalised Domains
url_parse split URLs into their component parts
urltools Tools for handling URLs
path Get or set a URL's path
url_compose Recompose Parsed URLs
suffix_dataset Dataset of public suffixes
param_set Set the value associated with a parameter in a URL's query.
suffix_refresh Retrieve a public suffix dataset
tld_dataset Dataset of top-level domains (TLDs)
fragment Get or set a URL's fragment
tld_extract Extract TLDs
parameters Get or set a URL's parameters
strip_credentials Get or remove user authentication credentials
domain Get or set a URL's domain
host_extract Extract hosts
param_get get the values of a URL's parameters
tld_refresh Retrieve a TLD dataset
param_remove Remove key-value pairs from query strings
url_decode Encode or decode a URI
suffix_extract extract the suffix from domain names
No Results!

Vignettes of urltools

Name
urltools.Rmd
No Results!

Last month downloads

Details

Type Package
Date 2018-07-26
License MIT + file LICENSE
LazyData TRUE
LinkingTo Rcpp
URL https://github.com/Ironholds/urltools/
BugReports https://github.com/Ironholds/urltools/issues
VignetteBuilder knitr
RoxygenNote 6.0.1
Encoding UTF-8
NeedsCompilation yes
Packaged 2018-07-26 18:36:17 UTC; ironholds
Repository CRAN
Date/Publication 2018-08-03 08:20:02 UTC

Include our badge in your README

[![Rdoc](http://www.rdocumentation.org/badges/version/urltools)](http://www.rdocumentation.org/packages/urltools)