urltools v1.7.3
Monthly downloads
Vectorised Tools for URL Handling and Parsing
A toolkit for all URL-handling needs, including encoding and decoding,
parsing, parameter extraction and modification. All functions are
designed to be both fast and entirely vectorised. It is intended to be
useful for people dealing with web-related datasets, such as server-side
logs, although may be useful for other situations involving large sets of
URLs.
Readme
urltools
A package for elegantly handling and parsing URLs from within R.
Author: Oliver Keyes, Jay Jacobs
License: MIT
Status: Stable
Description
URLs in R are often treated as nothing more than part of data retrieval -
they're used for making connections and reading data. With web analytics
and research, however, URLs can be the data, and R's default handlers
are not best suited to handle vectorised operations over large datasets.
urltools
is intended to solve this.
It contains drop-in replacements for R's URLdecode and URLencode functions, along with new functionality such as a URL parser and parameter value extractor. In all cases, the functions are designed to be content-safe (not breaking on unexpected values) and fully vectorised, resulting in a dramatic speed improvement over existing implementations - crucial for large datasets. For more information, see the urltools vignette.
Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.
Installation
The latest CRAN version can be obtained via:
install.packages("urltools")
To get the development version:
devtools::install_github("ironholds/urltools")
Dependencies
- R. Doy.
- Rcpp
Functions in urltools
Name | Description | |
port | Get or set a URL's port | |
param_get | get the values of a URL's parameters | |
tld_refresh | Retrieve a TLD dataset | |
url_compose | Recompose Parsed URLs | |
puny_encode | Encode or Decode Internationalised Domains | |
tld_extract | Extract TLDs | |
url_parse | split URLs into their component parts | |
suffix_refresh | Retrieve a public suffix dataset | |
suffix_extract | extract the suffix from domain names | |
tld_dataset | Dataset of top-level domains (TLDs) | |
scheme | Get or set a URL's scheme | |
suffix_dataset | Dataset of public suffixes | |
urltools | Tools for handling URLs | |
strip_credentials | Get or remove user authentication credentials | |
domain | Get or set a URL's domain | |
host_extract | Extract hosts | |
param_remove | Remove key-value pairs from query strings | |
param_set | Set the value associated with a parameter in a URL's query. | |
parameters | Get or set a URL's parameters | |
url_decode | Encode or decode a URI | |
path | Get or set a URL's path | |
fragment | Get or set a URL's fragment | |
No Results! |
Vignettes of urltools
Name | ||
urltools.Rmd | ||
No Results! |
Last month downloads
Details
Type | Package |
Date | 2019-04-14 |
License | MIT + file LICENSE |
LazyData | TRUE |
LinkingTo | Rcpp |
URL | https://github.com/Ironholds/urltools/ |
BugReports | https://github.com/Ironholds/urltools/issues |
VignetteBuilder | knitr |
RoxygenNote | 6.1.1 |
Encoding | UTF-8 |
NeedsCompilation | yes |
Packaged | 2019-04-14 22:25:08 UTC; ironholds |
Repository | CRAN |
Date/Publication | 2019-04-14 23:02:47 UTC |
suggests | knitr , testthat |
imports | methods , Rcpp , triebeard |
depends | R (>= 2.10) |
Contributors | Mark Greenaway, Jay Jacobs, Drew Schmidt, Bob Rudis, Alex Pinto, Maryam Khezrzadeh, Adam Costello, Jeff Bezanson, Peter Meilstrup, Xueyuan Jiang, Os Keyes |
Include our badge in your README
[](http://www.rdocumentation.org/packages/urltools)