tld_extract

a vector of domains, retrieved through <code><a rd-options="" href="/link/url_parse?package=urltools&version=1.7.1" data-mini-rdoc="urltools::url_parse">url_parse</a></code> or
<code><a rd-options="" href="/link/domain?package=urltools&version=1.7.1" data-mini-rdoc="urltools::domain">domain</a></code>.

domains

a dataset of TLDs. If NULL (the default), <code>tld_extract</code> relies
on urltools' <code><a rd-options="" href="/link/tld_dataset?package=urltools&version=1.7.1" data-mini-rdoc="urltools::tld_dataset">tld_dataset</a></code>; otherwise, you can pass in the result of
<code><a rd-options="" href="/link/tld_refresh?package=urltools&version=1.7.1" data-mini-rdoc="urltools::tld_refresh">tld_refresh</a></code>.

tlds

<code>tld_extract</code> extracts the top-level domain (TLD) from
a vector of domain names. This is distinct from the suffixes, extracted with
<code><a rd-options="" href="/link/suffix_extract?package=urltools&version=1.7.1" data-mini-rdoc="urltools::suffix_extract">suffix_extract</a></code>; TLDs are top level, while suffixes are just
domains through which internet users can publicly register domains (the difference
between <code>.org.uk</code> and <code>.uk</code>).

A toolkit for all URL-handling needs, including encoding and decoding,
parsing, parameter extraction and modification. All functions are
designed to be both fast and entirely vectorised. It is intended to be
useful for people dealing with web-related datasets, such as server-side
logs, although may be useful for other situations involving large sets of
URLs.

Oliver Keyes

urltools

Vectorised Tools for URL Handling and Parsing

tld_extract function

a vector of domains, retrieved through <code><a rd-options='' href='url_parse'>url_parse</a></code> or
<code><a rd-options='' href='domain'>domain</a></code>.

a dataset of TLDs. If NULL (the default), <code>tld_extract</code> relies
on urltools' <code><a rd-options='' href='tld_dataset'>tld_dataset</a></code>; otherwise, you can pass in the result of
<code><a rd-options='' href='tld_refresh'>tld_refresh</a></code>.

<code>tld_extract</code> extracts the top-level domain (TLD) from
a vector of domain names. This is distinct from the suffixes, extracted with
<code><a rd-options='' href='suffix_extract'>suffix_extract</a></code>; TLDs are top level, while suffixes are just
domains through which internet users can publicly register domains (the difference
between <code>.org.uk</code> and <code>.uk</code>).

tld_extract: Extract TLDs

Description

Usage

Arguments

Value

See Also

Examples