suffix_extract

a vector of damains, from <code><a href="/link/domain?package=urltools&version=1.2.0" rd-options="" data-mini-rdoc="urltools::domain">domain</a></code>
or <code><a href="/link/url_parse?package=urltools&version=1.2.0" rd-options="" data-mini-rdoc="urltools::url_parse">url_parse</a></code>. Alternately, full URLs can be provided
and will then be run through <code><a href="/link/doma?package=urltools&version=1.2.0" rd-options="" data-mini-rdoc="urltools::doma">doma</a></code>

domains

domain names have suffixes - common endings that people
can or could register domains under. This includes things like ".org", but
also things like ".edu.co". A simple Top Level Domain list, as a
result, probably won't cut it.<p></p><p><code><a href="/link/suffix_extract?package=urltools&version=1.2.0" rd-options="" data-mini-rdoc="urltools::suffix_extract">suffix_extract</a></code> takes the list of public suffixes,
as maintained by Mozilla (see <code><a href="/link/suffix_dataset?package=urltools&version=1.2.0" rd-options="" data-mini-rdoc="urltools::suffix_dataset">suffix_dataset</a></code>) and
a vector of domain names, and produces a data.frame containing the
suffix that each domain uses, and the remaining fragment.</p>

A toolkit for handling URLs that so far includes functions for URL
    encoding and decoding, parsing, and parameter extraction. All functions are
    designed to be both fast and entirely vectorised. It is intended to be
    useful for people dealing with web-related datasets, such as server-side
    logs, although may be useful for other situations involving large sets of
    URLs.

suffix_extract: extract the suffix from domain names

Description

Usage

Arguments

Value

See Also

Examples