Learn R Programming

dodgr: Distances on Directed Graphs in R

dodgr is an R package for efficient calculation of many-to-many pairwise distances on dual-weighted directed graphs, for aggregation of flows throughout networks, and for highly realistic routing through street networks (time-based routing considering incline, turn-angles, surface quality, everything).

Note that most dodgr algorithms implement parallel computation with the RcppParallel library, and by default use the maximal number of available cores or threads. If you do not wish dodgrto use all available threads, please reduce the number manually by first specifying a value via

RcppParallel::setThreadOptions (numThreads = 1L) # or desired number

What’s so special?

Four aspects. First, while other packages exist for calculating distances on directed graphs, notably igraph, even that otherwise fabulous package does not (readily) permit analysis of dual-weighted graphs. Dual-weighted graphs have two sets of weights for each edge, so routing can be evaluated with one set of weights, while distances can be calculated with the other. A canonical example is a street network, where weighted distances are assigned depending on mode of transport (for example, weighted distances for pedestrians on multi-lane vehicular roads are longer than equivalent distances along isolated walking paths), yet the desired output remains direct, unweighted distances. Accurate calculation of distances on street networks requires a dual-weighted representation. In R, dodgr is currently the only package that offers this functionality (without excessive data wrangling).

Second, while igraph and almost all other routing packages are primarily designed for one-to-one routing, dodgr is specifically designed for many-to-many routing, and will generally outperform equivalent packages in large routing tasks.

Third, dodgr goes beyond the functionality of comparable packages through including routines to aggregate flows throughout a network, through specifying origins, destinations, and flow densities between each pair of points. Alternatively, flows can be aggregated according to a network dispersal model from a set of origin points and associated densities, and a user-specified dispersal model.

Fourth and finally, dodgr implements highly realistic and fully-customisable profiles for routing through street networks with various modes of transport, and using either distance- or time-based routing. Routing can include such factors as waiting times at traffic lights, delays for turning across oncoming traffic, access restrictions, and the effects of elevation on both cyclists and pedestrians. See the dedicated vignette on street networks and time-based routing for more detail.

Installation

You can install latest stable version of dodgr from CRAN with:

install.packages ("dodgr") # current CRAN version

Alternatively, current development versions can be installed using any of the following options:

# install.packages("remotes")
remotes::install_git ("https://git.sr.ht/~mpadge/dodgr")
remotes::install_git ("https://codeberg.org/UrbanAnalyst/dodgr")
remotes::install_bitbucket ("UrbanAnalyst/dodgr")
remotes::install_gitlab ("UrbanAnalyst/dodgr")
remotes::install_github ("UrbanAnalyst/dodgr")

Then load with

library (dodgr)
packageVersion ("dodgr")
#> [1] '0.2.21'

Important Note

While dodgr works with any arbitrary networks, it also includes numerous functions explicitly intended to be applied to geodesic coordinates, which are identified whenever input data have columns labelled “longitude” and “latitude”, or similar. Coordinates for such data must be in the EPSG:4326 (WGS84) coordinate system. dodgr treats coordinates as numbers only, and it is up to the user to ensure appropriate transformation to WGS84 coordinates prior to submitting data to dodgr functions.

Usage: Sample Data and dodgr networks

To illustrate functionality, the package includes an example data set containing the Open Street Map network for Hampi, India (a primarily pedestrian village in the middle of a large World Heritage zone). These data are in Simple Features (sf) format, as a collection of LINESTRING objects. dodgr represents networks as a simple rectangular graph, with each row representing an edge segment between two points or vertices. sf-format objects can be converted to equivalent dodgr representations with the weight_streetnet() function:

class (hampi)
#> [1] "sf"         "data.frame"
dim (hampi)
#> [1] 236  15
graph <- weight_streetnet (hampi, wt_profile = "foot")
class (graph)
#> [1] "data.frame"      "dodgr_streetnet"
dim (graph)
#> [1] 6813   15

The sf-format network contained 236 LINESTRING objects, with the weight_streetnet() function decomposing these into 6,813 distinct edges, indicating that the sf representation had around 29 edges or segments in each LINESTRING object. The dodgr network then looks like this:

head (graph)
geom_numedge_idfrom_idfrom_lonfrom_latto_idto_lonto_latdd_weightedhighwayway_idcomponenttimetime_weighted
1133931850076.4749115.3416733931850276.4761215.34173130.000241130.000241path28565950193.60017493.600174
1233931850276.4761215.3417333931850076.4749115.34167130.000241130.000241path28565950193.60017493.600174
1333931850276.4761215.34173239895802876.4762115.341748.8906228.890622path2856595016.4012486.401248
14239895802876.4762115.3417433931850276.4761215.341738.8906228.890622path2856595016.4012486.401248
15239895802876.4762115.34174142711607776.4762815.341799.3077369.307736path2856595016.7015706.701570
16142711607776.4762815.34179239895802876.4762115.341749.3077369.307736path2856595016.7015706.701570

The geom_num column maps directly onto the sequence of LINESTRING objects within the sf-formatted data. The highway column is taken directly from Open Street Map, and denotes the kind of “highway” represented by each edge. The component column is an integer value describing which of the connected components of the network each edge belongs to (with 1 always being the largest component; 2 the second largest; and so on).

Note that the d_weighted values are often greater than the geometric distances, d. In the example shown, service highways are not ideal for pedestrians, and so weighted distances are slightly greater than actual distances. Compare this with:

head (graph [graph$highway == "path", ])
geom_numedge_idfrom_idfrom_lonfrom_latto_idto_lonto_latdd_weightedhighwayway_idcomponenttimetime_weighted
1133931850076.4749115.3416733931850276.4761215.34173130.000241130.000241path28565950193.60017493.600174
1233931850276.4761215.3417333931850076.4749115.34167130.000241130.000241path28565950193.60017493.600174
1333931850276.4761215.34173239895802876.4762115.341748.8906228.890622path2856595016.4012486.401248
14239895802876.4762115.3417433931850276.4761215.341738.8906228.890622path2856595016.4012486.401248
15239895802876.4762115.34174142711607776.4762815.341799.3077369.307736path2856595016.7015706.701570
16142711607776.4762815.34179239895802876.4762115.341749.3077369.307736path2856595016.7015706.701570

A "path" offers ideal walking conditions, and so weighted distances are equal to actual distances.

Usage: Distances and Times

The many-to-many nature of dodgr means that the function to calculate distances, dodgr_distances() or, for street networks, times, dodgr_times(), accepts two vectors or matrices of routing points as inputs (describing origins and destinations), and returns a corresponding matrix of pairwise distances. If an input graph has columns for both distances and weighted distances, and/or times and weighted times, the weighted versions are used to determine the effectively shortest or fastest routes through a network, while actual distances or times are summed along the routes to calculate final values. It is of course also possible to calculate distances along fastest routes, times along shortest routes, or any combination thereof, as detailed in the package vignette on street networks and time-based routing.

Routing points can, for example, be randomly selected from the vertices of a graph. The vertices can in turn be extracted with the dodgr_vertices() function:

v <- dodgr_vertices (graph)
head (v)
idxycomponentn
133931850076.4749115.3416710
233931850276.4761215.3417311
4239895802876.4762115.3417412
6142711607776.4762815.3417913
8779971091676.4763415.3418414
1033931850376.4764115.3419015

For OSM data extracted with the osmdata package (or, equivalently, via the dodgr::dodgr_streetnet() function), each object (vertices, ways, and high-level relations between these objects) is assigned a unique identifying number. These are retained both in osmdata and dodgr, as the way_id column in the above graph, and as the id column in the vertices. Random vertices may be generated in this case through selecting id values:

from <- sample (v$id, size = 20)
to <- sample (v$id, size = 50)
d <- dodgr_dists (graph = graph, from = from, to = to)
dim (d)
#> [1] 20 50

Alternatively, the points may be specified as matrices of geographic coordinates:

from_x <- min (graph$from_lon) + runif (20) * diff (range (graph$from_lon))
from_y <- min (graph$from_lat) + runif (20) * diff (range (graph$from_lat))
to_x <- min (graph$from_lon) + runif (50) * diff (range (graph$from_lon))
to_y <- min (graph$from_lat) + runif (50) * diff (range (graph$from_lat))
d <- dodgr_dists (graph = graph, from = cbind (from_x, from_y), to = cbind (to_x, to_y))

In this case, the random points will be mapped on to the nearest points on the street network. This may, of course, map some points onto minor, disconnected components of the graph. This can be controlled either by reducing the graph to it’s largest connected component only:

graph <- graph [graph$component == 1, ]
nrow (graph)

or by explicitly using the match_points_to_verts() function with the option connected = TRUE:

from <- match_points_to_verts (v, cbind (from_x, from_y), connected = TRUE)
to <- match_points_to_verts (v, cbind (to_x, to_y), connected = TRUE)

This function returns an index into the result of dodgr_vertices, and so points to use for routing must then be extracted as follows:

from <- v$id [from] # or from <- v [from, c ("x", "y")]
to <- v$id [to]
d <- dodgr_dists (graph = graph, from = from, to = to)

Usage: Flow Aggregation

Flow aggregation refers to the procedure of routing along multiple ways according to specified densities of flow between defined origin and destination points, and aggregating flows along each edge of the network. The procedure is functionally similar to the above procedure for distances, with the addition of a matrix specifying pairwise flow densities between the input set of origin (from) and destination (to) points. The following example illustrates use with a random “flow matrix”:

flows <- array (runif (length (from) * length (to)), dim = c (length (from), length (to)))
length (from)
#> [1] 20
length (to)
#> [1] 50
dim (flows)
#> [1] 20 50
f <- dodgr_flows_aggregate (graph = graph, from = from, to = to, flows = flows)

The result is simply the input graph with an additional column quantifying the aggregate flows along each edge:

head (f)
geom_numedge_idfrom_idfrom_lonfrom_latto_idto_lonto_latdd_weightedhighwayway_idcomponenttimetime_weightedflow
1133931850076.4749115.3416733931850276.4761215.34173130.000241130.000241path28565950193.60017493.6001740.5815996
1233931850276.4761215.3417333931850076.4749115.34167130.000241130.000241path28565950193.60017493.6001742.3630118
1333931850276.4761215.34173239895802876.4762115.341748.8906228.890622path2856595016.4012486.4012480.5815996
14239895802876.4762115.3417433931850276.4761215.341738.8906228.890622path2856595016.4012486.4012482.3630118
15239895802876.4762115.34174142711607776.4762815.341799.3077369.307736path2856595016.7015706.7015700.5815996
16142711607776.4762815.34179239895802876.4762115.341749.3077369.307736path2856595016.7015706.7015702.3630118

An additional flow aggregation function can be applied in cases where only densities at origin points are known, and movement throughout a graph is dispersive:

f <- dodgr_flows_disperse (graph = graph, from = from, dens = runif (length (from)))

Further detail

For more detail, see the main package vignette, and the second vignette on street networks and time-based routing

Contributors

All contributions to this project are gratefully acknowledged using the allcontributors package following the allcontributors specification. Contributions of any kind are welcome!

Code

Issue Authors

Issue Contributors

Copy Link

Version

Install

install.packages('dodgr')

Monthly Downloads

884

Version

0.4.2

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Mark Padgham

Last Published

March 6th, 2025

Functions in dodgr (0.4.2)

dodgr_flows_si

Aggregate flows throughout a network using a spatial interaction model.
dodgr_dists

Calculate matrix of pair-wise distances between points.
dodgr_fundamental_cycles

Calculate fundamental cycles in a graph.
dodgr_dists_categorical

Cumulative distances along different edge categories
dodgr_full_cycles

Calculate fundamental cycles on a FULL (that is, non-contracted) graph.
dodgr_flows_disperse

Aggregate flows dispersed from each point in a network.
dodgr_flowmap

Create a map of dodgr flows.
dodgr_load_streetnet

Load a street network previously saved with dodgr_save_streetnet.
dodgr_flows_aggregate

Aggregate flows throughout a network.
dodgr_streetnet

Extract a street network in sf-format for a given location.
dodgr_sflines_to_poly

Convert sf LINESTRING objects to POLYGON objects representing all fundamental cycles within the LINESTRING objects.
dodgr_paths

Calculate lists of pair-wise shortest paths between points.
dodgr_save_streetnet

Save a weighted streetnet to a local file
dodgr_sample

Sample a random but connected sub-component of a graph
dodgr_isodists

Calculate isodistance contours from specified points.
dodgr_isochrones

Calculate isochrone contours from specified points.
dodgr_uncontract_graph

Re-expand a contracted graph.
dodgr_insert_vertex

Insert a new node or vertex into a network
dodgr_streetnet_sc

Extract a street network in silicate-format for a given location.
dodgr_times

Calculate matrix of pair-wise travel times between points.
dodgr_isoverts

Calculate isodistance or isochrone contours from specified points.
dodgr_to_sfc

Convert a dodgr graph into an equivalent sf::sfc object.
dodgr_to_igraph

Convert a dodgr graph to an igraph.
dodgr_to_sf

Convert a dodgr graph into an equivalent sf object.
match_points_to_graph

Alias for match_pts_to_graph
match_points_to_verts

Alias for match_pts_to_verts
dodgr_vertices

Extract vertices of graph, including spatial coordinates if included.
igraph_to_dodgr

Convert a igraph network to an equivalent dodgr representation.
dodgr_distances

Calculate matrix of pair-wise distances between points.
hampi

Sample street network from Hampi, India.
estimate_centrality_threshold

Estimate a value for the 'dist_threshold' parameter of the dodgr_centrality function.
match_pts_to_graph

Match spatial points to the edges of a spatial graph.
estimate_centrality_time

Estimate time required for a planned centrality calculation.
dodgr_to_tidygraph

Convert a dodgr graph to an tidygraph.
match_pts_to_verts

Match spatial points to the vertices of a spatial graph.
merge_directed_graph

Merge directed edges into equivalent undirected edges.
weight_streetnet

Weight a street network according to a specified weighting profile.
%>%

Pipe operator
summary.dodgr_dists_categorical

Transform a result from dodgr_dists_categorical to summary statistics
write_dodgr_wt_profile

Write dodgr weighting profiles to local file.
weight_railway

Weight a network for routing along railways.
weighting_profiles

Weighting profiles used to route different modes of transport.
os_roads_bristol

Sample street network from Bristol, U.K.
dodgr_contract_graph

Contract graph to junction vertices only.
dodgr_cache_off

Turn off all dodgr caching in current session.
dodgr_cache_on

Turn on all dodgr caching in current session.
dodgr_centrality

Calculate betweenness centrality for a 'dodgr' network.
dodgr_components

Identify connected components of graph.
dodgr

Distances On Directed GRaphs ("dodgr")
compare_heaps

Compare timings of different sort heaps for a given input graph.
dodgr_deduplicate_graph

Deduplicate edges in a graph
dodgr_dists_nearest

Calculate vector of shortest distances from a series of 'from' points to nearest one of series of 'to' points.
add_nodes_to_graph

Insert new nodes into a graph, breaking edges at point of nearest intersection.
clear_dodgr_cache

Remove cached versions of dodgr graphs.