The beeswarm geom is a convenient means to offset points within categories to reduce overplotting. Uses the beeswarm package.
geom_beeswarm(
mapping = NULL,
data = NULL,
stat = "identity",
...,
method = "swarm",
cex = 1,
side = 0L,
priority = "ascending",
fast = TRUE,
dodge.width = NULL,
corral = "none",
corral.width = 0.9,
preserve.data.axis = FALSE,
groupOnX = NULL,
orientation = NULL,
beeswarmArgs = list(),
na.rm = FALSE,
show.legend = NA,
inherit.aes = TRUE
)Set of aesthetic mappings created by aes or
aes_. If specified and inherit.aes = TRUE (the
default), is combined with the default mapping at the top level of the
plot. You only need to supply mapping if there isn't a mapping
defined for the plot.
A data frame. If specified, overrides the default data frame defined at the top level of the plot.
The statistical transformation to use on the data for this layer, as a string.
other arguments passed to ggplot2::layer params argument.
Method for arranging points.
Options are "swarm" (default), "compactswarm", "square", "hex", and "center".
See Details below.
Scaling for adjusting point spacing (see beeswarm::swarmx()).
Values between 1 (default) and 3 tend to work best.
Direction to perform jittering: 0: both directions; 1: to the right or upwards; -1: to the left or downwards.
Method used to perform point layout.
Options are "ascending" (default), "descending", "density", "random", or "none".
See Details below.
If TRUE (default), use compiled version of swarm algorithm.
This option is ignored for all methods except "swarm" and "compactswarm".
Amount by which points from different aesthetic groups will be dodged. This requires that one of the aesthetics is a factor.
Method used to adjust points that would be placed too wide
horizontally. Options are "none" (default), "gutter", "wrap", "random", and "omit".
See Details below.
Width of the corral, if not "none". Default is 0.9.
If TRUE, prevent method from modifying the data axis (default FALSE). See Details for more info.
The orientation (i.e., which axis to group on) is inferred from the data.
This can be overridden by setting orientation to either "x" or "y".
If FALSE, the default, missing values are removed with a warning.
If TRUE, missing values are silently removed.
logical. Should this layer be included in the legends?
NA, the default, includes if any aesthetics are mapped.
FALSE never includes, and TRUE always includes.
If FALSE, overrides the default aesthetics,
rather than combining with them. This is most useful for helper functions
that define both data and aesthetics and shouldn't inherit behaviour from
the default plot specification, e.g. borders.
method: specifies the algorithm used to avoid overlapping points. The
default "swarm" method places points in increasing order. If a point would
overlap with an existing point, it is shifted sideways (along the group axis)
by a minimal amount sufficient to avoid overlap.
While the "swarm" method places points in a predetermined
order, the "compactswarm" method uses a greedy strategy to determine which
point will be placed next. This often leads to a more tightly-packed layout.
The strategy is very simple: on each iteration, a point that can be placed as
close as possible to the non-data axis is chosen and placed. If there are two
or more equally good points, priority is used to break ties.
The other 3 methods first discretise the values along the data axis, in order
to create more efficient packing. The "square" method places points on a
square grid, whereas "hex" uses a hexagonal grid. "centre"/"center"
uses a square grid to produce a symmetric swarm. The number of break points
for discretisation is determined by a combination of the available plotting
area and the cex argument.
priority: controls the order in which points are placed, which generally
has a noticeable effect on the plot appearance. "ascending" gives the
'traditional' beeswarm plot. "descending" is the opposite. "density"
prioritizes points with higher local density. "random" places points in a
random order. "none" places points in the order provided.
corral: By default, swarms from different groups are not prevented from
overlapping, i.e. "corral = "none". Thus, datasets that are very large or
unevenly distributed may produce ugly overlapping beeswarms. To control
runaway points one can use the following methods. "gutter" collects runaway
points along the boundary between groups. "wrap" implement periodic boundaries.
"random" places runaway points randomly in the region. "omit" omits runaway
points.
#' preserve.data.axis: In some cases, the underlying method may attempt to change the position of points along the ungrouped, or data, axis, in order to keep them from overlapping. To prevent this, set this option to TRUE.
geom_quasirandom() an alternative method,
beeswarm::swarmx() how spacing is determined,
ggplot2::geom_point() for regular, unjittered points,
ggplot2::geom_jitter() for jittered points,
ggplot2::geom_boxplot() for another way of looking at the conditional
distribution of a variable
# Generate fake data
distro <- data.frame(
'variable'=rep(c('runif','rnorm'),each=100),
'value'=c(runif(100, min=-3, max=3), rnorm(100))
)
ggplot2::ggplot(distro,ggplot2::aes(variable, value)) +
geom_beeswarm(priority='density',size=2.5)
Run the code above in your browser using DataLab