Helper function for ggstatsplot::ggbarstats to apply this
function across multiple levels of a given factor and combining the
resulting plots using ggstatsplot::combine_plots.
grouped_ggbarstats(
data,
main,
condition,
counts = NULL,
grouping.var,
title.prefix = NULL,
output = "plot",
x = NULL,
y = NULL,
...,
plotgrid.args = list(),
title.text = NULL,
title.args = list(size = 16, fontface = "bold"),
caption.text = NULL,
caption.args = list(size = 10),
sub.text = NULL,
sub.args = list(size = 12)
)for use with formula, a data frame containing all the
data
The variable to use as the rows in the contingency table.
The variable to use as the columns in the contingency
table. Default is NULL. If NULL, one-sample proportion test (a goodness
of fit test) will be run for the x variable. Otherwise an appropriate
association test will be run. This argument can not be NULL for
ggbarstats function.
A string naming a variable in data containing counts, or NULL
if each row represents a single observation (Default).
A single grouping variable (can be entered either as a
bare name x or as a string "x").
Character string specifying the prefix text for the fixed
plot title (name of each factor level) (Default: NULL). If NULL, the
variable name entered for grouping.var will be used.
Can either be "null" (or "caption" or "H0" or "h0"),
which will return expression with evidence in favor of the null hypothesis,
or "alternative" (or "title" or "H1" or "h1"), which will return
expression with evidence in favor of the alternative hypothesis, or
"results", which will return a dataframe with results all the details).
The variable to use as the rows in the contingency table.
The variable to use as the columns in the contingency
table. Default is NULL. If NULL, one-sample proportion test (a goodness
of fit test) will be run for the x variable. Otherwise an appropriate
association test will be run. This argument can not be NULL for
ggbarstats function.
Arguments passed on to ggbarstats
labels.legendA character vector with custom labels for levels of
the x variable displayed in the legend.
xlabCustom text for the x axis label (Default: NULL, which
will cause the x axis label to be the x variable).
ylabCustom text for the y axis label (Default: NULL).
proportion.testDecides whether proportion test for main variable is
to be carried out for each level of y (Default: TRUE).
labelCharacter decides what information needs to be
displayed on the label in each pie slice. Possible options are
"percentage" (default), "counts", "both".
sample.size.labelLogical that decides whether sample size information
should be displayed for each level of the grouping variable y
(Default: TRUE).
ratioA vector of proportions: the expected proportions for the
proportion test (should sum to 1). Default is NULL, which means the null
is equal theoretical proportions across the levels of the nominal variable.
This means if there are two levels this will be ratio = c(0.5,0.5) or if
there are four levels this will be ratio = c(0.25,0.25,0.25,0.25), etc.
pairedLogical indicating whether data came from a within-subjects or
repeated measures design study (Default: FALSE). If TRUE, McNemar's
test subtitle will be returned. If FALSE, Pearson's chi-square test will
be returned.
results.subtitleDecides whether the results of statistical tests are
to be displayed as a subtitle (Default: TRUE). If set to FALSE, only
the plot will be returned.
perc.kNumeric that decides number of decimal places for percentage
labels (Default: 0).
label.argsAdditional aesthetic arguments that will be passed to
geom_label.
bf.messageLogical that decides whether to display a caption with
results from Bayes Factor test in favor of the null hypothesis (default:
FALSE).
sampling.planCharacter describing the sampling plan. Possible options
are "indepMulti" (independent multinomial; default), "poisson",
"jointMulti" (joint multinomial), "hypergeom" (hypergeometric). For
more, see ?BayesFactor::contingencyTableBF().
fixed.marginFor the independent multinomial sampling plan, which
margin is fixed ("rows" or "cols"). Defaults to "rows".
prior.concentrationSpecifies the prior concentration parameter, set
to 1 by default. It indexes the expected deviation from the null
hypothesis under the alternative, and corresponds to Gunel and Dickey's
(1974) "a" parameter.
subtitleThe text for the plot subtitle. Will work only if
results.subtitle = FALSE.
captionThe text for the plot caption.
conf.levelScalar between 0 and 1. If unspecified, the defaults return
95% lower and upper confidence intervals (0.95).
nbootNumber of bootstrap samples for computing confidence interval
for the effect size (Default: 100).
legend.titleTitle text for the legend.
kNumber of digits after decimal point (should be an integer)
(Default: k = 2).
ggthemeA function, ggplot2 theme name. Default value is
ggplot2::theme_bw(). Any of the ggplot2 themes, or themes from
extension packages are allowed (e.g., ggthemes::theme_fivethirtyeight(),
hrbrthemes::theme_ipsum_ps(), etc.).
ggstatsplot.layerLogical that decides whether theme_ggstatsplot
theme elements are to be displayed along with the selected ggtheme
(Default: TRUE). theme_ggstatsplot is an opinionated theme layer that
override some aspects of the selected ggtheme.
packageName of package from which the palette is desired as string or symbol.
paletteIf a character string (e.g., "Set1"), will use that named
palette. If a number, will index into the list of palettes of appropriate
type. Default palette is "Dark2".
directionEither 1 or -1. If -1 the palette will be reversed.
ggplot.componentA ggplot component to be added to the plot prepared
by ggstatsplot. This argument is primarily helpful for grouped_ variant
of the current function. Default is NULL. The argument should be entered
as a function.
messagesDecides whether messages references, notes, and warnings are
to be displayed (Default: TRUE).
A list of additional arguments to cowplot::plot_grid.
String or plotmath expression to be drawn as title for the combined plot.
A list of additional arguments
provided to title, caption and sub, resp.
String or plotmath expression to be drawn as the caption for the combined plot.
A list of additional arguments
provided to title, caption and sub, resp.
The label with which the combined plot should be annotated. Can be a plotmath expression.
A list of additional arguments
provided to title, caption and sub, resp.
Unlike a number of statistical softwares, ggstatsplot doesn't
provide the option for Yates' correction for the Pearson's chi-squared
statistic. This is due to compelling amount of Monte-Carlo simulation
research which suggests that the Yates' correction is overly conservative,
even in small sample sizes. As such it is recommended that it should not
ever be applied in practice (Camilli & Hopkins, 1978, 1979; Feinberg, 1980;
Larntz, 1978; Thompson, 1988).
For more about how the effect size measures and their confidence intervals
are computed, see ?rcompanion::cohenG, ?rcompanion::cramerV, and
?rcompanion::cramerVFit.
# NOT RUN {
# for reproducibility
set.seed(123)
# let's create a smaller dataframe
diamonds_short <- ggplot2::diamonds %>%
dplyr::filter(.data = ., cut %in% c("Very Good", "Ideal")) %>%
dplyr::filter(.data = ., clarity %in% c("SI1", "SI2", "VS1", "VS2")) %>%
dplyr::sample_frac(tbl = ., size = 0.05)
# plot
ggstatsplot::grouped_ggbarstats(
data = diamonds_short,
x = color,
y = clarity,
grouping.var = cut,
title.prefix = "Quality",
bar.label = "both",
plotgrid.args = list(nrow = 2)
)
# }
Run the code above in your browser using DataLab