A convenience wrapper for taxa_table() + stats_table().
taxa_stats(
biom,
regr = NULL,
stat.by = NULL,
rank = -1,
taxa = 6,
lineage = FALSE,
unc = "singly",
other = FALSE,
split.by = NULL,
transform = "none",
test = "emmeans",
fit = "gam",
at = NULL,
level = 0.95,
alt = "!=",
mu = 0,
p.adj = "fdr"
)A tibble data.frame with fields from the table below. This tibble
object provides the $code operator to print the R code used to generate
the statistics.
| Field | Description |
| .mean | Estimated marginal mean. See emmeans::emmeans(). |
| .mean.diff | Difference in means. |
| .slope | Trendline slope. See emmeans::emtrends(). |
| .slope.diff | Difference in slopes. |
| .h1 | Alternate hypothesis. |
| .p.val | Probability that null hypothesis is correct. |
| .adj.p | .p.val after adjusting for multiple comparisons. |
| .effect.size | Effect size. See emmeans::eff_size(). |
| .lower | Confidence interval lower bound. |
| .upper | Confidence interval upper bound. |
| .se | Standard error. |
| .n | Number of samples. |
| .df | Degrees of freedom. |
| .stat | Wilcoxon or Kruskal-Wallis rank sum statistic. |
| .t.ratio | .mean / .se |
| .r.sqr | Percent of variation explained by the model. |
| .adj.r | .r.sqr, taking degrees of freedom into account. |
| .aic | Akaike Information Criterion (predictive models). |
| .bic | Bayesian Information Criterion (descriptive models). |
| .loglik | Log-likelihood goodness-of-fit score. |
| .fit.p | P-value for observing this fit by chance. |
An rbiom object, such as from as_rbiom().
Any value accepted by as_rbiom() can also be given here.
Dataset field with the x-axis (independent; predictive)
values. Must be numeric. Default: NULL
Dataset field with the statistical groups. Must be
categorical. Default: NULL
What rank(s) of taxa to display. E.g. "Phylum",
"Genus", ".otu", etc. An integer vector can also be
given, where 1 is the highest rank, 2 is the second
highest, -1 is the lowest rank, -2 is the second
lowest, and 0 is the OTU "rank". Run biom$ranks to
see all options for a given rbiom object. Default: -1.
Which taxa to display. An integer value will show the top n
most abundant taxa. A value 0 <= n < 1 will show any taxa with that
mean abundance or greater (e.g. 0.1 implies >= 10%). A
character vector of taxa names will show only those named taxa.
Default: 6.
Include all ranks in the name of the taxa. For instance,
setting to TRUE will produce
Bacteria; Actinobacteria; Coriobacteriia; Coriobacteriales.
Otherwise the taxa name will simply be Coriobacteriales. You
want to set this to TRUE when unc = "asis" and you have taxa
names (such as Incertae_Sedis) that map to multiple higher
level ranks. Default: FALSE
How to handle unclassified, uncultured, and similarly ambiguous taxa names. Options are:
"singly" - Replaces them with the OTU name.
"grouped" - Replaces them with a higher rank's name.
"drop" - Excludes them from the result.
"asis" - To not check/modify any taxa names.
Abbreviations are allowed. Default: "singly"
Sum all non-itemized taxa into an "Other" taxa. When
FALSE, only returns taxa matched by the taxa
argument. Specifying TRUE adds "Other" to the returned set.
A string can also be given to imply TRUE, but with that
value as the name to use instead of "Other".
Default: FALSE
Dataset field(s) that the data should be split by prior to
any calculations. Must be categorical. Default: NULL
Transformation to apply. Options are:
c("none", "rank", "log", "log1p", "sqrt", "percent"). "rank" is
useful for correcting for non-normally distributions before applying
regression statistics. Default: "none"
Method for computing p-values: 'wilcox', 'kruskal',
'emmeans', or 'emtrends'. Default: 'emmeans'
How to fit the trendline. 'lm', 'log', or 'gam'.
Default: 'gam'
Position(s) along the x-axis where the means or slopes should be
evaluated. Default: NULL, which samples 100 evenly spaced positions
and selects the position where the p-value is most significant.
The confidence level for calculating a confidence interval.
Default: 0.95
Alternative hypothesis direction. Options are '!='
(two-sided; not equal to mu), '<' (less than mu), or '>'
(greater than mu). Default: '!='
Reference value to test against. Default: 0
Method to use for multiple comparisons adjustment of
p-values. Run p.adjust.methods for a list of available
options. Default: "fdr"
Other taxa_abundance:
sample_sums(),
taxa_boxplot(),
taxa_clusters(),
taxa_corrplot(),
taxa_heatmap(),
taxa_stacked(),
taxa_sums(),
taxa_table()
Other stats_tables:
adiv_stats(),
bdiv_stats(),
distmat_stats(),
stats_table()
library(rbiom)
biom <- rarefy(hmp50)
taxa_stats(biom, stat.by = "Body Site", rank = "Family")[,1:6]
Run the code above in your browser using DataLab