broom (version 0.4.1)

data.frame_tidiers: Tidiers for data.frame objects

Description

These perform tidy summaries of data.frame objects. tidy produces summary statistics about each column, while glance simply reports the number of rows and columns. Note that augment.data.frame will throw an error.

Usage

"tidy"(x, ...)
"augment"(x, data, ...)
"glance"(x, ...)

Arguments

x
A data.frame
...
extra arguments: for tidy, these are passed on to describe from psych package
data
data, not used

Value

tidy.data.frame produces a data frame with one row per original column, containing summary statistics of each:
column
name of original column
n
Number of valid (non-NA) values
mean
mean
sd
standard deviation
median
median
trimmed
trimmed mean, with trim defaulting to .1
mad
median absolute deviation (from the median)
min
minimum value
max
maximum value
range
range
skew
skew
kurtosis
kurtosis
se
standard error
glance returns a one-row data.frame with
nrow
number of rows
ncol
number of columns
complete.obs
number of rows that have no missing values
na.fraction
fraction of values across all rows and columns that are missing

Details

The tidy method calls the psych method describe directly to produce its per-columns summary statistics.

See Also

describe

Examples

Run this code

td <- tidy(mtcars)
td

glance(mtcars)

library(ggplot2)
# compare mean and standard deviation
ggplot(td, aes(mean, sd)) + geom_point() +
     geom_text(aes(label = column), hjust = 1, vjust = 1) +
     scale_x_log10() + scale_y_log10() + geom_abline()

Run the code above in your browser using DataLab