Provides a summary of data with the the following columns:
nameName of the column.
typeType of the column, equal to "numerical", "logical", "factor", "categorical", or "NA only".
distinct_valuesCount of distinct values. It ignores NA values. Thus, if a columns only has NAs, then the value of this field will be zero.
minimumMinimum of numerical columns excluding NA values.
medianMedian of numerical columns excluding NA values.
maximumMaximum of numerical columns excluding NA values.
meanMean of numerical variables. It ignores NAs.
sdStandard deviation of numerical variables. It ignores NAs.
na_proportionProportion of NAs.
countTally of values if the column has 5 values at
most. This value (5) can be modified with the parameter
limit2tally.
sample_valuesSample of (different) values in each column.
glance_data(x, limit2tally = 20)A dataframe with named columns.
One of the summaries is a tally of the distinct
values on each column. If there are too many different values
in a column, this summary would be meaningless. This
limit2tally is the limit of distinct values to
tally. If there are more than that it returns
"Too many unique values".
A tibble.
# NOT RUN {
glance_data(iris)
# }
Run the code above in your browser using DataLab