get_data_space

dataframe, with not more then two columns one of them numeric
containing importance measures and one character or factor column containing
corresponding variable names as found in training data.

integer, number of top important variables to select. For
plotting more than 4 will result in two many flows and the alluvial plot
will not be very readable, Default: 4

degree

integer, number of bins for numeric variables, and maximum number
of levels for factor variables, increasing this number might result in too
many flows, Default: 5

bins

integer, maximum number of levels per factor variable, Default: 10

max_levels

calculates a dataspace based on the modelling dataframe and the
 importance of the explanatory variables. It only considers the most
 important variables as defined by the degree parameter. It selects a number
 (defined by bins) of sensible single values spread over the range of the
 numeric variables and creates all possible value combinations among the most
 important variables. The values of the remaining variables are set to
 mode(factors) or median(numerics).

Alluvial plots are similar to sankey diagrams and visualise categorical data
over multiple dimensions as flows. (Rosvall M, Bergstrom CT (2010) Mapping Change in
Large Networks. PLoS ONE 5(1): e8694. <doi:10.1371/journal.pone.0008694>
Their graphical grammar however is a bit more complex then that of a regular x/y
plots. The 'ggalluvial' package made a great job of translating that grammar into
'ggplot2' syntax and gives you many options to tweak the appearance of an alluvial
plot, however there still remains a multi-layered complexity that makes it difficult
to use 'ggalluvial' for explorative data analysis. 'easyalluvial' provides a simple
interface to this package that allows you to produce a decent alluvial plot from any
dataframe in either long or wide format from a single line of code while also handling
continuous data. It is meant to allow a quick visualisation of entire dataframes
with a focus on different colouring options that can make alluvial plots a great
tool for data exploration.

get_data_space: calculate data space

Description

Usage

Arguments

Value

Details

See Also

Examples