distr

data

Variables. Main (target variable) and secondary (values
variable) to group by (if needed).

Integer. 1 for both plots, 2 for counter plot only, 3 for
percentages plot only.

type

Boolean. Show a reference line if levels = 2? Quite useful
when data is unbalanced (not 50/50) because a reference line is drawn.

note

Integer. Filter and plot the most n frequent for categorical values.

Integer. Number of splits for numerical values.

breaks

Boolean. Ignore <code>NA</code>s if needed.

na.rm

Character. Force class on the values data. Choose between 'none',
'character', 'numeric', 'date'

force

Integer. Trim labels until the nth character for categorical values
(applies for both, target and values)

trim

Boolean. Use <code>cleanText()</code> for categorical values (applies
for both, target and values)

clean

Boolean. Do you wish to sort by alphabetical order?

Boolean. Use custom colours function?

custom_colours

Boolean. Return a plot? Otherwise, a table with results

plot

chords

Boolean. Save the output plot in our working directory

save

Character. Into which subdirectory do you wish to save the plot to?

subdir

Compare the distribution of a target variable vs another variable. This
function automatically splits into quantiles for numerical variables.
Custom and tidyverse friendly.

Auxiliary package for better/faster analytics, visualization, data mining, and machine learning
tasks. With a wide variety of family functions, like Machine Learning, Data Wrangling,
Exploratory, and Scrapper, it helps the analyst or data scientist to get quick and robust
results, without the need of repetitive coding or extensive programming skills.

State of Data and AI Literacy Report 2025

distr: Compare Variables with their Distributions

Description

Usage

Arguments

Value

See Also

Examples