run_rf

This function trains a random forest model (ranger) on the
specified training dataset and makes predictions on the test dataset in a
counterfactual scenario. The model uses meteorological variables and temporal features.

Analyzes the impact of external conditions on air quality using counterfactual approaches, featuring methods for data preparation, modeling, and visualization.

Imke Voss

ubair

Effects of External Conditions on Air Quality

Raphael Franke

run_rf function

<dl><dt>train</dt>
<dd>Dataframe of train data as returned by the <code>split_data_counterfactual()</code>
function.</dd>
<dt>test</dt>
<dd>Dataframe of test data as returned by the <code>split_data_counterfactual()</code>
function.</dd>
<dt>model_params</dt>
<dd>list of hyperparameters to use in ranger call. See <code>ranger:ranger()</code> for options.</dd>
<dt>alpha</dt>
<dd>Confidence level of the prediction interval between 0 and 1.</dd>
<dt>calc_shaps</dt>
<dd>Boolean value. If TRUE, calculate SHAP values for the
method used and format them so they can be visualised with <code>shapviz:sv_importance()</code> and
<code>shapviz:sv_dependence()</code>.
The SHAP values are generated for a subset (or all, depending on the size of the dataset) of the
test data.</dd></dl>

Arguments

Run random forest model with ranger — run_rf

<dl>

<dt>train</dt>
<dd>Dataframe of train data as returned by the <code>split_data_counterfactual()</code>
function.</dd>


<dt>test</dt>
<dd>Dataframe of test data as returned by the <code>split_data_counterfactual()</code>
function.</dd>


<dt>model_params</dt>
<dd>list of hyperparameters to use in ranger call. See <code>ranger:ranger()</code> for options.</dd>


<dt>alpha</dt>
<dd>Confidence level of the prediction interval between 0 and 1.</dd>


<dt>calc_shaps</dt>
<dd>Boolean value. If TRUE, calculate SHAP values for the
method used and format them so they can be visualised with <code>shapviz:sv_importance()</code> and
<code>shapviz:sv_dependence()</code>.
The SHAP values are generated for a subset (or all, depending on the size of the dataset) of the
test data.</dd>

</dl>

run_rf: Run random forest model with ranger

Description

Usage

Value

Arguments

Details