compare_two_models

A data.frame of unsummarised scores as produced by
<code><a rd-options="" href="/link/eval_forecasts?package=scoringutils&version=0.1.7.2" data-mini-rdoc="scoringutils::eval_forecasts">eval_forecasts</a></code>

scores

character, name of the first model

name_model1

character, name of the model to compare against

name_model2

A character vector of length one with the metric to do
the comparison on.

metric

list with options to pass down to <code><a rd-options="" href="/link/compare_two_models?package=scoringutils&version=0.1.7.2" data-mini-rdoc="scoringutils::compare_two_models">compare_two_models</a></code>.
To change only one of the default options, just pass a list as input with
the name of the argument you want to change. All elements not included in the
list will be set to the default (so passing an empty list would result in the
default options).

test_options

character vector of columns to group scoring by. This should be the
lowest level of grouping possible, i.e. the unit of the individual
observation. This is important as many functions work on individual
observations. If you want a different level of aggregation, you should use
<code>summarise_by</code> to aggregate the individual scores.
Also not that the pit will be computed using <code>summarise_by</code>
instead of <code>by</code>

This function compares two models based on the subset of forecasts for which
both models have made a prediction. It gets called
from <code><a rd-options="" href="/link/pairwise_comparison_one_group?package=scoringutils&version=0.1.7.2" data-mini-rdoc="scoringutils::pairwise_comparison_one_group">pairwise_comparison_one_group</a></code>, which handles the
comparison of multiple models on a single set of forecasts (there are no
subsets of forecasts to be distinguished). <code><a rd-options="" href="/link/pairwise_comparison_one_group?package=scoringutils&version=0.1.7.2" data-mini-rdoc="scoringutils::pairwise_comparison_one_group">pairwise_comparison_one_group</a></code>
in turn gets called from from <code><a rd-options="" href="/link/pairwise_comparison?package=scoringutils&version=0.1.7.2" data-mini-rdoc="scoringutils::pairwise_comparison">pairwise_comparison</a></code> which can handle
pairwise comparisons for a set of forecasts with multiple subsets, e.g.
pairwise comparisons for one set of forecasts, but done separately for two
different forecast targets.

Combines a collection of metrics and proper scoring rules
(Tilmann Gneiting & Adrian E Raftery (2007)
<doi:10.1198/016214506000001437>) with an easy to
use wrapper that can be used to automatically evaluate predictions.
Apart from proper scoring rules functions are provided to assess bias,
sharpness and calibration (Sebastian Funk, Anton Camacho, Adam J. Kucharski,
Rachel Lowe, Rosalind M. Eggo, W. John Edmunds (2019)
<doi:10.1371/journal.pcbi.1006785>) of forecasts.
Several types of predictions can be evaluated:
probabilistic forecasts (generally
predictive samples generated by Markov Chain Monte Carlo procedures),
quantile forecasts or point forecasts. Observed values and predictions
can be either continuous, integer, or binary. Users can either choose
to apply these rules separately in a vector / matrix format that can
be flexibly used within other packages, or they can choose to do an
automatic evaluation of their forecasts. This is implemented with
'data.table' and provides a consistent and very efficient framework for
evaluating various types of predictions.

Nikos Bosse

scoringutils

Utilities for Scoring and Assessing Predictions

Sam Abbott 

Johannes Bracher 

Joel Hellewell

Sophie Meakins 

James Munday

Katharine Sherratt

Sebastian Funk

compare_two_models function

A data.frame of unsummarised scores as produced by
<code><a rd-options='' href='eval_forecasts'>eval_forecasts</a></code>

list with options to pass down to <code><a rd-options='' href='compare_two_models'>compare_two_models</a></code>.
To change only one of the default options, just pass a list as input with
the name of the argument you want to change. All elements not included in the
list will be set to the default (so passing an empty list would result in the
default options).

This function compares two models based on the subset of forecasts for which
both models have made a prediction. It gets called
from <code><a rd-options='' href='pairwise_comparison_one_group'>pairwise_comparison_one_group</a></code>, which handles the
comparison of multiple models on a single set of forecasts (there are no
subsets of forecasts to be distinguished). <code><a rd-options='' href='pairwise_comparison_one_group'>pairwise_comparison_one_group</a></code>
in turn gets called from from <code><a rd-options='' href='pairwise_comparison'>pairwise_comparison</a></code> which can handle
pairwise comparisons for a set of forecasts with multiple subsets, e.g.
pairwise comparisons for one set of forecasts, but done separately for two
different forecast targets.

compare_two_models: Compare Two Models Based on Subset of Common Forecasts

Description

Usage

Arguments