validation.dataset_srsc: Error of estimates with respect to truth

Description

Let us denote a model parameter by $\theta_0$ $N_I$ by a number of images and number of lesions by $N_L$ which are specified by user as the variables of the function.

(I) Replicates models for datasets $D_1,D_2,...,D_k,...,D_K$.
Draw a dataset $D_k$ from a likelihood (model), namely: $D_k \sim likelihood(|\theta_0)$.
Draw a MCMC samples $\{ \theta_i (D_k)\}$ from a posterior, namely: $ \theta _i \sim \pi(|D_k)$.
Calculate a posterior mean, namely: $ \bar{\theta}(D_k) := \sum_i \theta_i(D_k) $.
Calculates error for $D_k$: $\epsilon_k$:=Trueth - posterior mean estimates of $D_k$ = $|\theta_0 - \bar{\theta}(D_k)|$ (or = $\theta_0 - \bar{\theta}(D_k)$, accordinly by the user specified absolute.errors ).
(II) Calculates mean of errors: mean of errors $ \bar{\epsilon}(\theta_0,N_I,N_L)$= $ \frac{1}{K} \sum \epsilon_k $

Running this function, we can see that the error $ \bar{\epsilon}(\theta_0,N_I,N_L)$ decreases monotonically as a given number of images $N_I$ or a given number of lesions $N_L$ increases.

Also, the scale of error also will be found. Thus this function can show how our estimates are correct. Scale of error differs for each componenet of model parameters.

Revised 2019 August 28

Usage

validation.dataset_srsc(
  replicate.datset = 3,
  ModifiedPoisson = FALSE,
  mean.truth = 0.6,
  sd.truth = 5.3,
  z.truth = c(-0.8, 0.7, 2.38),
  NL = 259,
  NI = 57,
  ite = 1111,
  cha = 1,
  summary = TRUE,
  serial.number = 1,
  base_size = 0,
  absolute.errors = TRUE
)

Arguments

replicate.datset

A Number indicate that how many you replicate dataset from user's specified dataset.

ModifiedPoisson

Logical, that is TRUE or FALSE.

If ModifiedPoisson = TRUE, then Poisson rate of false alarm is calculated per lesion, and model is fitted so that the FROC curve is an expected curve of points consisting of the pairs of TPF per lesion and FPF per lesion.

Similarly,

If ModifiedPoisson = TRUE, then Poisson rate of false alarm is calculated per image, and model is fitted so that the FROC curve is an expected curve of points consisting of the pair of TPF per lesion and FPF per image.

For more details, see the author's paper in which I explained per image and per lesion. (for details of models, see vignettes , now, it is omiited from this package, because the size of vignettes are large.)

If ModifiedPoisson = TRUE, then the False Positive Fraction (FPF) is defined as follows ($F_c$ denotes the number of false alarms with confidence level $c$ )

$$ \frac{F_1+F_2+F_3+F_4+F_5}{N_L}, $$

$$ \frac{F_2+F_3+F_4+F_5}{N_L}, $$

$$ \frac{F_3+F_4+F_5}{N_L}, $$

$$ \frac{F_4+F_5}{N_L}, $$

$$ \frac{F_5}{N_L}, $$

where $N_L$ is a number of lesions (signal). To emphasize its denominator $N_L$, we also call it the False Positive Fraction (FPF) per lesion.

On the other hand,

if ModifiedPoisson = FALSE (Default), then False Positive Fraction (FPF) is given by

$$ \frac{F_1+F_2+F_3+F_4+F_5}{N_I}, $$

$$ \frac{F_2+F_3+F_4+F_5}{N_I}, $$

$$ \frac{F_3+F_4+F_5}{N_I}, $$

$$ \frac{F_4+F_5}{N_I}, $$

$$ \frac{F_5}{N_I}, $$

where $N_I$ is the number of images (trial). To emphasize its denominator $N_I$, we also call it the False Positive Fraction (FPF) per image.

The model is fitted so that the estimated FROC curve can be ragraded as the expected pairs of FPF per image and TPF per lesion (ModifiedPoisson = FALSE )

or as the expected pairs of FPF per image and TPF per lesion (ModifiedPoisson = TRUE)

If ModifiedPoisson = TRUE, then FROC curve means the expected pair of FPF per lesion and TPF.

On the other hand, if ModifiedPoisson = FALSE, then FROC curve means the expected pair of FPF per image and TPF.

So,data of FPF and TPF are changed thus, a fitted model is also changed whether ModifiedPoisson = TRUE or FALSE. In traditional FROC analysis, it uses only per images (trial). Since we can divide one image into two images or more images, number of trial is not important. And more important is per signal. So, the author also developed FROC theory to consider FROC analysis under per signal. One can see that the FROC curve is rigid with respect to change of a number of images, so, it does not matter whether ModifiedPoisson = TRUE or FALSE. This rigidity of curves means that the number of images is redundant parameter for the FROC trial and thus the author try to exclude it.

Revised 2019 Dec 8 Revised 2019 Nov 25 Revised 2019 August 28

mean.truth

This is a parameter of the latent Gaussian assumption for the noise distribution.

sd.truth

This is a parameter of the latent Gaussian assumption for the noise distribution.

z.truth

This is a parameter of the latent Gaussian assumption for the noise distribution.

Number of Lesions.

Number of Images.

ite

A variable to be passed to the function rstan::sampling() of rstan in which it is named iter. A positive integer representing the number of samples synthesized by Hamiltonian Monte Carlo method, and, Default = 10000.

cha

A variable to be passed to the function rstan::sampling() of rstan in which it is named chains. A positive integer representing the number of chains generated by Hamiltonian Monte Carlo method, and, Default = 1.

summary

Logical: TRUE of FALSE. Whether to print the verbose summary. If TRUE then verbose summary is printed in the R console. If FALSE, the output is minimal. I regret, this variable name should be verbose.

serial.number

An positive integer or Character. This is for programming perspective. The author use this to print the serial numbre of validation. This will be used in the validation function.

base_size

An numeric for size of object, this is for the package developer.

absolute.errors

A logical specifying whether mean of errors is defined by

TRUE: $ \bar{\epsilon}(\theta_0,N_I,N_L)$= $ \frac{1}{K} \sum | \epsilon_k | $
FALSE: $ \bar{\epsilon}(\theta_0,N_I,N_L)$= $ \frac{1}{K} \sum \epsilon_k $

Value

Return values is,

Stanfit objects: for each Replicated datasets
Errors: EAPs minus true values, in the above notations, it is $ \bar{\epsilon}(\theta_0,N_I,N_L)$
Variances of estimators.: This calculates the vaiance of posterior means over all replicated datasets

Examples

Run this code

# NOT RUN {
#===========================    The first example  ======================================


#   It is sufficient to run the function with default variable

   datasets <- validation.dataset_srsc()



#=============================  The second example ======================================

#   If user do not familiar with the values of thresholds, then
#   it would be better to use the actual estimated values
#    as an example of true parameters. In the following,
#     I explain this.

# First, to get estimates, we run the following:

  fit <- fit_Bayesian_FROC(dataList.Chakra.1,ite = 1111,summary =FALSE,cha=3)






#  Secondly, extract the expected a posterior estimators (EAPs) from the object fit


  z <- rstan::get_posterior_mean(fit,par=c("z"))[,"mean-all chains"]





#  Thirdly we use this z as a true values.


   datasets <- validation.dataset_srsc(z.truth = z)



#========================================================================================
#            1)             extract replicated fitted model object
#========================================================================================


    # Replicates models

    a <- validation.dataset_srsc(replicate.datset = 3,ite = 111)



    # Check convergence, in the above MCMC iterations = 111 which is too small to get
    # a convergence MCMC chain, and thus the following example will the example
    # of a non-convergent model in the r hat criteria.

    ConfirmConvergence( a$fit[[3]])


    # Check trace plot to confirm whether MCMC chain do converge or not.

    stan_trace( a$fit[[3]],pars = "A")


   # Check p value
    ppp( a$fit[[3]])



    # In the above example, the posterior predictive p value is enough large,
    # but the model did not converge in R that criteria, which will cause
    # that the model does not fit to data. However p value is said
    # we can not reject the null hypothesis that the model does fit.
    # The author think this contradiction cause that the
    # number of MCMC iterations are too small which leads us to incorrect
    # Monte Carlo integral for p value. Thu p value is not correct.
    # Calculation of p value relies on the law of large number and thus
    # to obtain reliable posterior predictive p value, we need enough large
    # MCMC samples. 2019 August 29






                                          # Revised in 2019 August 29





#========================================================================================
#            1)            Histogram of error of postrior means for replicated datasets
#========================================================================================
#'

  a<-   validation.dataset_srsc(replicate.datset = 100)
  hist(a$error.of.AUC,breaks = 111)
  hist(a$error.of.AUC,breaks = 30)






#========================================================================================
#                             absolute.errors = FALSE generates negative biases
#========================================================================================


 validation.dataset_srsc(absolute.errors = FALSE)



#========================================================================================
#                             absolute.errors = TRUE dose not generate negative biases
#========================================================================================


 validation.dataset_srsc(absolute.errors = TRUE)


# }
# NOT RUN {
# dontrun
# }

Run the code above in your browser using DataLab