The SCF provides five implicates per survey wave, each a plausible version
of the population under a specific missing-data model. Analysts conduct the
same statistical procedure on each implicate, producing a set of five
estimates \( Q_1, Q_2, ..., Q_5 \). These are then combined using Rubin’s
Rules, a procedure to combine results across these implicates with an
attempt to account for:
For a scalar quantity \( Q \), the pooled estimate and
total variance are calculated as:
$$ \bar{Q} = \frac{1}{M} \sum Q_m $$
$$ \bar{U} = \frac{1}{M} \sum U_m $$
$$ B = \frac{1}{M - 1} \sum (Q_m - \bar{Q})^2 $$
$$ T = \bar{U} + \left(1 + \frac{1}{M} \right) B $$
Where:
\( M \) is the number of implicates (typically 5 for SCF)
\( Q_m \) is the estimate from implicate \( m \)
\( U_m \) is the sampling variance of \( Q_m \), accounting for replicate weights and design
The total variance \( T \) reflects both within-imputation uncertainty (sampling error)
and between-imputation uncertainty (missing-data imputation).
The standard error of the pooled estimate is \( \sqrt{T} \). Degrees of freedom are
adjusted using the Barnard-Rubin method:
$$ \nu = (M - 1) \left(1 + \frac{\bar{U}}{(1 + \frac{1}{M}) B} \right)^2 $$
The fraction of missing information (FMI) is also reported:
it reflects the proportion of total variance attributable to imputation uncertainty.