Interrater agreement of log probabilities (treated as "ratings"/rows) among BERT language models (treated as "raters"/columns), with both row and column as ("two-way") random effects.
ICC_models(data, type = "agreement", unit = "average")
A data.table of ICC.
Raw data returned from FMAT_run
.
Interrater "agreement"
(default) or "consistency"
.
Reliability of "average"
scores (default) or "single"
scores.