Helper function that computes various association metrics for bigrams based on their probability distributions. Supports PMI (Pointwise Mutual Information), Dice's Coefficient, and G-score calculations.
calculate_metrics(bigram_probs, association)
A data frame containing the original probability columns plus requested association metrics:
pmi: Pointwise Mutual Information
dice_coeff: Dice's Coefficient
g_score: G-score
A data frame containing bigram probability data with columns:
p_xy Joint probability of bigram
p_x Marginal probability of first token
p_y Marginal probability of second token
Character vector specifying which metrics to calculate