Given a multitoken category (e.g., named entity ids), this function finds the most frequently occuring string in this category and adds it as a label for the category
add_multitoken_label(
tc,
colloc_id,
feature = "token",
new_feature = sprintf("%s_l", colloc_id),
pref_subset = NULL
)
a tcorpus object
the data column containing the unique id for multitoken tokens
the name of the feature column
the name of the new feature column
Optionally, a subset call, to specify a subset that has priority for finding the most frequently occuring string