This function sanitizes dictionary terms to ensure they're valid for entity extraction.
sanitize_dictionary(
dictionary,
term_column = "term",
type_column = "type",
validate_types = TRUE,
verbose = TRUE
)A data frame with sanitized terms.
A data frame containing dictionary terms.
The name of the column containing the terms to sanitize.
The name of the column containing entity types.
Logical. If TRUE, validates terms against their claimed type.
Logical. If TRUE, prints information about the filtering process.