build_corpus: Calculate word corpus for weighted jaccard matching
Description
Calculate word corpus for weighted jaccard matching
Usage
build_corpus(namelist1, namelist2)
Value
a data.table with columns for frequency, inverse frequency, and log inverse frequency for each word in the two strings.
Arguments
- namelist1
character vector of names from dataset 1
- namelist2
character vector of names from dataset 2