Converts a comparison matrix generated by pairwise_compare
into a
data frame of candidates for matches.
pairwise_candidates(m, directional = FALSE)
A matrix from pairwise_compare
.
Should be set to the same value as in
pairwise_compare
.
A data frame containing all the non-NA
values from m
.
Columns a
and b
are the IDs from the original corpus as
passed to the comparison function. Column score
is the score
returned by the comparison function.
# NOT RUN {
dir <- system.file("extdata/legal", package = "textreuse")
corpus <- TextReuseCorpus(dir = dir)
m1 <- pairwise_compare(corpus, ratio_of_matches, directional = TRUE)
pairwise_candidates(m1, directional = TRUE)
m2 <- pairwise_compare(corpus, jaccard_similarity)
pairwise_candidates(m2)
# }
Run the code above in your browser using DataLab