Match terms in C++
match(term_vector, pdf_tdm, term_tdm, thresholds, pdf_index, terms, sentences)
Index vector of where each of the terms is in the pdf_tdm. i.e. the ith element of term_vector is j. Therefor, term i is at column j in the pdf_tdm.
Term document matrix of words in the PDF
Term document matrix of words in the terms and pdf sentences.
Acceptance thresholds
Index of terms in PDF
List of terms used, this is the vector of column names of term_tdm.
Vector of sentences read in from the PDf.
List of matched terms.