ParseProbDup

An object of class <code>ProbDup</code>.

pdup

The maximum count of probable duplicate sets which are to be parsed to a data frame.

max.count

logical. If <code>TRUE</code>, inserts a row of <code>NAs</code> 
after each set.

insert.blanks

<code>ParseProbDup</code> converts an object of class <code>ProbDup</code> to a data frame for export.

Provides functions to aid the identification of probable/possible
duplicates in Plant Genetic Resources (PGR) collections using
'passport databases' comprising of information records of each constituent
sample. These include methods for cleaning the data, creation of a
searchable Key Word in Context (KWIC) index of keywords associated with
sample records and the identification of nearly identical records with
similar information by fuzzy, phonetic and semantic matching of keywords.

J Aravind

PGRdup

Discover Probable Duplicates in Plant Genetic Resources
Collections

`SET_NO`	The set number.
`TYPE`	The type of probable duplicate set. 'F' for fuzzy, 'P' for phonetic and 'S' for semantic matching sets.
`K`	The KWIC index or database of origin of the record. The `method` is specified within the square brackets in the column name.
`PRIM_ID`	The primary ID of the accession record from which the set could be identified.
`IDKW`	The 'matching' keywords along with the IDs.
`COUNT`	The number of elements in a set.

ParseProbDup: Parse an object of class `ProbDup` to a data frame.

Description

Usage

Arguments

Value

See Also

Examples