read.genesys
reads PGR data in a Darwin Core - germplasm zip
archive downloaded from genesys database and creates a flat file
data.frame
from it.
read.genesys(zip.genesys, scrub.names.space = TRUE, readme = TRUE)
TRUE
, all space characters are
removed from name field in names extension (see Details).TRUE
, the genesys zip file readme is printed
to console.data.frame
.All the space characters can be removed from the fields corresponding to
accession names such as acceNumb, collNumb, ACCENAME, COLLNUMB, DONORNUMB and
OTHERNUMB using the argument scrub.names.space
to facilitate creation
of KWIC index with KWIC
function and subsequent
matching operations to identify probable duplicates with
ProbDup
function.
The argument readme
can be used to print the readme file in the
archive to console, if required.
data.table