FormatCV

<code>data.frame</code> object that is for use only when
<code>cv.scheme</code> is provided. Contains the trial to be tested in subsequent
model training functions. The first column contains unique identifiers,
second contains genotypes, third contains reference values, followed by
spectral columns. Include no other columns to right of spectra! Column
names of spectra must start with "X", reference column must be named
"reference", and genotype column must be named "genotype".

trial1

<code>data.frame</code> object that is for use only when
<code>cv.scheme</code> is provided. This data.frame contains a trial that has
overlapping genotypes with <code>trial1</code> but that were grown in a different
site/year (different environment). Formatting must be consistent with
<code>trial1</code>.

trial2

<code>data.frame</code> object that is for use only when
<code>cv.scheme</code> is provided. This data.frame contains a trial that may or
may not contain genotypes that overlap with <code>trial1</code>. Formatting must
be consistent with <code>trial1</code>.

trial3

A cross validation (CV) scheme from Jarqu&lt;U+00ED&gt;n et al., 2017.
Options for cv.scheme include:<ul>
<li>"CV1": untested lines in tested environments</li>
<li>"CV2": tested lines in tested environments</li>
<li>"CV0": tested lines in untested environments</li>
<li>"CV00": untested lines in untested environments</li>
</ul>

cv.scheme

Number used in the function <code>set.seed()</code> for reproducible
randomization. If <code>NULL</code>, no seed is set. Default is <code>NULL</code>.

seed

boolean that, if <code>TRUE</code>, removes the "genotype"
column is removed from the output <code>data.frame</code>. Default is
<code>FALSE</code>.

remove.genotype

Standalone function that is also used within
 <code><a rd-options="" href="/link/TrainSpectralModel?package=waves&version=0.1.0" data-mini-rdoc="waves::TrainSpectralModel">TrainSpectralModel</a></code> to divide trials or studies into training and test
 sets based on overlap in trial environments and genotype entries

Originally designed application in the context of resource-limited plant research
and breeding programs, 'waves' provides an open-source solution to spectral data processing
and model development by bringing useful packages together into a streamlined pipeline.
This package is wrapper for functions related to the analysis of point visible and
near-infrared reflectance measurements. It includes visualization, filtering, aggregation,
preprocessing, cross-validation set formation, model training, and prediction functions to
enable open-source association of spectral and reference data.
Specialized cross-validation schemes are described in detail in Jarqu<c3><ad>n et al. (2017)
<doi:10.3835/plantgenome2016.12.0130>. Example data is from Ikeogu et al. (2017)
<doi:10.1371/journal.pone.0188918>.

Jenna Hershberger

waves

Vis-NIR Spectral Analysis Wrapper

Michael Gore

NSF BREAD IOS-1543958 

FormatCV function

Standalone function that is also used within
 <code><a rd-options='' href='TrainSpectralModel'>TrainSpectralModel</a></code> to divide trials or studies into training and test
 sets based on overlap in trial environments and genotype entries

Format multiple trials with or without overlapping genotypes into
  training and test sets according to user-provided cross validation scheme — FormatCV

FormatCV: Format multiple trials with or without overlapping genotypes into training and test sets according to user-provided cross validation scheme

Description

Usage

Arguments

Value

Details

References

Examples