Learn R Programming

⚠️There's a newer version (3.2.3) of this package.Take me there.

R Package for Core Hunter 3

Latest release

Development snapshot

Core Hunter is a tool to sample diverse, representative subsets from large germplasm collections, with minimum redundancy. Such so-called core collections have applications in plant breeding and genetic resource management in general. Core Hunter can construct cores based on genetic marker data, phenotypic traits or precomputed distance matrices, optimizing one of many provided evaluation measures depending on the precise purpose of the core (e.g. high diversity, representativeness, or allelic richness). In addition, multiple measures can be simultaneously optimized as part of a weighted index to bring the different perspectives closer together. The Core Hunter library is implemented in Java 8 as an open source project (see http://www.corehunter.org).

Version 3 has been recoded from scratch using the JAMES framework which provides the applied optimization algorithms. Requirements

A [Java Runtime Environment] (http://www.oracle.com/technetwork/java/javase/downloads/jre8-downloads-2133155.html) (JRE) version 8 or later is required to run Core Hunter.

Getting started

The package corehunter can be installed from CRAN with

> install.packages("corehunter")

Afterwards, load the package

> library(corehunter)

and add your data, e.g.

> my.genotypes <- genotypes(file = "path/to/file")

Sampling a core collection is then as easy as

> sampleCore(my.genotypes)

There are numerous options when sampling a core. For example, you can change the size of the core (defaults to 20%), optimize a specific measure (defaults to average entry-to-nearest-entry distance), maximize a weighted index including multiple measures, change stop conditions (by default, the algorithm stops when it was unable to further improve the core during the last 10 seconds), etc. All functions have detailed documentation, for example try

> ?sampleCore
> ?objective
> ?genotypes
> ?phenotypes
> ?distances
> ?coreHunterData

Many examples are included in the package as well.

Supported data types

Core Hunter 3 supports multiple types of genetic marker data, phenotypic traits and precomputed distance matrices. See http://www.corehunter.org for more details. Data can be loaded from files, data frames and matrices.

Evaluation measures

One of the main strengths of Core Hunter is that it can directly optimize a number of different evaluation measures. If desired, multiple measures can be simultaneously optimized as part of a weighted index. The measures included in Core Hunter 3 are listed below.

Distance based measures

  • Average entry-to-nearest-entry distance (diversity)
  • Average accession-to-nearest-entry distance (representativeness)
  • Average entry-to-entry distance (provided for historical reasons, not preferred)

Gower's distance is used to compute distances from phenotypic traits, and both the Modified Roger's as well as Cavalli-Sforza & Edwards distances are supported for genetic marker data. Alternatively, a precomputed distance matrix can be used.

Allelic richness

  • Shannon's index
  • Expected heterozygosity
  • Allele coverage

Available for genetic marker data only.

Copy Link

Version

Install

install.packages('corehunter')

Monthly Downloads

393

Version

3.0.1

License

MIT + file LICENSE

Maintainer

Herman De Beukelaer

Last Published

October 4th, 2016

Functions in corehunter (3.0.1)

getAlleleFrequencies

Get Allele frequency matrix.
setRange

Set the normalization range of the given objective.
sampleCore

Sample a core collection from the given data.
getNormalizationRanges

Determine normalization ranges of all objectives in a multi-objective configuration.
distances

Create Core Hunter distance data from matrix or file.
evaluateCore

Evaluate a core collection using the specified objective.
phenotypes

Create Core Hunter phenotype data from data frame or file.
read.autodelim

Read delimited file.
exampleData

Example dataset with 218 individuals.
genotypes

Create Core Hunter genotype data from data frame, matrix or file.
wrapData

Wrap distances, genotypes or phenotypes in Core Hunter data.
corehunter

Core Hunter 3
coreHunterData

Initialize Core Hunter data.