Learn R Programming

Package Overview

pooledpeaks is designed for analyzing genetic data obtained from Fragment Analysis output files (.fsa) of pooled biological samples. It provides functions for a comprehensive analysis pipeline from processing .fsa files, to cleaning the peak data, and conducting population genetic analyses. Some features are listed below and a usage example of the entire pipeline is included as a vignette. Please check out the Contributing Guidelines for information on how to add to this package.

Installation Instructions

You can install the package directly from GitHub using the following instructions:

Open R and copy the following code into your console

Install devtools and pooledpeaks from GitHub

install.packages("devtools")
devtools::install_github("kmkuesters/pooledpeaks")

Install pooledpeaks directly from CRAN

install.packages("pooledpeaks")

Features

For a detailed example of how to apply the functions contained in this package please see the Introduction to Using the pooledpeaks Workflow. Example data can be found on GitHub under the inst/extdata folder including .fsa files and a formatted "Multiplex_frequencies.txt" file for the Genetic Analysis portion.

  • Peak Scoring: Process .fsa files and score peaks contained therein.
check_fsa_v_batch()
fsa_metadata()
fsa_batch_imp()
associate_dyes()
score_markers_rev3()
  • Data Manipulation: Clean and prepare peak data for downstream analyses.
clean_scores()
lf_to_tdf()
data_manipulation()
Rep_check()
PCDM()
LoadData()
  • Population Genetics Analysis:

    • Calculate Gene Identity Matrix and Genetic Distance Matrix

    • Calculate diversity indices

    • Calculate differentiation indices

    • Perform cluster analysis

TypedLoci()
GeneIdentityMatrix()
GeneticDistanceMatrix()
GST()
JostD()
cluster()
  • Visualization: Visualize the peak scoring and genetic analysis results.
MDSplot()

Sample Data

The sample .fsa files included in this package are provided for demonstration purposes and originate from two sources:

  • Schistosoma haematobium laboratory isolates, used for preliminary testing

of the pooledpeaks workflow. These data contain no identifiable or human subject information.

  • De-identified Schistosoma mansoni samples from a three studies conducted in

Brazil, extracted from discarded human waste. These files were originally used for genetic analysis and are shared here in anonymized form to illustrate compatibility with additional species and data sources.These studies are described in detail by Long et al. (2022), available at https://www.nature.com/articles/s41598-022-04776-0:

These files are intended solely to demonstrate the functionality of the pooledpeaks package and are not for diagnostic or clinical use.To access the example .fsa files included with the package, use the following path within R:

system.file("extdata", package = "pooledpeaks")

The pooledpeaks package was developed by the Blanton Lab as part of Kathleen Kuesters' dissertation.

References:

Copy Link

Version

Install

install.packages('pooledpeaks')

Monthly Downloads

344

Version

1.2.1

License

GPL (>= 3)

Issues

Pull Requests

Stars

Forks

Maintainer

Kathleen Kuesters

Last Published

May 1st, 2025

Functions in pooledpeaks (1.2.1)

pooledpeaks-package

pooledpeaks: Genetic Analysis of Pooled Samples
data_manipulation

Data Manipulation for Marker Data
fsa_batch_imp

Batch Import of .fsa files
preJostD

Calculate Pre-Jost's D
score_markers_rev3

Score Markers Wrapper
clean_scores

Clean Scores Data
cluster

K-means Clustering
preGST

Pre GST Calculation
fsa_metadata

Retrieve Metadata
RWCDistanceMatrix

Random Walk Covariance Distance Matrix
lf_to_tdf

Transform LF to TDF
ClusterFromSamples

Cluster From Samples
GeneIdentityMatrix

Gene Identity Matrix
AlRich

Calculate Allelic Richness
BootStrap3

Perform Bootstrap Analysis
GeneticDistanceMatrix

Genetic Distance Matrix
DistCor

Distance Correlation
JostD

Calculate Jost's D
GST

Nei's GST
EmpiricalSE

Calculate Empirical Standard Error
JostD_KK

Pairwise Jost D between replicates
PCDM

Post-consolidation Data Manipulation
TwoLevelGST

Calculate Two-Level GST
Rep_check

Replicate Check for Duplicate Samples
SampleOfLoci

Sample Of Loci
TypedLoci

Typed Loci
associate_dyes

Associate Dye Names in Batch Import Output
MDSplot

Multi Dimensional Scaling (MDS) Plot
LoadData

Load Genetic Data
check_fsa_v_batch

Check .fsa Version and Batch Information