Learn R Programming

prewas: data pre-processing for more informative bacterial GWAS

Introduction

The prewas R package allows users to create a binary SNP matrix from a whole genome alignment. The SNP matrix includes the following features: (1) multiple line representation of multiallelic sites, (2) multiple line representation for SNPs present in overlapping genes, and (3) choice over the reference allele. Additionally, users can collapse SNPs into genes so the output is a binary gene matrix. Output from the prewas package should be used as the input to bacterial GWAS tools such as hogwash.

Installation

To install prewas follow these commands in R:

install.packages("prewas")
library(prewas)

Note: this package depends on R (>= 3.5.0).

Documentation

prewas is described in the paper: "prewas: data pre-processing for more informative bacterial GWAS". The Rscripts and data for the paper's figures and analyses can be found in the manuscript analysis repository.

A tutorial explaining how to use the package can be found in the vignette.

Contributors

Katie Saund, Stephanie Thiede, and Zena Lapp contributed to this code.

Copy Link

Version

Install

install.packages('prewas')

Monthly Downloads

43

Version

1.1.1

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Katie Saund

Last Published

April 2nd, 2021

Functions in prewas (1.1.1)

results

Results from running prewas() on the example data.
collapse_snps_into_genes

Collapse SNPs into the gene(s) which they are from
collapse_snps_into_genes_by_impact

Collapse SNPs into the gene(s) which they are from and by snpeff impcat
prewas

Preprocess SNPs before bGWAS
outgroup

Name of outgroup in the phylogenetic tree.
snpeff_vcf

Nucleotide variants in example genome samples with snpeff annotations.
gff

GFF3 file for example genomes
vcf

Nucleotide variants in example genome samples
tree

Phylogenetic tree of example genomes