cleanDB

Data table with columns 'compoundname, structure, baseformula, charge, description'

db.formatted

parallel::makeCluster object for multithreading

silent

How many compounds to process per 'block'? Higher number means bigger memory spikes, but faster processing time.

blocksize

SMILES format, Default: 'Canonical'

smitype

This is a wrapper function to take a 'raw' input data table with compound information, uniformize the SMILES

Provides parsing functionality for over 30 metabolomics databases, with most available without having to create an account on given websites.
Once parsed, calculates given adducts and isotope patterns and inserts into one big database which can be used to annotate unknown m/z values.
Furthermore, formulas can be predicted for a given m/z, and these can be matched to ChemSpider, PubChem, SUPERNATURAL II, KNApSAcK and ChemIDplus for further annotation.
Current databases available: HMDB, ChEBI, LMDB, BMDB, MCDB, ECMDB, Wikidata, mVOC, VMH, T3DB, Exposome Explorer, FooDB, MetaCyc (requires account), DrugBank (requires account), ReSPECT, MaConDa, Blood Exposome DB, KEGG, SMPDB, LIPID MAPS, MetaboLights, DimeDB, Phenol Explorer, MassBank, YMDB, PAMDB, ANPDB, Metabolomics Workbench, PharmGKB, Reactome, mVOC and STOFF.
Featured in the 'MetaboShiny' package (Wolthuis, J. (2019) <doi:10.1007/s11306-020-01717-8>).

cleanDB: Uniformize database and remove invalid formulas/SMILES

Description

Usage

Arguments

Value

See Also

Examples