Computes chemical similarity between two (or more) input molecules.
ms.compute (molA, molB, format = 'smiles', standardize = TRUE, explicitH = FALSE,
sim.method = 'tanimoto', fp.type = 'extended', fp.mode = 'bit', fp.depth = 6,
fp.size = 1024, fpCached = FALSE)
ms.compute.sim.matrix (molA, format = 'smiles', standardize = TRUE, explicitH = FALSE,
sim.method = 'tanimoto', fp.type = 'extended', fp.mode = 'bit', fp.depth = 6,
fp.size = 1024, clearCache = TRUE)
ms.compute.PCA(molA, format = 'smiles', standardize = TRUE, explicitH = FALSE,
fp.type = 'extended', fp.mode = 'bit', fp.depth = 6, fp.size = 1024,
clearCache = TRUE)Returns similarity value(s).
ms.computereturns a similarity value.
ms.compute.sim.matrixreturns a \(m \times m\) symmetric matrix of similarity values. \(m\) is the length of the input list.
ms.compute.PCAreturns prcomp object.
input molecule in SMILES format or name (with path) of MDL MOL file. ms.compute.sim.matrix accepts list of molecules as input.
input molecule in SMILES format or name (with path) of MDL MOL file.
specifies format of input molecule(s). Molecule(s) can be provided in one of following formats: 'SMILES' (default) or 'MOL'.
suppresses all explicit hydrogen if set as TRUE (default).
converts all implicit hydrogen to explicit if set as TRUE. It is set as FALSE by default.
similarity metric to be used to evaluate molecule similarity. Allowed types include:
'simple', 'jaccard', 'tanimoto' (default), 'russelrao', 'dice', 'rodgerstanimoto', 'achiai', 'cosine', 'kulczynski2', 'mt', 'baroniurbanibuser', 'tversky', 'robust', 'hamann', 'pearson', 'yule', 'mcconnaughey', 'simpson', 'jaccard-count' and 'tanimoto-count'.
fingerprint type to use. Allowed types include:
'standard', 'extended' (default), 'graph', 'estate', 'hybridization', 'maccs', 'pubchem', 'kr', 'shortestpath', 'signature' and 'circular'.
fingerprint mode to be used. It can either be set to 'bit' (default) or 'count'.
search depth for fingerprint construction. This argument is ignored for 'pubchem', 'maccs', 'kr' and 'estate' fingerprints.
length of the fingerprint bit string. This argument is ignored for 'pubchem', 'maccs', 'kr', 'estate', 'circular' (count mode) and 'signature' fingerprints.
boolean that enables fingerprint caching. It is set to FALSE by default.
boolean that resets the cache before (and after) processing molecule lists. It is set to TRUE by default. Cache can also be explicitly cleared by using rs.clearCache.
Varun Giri varungiri@gmail.com
See rs.compute functions, for details for fingerprints and similarity matrices.
ms.compute can use fingerprint caching by enabling fpCached option. ms.compute and ms.compute.sim.matrix use same cache as rs.compute and other functions in the package.
ms.compute.PCA computes PCA based on the fingerprints using prcomp funtion.
rs.compute, rs.clearCache
ms.compute('N', '[H]N([H])[H]', standardize = FALSE)
Run the code above in your browser using DataLab