eiInit(inputs,dir=".",format="sdf",descriptorType="ap",append=FALSE,
conn=defaultConn(dir,create=TRUE), updateByName = FALSE, cl = NULL, connSource = NULL)
format
format, or an SDFset. This can
also be a vector of filenames and if cl
is also specified and if you database
supports it (SQLite does not), it will load these file in parallel on the cluster.inputs
. Currenly only "sdf" and "smiles" is
supported.eiAdd
instead to add new compounds to a database.If false, we allow duplicate compound names to exist in the database, though not duplicate definitions. So identical compounds will not be re-added, but if a new version of an existing compound is added it will not update the existing one, it will add the modified one as a completely new compound with a new compound id.
This function can also be used to setup the envrionment on the cluster worker nodes. For example, you might need to re-load libraries like RSQLite and such.
eiQuery
.read.SDFset
command and then use that
instead of the filename.
EiInit will create a folder called
'data'. Commands should always be executed in the folder containing
this directory (ie, the parent directory of "data"), or else
specify the location of that directory with the dir
option.eiMakeDb
eiPerformanceTest
eiQuery
data(sdfsample)
dir=file.path(tempdir(),"init")
dir.create(dir)
eiInit(sdfsample,dir=dir)
Run the code above in your browser using DataLab