Having the generative probabilities from step1 (generative.prob() or generative.prob.nucl()), we could proceed directly with the PT MCMC to explore the state space. Typically the number of total potential species is large. Therefore we reduce the size of the state-space, by decreasing the number of species to the low hundreds. We achieve this by fitting a Mixture Model with as many categories as all the potential species. Post fitting, we retain only the species categories that are not empty, that is categories that have at least one read assigned to them.
reduce.space.explicit is the same function as reduce.space but with more involved syntax.
reduce.space(step1, read.cutoff = 1, EMiter = 500, seed = 1)
reduce.space.explicit(pij.sparse.mat, ordered.species, read.weights, outDir, gen.prob.unknown, read.cutoff = 1, EMiter = 500, seed = 1)## See vignette for more details.
## Not run:
# # Either load the object created by previous step
# data(step1) ## example output of step1, i.e generative.prob()
# step2 <- reduce.space(step1=step1)
#
# # or alternatively point to the location of the step1.RData object
# step2 <- reduce.space(step1="/pathtoFile/step1.RData")
# ## End(Not run)
Run the code above in your browser using DataLab