Compare sets of regions via jaccard and relative distance using permutation to get an empirical p-value.
test.region.similarity(
x,
y,
n = 1000,
stratify.by.chr = FALSE,
species = "human",
build = "hg19",
mask.gaps = FALSE,
mask.repeats = FALSE,
gaps.file = NULL,
repeats.file = NULL,
check.zero.based = TRUE,
check.chr = TRUE,
check.valid = TRUE,
verbose = TRUE
)
A list of results
first region to be compared. this is the region that is permuted
second region to be compared
the number of iterations to permute
Should the permutation be happen separetely for each chromosome. That is are chromosomes exchangeable.
species
the build of the reference
should the gaps (Ns) in the human reference be ignored as potential start points. This drammatically increases memory and run time but is more appropriate in almost all settings. By default it's off.
should the repeats from repeatMasker be ignored as potential start points. This drammatically increases memory and run time but is more appropriate in almost all settings. By default it's off.
database file of gaps. Defaults to Homo sapiens Hg19 gap.txt.gz file available through UCSC
database file of repeats as supplied by UCSC containing RepMasker data e.g rmsk.txt.gz
should 0 based coordinates be checked
should chr prefix be checked
should the region be checkded for integerity
should log messages and checking take place
Daryl Waggott
Iteratively permutes intervals and recalculates jaccard and reldist statistics.