pre5.genos2numeric
, only leaves genotypes the way they are, without categorizing them into 3 levels. Removes all SNPs that have missing or bad values. Intended to be done after imputation, to ensure consistency. Geno values should use letters A, T, C, G if letter.encoding
=TRUE.genos.clean(file.ped, ending.ped = ".txt", dir.ped, file.dat, ending.dat = ".dat",
dir.dat = dir.ped, dir.out, num.nonsnp.col = 2, num.nonsnp.last.col = 1,
letter.encoding = TRUE, save.ids.name = "")
file.ped
, should contain the dot '.', if file has no ending, use an empty string "". This is needed to name the output file as file.ped
is without ending.
file.ped
can be found.
file.dat
, should contain the dot '.'. This is needed to name the output file as file.dat
is without ending.
file.dat
can be found. Defaults to dir.ped
.
letter.encoding
= True. The reason for calling this function, and not pre5.genos2numeric
is because you might wish to call other software packages on the fully imputed data, which will not need the data categorized into 3 levels. Outputs the following files:
_clean - in dir.out directory, the resultant file: the SNP columns + last columns (but no user IDs will be recorded). _clean.dat - in dir.out directory, the corresponding .dat file, will be different from original if any bad SNPs get removed. - the column of patient IDs, if save.ids.name is not empty "".
pre5.genos2numeric
, pre5.genos2numeric.batch
,
pre3.call.mach
, pre4.combine.case.control