- input
a data.frame
or a file or a directory contains copy number profile.
- pattern
an optional regular expression used to select part of files if
input
is a directory, more detail please see list.files()
function.
- ignore_case
logical. Should pattern-matching be case-insensitive?
- seg_cols
four strings used to specify chromosome, start position,
end position and copy number value in input
, respectively.
Default use names from ABSOLUTE calling result.
- samp_col
a character used to specify the sample column name. If input
is a directory and cannot find samp_col
, sample names will use file names
(set this parameter to NULL
is recommended in this case).
- add_loh
if TRUE
, add LOH labels to segments. NOTE a column
'minor_cn' must exist to indicate minor allele copy number value.
Sex chromosome will not be labeled.
- loh_min_len
The length cut-off for labeling a segment as 'LOH'.
Default is 10Kb
.
- loh_min_frac
When join_adj_seg
set to TRUE
, only the length fraction
of LOH region is larger than this value will be labeled as 'LOH'.
Default is 30%.
- join_adj_seg
if TRUE
(default), join adjacent segments with
same copy number value. This is helpful for precisely count the number of breakpoint.
When set use_all=TRUE
, the mean function will be applied to extra numeric columns
and unique string columns will be pasted by comma for joined records.
- skip_annotation
if TRUE
, skip annotation step, it may affect some analysis
and visualization functionality, but speed up reading data.
- use_all
default is FALSE
. If True
, use all columns from raw input.
- min_segnum
minimal number of copy number segments within a sample.
- max_copynumber
bigger copy number within a sample will be reset to this value.
- genome_build
genome build version, should be 'hg19', 'hg38', 'mm9' or 'mm10'.
- genome_measure
default is 'called', can be 'wg' or 'called'.
Set 'called' will use called segments size to compute total size for CNA burden calculation,
this option is useful for WES and target sequencing.
Set 'wg' will use autosome size from genome build, this option is useful for WGS, SNP etc..
- complement
if TRUE
, complement chromosome (except 'Y') does not show in input data
with normal copy 2.
- ...
other parameters pass to data.table::fread()