clusGap.
clustTrend(tcgs, expr, Subject_ID, TimePoint, threshold = 0.05, myproc = "BY", nbsimu_pval = 1e+06, baseline = NULL, only.signif = TRUE, group.var = NULL, Group_ID_paired = NULL, ref = NULL, group_of_interest = NULL, FUNcluster = NULL, clustering_metric = "euclidian", clustering_method = "ward", B = 100, max_trends = 4, aggreg.fun = "median", trend.fun = "median", methodOptiClust = "firstSEmax", indiv = "genes", verbose = TRUE)
"print"(x, ...)
"plot"(x, ...)clustTrend, or a
ClusteredTrends object for print.ClusteredTrends and
plot.ClusteredTrends.'Estimations' element of TcGSA.LR. See details.expr (when it is a dataframe) and that contains the patient
identifier of each sample.Subject_ID and the columns of expr (when it
is a dataframe), and that contains the time points at which gene expression
was measured.Bonferroni",
"Holm", "Hochberg", "SidakSS", "SidakSD",
"BH", "BY", "ABH", "TSBH". See
mt.rawp2adjp for details. Default is
"BY", the Benjamini & Yekutieli (2001) step-up FDR-controlling
procedure (general dependency structures). In order to control the FWER(in
case of an analysis that is more a hypothesis confirmation than an
exploration of the expression data), we recommand to use "Holm", the
Holm (1979) step-down adjusted p-values for strong control of the FWER.1e+06.TimePoint
that can be used as a baseline. Default is NULL, in which case no
timepoint is used as a baseline value for gene expression. Has to be
NULL when comparing two treatment groups.FALSE, all the gene sets from the
gmt object contained in x are clustered. Default is
TRUE.Timepoint,
Subject_ID and the columns of expr. It indicates to which
treatment group each sample belongs to. Default is NULL, which means
that there is only one treatment group.Timepoint, Subject_ID, group.var and the
columns of expr. This argument must not be NULL in the case of
a paired analysis, and must be NULL otherwise. Default is
NULL.NULL, which means that reference is the
first group in alphabetical order of the labels of group.var.NULL,
which means that group of interest is the second group in alphabetical order
of the labels of group.var.FUNcluster is NULL. The currently available
options are "euclidean" and "manhattan". Default is
"euclidean". See agnes. Also, a "sts" option
is available in TcGSA. It implements the 'Short Time Series' distance
[Moller-Levet et al., Fuzzy CLustering of short time series and unevenly distributed
sampling points, Advances in Intelligent Data Analysis V:330-340 Springer, 2003]
designed specifically for clustering time series.FUNcluster is
NULL. The six methods implemented are "average" ([unweighted
pair-]group average method, UPGMA), "single" (single linkage),
"complete" (complete linkage), "ward" (Ward's method),
"weighted" (weighted average linkage). Default is "ward". See
agnes.500. See
clusGap.4."mean", "median"
or the name of any other defined statistics function that returns a single
numeric value. It specifies the function used to aggregate the observations
before the clustering. Default is to median. Default is
"median"."mean", "median" or
the name of any other function that returns a single numeric value. It
specifies the function used to calculate the trends of the identified
clustered. Default is to median."globalmax", "firstmax",
"Tibs2001SEmax", "firstSEmax" and "globalSEmax".
Default is "firstSEmax". See 'method' in
clusGap, Details and Tibshirani et al.,
2001 in References.aggreg.fun) before the clustering. Possible
values are "genes" or "patients". Default is "genes".TRUE.ClusteredTrends'.NsClust (the
number of analysed gene sets). Each element of the list is a data frame, in
which there is as many column as the optimal number of clusters for the
corresponding gene setsfor each cluster. Each column of the data frame
contains the median trend values for the corresponding cluster.
NsClust (the
number of analysed gene sets). Each element of the list is a vector which
gives the partition of the genes inside the corresponding gene set.
'max_trends'.
expr is a matrix or a dataframe, then the genes dynamics are
clustered on the "original" data. On the other hand, if expr is a
list returned in the 'Estimations' element of TcGSA.LR,
then the dynamics are computed on the estimations made by the
TcGSA.LR function.This function uses the Gap statistics to determine the optimal number of
clusters in the plotted gene set. See
clusGap.
plot1GS, TcGSA.LR,
clusGap
data(data_simu_TcGSA)
tcgsa_sim_1grp <- TcGSA.LR(expr=expr_1grp, gmt=gmt_sim, design=design,
subject_name="Patient_ID", time_name="TimePoint",
time_func="linear", crossedRandom=FALSE)
CT <- clustTrend(tcgsa_sim_1grp,
expr=expr_1grp, Subject_ID=design$Subject_ID, TimePoint=design$TimePoint)
CT
plot(CT)
CT$NbClust
CT$NbClust["Gene set 5"]
CT$ClustMeds[["Gene set 4"]]
CT$ClustMeds[["Gene set 5"]]
Run the code above in your browser using DataLab