Look back to the data model described in "dat
":
$$Y_{ij}= \alpha_0 + \sum_{m=1}^{t}\theta_m Clin_{ijm} + \sum_{u=1}^{q}\alpha_u E_{iju} + \sum_{v=1}^{p}\eta_v^\top Z_{ijv}+\epsilon_{ij},$$
where \(Z_{ijv}\) contains the \(v\)th genetic main factor and its interactions with the \(q\) environment factors for the \(j\)th measurement on the \(i\)th subject
and \(\eta_{v}\) is the corresponding coefficient vector of length \(1+q\).
When structure="bilevel", variable selection for genetic main effects and gene-environment interactions under the longitudinal response will be conducted on both individual and group levels (bi-level selection):
Group-level selection: by determining whether \(||\eta_{v}||_{2}=0\), we can know if the \(v\)th genetic variant has any effect at all.
Individual-level selection: investigate whether the \(v\)th genetic variant has main effect, G\(\times\)E interaction or both, by determining which components in \(\eta_{v}\) has non-zero values.
If structure="group", only group-level selection will be conducted on \(||\eta_{v}||_{2}\); if structure="individual", only individual-level selection will be conducted on each \(\eta_{vu}\), (\(u=1,\ldots,q\)).
This function also provides choices for the framework that is used. If func="QIF", variable selection will be conducted within the quadratic inference functions framework; if func="GEE", variable selection will be
conducted within the generalized estimating equation framework.
There are three options for the choice of the working correlation. If corr="exchangeable", the exchangeable working correlation will be applied; if corr="AR-1", the AR-1 working correlation will be adopted; if corr="independence",
the independence working correlation will be used.
Please check the references for more details.