get_cluster_sequences

For each cluster, extract all sequence of length <code>k</code> from the ordered observations grouped by individual
IDs. Returns a list of sequences per cluster.

matrix

state

Censored

Provides basic tools and wrapper functions for computing clusters of instances described by multiple time-to-event censored endpoints. From long-format datasets, where one instance is described by one or more dated records, the main function, `make_state_matrices()`, creates state matrices. Based on these matrices, optimised procedures using the Jaccard distance between instances enable the construction of longitudinal typologies. The package is under active development, with additional tools for graphical representation of typologies planned. For methodological details, see our accompanying paper: `Delord M, Douiri A (2025) <doi:10.1186/s12874-025-02476-7>`.

Marc Delord

MSCA

Unsupervised Clustering of Multiple Censored Time-to-Event
Endpoints

get_cluster_sequences function

<dl><dt>dt</dt>
<dd>A <code>data.table</code> or data.frame containing the data in a long format.</dd>
<dt>cl_col</dt>
<dd>Name of the column containing cluster labels.</dd>
<dt>id_col</dt>
<dd>Name of the column identifying individual trajectories (e.g. patient ID).</dd>
<dt>event_col</dt>
<dd>Name of the column containing ordered events (e.g. diagnoses, prescriptions).</dd>
<dt>k</dt>
<dd>Integer specifying the sequence length (recomended 2).</dd></dl>

Arguments

Author

Extract sequences of length k within clusters — get_cluster_sequences

<dl>

<dt>dt</dt>
<dd>A <code>data.table</code> or data.frame containing the data in a long format.</dd>


<dt>cl_col</dt>
<dd>Name of the column containing cluster labels.</dd>


<dt>id_col</dt>
<dd>Name of the column identifying individual trajectories (e.g. patient ID).</dd>


<dt>event_col</dt>
<dd>Name of the column containing ordered events (e.g. diagnoses, prescriptions).</dd>


<dt>k</dt>
<dd>Integer specifying the sequence length (recomended 2).</dd>

</dl>

get_cluster_sequences: Extract sequences of length k within clusters

Description

Usage

Value

Arguments

Author

References

See Also