divide_samples

A global variable used in multiple functions.
This utility function divides a sequence of sample indices into <code>num</code> segments
ensuring that each segment meets a specified minimum size. It optionally
extracts a subset of each segment based on predefined selection logic:<ul>
<li>For a single group (<code>num = 1</code>): selects a random contiguous sub-vector
comprising between 10% and 55% of the total samples.</li>
<li>For multiple groups (<code>num &gt; 1</code>): selects a contiguous sub-vector
comprising approximately 75% of each segment.</li>
</ul>

Provides tools to simulate multi-omics datasets with predefined signal structures. The generated data can be used for testing, validating, and benchmarking integrative analysis methods such as factor models and clustering approaches. This version includes enhanced signal customization, visualization tools (scatter, histogram, 3D), MOFA-based analysis pipelines, PowerPoint export, and statistical profiling of datasets. Designed for both method development and teaching, SUMO supports real and synthetic data pipelines with interpretable outputs. Tini, Giulia, et al (2019) <doi:10.1093/bib/bbx167>.

Bernard Isekah Osang'ir

SUMO

Generating Multi-Omics Datasets for Testing and Benchmarking

Ziv Shkedy

Surya Gupta

Jürgen Claesen

divide_samples function

<dl><dt>n_samples</dt>
<dd>Integer. Total number of samples to divide.</dd>
<dt>num</dt>
<dd>Integer. Number of desired segments or latent factors.</dd>
<dt>min_size</dt>
<dd>Integer. Minimum size (length) allowed for each segment.</dd></dl>

Arguments

Global Variable — divide_samples

<dl>

<dt>n_samples</dt>
<dd>Integer. Total number of samples to divide.</dd>


<dt>num</dt>
<dd>Integer. Number of desired segments or latent factors.</dd>


<dt>min_size</dt>
<dd>Integer. Minimum size (length) allowed for each segment.</dd>

</dl>

divide_samples: Global Variable

Description

Usage

Value

Arguments

Details

Examples