seqCluster

Graph clustering based on distances between sequences

A comprehensive framework for bioinformatics exploratory analysis of bulk and single-cell
T-cell receptor and antibody repertoires. It provides seamless data loading, analysis and
visualisation for AIRR (Adaptive Immune Receptor Repertoire) data, both bulk immunosequencing (RepSeq)
and single-cell sequencing (scRNAseq). Immunarch implements most of the widely used AIRR analysis methods,
such as: clonality analysis, estimation of repertoire similarities in distribution of clonotypes
and gene segments, repertoire diversity analysis, annotation of clonotypes using external immune receptor
databases and clonotype tracking in vaccination and cancer studies. A successor to our
previously published 'tcR' immunoinformatics package (Nazarov 2015) <doi:10.1186/s12859-015-0613-1>.

Vadim I. Nazarov

immunarch

Bioinformatics Analysis of T-Cell and B-Cell Immune Repertoires

Vasily O. Tsvetkov

Siarhei Fiadziushchanka

Eugene Rumynskiy

Aleksandr A. Popov

Ivan Balashov

Maria Samokhina

Anna Lorenc

Daniel J. Moore

Victor Greiff

ImmunoMind 

seqCluster function

<dl><dt>.data</dt>
<dd>The data which was used to caluculate .dist object. Can be <a href="/link/data.frame?package=immunarch&version=0.9.1" data-mini-rdoc="immunarch::data.frame">data.frame</a>,
data.table, or a list of these objects.
Every object must have columns in the immunarch compatible format immunarch_data_format</dd>
<dt>.dist</dt>
<dd>List of distance objects produced with seqDist function.</dd>
<dt>.perc_similarity</dt>
<dd>Numeric value between 0 and 1 specifying the maximum acceptable weight of an edge in a graph.
This threshold depends on the length of sequences.</dd>
<dt>.nt_similarity</dt>
<dd>Numeric between 0-sequence length specifying
the threshold of allowing a 1 in n nucleotides mismatch in sequencies.</dd>
<dt>.fixed_threshold</dt>
<dd>Numeric specifying the threshold on the maximum weight of an edge in a graph.</dd></dl>

Arguments

Function for assigning clusters based on sequences similarity — seqCluster

<dl>

<dt>.data</dt>
<dd>The data which was used to caluculate .dist object. Can be <a href='https://rdrr.io/r/base/data.frame.html'>data.frame</a>,
data.table, or a list of these objects.
Every object must have columns in the immunarch compatible format immunarch_data_format</dd>


<dt>.dist</dt>
<dd>List of distance objects produced with seqDist function.</dd>


<dt>.perc_similarity</dt>
<dd>Numeric value between 0 and 1 specifying the maximum acceptable weight of an edge in a graph.
This threshold depends on the length of sequences.</dd>


<dt>.nt_similarity</dt>
<dd>Numeric between 0-sequence length specifying
the threshold of allowing a 1 in n nucleotides mismatch in sequencies.</dd>


<dt>.fixed_threshold</dt>
<dd>Numeric specifying the threshold on the maximum weight of an edge in a graph.</dd>

</dl>

Function for assigning clusters based on sequences similarity

seqCluster: Function for assigning clusters based on sequences similarity

Description

Usage

Value

Arguments

Examples