A data.frame
with sequence and lineage information, including the
the parent nucleotide sequence in the lineage tree(parent_sequence
),
an internal parent identifier (parent
), and additional rows for germline
sequence and inferred intermediate sequences.
Values in the sequence_id
field are renamed to numeric values,
prefixed with the clonal grouping identifier and labeled as either "Inferred"
or "Germline"
if they are not an observed sequence. For example, for a lineage
with clone_id = 34
the new identifiers would be of the form:
"34_Germline"
, "34_Inferred1"
, "34_1"
, "34_2"
, etc.
Note that the original sequence identifier is preserved in the orig_sequence_id
field
and the original parent sequence identifier is retained in orig_parent
.