
This function concatenates completely (index incidence = 100 index incidence < 100 k-mer position or are adjacent to each other and generate the CCS/HCS sequence in either CSv or FASTA format
concat_conserved_kmer(
data,
conservation_level = "HCS",
kmer = 9,
threshold_pct = NULL
)
A list wit csv and fasta dataframes
DiMA JSON converted csv file data
CCS (completely conserved) / HCS (highly conserved)
size of the k-mer window
manually set threshold of index.incidence for HCS
csv<-concat_conserved_kmer(proteins_1host)$csv
csv_2hosts<-concat_conserved_kmer(protein_2hosts, conservation_level = "CCS")$csv
fasta <- concat_conserved_kmer(protein_2hosts, conservation_level = "HCS")$fasta
Run the code above in your browser using DataLab