Compute similarity scores between sequences of integers
seq_sim(a, b, method = c("osa", "lv", "dl", "hamming", "lcs", "qgram",
"cosine", "jaccard", "jw"), q = 1, ...)
list
of integer
vectors (target)
list
of integer
vectors (source). Optional for
seq_distmatrix
.
Method for distance calculation. The default is "osa"
,
see stringdist-metrics
.
Size of the method='qgram'
, 'jaccard'
or 'cosine'
.
additional arguments are passed on to seq_dist
.
A numeric
vector of length max(length(a),length(b))
. If one of the
entries in a
or b
is NA_integer_
, all comparisons with that
element result in NA
. Missings occurring within the sequences are treated
as an ordinary number (the representation of NA_integer_
).
# NOT RUN {
L1 <- list(1:3,2:4)
L2 <- list(1:3)
seq_sim(L1,L2,method="osa")
# note how missing values are handled (L2 is recycled over L1)
L1 <- list(c(1L,NA_integer_,3L),2:4,NA_integer_)
L2 <- list(1:3)
seq_sim(L1,L2)
# }
Run the code above in your browser using DataLab