Learn R Programming

tlsh (version 0.1.0)

minhash_v2: Function to create a matrix of minhashed signatures

Description

Function to create a matrix of minhashed signatures

Usage

minhash_v2(
  shingled_records,
  p,
  do_one_hash_and_record = do_one_hash_and_record
)

Arguments

shingled_records

Shingled records

p

Number of permutations to be applied to the hash function

do_one_hash_and_record

Combination of one hash and one record

Value

Computes an integer-valued matrix of minhash signatures with one row per permutation and one column per record

Examples

Run this code
# NOT RUN {
head(data <- RLdata500[-c(2,4)])
minidata <- data[1:2,]
head(all_the_shingles <- apply(minidata,1,shingles,k=8))
head(minhash.minidata <- minhash_v2(all_the_shingles, p=10))
# }

Run the code above in your browser using DataLab