Cosine
Implements Cosine Similarity between strings.
The strings are first transformed in vectors of occurrences of k-shingles (sequences of k characters). In this n-dimensional space, the similarity between the two strings is the cosine of their respective vectors.
The cosine distance is computed as 1 - cosine similarity
.
Parameters
k
length of k-shingles