Jaccard Similarity Coefficient
Compare the overlap between two sets β the intersection divided by the union of character bigrams.
Tilores uses Jaccard similarity in production β so you can automate matching with rules you configure.
Try it yourself
How it works
The Jaccard similarity coefficient measures the overlap between two sets by dividing the size of their intersection by the size of their union. When applied to strings (using character bigrams as set elements), it measures how many character pairs the strings share. A score of 1 means identical sets, 0 means no overlap. Jaccard similarity is particularly useful for comparing addresses and multi-word strings where word order may vary.
Use cases in entity resolution
Related tools
Don't implement this yourself
Tilores Studio runs the full matching engine β this algorithm plus configurable rules and real-time entity resolution β locally on your machine. Free, no account, no cloud. Load your own data and see it working in minutes.