πŸ‡»πŸ‡ͺ Venezuela Te Busca: deduplicamos el registro de personas desaparecidas. Ayuda gratuita para reunir familias.Saber mΓ‘s

← All Tools
Free Tool

Jaccard Similarity Coefficient

Compare the overlap between two sets β€” the intersection divided by the union of character bigrams.

Tilores uses Jaccard similarity in production β€” so you can automate matching with rules you configure.

Try it yourself

Similarity Score
0.6000
No match

How it works

The Jaccard similarity coefficient measures the overlap between two sets by dividing the size of their intersection by the size of their union. When applied to strings (using character bigrams as set elements), it measures how many character pairs the strings share. A score of 1 means identical sets, 0 means no overlap. Jaccard similarity is particularly useful for comparing addresses and multi-word strings where word order may vary.

Use cases in entity resolution

Address matching
Multi-word string comparison
Set overlap analysis
Document deduplication

Don't implement this yourself

Tilores Studio runs the full matching engine β€” this algorithm plus configurable rules and real-time entity resolution β€” locally on your machine. Free, no account, no cloud. Load your own data and see it working in minutes.