πŸ‡»πŸ‡ͺ Venezuela Te Busca: deduplicamos el registro de personas desaparecidas. Ayuda gratuita para reunir familias.Saber mΓ‘s

← All Tools
Free Tool

Q-gram (N-gram) Similarity

Divide strings into substrings of length Q and compare them to determine similarity.

Tilores uses Q-gram similarity in production β€” so you can automate matching with rules you configure.

Try it yourself

Similarity Score
0.5000
No match

How it works

Q-gram similarity (also known as N-gram similarity) works by splitting each string into overlapping substrings of length Q (typically 2 or 3) and comparing the resulting sets. The similarity is the ratio of shared Q-grams to total Q-grams. This method is robust against character transpositions and is commonly used as a pre-filtering step in entity resolution systems to quickly identify candidate matches before applying more expensive algorithms.

Use cases in entity resolution

Candidate pair generation
Pre-filtering for entity resolution
Approximate string matching
Plagiarism detection

Don't implement this yourself

Tilores Studio runs the full matching engine β€” this algorithm plus configurable rules and real-time entity resolution β€” locally on your machine. Free, no account, no cloud. Load your own data and see it working in minutes.