Q-gram (N-gram) Similarity
Divide strings into substrings of length Q and compare them to determine similarity.
Tilores uses Q-gram similarity in production β so you can automate matching with rules you configure.
Try it yourself
How it works
Q-gram similarity (also known as N-gram similarity) works by splitting each string into overlapping substrings of length Q (typically 2 or 3) and comparing the resulting sets. The similarity is the ratio of shared Q-grams to total Q-grams. This method is robust against character transpositions and is commonly used as a pre-filtering step in entity resolution systems to quickly identify candidate matches before applying more expensive algorithms.
Use cases in entity resolution
Related tools
Don't implement this yourself
Tilores Studio runs the full matching engine β this algorithm plus configurable rules and real-time entity resolution β locally on your machine. Free, no account, no cloud. Load your own data and see it working in minutes.