Free Tool

Q-gram (N-gram) Similarity

Divide strings into substrings of length Q and compare them to determine similarity.

Tilores uses Q-gram similarity in production — so you can automate matching with rules you configure.

Download Tilores Studio (Free)

Try it yourself

String A

String B

Similarity Score

0.5000

No match

How it works

Q-gram similarity (also known as N-gram similarity) works by splitting each string into overlapping substrings of length Q (typically 2 or 3) and comparing the resulting sets. The similarity is the ratio of shared Q-grams to total Q-grams. This method is robust against character transpositions and is commonly used as a pre-filtering step in entity resolution systems to quickly identify candidate matches before applying more expensive algorithms.

Use cases in entity resolution

Candidate pair generation

Pre-filtering for entity resolution

Approximate string matching

Plagiarism detection

Related tools

Cosine Similarity Calculator

Measure the cosine of the angle between two strings represented as vectors. Used in RAG, recommendation systems, and information retrieval.

Jaccard Similarity Coefficient

Compare the overlap between two sets — the intersection divided by the union of character bigrams.

Sørensen-Dice Coefficient

Don't implement this yourself

Tilores Studio runs the full matching engine — this algorithm plus configurable rules and real-time entity resolution — locally on your machine. Free, no account, no cloud. Load your own data and see it working in minutes.

Download Tilores Studio