Jaro Similarity
Calculate the Jaro similarity between two strings — a foundational metric in record linkage and deduplication.
Try it yourself
How it works
The Jaro similarity measures the similarity between two strings based on the number of matching characters and their order. It accounts for transpositions (characters that match but are in different positions). The result ranges from 0 to 1, where 1 means the strings are identical. Jaro similarity is the basis for the Jaro-Winkler metric and is widely used in census data processing and medical record linkage.
Use cases in entity resolution
Related tools
Don't implement this yourself
Tilores combines all these algorithms with configurable matching rules and production-ready infrastructure. Skip the implementation — see it working with your own data.