String similarity metrics in Python [duplicate]

I realize it’s not the same thing, but this is close enough:

>>> import difflib
>>> a="Hello, All you people"
>>> b = 'hello, all You peopl'
>>> seq=difflib.SequenceMatcher(a=a.lower(), b=b.lower())
>>> seq.ratio()
0.97560975609756095

You can make this as a function

def similar(seq1, seq2):
    return difflib.SequenceMatcher(a=seq1.lower(), b=seq2.lower()).ratio() > 0.9

>>> similar(a, b)
True
>>> similar('Hello, world', 'Hi, world')
False

More Related Contents:

Find common substring between two strings
Is the time-complexity of iterative string append actually O(n^2), or O(n)?
Find longest repetitive sequence in a string
Python string ‘in’ operator implementation algorithm and time complexity
When splitting an empty string in Python, why does split() return an empty list while split(‘\n’) returns [”]?
How to test if one string is a subsequence of another? [duplicate]
Python: find closest string (from a list) to another string
How can I interleave or create unique permutations of two strings (without recursion)
How to handle special cases in my Python code?
How to extract the substring between two markers?
Remove unwanted parts from strings in a column
What is the difference between a string and a byte string?
Good Python modules for fuzzy string comparison? [closed]
Python regex match OR operator
How to extract an IP address from an HTML string?
Regex for existence of some words whose order doesn’t matter
How would I get everything before a : in a string Python
What is the simplest way to swap each pair of adjoining chars in a string with Python?
Why python’s list slicing doesn’t produce index out of bound error? [duplicate]
How to explain the str.maketrans function in Python 3.6?
How to remove symbols from a string with Python? [duplicate]
Detecting Vowels vs Consonants In Python [duplicate]
Python: Choose random line from file, then delete that line
Print without b’ prefix for bytes in Python 3
How to check a string for a special character?
Python: Best Way to remove duplicate character from string
Python: powerset of a given set with generators [duplicate]
Print “\n” or newline characters as part of the output on terminal
Most Efficient Way to Find Whether a Large List Contains a Specific String (Python)
Why do I get “TypeError: not all arguments converted during string formatting” trying to substitute a placeholder like {0} using %?

More Related Contents:

Leave a Comment Cancel reply