2010-07-13 20:15:08 +00:00
|
|
|
=========
|
|
|
|
jellyfish
|
|
|
|
=========
|
|
|
|
|
2015-05-28 23:28:37 +00:00
|
|
|
.. image:: https://travis-ci.org/jamesturk/jellyfish.svg?branch=master
|
|
|
|
:target: https://travis-ci.org/jamesturk/jellyfish
|
2014-07-15 19:29:44 +00:00
|
|
|
|
2015-05-28 23:28:37 +00:00
|
|
|
.. image:: https://coveralls.io/repos/jamesturk/jellyfish/badge.png?branch=master
|
|
|
|
:target: https://coveralls.io/r/jamesturk/jellyfish
|
2014-08-11 18:59:49 +00:00
|
|
|
|
2015-05-28 23:28:37 +00:00
|
|
|
.. image:: https://img.shields.io/pypi/v/jellyfish.svg
|
2014-07-15 19:29:44 +00:00
|
|
|
:target: https://pypi.python.org/pypi/jellyfish
|
|
|
|
|
2015-02-27 06:05:26 +00:00
|
|
|
.. image:: https://readthedocs.org/projects/jellyfish/badge/?version=latest
|
|
|
|
:target: https://readthedocs.org/projects/jellyfish/?badge=latest
|
|
|
|
:alt: Documentation Status
|
|
|
|
|
2015-06-15 19:02:13 +00:00
|
|
|
.. image:: https://ci.appveyor.com/api/projects/status/t5o03rqcusxhhe41/branch/master?svg=true
|
2015-04-22 23:55:12 +00:00
|
|
|
:target: https://ci.appveyor.com/project/jamesturk/jellyfish/
|
|
|
|
|
2010-07-13 20:15:08 +00:00
|
|
|
Jellyfish is a python library for doing approximate and phonetic matching of strings.
|
|
|
|
|
2015-06-15 19:02:13 +00:00
|
|
|
Written by James Turk <james.p.turk@gmail.com> and Michael Stephens.
|
2010-07-13 20:15:08 +00:00
|
|
|
|
2015-05-28 23:28:37 +00:00
|
|
|
See https://github.com/jamesturk/jellyfish/graphs/contributors for contributors.
|
2012-01-26 16:13:53 +00:00
|
|
|
|
2015-05-28 23:28:37 +00:00
|
|
|
Source is available at http://github.com/jamesturk/jellyfish.
|
2010-07-13 20:15:08 +00:00
|
|
|
|
|
|
|
Included Algorithms
|
|
|
|
===================
|
|
|
|
|
|
|
|
String comparison:
|
|
|
|
|
|
|
|
* Levenshtein Distance
|
|
|
|
* Damerau-Levenshtein Distance
|
|
|
|
* Jaro Distance
|
|
|
|
* Jaro-Winkler Distance
|
|
|
|
* Match Rating Approach Comparison
|
|
|
|
* Hamming Distance
|
|
|
|
|
|
|
|
Phonetic encoding:
|
|
|
|
|
|
|
|
* American Soundex
|
|
|
|
* Metaphone
|
|
|
|
* NYSIIS (New York State Identification and Intelligence System)
|
|
|
|
* Match Rating Codex
|
|
|
|
|
|
|
|
Example Usage
|
|
|
|
=============
|
|
|
|
|
|
|
|
>>> import jellyfish
|
|
|
|
>>> jellyfish.levenshtein_distance('jellyfish', 'smellyfish')
|
|
|
|
2
|
|
|
|
>>> jellyfish.jaro_distance('jellyfish', 'smellyfish')
|
|
|
|
0.89629629629629637
|
|
|
|
>>> jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs')
|
|
|
|
1
|
|
|
|
|
|
|
|
>>> jellyfish.metaphone('Jellyfish')
|
|
|
|
'JLFX'
|
|
|
|
>>> jellyfish.soundex('Jellyfish')
|
|
|
|
'J412'
|
|
|
|
>>> jellyfish.nysiis('Jellyfish')
|
|
|
|
'JALYF'
|
|
|
|
>>> jellyfish.match_rating_codex('Jellyfish')
|
2012-01-26 16:13:53 +00:00
|
|
|
'JLLFSH'
|