Tal Einat
|
c873b767e7
|
fix Levenshtein ngrams search sometimes returning too short matches
|
2020-05-07 16:45:52 +03:00 |
Tal Einat
|
a0d69f1e33
|
include the matched part of the sequence in Match objects
|
2020-01-14 15:40:39 +02:00 |
Tal Einat
|
c89a1af061
|
first working version of find_near_matches_in_file()
|
2019-04-30 20:51:19 +03:00 |
Tal Einat
|
0ea72ee2b5
|
refactor: extract overlapping match consolidation from search funcs
|
2019-04-29 16:07:09 +03:00 |
Tal Einat
|
06b1744df5
|
use attrs for Match and LevenshteinSearchParams
|
2019-04-26 17:31:08 +03:00 |
Tal Einat
|
a8219a04ab
|
fix edge case: max dist >= sub-sequence length
|
2019-04-22 01:02:29 +03:00 |
Tal Einat
|
7893b54e8a
|
implement _expand_short() in Cython
|
2018-12-06 23:06:52 +02:00 |
Tal Einat
|
f8e19fdc31
|
finalize reworking of _expand(), including more test cases
|
2018-12-06 21:52:43 +02:00 |
Tal Einat
|
d4d4c9d068
|
improve testing of Levenshtein-only search
|
2018-12-06 09:19:48 +02:00 |
Tal Einat
|
3b03a808f9
|
add tests with lists of words as sequences
|
2017-09-07 14:45:45 +03:00 |
Tal Einat
|
573bdf3934
|
remove find_near_matches() test which was testing the implementation method
|
2017-09-07 12:55:08 +03:00 |
Tal Einat
|
3ee959ddc2
|
stop using TestCase.assertListEqual(); use assertEqual() instead
|
2014-05-06 22:19:56 +03:00 |
Tal Einat
|
ef4168edcc
|
fixed test for Levenshtein search
|
2014-03-27 15:30:06 +02:00 |
Tal Einat
|
08655a4c3f
|
added tests for levenshtein._expand() and changed test code to assertEqual(actual, expected) convention
|
2014-03-20 14:50:53 +02:00 |
Tal Einat
|
c6930e2712
|
added finding near matches with only substitutions
also added a super-generic find_near_matches() utility function
|
2014-03-15 19:36:13 +02:00 |