Commit Graph

15 Commits

Author SHA1 Message Date
Tal Einat c873b767e7
fix Levenshtein ngrams search sometimes returning too short matches 2020-05-07 16:45:52 +03:00
Tal Einat a0d69f1e33
include the matched part of the sequence in Match objects 2020-01-14 15:40:39 +02:00
Tal Einat c89a1af061 first working version of find_near_matches_in_file() 2019-04-30 20:51:19 +03:00
Tal Einat 0ea72ee2b5 refactor: extract overlapping match consolidation from search funcs 2019-04-29 16:07:09 +03:00
Tal Einat 06b1744df5 use attrs for Match and LevenshteinSearchParams 2019-04-26 17:31:08 +03:00
Tal Einat a8219a04ab fix edge case: max dist >= sub-sequence length 2019-04-22 01:02:29 +03:00
Tal Einat 7893b54e8a implement _expand_short() in Cython 2018-12-06 23:06:52 +02:00
Tal Einat f8e19fdc31 finalize reworking of _expand(), including more test cases 2018-12-06 21:52:43 +02:00
Tal Einat d4d4c9d068 improve testing of Levenshtein-only search 2018-12-06 09:19:48 +02:00
Tal Einat 3b03a808f9 add tests with lists of words as sequences 2017-09-07 14:45:45 +03:00
Tal Einat 573bdf3934 remove find_near_matches() test which was testing the implementation method 2017-09-07 12:55:08 +03:00
Tal Einat 3ee959ddc2 stop using TestCase.assertListEqual(); use assertEqual() instead 2014-05-06 22:19:56 +03:00
Tal Einat ef4168edcc fixed test for Levenshtein search 2014-03-27 15:30:06 +02:00
Tal Einat 08655a4c3f added tests for levenshtein._expand() and changed test code to assertEqual(actual, expected) convention 2014-03-20 14:50:53 +02:00
Tal Einat c6930e2712 added finding near matches with only substitutions
also added a super-generic find_near_matches() utility function
2014-03-15 19:36:13 +02:00