Commit Graph

12 Commits

Author SHA1 Message Date
Max Bachmann 375c13e436 Release v1.0.0 (#68)
- all normalized string_metrics can now be used as scorer for process.extract/extractOne
- Implementation of the C++ Wrapper completely refactored to make it easier to add more scorers, processors and string matching algorithms in the future.
- increased test coverage, that already helped to fix some bugs and help to prevent regressions in the future
- improved docstrings of functions

- Added bitparallel implementation of the Levenshtein distance for the weights (1,1,1) and (1,1,2).
- Added specialized implementation of the Levenshtein distance for cases with a small maximum edit distance, that is even faster, than the bitparallel implementation.
- Improved performance of `fuzz.partial_ratio`
-> Since `fuzz.ratio` and `fuzz.partial_ratio` are used in most scorers, this improves the overall performance.
- Improved performance of `process.extract` and `process.extractOne`

- the `rapidfuzz.levenshtein` module is now deprecated and will be removed in v2.0.0
  These functions are now placed in `rapidfuzz.string_metric`. `distance`, `normalized_distance`, `weighted_distance` and `weighted_normalized_distance` are combined into `levenshtein` and `normalized_levenshtein`.

- added normalized version of the hamming distance in `string_metric.normalized_hamming`
- process.extract_iter as a generator, that yields the similarity of all elements, that have a similarity >= score_cutoff

- multiple bugs in extractOne when used with a scorer, thats not from RapidFuzz
- fixed bug in `token_ratio`
- fixed bug in result normalisation causing zero division
2021-02-12 16:48:10 +01:00
maxbachmann 789941dc40 replace difflib 2020-09-29 00:18:24 +02:00
maxbachmann 10946dfac0 add python 2.7 support 2020-08-22 23:06:05 +02:00
maxbachmann eae941a647
further reduce tarball size 2020-06-27 12:49:32 +02:00
maxbachmann cceb7cb2ea
do not include boost 2020-05-22 18:26:43 +02:00
maxbachmann 3137df9e96
remove boost::optional dependency 2020-05-22 14:38:13 +02:00
maxbachmann 15c6dbb6fb
reduce string copies and tarball size 2020-05-22 13:28:38 +02:00
maxbachmann e4006839fc
add missing files to tarball 2020-04-15 23:17:35 +02:00
maxbachmann 4da4234f73
fix string view usage 2020-04-05 02:48:44 +02:00
maxbachmann ab8e98bc2d
add missing files to tarball 2020-04-04 06:51:45 +02:00
maxbachmann e1b5f323b8
add license to package 2020-03-23 13:10:37 +01:00
maxbachmann e157e11fa7
complete basic implementation of rapidfuzz 2020-03-18 21:34:32 +01:00