Commit Graph

18 Commits

Author SHA1 Message Date
Max Bachmann 4444f3411f Fix Indel.normalized_similarity 2022-02-11 16:02:00 +01:00
Max Bachmann 5f0b08e8ab restructure documentation 2022-01-29 19:28:26 +01:00
Max Bachmann 5fc5ca7857 update documentation theme 2022-01-25 12:29:44 +01:00
Max Bachmann e6008d0a4f replace setuptools with scikit-build 2021-12-19 15:50:40 +01:00
Max Bachmann 3314798a7e fix crash in editops 2021-12-13 23:28:17 +01:00
Max Bachmann 460d291a38 improve performance and memory usage of editops 2021-12-11 19:54:14 +01:00
Max Bachmann df568cafba Add wheels for Python3.6 2021-11-19 09:15:20 +01:00
Max Bachmann c2c4516f7e Release v1.8.2 2021-10-27 14:07:37 +02:00
Max Bachmann 2422d8abba Fix incorrect editops results 2021-10-21 19:45:52 +02:00
Max Bachmann 1deba44d3b skip Python3.10 on MacOs for now 2021-10-20 12:43:10 +02:00
Max Bachmann 2accb88ef8 Release v1.7.1 2021-10-02 09:30:47 +02:00
Max Bachmann bbc4a74a3d Release v1.7.0 2021-09-24 01:18:13 +02:00
Max Bachmann e40580dd26 Release v1.6.2 2021-09-15 09:37:52 +02:00
Max Bachmann 8a293f1545 Release v1.6.1 2021-09-11 12:27:39 +02:00
Max Bachmann 1aed654d4f
improve performance of partial_ratio (#121) 2021-09-10 02:08:08 +02:00
Max Bachmann 52a91de4f8 Release v1.5.1 2021-09-01 01:49:05 +02:00
Max Bachmann 9562d4b69b add jaro_similarity/jaro_winkler_similarity to docs 2021-08-20 12:56:26 +02:00
Max Bachmann 375c13e436 Release v1.0.0 (#68)
- all normalized string_metrics can now be used as scorer for process.extract/extractOne
- Implementation of the C++ Wrapper completely refactored to make it easier to add more scorers, processors and string matching algorithms in the future.
- increased test coverage, that already helped to fix some bugs and help to prevent regressions in the future
- improved docstrings of functions

- Added bitparallel implementation of the Levenshtein distance for the weights (1,1,1) and (1,1,2).
- Added specialized implementation of the Levenshtein distance for cases with a small maximum edit distance, that is even faster, than the bitparallel implementation.
- Improved performance of `fuzz.partial_ratio`
-> Since `fuzz.ratio` and `fuzz.partial_ratio` are used in most scorers, this improves the overall performance.
- Improved performance of `process.extract` and `process.extractOne`

- the `rapidfuzz.levenshtein` module is now deprecated and will be removed in v2.0.0
  These functions are now placed in `rapidfuzz.string_metric`. `distance`, `normalized_distance`, `weighted_distance` and `weighted_normalized_distance` are combined into `levenshtein` and `normalized_levenshtein`.

- added normalized version of the hamming distance in `string_metric.normalized_hamming`
- process.extract_iter as a generator, that yields the similarity of all elements, that have a similarity >= score_cutoff

- multiple bugs in extractOne when used with a scorer, thats not from RapidFuzz
- fixed bug in `token_ratio`
- fixed bug in result normalisation causing zero division
2021-02-12 16:48:10 +01:00