Commit Graph

12 Commits

Author SHA1 Message Date
Max Bachmann 5ecd72eb39 extend duration 2021-09-24 02:24:06 +02:00
Max Bachmann d0ec89e9f9
cleanup cdist implementation 2021-09-23 22:41:10 +02:00
Max Bachmann a9e7bd703f
add back legacy python support (#122) 2021-09-11 12:25:31 +02:00
Max Bachmann 56f062b063 add cdist implementation 2021-09-10 13:37:40 +02:00
Max Bachmann 1aed654d4f
improve performance of partial_ratio (#121) 2021-09-10 02:08:08 +02:00
Max Bachmann 9fd6d08655 add levenshtein_editops 2021-08-21 03:08:50 +02:00
Max Bachmann e3e04da293 use keyword only arguments 2021-08-19 23:02:00 +02:00
Max Bachmann 05f907bf2b
add distance support to process.*
## Changed
- added processor support to `levenshtein` and `hamming`
- added distance support to extract/extractOne/extract_iter

## Fixes
- incorrect results of `normalized_hamming` and `normalized_levenshtein` when used with `utils.default_process` as processor
2021-03-29 19:09:22 +02:00
Max Bachmann 853681f7cf fix bug in mbleven implementation 2021-03-20 12:04:12 +01:00
Max Bachmann 2d80120b21 add more benchmarks to documentation 2021-03-07 17:50:39 +01:00
Max Bachmann 5383d286b2
Release v1.1.0 (#75)
## Changed
- string_metric.normalized_levenshtein supports now all weights
- when different weights are used for Insertion and Deletion the strings can not be swapped inside the Levenshtein implementation anymore. So different weights for Insertion and Deletion are now supported.
- replace C++ implementation with a Cython implementation. This has the following advantages:
  - The implementation is less error prone, since a lot of the complex things are done by Cython
  - slighly faster than the current implementation (up to 10% for some parts)
  - about 33% smaller binary size
  - reduced compile time
- Added **kwargs argument to process.extract/extractOne/extract_iter that is passed to the scorer
- Add max argument to hamming distance
- Add support for whole Unicode range to utils.default_process

## Performance
- replaced Wagner Fischer usage in the normal Levenshtein distance with a bitparallel implementation
2021-02-21 19:42:36 +01:00
Max Bachmann 426fbb24e9
implement process.extractOne in C++ (#53)
* start to simplify complexion

* start implementation

* add extractOne to C++

* fix a couple of bugs in the implementation

* start adressing performance issues
2020-11-15 20:18:46 +01:00