Commit Graph

26 Commits

Author SHA1 Message Date
Max Bachmann 4444f3411f Fix Indel.normalized_similarity 2022-02-11 16:02:00 +01:00
Max Bachmann f1017ab114 Activate MSVC 2022-02-05 23:30:47 +01:00
Max Bachmann 05fbfdad27 update to new capi 2022-01-29 17:07:02 +01:00
Max Bachmann d58ce6d66d fix performance regression in extractOne 2022-01-25 05:26:19 +01:00
Max Bachmann 0d97eec1a0 make editops/opcodes immutable for now 2022-01-24 22:59:29 +01:00
Max Bachmann 6ece2b94de update external libraries 2022-01-23 02:16:11 +01:00
Max Bachmann a861fc980a fix missing symbol 2022-01-20 07:14:29 +01:00
Max Bachmann dd26483b5f properly link to subprojects 2022-01-17 16:53:58 +01:00
Max Bachmann 143b394566 fix some bugs in Editops/Opcodes class 2022-01-15 23:46:33 +01:00
Max Bachmann 7fc3411646 fix incorrect src_pos/dest_pos 2022-01-15 21:34:16 +01:00
Max Bachmann ba3deb93f2 add reverse function to Editops/Opcodes 2022-01-15 20:55:40 +01:00
Max Bachmann 5b7dd075d0 add more functionality to Editops/Opcodes 2022-01-15 20:43:15 +01:00
Max Bachmann 39fe0f99a8 implement editops for lcs 2022-01-10 02:55:53 +01:00
Max Bachmann 27dc52e145 use improved algorithm for editops calculation 2022-01-09 20:20:22 +01:00
Max Bachmann 6f1ac40e2e change order of backtracing algorithm 2022-01-05 13:43:23 +01:00
Max Bachmann 2e489fde96 deprecate rapidfuzz.string_metric 2021-12-30 23:52:18 +01:00
Max Bachmann 1371bd93b1 standardize RapidFuzz C-Api 2021-12-30 14:11:53 +01:00
Max Bachmann 85af15c363 change error result of hamming/levenshtein 2021-12-19 15:38:10 +01:00
Max Bachmann 3314798a7e fix crash in editops 2021-12-13 23:28:17 +01:00
Max Bachmann 460d291a38 improve performance and memory usage of editops 2021-12-11 19:54:14 +01:00
Max Bachmann 7edf52150a fix manifest 2021-11-07 19:46:21 +01:00
Max Bachmann 0afb49d28f move submodules into common location 2021-11-06 20:29:21 +01:00
Max Bachmann 5383d286b2
Release v1.1.0 (#75)
## Changed
- string_metric.normalized_levenshtein supports now all weights
- when different weights are used for Insertion and Deletion the strings can not be swapped inside the Levenshtein implementation anymore. So different weights for Insertion and Deletion are now supported.
- replace C++ implementation with a Cython implementation. This has the following advantages:
  - The implementation is less error prone, since a lot of the complex things are done by Cython
  - slighly faster than the current implementation (up to 10% for some parts)
  - about 33% smaller binary size
  - reduced compile time
- Added **kwargs argument to process.extract/extractOne/extract_iter that is passed to the scorer
- Add max argument to hamming distance
- Add support for whole Unicode range to utils.default_process

## Performance
- replaced Wagner Fischer usage in the normal Levenshtein distance with a bitparallel implementation
2021-02-21 19:42:36 +01:00
Max Bachmann d54fefc292
boost is no longer required for rapidfuzz-cpp 2020-10-09 09:40:26 +02:00
maxbachmann 789941dc40 replace difflib 2020-09-29 00:18:24 +02:00
maxbachmann 15c6dbb6fb
reduce string copies and tarball size 2020-05-22 13:28:38 +02:00