Commit Graph

48 Commits

Author SHA1 Message Date
Max Bachmann 4444f3411f Fix Indel.normalized_similarity 2022-02-11 16:02:00 +01:00
Max Bachmann 0f23bdbe5e Finalize v2.0.0 2022-02-09 01:06:31 +01:00
Max Bachmann 6b79b02f1e update docstrings 2022-02-03 20:36:43 +01:00
Max Bachmann 5f0b08e8ab restructure documentation 2022-01-29 19:28:26 +01:00
Max Bachmann 87edcd8026 fix image location 2022-01-26 16:24:31 +01:00
Max Bachmann 1eeb085733 add missing normalized metrics 2022-01-25 12:59:46 +01:00
Max Bachmann 5fc5ca7857 update documentation theme 2022-01-25 12:29:44 +01:00
Max Bachmann e6008d0a4f replace setuptools with scikit-build 2021-12-19 15:50:40 +01:00
Max Bachmann 3314798a7e fix crash in editops 2021-12-13 23:28:17 +01:00
Max Bachmann 460d291a38 improve performance and memory usage of editops 2021-12-11 19:54:14 +01:00
Max Bachmann df568cafba Add wheels for Python3.6 2021-11-19 09:15:20 +01:00
Max Bachmann c2c4516f7e Release v1.8.2 2021-10-27 14:07:37 +02:00
Max Bachmann 2422d8abba Fix incorrect editops results 2021-10-21 19:45:52 +02:00
Max Bachmann 1deba44d3b skip Python3.10 on MacOs for now 2021-10-20 12:43:10 +02:00
Max Bachmann 2accb88ef8 Release v1.7.1 2021-10-02 09:30:47 +02:00
Max Bachmann a90d6a736b add multiprocessing to cdist 2021-09-26 21:44:23 +02:00
Max Bachmann bbc4a74a3d Release v1.7.0 2021-09-24 01:18:13 +02:00
Max Bachmann e40580dd26 Release v1.6.2 2021-09-15 09:37:52 +02:00
Max Bachmann f25fe290f7 move cdist to separate module 2021-09-15 03:50:30 +02:00
Max Bachmann 8a293f1545 Release v1.6.1 2021-09-11 12:27:39 +02:00
Max Bachmann 56f062b063 add cdist implementation 2021-09-10 13:37:40 +02:00
Max Bachmann 1aed654d4f
improve performance of partial_ratio (#121) 2021-09-10 02:08:08 +02:00
Max Bachmann 52a91de4f8 Release v1.5.1 2021-09-01 01:49:05 +02:00
Max Bachmann c41abbfe1c Update documentation to clone submodule 2021-08-31 13:51:37 +02:00
Max Bachmann 9fd6d08655 add levenshtein_editops 2021-08-21 03:08:50 +02:00
Max Bachmann 9562d4b69b add jaro_similarity/jaro_winkler_similarity to docs 2021-08-20 12:56:26 +02:00
Max Bachmann 3e1776ccd4 update benchmarks 2021-03-23 07:50:04 +01:00
Max Bachmann e124f4f32e improve performance of Levenshtein distance 2021-03-08 01:15:13 +01:00
Max Bachmann 2d80120b21 add more benchmarks to documentation 2021-03-07 17:50:39 +01:00
Max Bachmann 53b8e3bd61 update build mechanism 2021-03-07 17:45:24 +01:00
Max Bachmann 0e6466d835 rename master branch to main 2021-02-14 15:00:57 +01:00
Max Bachmann 375c13e436 Release v1.0.0 (#68)
- all normalized string_metrics can now be used as scorer for process.extract/extractOne
- Implementation of the C++ Wrapper completely refactored to make it easier to add more scorers, processors and string matching algorithms in the future.
- increased test coverage, that already helped to fix some bugs and help to prevent regressions in the future
- improved docstrings of functions

- Added bitparallel implementation of the Levenshtein distance for the weights (1,1,1) and (1,1,2).
- Added specialized implementation of the Levenshtein distance for cases with a small maximum edit distance, that is even faster, than the bitparallel implementation.
- Improved performance of `fuzz.partial_ratio`
-> Since `fuzz.ratio` and `fuzz.partial_ratio` are used in most scorers, this improves the overall performance.
- Improved performance of `process.extract` and `process.extractOne`

- the `rapidfuzz.levenshtein` module is now deprecated and will be removed in v2.0.0
  These functions are now placed in `rapidfuzz.string_metric`. `distance`, `normalized_distance`, `weighted_distance` and `weighted_normalized_distance` are combined into `levenshtein` and `normalized_levenshtein`.

- added normalized version of the hamming distance in `string_metric.normalized_hamming`
- process.extract_iter as a generator, that yields the similarity of all elements, that have a similarity >= score_cutoff

- multiple bugs in extractOne when used with a scorer, thats not from RapidFuzz
- fixed bug in `token_ratio`
- fixed bug in result normalisation causing zero division
2021-02-12 16:48:10 +01:00
maxbachmann 91694448cf update return types of processors 2020-11-16 17:40:31 +01:00
Max Bachmann b3af7641a4 fix documentation of process.extractOne 2020-10-26 18:46:56 +01:00
Max Bachmann a0a84f99c4
update installation guide for the C++ version
The installation is now possible using cmake
2020-10-12 01:52:10 +02:00
maxbachmann 3712ba4a87
update installation guide 2020-09-30 13:25:31 +02:00
maxbachmann 3d2cfe8b4a update rapidfuzz-cpp and add support for tuple scores in processors 2020-08-14 11:45:07 +02:00
maxbachmann 303fcd1118
separate documentation for fuzz and process 2020-05-30 22:41:35 +02:00
TrigonaMinima 00c4a95479 document: default processor in fuzz.*ratio functions 2020-05-31 00:52:42 +05:30
TrigonaMinima 68ce218958 document: common desc for fuzz.*ratio functions + python/c++ tabs 2020-05-30 20:34:19 +05:30
TrigonaMinima 5ce2172b13 document: parameters and return values for process.extract* functions 2020-05-30 12:08:04 +05:30
TrigonaMinima d5a7dd6e75 document: parameters and return values for fuzz.*ratio functions 2020-05-30 11:59:56 +05:30
Max Bachmann eab5dcb518
update failure message 2020-05-27 16:25:20 +02:00
maxbachmann bbf2de840e
add documentation 2020-05-27 14:16:12 +02:00
maxbachmann aaa3b34948
add tabs 2020-05-27 09:52:31 +02:00
Max Bachmann a2b4dd3b5e
Merge pull request #31 from TrigonaMinima/docs
Read the Docs Setup
2020-05-27 02:34:55 -04:00
maxbachmann 1a1be29bd7
add first flowchart 2020-05-13 21:27:08 +02:00
TrigonaMinima f21b6e58dc Read the Docs setup + Basic documentation files 2020-05-10 01:38:40 +05:30