Commit Graph

57 Commits

Author SHA1 Message Date
Max Bachmann 426fbb24e9
implement process.extractOne in C++ (#53)
* start to simplify complexion

* start implementation

* add extractOne to C++

* fix a couple of bugs in the implementation

* start adressing performance issues
2020-11-15 20:18:46 +01:00
Max Bachmann b3af7641a4 fix documentation of process.extractOne 2020-10-26 18:46:56 +01:00
Max Bachmann 9b64ad2fee
add wheels for Python2.7 on Windows (#47) 2020-10-22 05:54:39 +02:00
Max Bachmann 06d4484d8a
increment Version 2020-10-09 10:06:01 +02:00
maxbachmann 865fbf0d8a
fix inconsistency towards fuzzywuzzy 2020-10-01 22:42:58 +02:00
maxbachmann 82e77dbb41
reduce template complexity for msvc 2020-09-30 18:02:34 +02:00
maxbachmann 789941dc40 replace difflib 2020-09-29 00:18:24 +02:00
maxbachmann 13a828ce1b
fix partial_token_set_ratio returning 100 all the time 2020-09-22 18:24:34 +02:00
maxbachmann 588f73c2ef
add version, author and license to __init__.py 2020-09-12 18:29:16 +02:00
maxbachmann 6efaf59dc1
do not auto junk in partial ratio 2020-09-01 02:46:38 +02:00
maxbachmann 10946dfac0 add python 2.7 support 2020-08-22 23:06:05 +02:00
maxbachmann 3d2cfe8b4a update rapidfuzz-cpp and add support for tuple scores in processors 2020-08-14 11:45:07 +02:00
maxbachmann eae941a647
further reduce tarball size 2020-06-27 12:49:32 +02:00
maxbachmann 293caa1242
fix inconsistency from #32 2020-05-24 08:19:28 +02:00
maxbachmann 3137df9e96
remove boost::optional dependency 2020-05-22 14:38:13 +02:00
maxbachmann f0f8247d02
allow any object with items 2020-05-21 08:39:13 +02:00
maxbachmann a4bfbeb2f5
exit early when exact match was found 2020-05-19 18:31:24 +02:00
maxbachmann 46cf20aa4e
remove intermediate python function to improve performance 2020-05-12 08:56:28 +02:00
maxbachmann 3e7c410c44
use common interface for all fuzzy ratios 2020-05-07 14:40:37 +02:00
maxbachmann d5995e2f18
help pylint finding members better 2020-05-07 11:07:22 +02:00
maxbachmann 3121457e42
exit early in token_sort_ratio 2020-04-29 13:32:11 +02:00
maxbachmann 8f596ae7c7
manually check punctuation for #29 2020-04-28 09:27:56 +02:00
maxbachmann 96b5660720
increment version 2020-04-24 20:31:43 +02:00
maxbachmann 7fa6c88a3d
support choice: match_choice dict 2020-04-22 19:00:46 +02:00
maxbachmann 438be93cb6
fix #25 2020-04-16 10:13:01 +02:00
maxbachmann e4006839fc
add missing files to tarball 2020-04-15 23:17:35 +02:00
maxbachmann 66fbf1f574
increment version 2020-04-13 10:31:07 +02:00
maxbachmann cc872fdd3a
sort by largest 2020-04-09 16:26:08 +02:00
maxbachmann 044fd229a9
fix performance degradation and use the same interface everywhere 2020-04-09 09:32:29 +02:00
maxbachmann f0adc8da49
start fixing performance issues 2020-04-09 00:32:38 +02:00
maxbachmann b4509b8cd1
add extractIndices 2020-04-08 18:50:26 +02:00
maxbachmann 92820476f1
fix preprocessor for cyrillic 2020-04-07 18:28:52 +02:00
maxbachmann 99bb5f4328
fix a couple of bugs 2020-04-05 17:53:56 +02:00
maxbachmann a9c4865b83
replace non alphanumeric characters 2020-04-05 13:16:41 +02:00
maxbachmann 72763a1acf
fix msvc build error 2020-04-05 03:12:43 +02:00
maxbachmann 4da4234f73
fix string view usage 2020-04-05 02:48:44 +02:00
maxbachmann c74480ca60
fix some issues with mscvc 2015 2020-04-04 13:15:25 +02:00
maxbachmann ab8e98bc2d
add missing files to tarball 2020-04-04 06:51:45 +02:00
maxbachmann 839e19a359
release 0.6.0 2020-04-04 06:15:37 +02:00
maxbachmann 69e27b0ca4
fix another bug with embedded null characters in strings 2020-04-03 02:33:07 +02:00
maxbachmann 485b68d841
fix a bug extracting the args and kwargs from the C API 2020-04-03 00:18:03 +02:00
maxbachmann 6cd0f94e06
fix windows build for conda 2020-04-01 14:49:22 +02:00
maxbachmann 1715bbac44
implement extract method and cleanup the codebase 2020-04-01 13:35:40 +02:00
maxbachmann c225979db3 release version 0.4
- levenshtein module that provides access to weighted distance and normalized weighted distance

- extract returned a list of tuples of empty string and 0 when not enough elements had a ratio
  bigger than score_cutoff
- fix wrong sorting of results causing wrong results to be removed in extract
2020-03-29 20:04:30 +02:00
maxbachmann ab0b5729b5
Improve CI integration and add some aliases (#10) 2020-03-29 16:36:13 +02:00
maxbachmann f94d54a1f4
move default_process into python 2020-03-27 15:37:26 +01:00
maxbachmann 0b81415484
release version 0.3.0
- When using score_cutoff there is now a guarantee that it returns 0 when result < score_cutoff
- each function has now a preprocess argument to specify whether strings should be preprocessed
- the default preprocessing does now lower case and trim
- extract and extractOne accept a custom processor method instead of utils.default_processor now

- QRatio was removed since it does now exactly the same as normal ratio
2020-03-26 17:23:06 +01:00
maxbachmann 7d564d2bab
add scorer argument to provide custom scorer function 2020-03-24 15:25:46 +01:00
maxbachmann e1b5f323b8
add license to package 2020-03-23 13:10:37 +01:00
maxbachmann 510a0f190e
automatically build python Wheels (#5) 2020-03-22 23:12:51 +01:00