Commit Graph

31 Commits

Author SHA1 Message Date
maxbachmann 0c7ee10415
replace std::wstring_view with boost::wstring_view to add C++14 support 2020-04-03 14:38:34 +02:00
maxbachmann c16ca9d8ea
add bitmap based ratio to improve performance 2020-04-03 14:06:23 +02:00
maxbachmann eae6f41e14
fix TypeError on embedded null characters (#19) 2020-04-03 02:05:16 +02:00
maxbachmann 6cd0f94e06
fix windows build for conda 2020-04-01 14:49:22 +02:00
maxbachmann 1715bbac44
implement extract method and cleanup the codebase 2020-04-01 13:35:40 +02:00
maxbachmann 84e7b2283a
adjust ci build 2020-04-01 00:39:46 +02:00
maxbachmann 18528aed03
implement extractOne using C API 2020-04-01 00:14:56 +02:00
maxbachmann 7ee4808cf9
start replacing pybind11 with the python C API 2020-03-31 19:42:48 +02:00
maxbachmann 028db547d1
reduce template usage to a minimum 2020-03-31 15:16:03 +02:00
maxbachmann d665e3b961
start some cleanup 2020-03-31 12:32:53 +02:00
maxbachmann d00462cd81
add custome weights for levenshtein 2020-03-31 11:47:18 +02:00
maxbachmann 996cac3417
improve runtime for named arguments 2020-03-31 08:51:42 +02:00
maxbachmann c225979db3 release version 0.4
- levenshtein module that provides access to weighted distance and normalized weighted distance

- extract returned a list of tuples of empty string and 0 when not enough elements had a ratio
  bigger than score_cutoff
- fix wrong sorting of results causing wrong results to be removed in extract
2020-03-29 20:04:30 +02:00
maxbachmann ab0b5729b5
Improve CI integration and add some aliases (#10) 2020-03-29 16:36:13 +02:00
maxbachmann 0b81415484
release version 0.3.0
- When using score_cutoff there is now a guarantee that it returns 0 when result < score_cutoff
- each function has now a preprocess argument to specify whether strings should be preprocessed
- the default preprocessing does now lower case and trim
- extract and extractOne accept a custom processor method instead of utils.default_processor now

- QRatio was removed since it does now exactly the same as normal ratio
2020-03-26 17:23:06 +01:00
maxbachmann 510a0f190e
automatically build python Wheels (#5) 2020-03-22 23:12:51 +01:00
maxbachmann e9e5732653
lowercase strings before processing 2020-03-22 14:45:08 +01:00
maxbachmann aecc429207
add more benchmarks 2020-03-22 12:34:43 +01:00
maxbachmann 8ddfc2c35e
add token_ratio and partial_token_ratio in python
add benchmarks for fuzz
2020-03-22 01:45:57 +01:00
maxbachmann fa7dd2d390
fix tests and add more benchmarks 2020-03-22 01:07:31 +01:00
maxbachmann 097365692a
make levenshtein work string_view and wstring_view 2020-03-21 19:41:34 +01:00
maxbachmann 4d2eb0cc98 templatise utils to support multiple string types 2020-03-21 14:08:35 +01:00
maxbachmann 6e8081465e
fix broken score_cutoff 2020-03-20 23:51:10 +01:00
maxbachmann f54c51aa49
use wide strings to improve unicode handling 2020-03-20 22:57:02 +01:00
maxbachmann 439db23e28
add token_set and token_sort methods 2020-03-20 18:19:59 +01:00
maxbachmann 628e434fbe
add partial_ratio to interface 2020-03-20 16:26:12 +01:00
maxbachmann 3a46dbdc89
add limit argument to extract 2020-03-19 11:51:50 +01:00
maxbachmann 74af424dd4
add conversions between iterables and list 2020-03-19 00:48:47 +01:00
maxbachmann e157e11fa7
complete basic implementation of rapidfuzz 2020-03-18 21:34:32 +01:00
maxbachmann cb84c0521c add setup.py to build python version 2020-02-29 18:17:00 +01:00
maxbachmann d0f2de09e1 initialise c++ version 2020-02-29 15:45:15 +01:00