Søren Lind Kristiansen
|
bef735aef7
|
Fix Danish abbreviation 'm.h.t.'
|
2017-12-21 09:24:31 +01:00 |
Søren Lind Kristiansen
|
7a2f2f6f94
|
Fix formatting.
|
2017-12-20 18:37:37 +01:00 |
Søren Lind Kristiansen
|
15d13efafd
|
Tune Danish tokenizer to more closely match tokenization in Universal Dependencies.
|
2017-12-20 17:36:52 +01:00 |
Søren Lind Kristiansen
|
ef03e9ea53
|
Remove unused import.
|
2017-11-25 13:04:02 +01:00 |
Søren Lind Kristiansen
|
6aa241bcec
|
Add day of month tokenizer exceptions for Danish.
|
2017-11-24 15:03:24 +01:00 |
Søren Lind Kristiansen
|
0c276ed020
|
Add weekday abbreviations and remove abiguous month abbreviations for Danish.
|
2017-11-24 14:43:29 +01:00 |
Søren Lind Kristiansen
|
056547e989
|
Add multiple tokenizer exceptions for Danish.
|
2017-11-24 11:51:26 +01:00 |
Søren Lind Kristiansen
|
ac8116510d
|
Fix tokenization of 'i.' for Danish.
|
2017-11-24 11:16:53 +01:00 |
ines
|
819e30a26e
|
Tidy up tokenizer exceptions
|
2017-11-01 23:02:45 +01:00 |
ines
|
7e424a1804
|
Don't copy exception dicts if not necessary and tidy up
|
2017-10-31 21:05:29 +01:00 |
mollerhoj
|
e8f40ceed8
|
Add short names of months to tokenizer_exceptions
|
2017-07-03 15:49:51 +02:00 |
ines
|
bb8be3d194
|
Add Danish language data
|
2017-05-10 21:15:12 +02:00 |