Matthew Honnibal
|
63a267b34d
|
Fix #2073: Token.set_extension not working
|
2018-03-27 13:36:20 +02:00 |
Thomas Opsomer
|
fbf48b3f9f
|
lemma property to return hash instead of unicode
|
2018-03-14 17:03:00 +01:00 |
4altinok
|
ca8728035d
|
added new lex feat to token
|
2018-02-11 18:55:48 +01:00 |
Matthew Honnibal
|
ccb51a9f36
|
Make .similarity() return 1.0 if all orth attrs match
|
2018-01-15 16:29:48 +01:00 |
Matthew Honnibal
|
b904d81e9a
|
Fix rich comparison against None objects. Closes #1757
|
2018-01-15 15:51:25 +01:00 |
Matthew Honnibal
|
0cb090e526
|
Fix infinite recursion in token.sent_start. Closes #1640
|
2018-01-14 15:02:15 +01:00 |
Matthew Honnibal
|
5cbe913b6f
|
Don't raise deprecation warning in property. Closes #1813, #1712
|
2018-01-14 14:55:58 +01:00 |
Matthew Honnibal
|
fb26b2cb12
|
Use lookup lemmatizer if lemma unset
|
2017-11-23 12:31:58 +00:00 |
Matthew Honnibal
|
144a93c2a5
|
Back-off to tensor for similarity if no vectors
|
2017-11-03 20:56:33 +01:00 |
ines
|
9659391944
|
Update deprecated methods and add warnings
|
2017-11-01 16:49:42 +01:00 |
Matthew Honnibal
|
9e0ebee81c
|
Add Token.is_sent_start property, so can deprecate Token.sent_start
|
2017-11-01 13:27:14 +01:00 |
Matthew Honnibal
|
86eba61fae
|
Fix token.vector when vectors are missing
|
2017-11-01 00:47:35 +01:00 |
ines
|
544a407b93
|
Tidy up Doc, Token and Span and add missing docs
|
2017-10-27 17:07:26 +02:00 |
ines
|
6a0483b7aa
|
Tidy up and document Doc, Token and Span
|
2017-10-27 15:41:45 +02:00 |
Matthew Honnibal
|
b66b8f028b
|
Fix #1375 -- out-of-bounds on token.nbor()
|
2017-10-24 12:10:39 +02:00 |
Matthew Honnibal
|
e0a9b02b67
|
Merge Span._ and Span.as_doc methods
|
2017-10-09 22:00:15 -05:00 |
ines
|
3fc4fe61d2
|
Fix typo
|
2017-10-10 04:15:14 +02:00 |
Matthew Honnibal
|
080afd4924
|
Add ternary value setting to Token.sent_start
|
2017-10-08 23:51:58 +02:00 |
Matthew Honnibal
|
668a0ea640
|
Pass extensions into Underscore class
|
2017-10-07 18:56:01 +02:00 |
Matthew Honnibal
|
d55d6e1cfa
|
Fix comparison of Token from different docs. Closes #1257
|
2017-08-19 16:39:32 +02:00 |
Matthew Honnibal
|
f4662e9218
|
Fix vector linkage for token
|
2017-06-04 14:19:58 -05:00 |
Matthew Honnibal
|
498ad85309
|
Try using tensor for vector/similarity methdos
|
2017-05-30 23:35:17 +02:00 |
Matthew Honnibal
|
fe11564b8e
|
Finish stringstore change. Also xfail vectors tests
|
2017-05-28 15:10:22 +02:00 |
Matthew Honnibal
|
2445707f3c
|
Re-delegate vectors to vocab
|
2017-05-28 11:46:10 +02:00 |
Matthew Honnibal
|
01e59e4e6e
|
* Add Token.sent_start property, re Issue #235
|
2017-05-23 18:41:11 +02:00 |
ines
|
7ed8a92ed1
|
Update docstrings and API docs for Token
|
2017-05-20 15:13:33 +02:00 |
ines
|
a804045597
|
Use is_ancestor instead of deprecated is_ancestor_of
|
2017-05-19 20:23:40 +02:00 |
ines
|
e9e62b01b0
|
Update docstrings and API docs for Token
|
2017-05-19 18:47:56 +02:00 |
ines
|
9d85cda8e4
|
Fix models error message and use about.__docs_models__ (see #1051)
|
2017-05-13 13:05:47 +02:00 |
ines
|
6b942763f0
|
Tidy up imports
|
2017-05-13 13:04:40 +02:00 |
Matthew Honnibal
|
6a4221a6de
|
Allow lemma to be set from Python. Re #973
|
2017-04-16 18:07:53 +02:00 |
ines
|
0739ae7b76
|
Tidy up and fix formatting and imports
|
2017-04-15 13:05:15 +02:00 |
ines
|
e71a1f4bd0
|
Fix download commands in error messages (see #946)
|
2017-04-01 10:20:57 +02:00 |
Matthew Honnibal
|
fc3900e5b2
|
Allow ent_id to be set in Token
|
2017-03-31 14:00:14 +02:00 |
ines
|
66c1f194f9
|
Use consistent unicode declarations
|
2017-03-12 13:07:28 +01:00 |
Roman Inflianskas
|
66e1109b53
|
Add support for Universal Dependencies v2.0
|
2017-03-03 13:17:34 +01:00 |
Matthew Honnibal
|
e7f8e13cf3
|
Make Token hashable. Fixes #743
|
2017-01-16 13:27:57 +01:00 |
Matthew Honnibal
|
12cd27b821
|
Amend 8ae8b443f: Handle comparison with None tokens.
|
2017-01-11 13:03:32 +01:00 |
Matthew Honnibal
|
8ae8b443f1
|
Add richcmp method to Token. Closes #631
|
2017-01-09 19:30:31 +01:00 |
Matthew Honnibal
|
404019ad2f
|
Fix issue #672: ent_iob_ was a string, not unicode, due to missing unicode_literals statement.
|
2016-12-18 22:33:53 +01:00 |
Matthew Honnibal
|
293c79c09a
|
Fix #595: Lemmatization was incorrect for base forms, because morphological analyser wasn't adding morphology properly.
|
2016-11-04 00:29:07 +01:00 |
Matthew Honnibal
|
05a8b752a2
|
Fix Issue #600: Missing setters for Token attribute.
|
2016-11-02 23:28:59 +01:00 |
Matthew Honnibal
|
11664b9f20
|
Fix variable error in token
|
2016-11-01 13:28:00 +01:00 |
Matthew Honnibal
|
b86f8af0c1
|
Fix doc strings
|
2016-11-01 12:25:36 +01:00 |
Matthew Honnibal
|
5d5742b773
|
Add sentiment field to doc, rename getters_for_tokens and getters_for_spans, add user_hooks field to Doc.
|
2016-10-19 20:54:22 +02:00 |
Matthew Honnibal
|
7fd98fc91c
|
Remove deprecation shim around str/bytes in Token.
|
2016-10-17 14:02:47 +02:00 |
Matthew Honnibal
|
c1abc8f6ed
|
Fix deprecation stuff in Token: Remove the shim for the str/unicode semantics, and raise for has_repvec and repvec
|
2016-10-17 11:18:41 +02:00 |
Matthew Honnibal
|
5d10e2005c
|
Defer some attributes to Doc, via getters_for_tokens attribute.
|
2016-10-17 02:44:49 +02:00 |
Matthew Honnibal
|
ca32a1ab01
|
Revert "Work on Issue #285: intern strings into document-specific pools, to address streaming data memory growth. StringStore.__getitem__ now raises KeyError when it can't find the string. Use StringStore.intern() to get the old behaviour. Still need to hunt down all uses of StringStore.__getitem__ in library and do testing, but logic looks good."
This reverts commit 8423e8627f .
|
2016-09-30 20:20:22 +02:00 |
Matthew Honnibal
|
6736977d82
|
Revert "Changes to Doc and Token for new string store scheme"
This reverts commit 99de44d864 .
|
2016-09-30 20:11:15 +02:00 |