Matthew Honnibal
|
ccd2ab1a62
|
Merge pull request #1443 from ramananbalakrishnan/develop-get-lca-matrix
Add LCA matrix for spans and docs
|
2017-10-24 11:22:46 +02:00 |
Ramanan Balakrishnan
|
d2fe56a577
|
Add LCA matrix for spans and docs
|
2017-10-20 23:58:00 +05:30 |
Ramanan Balakrishnan
|
0726946563
|
cleanup to_array implementation using fixes on master
|
2017-10-20 17:09:37 +05:30 |
Ramanan Balakrishnan
|
b3ab124fc5
|
Support strings for attribute list in doc.to_array
|
2017-10-20 11:46:57 +05:30 |
Ramanan Balakrishnan
|
7b9b1be44c
|
Support single value for attribute list in doc.to_array
|
2017-10-19 17:00:41 +05:30 |
Matthew Honnibal
|
394633efce
|
Make doc pickling support hooks
|
2017-10-17 19:44:09 +02:00 |
Matthew Honnibal
|
cdb0c426d8
|
Improve deserialization of user_data, esp. for Underscore
|
2017-10-17 19:29:20 +02:00 |
Matthew Honnibal
|
32a8564c79
|
Fix doc pickling
|
2017-10-17 18:20:24 +02:00 |
Matthew Honnibal
|
92c1eb2d6f
|
Fix Doc pickling. This also removes need for Binder class
|
2017-10-17 16:11:13 +02:00 |
Matthew Honnibal
|
a002264fec
|
Remove caching of Token in Doc, as caused cycle.
|
2017-10-16 19:34:21 +02:00 |
ines
|
e0ff145a8b
|
Merge branch 'develop' into feature/dot-underscore
|
2017-10-11 11:57:05 +02:00 |
Matthew Honnibal
|
3b527fa52b
|
Call morphology.assign_untagged when pushing token to Doc
|
2017-10-11 03:23:57 +02:00 |
Matthew Honnibal
|
e0a9b02b67
|
Merge Span._ and Span.as_doc methods
|
2017-10-09 22:00:15 -05:00 |
Matthew Honnibal
|
e938bce320
|
Adjust parsing transition system to allow preset sentence segments.
|
2017-10-08 23:53:34 +02:00 |
Matthew Honnibal
|
668a0ea640
|
Pass extensions into Underscore class
|
2017-10-07 18:56:01 +02:00 |
ines
|
2480f8f521
|
Add missing return in Doc.from_disk() (closes #1330)
|
2017-09-18 15:32:00 +02:00 |
Matthew Honnibal
|
03b5b9727a
|
Fix Doc.vector for empty doc objects
|
2017-08-22 19:52:19 +02:00 |
Matthew Honnibal
|
0551b7b03a
|
Fix doc.vector
|
2017-08-22 19:46:52 +02:00 |
Matthew Honnibal
|
8b7ac77c23
|
Allow span label to be string in Doc.char_span
|
2017-08-19 16:18:09 +02:00 |
Matthew Honnibal
|
80236116a6
|
Add Doc.char_span method, to get a span by character offset
|
2017-08-19 12:21:09 +02:00 |
Matthew Honnibal
|
a6a2159969
|
Add slot for text categories to Doc
|
2017-07-22 00:34:15 +02:00 |
Matthew Honnibal
|
2a3bd5ee90
|
Fix fetching of noun chunk iterator
|
2017-06-04 15:53:05 -05:00 |
Matthew Honnibal
|
92ae36f84e
|
Improve way noun chunks iterator is looked up
|
2017-06-04 21:53:39 +02:00 |
Matthew Honnibal
|
675f448313
|
Fix vector linkage on Doc
|
2017-06-04 14:25:30 -05:00 |
ines
|
459a1e8470
|
Fix whitespace
|
2017-06-03 11:31:18 +02:00 |
ines
|
5109bba910
|
Port over fix from #1070
|
2017-06-03 11:31:11 +02:00 |
Matthew Honnibal
|
498ad85309
|
Try using tensor for vector/similarity methdos
|
2017-05-30 23:35:17 +02:00 |
Matthew Honnibal
|
4ddff020c3
|
Fix compile error
|
2017-05-28 23:30:40 +02:00 |
Matthew Honnibal
|
6d3caeadd2
|
Fix type check for long
|
2017-05-28 23:22:45 +02:00 |
Matthew Honnibal
|
7996d21717
|
Fixes for new StringStore
|
2017-05-28 11:09:27 -05:00 |
Matthew Honnibal
|
fe11564b8e
|
Finish stringstore change. Also xfail vectors tests
|
2017-05-28 15:10:22 +02:00 |
Matthew Honnibal
|
84e66ca6d4
|
WIP on stringstore change. 27 failures
|
2017-05-28 14:06:40 +02:00 |
ines
|
66088851dc
|
Add Doc.to_disk() and Doc.from_disk() methods
|
2017-05-24 11:58:17 +02:00 |
Matthew Honnibal
|
d44b1eafc4
|
Fix conflict artefacts
|
2017-05-23 18:47:11 +02:00 |
Matthew Honnibal
|
d68dd1f251
|
Add SENT_START attribute, for custom sentence boundary detection
|
2017-05-23 18:37:58 +02:00 |
ines
|
23f9a3ccc8
|
Update docstrings and API docs for Doc
|
2017-05-19 18:47:39 +02:00 |
ines
|
8455cb1327
|
Update docstring for Doc.__getitem__
|
2017-05-19 00:30:51 +02:00 |
ines
|
b687ad109d
|
Update docstrings and API docs for Doc class
|
2017-05-18 23:59:44 +02:00 |
ines
|
b87066ff10
|
Update docstrings and API docs for Doc class
|
2017-05-18 22:17:41 +02:00 |
ines
|
9d85cda8e4
|
Fix models error message and use about.__docs_models__ (see #1051)
|
2017-05-13 13:05:47 +02:00 |
ines
|
6b942763f0
|
Tidy up imports
|
2017-05-13 13:04:40 +02:00 |
ines
|
b9dea345e5
|
Remove old import
|
2017-05-13 12:32:11 +02:00 |
ines
|
293ee359c5
|
Fix formatting
|
2017-05-13 12:32:06 +02:00 |
Matthew Honnibal
|
ee1d35bdb0
|
Fix merge conflict
|
2017-05-13 03:20:19 +02:00 |
Matthew Honnibal
|
b2540d2379
|
Merge Kengz's tree_print patch
|
2017-05-13 03:18:49 +02:00 |
Matthew Honnibal
|
4efb391994
|
Fix serializer
|
2017-05-09 18:45:18 +02:00 |
Matthew Honnibal
|
1166b0c491
|
Implement Doc.to_bytes and Doc.from_bytes methods
|
2017-05-09 18:11:34 +02:00 |
Matthew Honnibal
|
9e167b7bb6
|
Strip serializer from code
|
2017-05-09 17:28:50 +02:00 |
ines
|
0739ae7b76
|
Tidy up and fix formatting and imports
|
2017-04-15 13:05:15 +02:00 |
ines
|
e71a1f4bd0
|
Fix download commands in error messages (see #946)
|
2017-04-01 10:20:57 +02:00 |