Commit Graph

5474 Commits

Author SHA1 Message Date
Matthew Honnibal 877f83807f Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-05-17 12:09:29 +02:00
Matthew Honnibal 793430aa7a Get spaCy train command working with neural network
* Integrate models into pipeline
* Add basic serialization (maybe incorrect)
* Fix pickle on vocab
2017-05-17 12:04:50 +02:00
Matthew Honnibal 3bf4a28d8d Use tag in CoNLL converter, not POS 2017-05-17 12:04:33 +02:00
ines 11f52b8b83 Add headline to installation details and move aside 2017-05-17 12:04:03 +02:00
ines 533bb63816 Implement quickstart widget 2017-05-17 12:04:03 +02:00
ines 7b9466f625 Add mixins and styles for quickstart widget 2017-05-17 12:04:03 +02:00
ines 2e875c40a8 Add quickstart.js 2017-05-17 12:04:03 +02:00
ines 9df9a87d03 Add visualizer usage example 2017-05-17 12:04:03 +02:00
ines 6364a9be9d Add What's new and spaCy 101 stubs 2017-05-17 12:04:03 +02:00
ines f4ae1e8750 Add section on adding titles to documents 2017-05-17 12:04:03 +02:00
ines 9979901b6f Fix formatting 2017-05-17 12:04:03 +02:00
ines 452d16d7a9 Rename API menu item to "Reference" 2017-05-17 12:04:03 +02:00
ines 02a4841e7b Move CLI docs to API reference 2017-05-17 12:04:03 +02:00
ines 95307d1e3c Add mixin to display help icon with tooltip 2017-05-17 12:04:03 +02:00
ines fec918ba2c Update icons 2017-05-17 12:04:03 +02:00
ines fb23799114 Add tooltips component 2017-05-17 12:04:03 +02:00
ines 1a05078c79 Add language-specific syntax iterators to en and de 2017-05-17 12:04:03 +02:00
ines 3cc6fe1484 Add pip to requirements.txt and setup.py 2017-05-17 12:04:03 +02:00
Matthew Honnibal c9a5d5d24b Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-05-16 16:22:05 +02:00
Matthew Honnibal 8cf097ca88 Redesign training to integrate NN components
* Obsolete .parser, .entity etc names in favour of .pipeline
* Components no longer create models on initialization
* Models created by loading method (from_disk(), from_bytes() etc), or
    .begin_training()
* Add .predict(), .set_annotations() methods in components
* Pass state through pipeline, to allow components to share information
    more flexibly.
2017-05-16 16:17:30 +02:00
Matthew Honnibal 221b4c1ee8 Fix test for Python 3 2017-05-16 13:06:30 +02:00
Matthew Honnibal 5211645af3 Get data flowing through pipeline. Needs redesign 2017-05-16 11:21:59 +02:00
Matthew Honnibal 1d7c18e58a Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-05-15 21:53:47 +02:00
Matthew Honnibal a9edb3aa1d Improve integration of NN parser, to support unified training API 2017-05-15 21:53:27 +02:00
ines accf05b0a9 Update visualizers docs 2017-05-15 14:37:01 +02:00
ines d7244ae72d Add docs on collapse_punct option 2017-05-15 13:51:33 +02:00
ines 98354be150 Only get user_data if it exists on doc 2017-05-15 13:39:47 +02:00
ines 6d7986b7bc Update docs 2017-05-15 01:46:33 +02:00
ines c6e8d55dcb Update NER workflow with new displaCy 2017-05-15 01:42:11 +02:00
ines 860a60e251 Fix explanation 2017-05-15 01:31:11 +02:00
ines 5c044cb670 Add visualizers usage docs 2017-05-15 01:25:18 +02:00
ines c33bdeb564 Use uppercase for entity types 2017-05-15 01:24:57 +02:00
ines 3d37564a09 Remove resources from navigation for now
Not sure what to do with this page... maybe merge it with something
else?
2017-05-14 23:29:58 +02:00
ines cf7e5ed534 Use American spelling for "visualizers"
Kinda sucks because we normally use British spelling, but it just looks
weird and confusing otherwise... same with tokenizer and all other
library internals. So this is sort of the "official policy" for now.
2017-05-14 23:29:36 +02:00
ines fe5a5086e1 Fix typo 2017-05-14 23:27:56 +02:00
ines 4aaa607b8d Add xmlns:xlink so SVGs are rendered properly as individual files 2017-05-14 19:54:13 +02:00
ines 1ae07da18f Add API docs for spacy.displacy (see #1058) 2017-05-14 19:31:23 +02:00
ines 9dd13cd76a Update docstrings 2017-05-14 19:30:47 +02:00
ines a04550605a Add Jupyter notebook support (see #1058) 2017-05-14 18:39:01 +02:00
ines c31792aaec Add displaCy visualisers (see #1058) 2017-05-14 17:50:23 +02:00
ines 844d64298d Fix formatting 2017-05-14 01:31:16 +02:00
ines b462076d80 Merge load_lang_class and get_lang_class 2017-05-14 01:31:10 +02:00
ines 36bebe7164 Update docstrings 2017-05-14 01:30:29 +02:00
Matthew Honnibal 48de4ed49f Require thinc 6.6, and compile the nn_parser module 2017-05-14 01:20:28 +02:00
Matthew Honnibal 4b9d69f428 Merge branch 'v2' into develop
* Move v2 parser into nn_parser.pyx
* New TokenVectorEncoder class in pipeline.pyx
* New spacy/_ml.py module

Currently the two parsers live side-by-side, until we figure out how to
organize them.
2017-05-14 01:10:23 +02:00
Matthew Honnibal 5cac951a16 Move new parser to nn_parser.pyx, and restore old parser, to make tests pass. 2017-05-14 00:55:01 +02:00
Matthew Honnibal f8c02b4341 Remove cupy imports from parser, so it can work on CPU 2017-05-14 00:37:53 +02:00
Matthew Honnibal 613ba79e2e Fiddle with sizings for parser 2017-05-13 17:20:23 -05:00
Matthew Honnibal e6d71e1778 Small fixes to parser 2017-05-13 17:19:04 -05:00
Matthew Honnibal 188c0f6949 Clean up unused import 2017-05-13 17:18:27 -05:00