Commit Graph

7951 Commits

Author SHA1 Message Date
ines 5eaa61c2b8 Fix formatting 2017-12-07 10:23:09 +01:00
ines 24e80c51b8 Document init-model command 2017-12-07 10:14:37 +01:00
Matthew Honnibal c91f451b0f Fix imports and CLI in init-model 2017-12-07 10:03:07 +01:00
ines 82e80ff928 Rename model command to init_model and fix formatting 2017-12-07 09:59:23 +01:00
Ines Montani 2feeb428d6
Merge pull request #1646 from GreenRiverRUS/master
Added model command to create models from raw data
2017-12-07 08:54:26 +00:00
Matthew Honnibal 6373d2580d Increment version to v2.0.5.dev0 2017-12-07 09:53:59 +01:00
Matthew Honnibal 36b47e3fa6 Fix (and test) vector pickling 2017-12-07 09:53:30 +01:00
Ines Montani 2ae4755def
Merge pull request #1689 from mpuels/patch-3
doc: Replace 'is not' with '!=' in code example
2017-12-07 06:10:28 +00:00
mpuels e3af19a076
doc: Replace 'is not' with '!=' in code example
The function `dependency_labels_to_root(token)` defined in section *Get syntactic dependencies* does not terminate. Here is a complete example:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head is not token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

Replacing `is not` with `!=` solves the issue:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head != token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

The output is

    ['cc', 'nsubj']
2017-12-06 20:08:42 +01:00
Ines Montani 050a94867b
Merge pull request #1688 from mpuels/patch-2
doc: Fix assert statement in Lightning Tour
2017-12-06 16:17:43 +00:00
mpuels 82e575ebfb
doc: Fix assert statement in Lightning Tour
Python 3 throws an error message on the original assert statement. Also, according to the Python documentation regarding the assert statement (https://docs.python.org/3/reference/simple_stmts.html#the-assert-statement), `assert` takes at least one argument and at most two. In the two-argument form the second argument is meant as an error message to be displayed when the assertion fails. I don't think this is intended in this case.
2017-12-06 16:40:51 +01:00
Ines Montani 798c4c14a7
Merge pull request #1687 from mpuels/patch-1
doc: Add missing *-operator to nlp.disable_pipes() in Lightning Tour
2017-12-06 14:36:29 +00:00
mpuels 662601f01c
doc: Add missing *-operator to nlp.disable_pipes()
I'm using SpaCy version 2.0.3. If I don't use the *-operator in the example, Python throws an error message. With the operator it works fine. Also according to the documentation of the function `nlp.disable_pipes()`, it expects one or more strings as arguments and not one argument being a list of strings.
2017-12-06 15:26:43 +01:00
ines b078e276e6 Document offsets_from_biluo_tags 2017-12-06 13:40:51 +01:00
ines fb663f9b7d Add Russian to list of languages 2017-12-06 13:40:32 +01:00
Matthew Honnibal 05f41ff587 Set version to 2.0.4 2017-12-06 13:24:02 +01:00
Matthew Honnibal 2b2ce125d5 Fix thinc version pin 2017-12-06 13:23:35 +01:00
Matthew Honnibal 04c38f7e87 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-06 12:15:52 +01:00
Matthew Honnibal 361944e512 If no rules are set, lemmatize by lookup 2017-12-06 12:12:11 +01:00
Matthew Honnibal 2ab0f2d186
Merge pull request #1664 from jimregan/italian-lemmatizer
BOM in Italian lemmatiser
2017-12-06 11:09:04 +01:00
Matthew Honnibal 3f247119d3
Merge pull request #1668 from sorenlind/da_morph
Add more Danish morph rules and clean up existing ones
2017-12-06 11:08:09 +01:00
Matthew Honnibal 04a92bd75e Pin msgpack-numpy requirement 2017-12-06 03:24:24 +01:00
Matthew Honnibal b2f1cf8775 Try to fix travis locale problem 2017-12-06 01:50:03 +01:00
Matthew Honnibal b051c11da7 Try to fix travis locale problem 2017-12-06 01:45:22 +01:00
Matthew Honnibal 8d27f091c8 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 17:01:35 +01:00
Matthew Honnibal beb5ad24c5 Update travis 2017-12-05 17:01:14 +01:00
ines 58a19518cf Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 13:17:58 +01:00
ines 7ade336ab7 Add "Unknown locale" issue to troubleshooting guide (see #1684, #1641, #1517) 2017-12-05 13:17:55 +01:00
Matthew Honnibal b712de774e Fix vectors pickling 2017-12-05 12:45:24 +01:00
Matthew Honnibal a59518df4f Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 11:33:20 +01:00
Matthew Honnibal 5719fd1ce9 Require dev1 of thinc 2017-12-05 10:52:48 +01:00
Matthew Honnibal 04650e38c7 Set version to 2.0.4.dev0 2017-12-05 10:52:31 +01:00
Ines Montani 6c95187915
Merge pull request #1683 from mkdynamic/patch-1
Fix link to CLEAR Style dependency labels PDF
2017-12-05 08:57:38 +00:00
Mark Dodwell 9d4c185860
Fix link to CLEAR Style dependency labels PDF 2017-12-04 23:28:06 -08:00
Matthew Honnibal 07acb43a85 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-04 14:42:52 +01:00
Matthew Honnibal f5cae1f598
Merge pull request #1674 from twerkmeister/patch-1
fix setup.py spacy req string for packaging
2017-12-04 14:04:09 +01:00
Thomas Werkmeister 94eac75b7c
fix setup.py spacy req string for packaging
Requirement should be `spacy>=2.0.2` instead of `spacy2.0.2`
2017-12-03 04:16:28 -06:00
ines 40638b7cdf Update resources 2017-12-02 04:16:03 +01:00
ines f2ea6d4713 Add Dutch example sentences (see #1107) 2017-12-01 23:36:05 +01:00
ines 9ea8a7cf0c Add spacy_cld to extensions 2017-12-01 23:21:33 +01:00
Canbey Bilgili 86ac8ea5ba Adds Canbey Bilgili's Contributor Agreement 2017-12-01 17:27:41 +03:00
Canbey Bilgili abe098b255 Adds Turkish Lemmatization 2017-12-01 17:04:32 +03:00
Ines Montani a07b44fb47
Merge pull request #1667 from GreenRiverRUS/russian_bugfixies
Bugfixies in russian support
2017-11-30 20:37:43 +00:00
Søren Lind Kristiansen d86b537a38 Enable morph rules for Danish 2017-11-30 15:58:02 +01:00
Søren Lind Kristiansen 13a988adc3 Remove 'Number[psor]' 2017-11-30 15:55:04 +01:00
Søren Lind Kristiansen dd6fde18a9 Add more Danish morph rules and clean up existing ones 2017-11-30 11:17:19 +01:00
Vadim Mazaev 495eacf470 Merge branch 'model_command' 2017-11-30 12:30:26 +03:00
Vadim Mazaev 4ba7ddf651 Bugfixies 2017-11-30 12:29:38 +03:00
Matthew Honnibal 6bc0f4d29f
Merge pull request #1611 from fsonntag/master
Solving #1494
2017-11-29 23:11:23 +01:00
Matthew Honnibal f9ed9ea529
Merge pull request #1624 from GreenRiverRUS/russian
Add support for Russian
2017-11-29 23:10:01 +01:00