Commit Graph

1336 Commits

Author SHA1 Message Date
Ines Montani 75f3234404
💫 Refactor test suite (#2568)
## Description

Related issues: #2379 (should be fixed by separating model tests)

* **total execution time down from > 300 seconds to under 60 seconds** 🎉
* removed all model-specific tests that could only really be run manually anyway – those will now live in a separate test suite in the [`spacy-models`](https://github.com/explosion/spacy-models) repository and are already integrated into our new model training infrastructure
* changed all relative imports to absolute imports to prepare for moving the test suite from `/spacy/tests` to `/tests` (it'll now always test against the installed version)
* merged old regression tests into collections, e.g. `test_issue1001-1500.py` (about 90% of the regression tests are very short anyways)
* tidied up and rewrote existing tests wherever possible

### Todo

- [ ] move tests to `/tests` and adjust CI commands accordingly
- [x] move model test suite from internal repo to `spacy-models`
- [x] ~~investigate why `pipeline/test_textcat.py` is flakey~~
- [x] review old regression tests (leftover files) and see if they can be merged, simplified or deleted
- [ ] update documentation on how to run tests


### Types of change
enhancement, tests

## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [ ] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-07-24 23:38:44 +02:00
kororo b1ec827ee0 Fix typo (#2579)
Update slogan, desc and code snippet to latest version
2018-07-24 22:47:33 +02:00
ines cd687091fb Remove nl examples from widget for now [ci skip]
Restore for next spaCy version when path to example sentences is fixed
2018-07-24 22:41:20 +02:00
ines 2d8ffb8bcd Fix formatting 2018-07-24 22:40:49 +02:00
ines 1b3da8d2ae Update website for v2.0.12 [ci skip] 2018-07-24 21:04:22 +02:00
ines ae5ed2d698 Update docs for v2.0.12 [ci skip] 2018-07-21 15:51:44 +02:00
ines d517dd4297 Document remove_extension methods 2018-07-21 15:51:28 +02:00
ines 153f41a5cc Use better examples for Doc extension methods 2018-07-21 15:51:11 +02:00
ines 3c30d1763c Merge branch 'master' into develop 2018-07-21 15:34:18 +02:00
kororo 2784babef9 Add ExcelCy into Universe list (#2572)
Hi guys,

This is my first spaCy extension. I am excited to able to do this. Please do let me know if there is any suggestions or modifications I need to do. Feel free to use/contribute the repo that I made.

## Description
ExcelCy is a SpaCy toolkit to help improve the data training experiences. It provides easy annotation using Excel file format. It has helper to pre-train entity annotation with phrase and regex matcher pipe.

### Types of change
Update to Universe list in website.

## Checklist
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-07-19 19:28:33 +02:00
ines 80e7485630 Merge branch 'master' into develop 2018-07-18 17:28:47 +02:00
Xiang Ji 19a5ef1c58 Fix venv command examples (#2560) [ci skip]
* Fix venv command examples

The documentation refers to `venv`, which is native to Python3.
However, the command examples are as if they were still `virtualenv`,
which is a package independent of `venv`:

- It doesn't need to be installed via `pip`. In fact `pip install venv` would
return an error.
- The correct way to invoke `venv` is `python3 -m venv`, not `venv`, which would
return command not found.

See https://docs.python.org/3/library/venv.html

I suspect the documentation simply replaced all occurrences of `virtualenv` with
`venv`. However they are different modules and are used differently.

* Update comment [ci skip]
2018-07-18 10:31:24 +02:00
ines 50c367ee96 Update meta [ci skip] 2018-07-10 13:51:45 +02:00
ines 3a321e79ac Merge branch 'master' into develop 2018-07-10 13:49:08 +02:00
ines 71bfc92913 Exclude models for non-stable versions [ci skip] 2018-07-10 13:44:55 +02:00
ines b5200962c0 Adjust formatting [ci skip] 2018-07-09 18:35:46 +02:00
Alex Villarreal bd35bf7f09 Guidance to handle binary files in git in Windows (#2526)
Adds guidance on what to do if users encounter the error described in [1634](https://github.com/explosion/spaCy/issues/1634), which probably only happens in Windows environments.
2018-07-09 18:31:37 +02:00
ines f575b01595 Update language and license meta [ci skip] 2018-07-04 15:09:36 +02:00
ines 63666af328 Merge branch 'master' into develop 2018-07-04 14:52:25 +02:00
Matthew Honnibal a85620a731 Note CoreNLP tokenizer correction on website 2018-07-02 11:35:31 +02:00
ines 06c6dc6fbc Update Juniper [ci skip] 2018-06-28 11:48:17 +02:00
Nipun Sadvilkar 741ba80bd5 Train model command n_iteration 20 -> 30 (#2454)
In source code `train.py` default Number of iterations  is 30
2018-06-18 11:57:08 +02:00
ines 53a2bc8c8d Only scroll sidebar item into view if needed [ci skip] 2018-06-12 10:58:50 +02:00
ines 65713a6593 Increment versions [ci skip] 2018-06-12 10:49:50 +02:00
Ines Montani 968f6f0bda
💫 Document Cython API (#2433)
## Description

This PR adds the most relevant documentation of spaCy's Cython API.

(Todo for when we publish this: rewrite `/api/#section-cython` and `/api/#cython` to `/api/cython#conventions`.)

### Types of change
docs

## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-06-11 17:47:46 +02:00
GolanLevy 72d7e80f94 adding a missing apostrophe (#2436) 2018-06-11 17:47:24 +02:00
ines 778e5f4da3 Merge branch 'master' into develop 2018-06-11 00:38:04 +02:00
himkt 57311d5d47 replace janome with mecab in the documentation and the test (#2415)
* Add links to Reddit data (see #2401)

* replace janome with mecab in the documentation and the test

* add the assignment
2018-06-11 00:33:13 +02:00
ines effb55d591 Adjust formatting [ci skip] 2018-06-11 00:29:13 +02:00
Nathan Breit ba6d2cf393 Add EpiTator to Universe (#2429) 2018-06-11 00:24:13 +02:00
himkt 1a568f2e08 fix wrong documentations (#2423) 2018-06-11 00:21:06 +02:00
Bohdan Moskalevskyi d66292f767 fix UD data file extensions (#2425)
* fix UD data files extension

* add contributor agreement for msklvsk
2018-06-08 14:26:11 +02:00
ines a0017e4909 Merge branch 'master' into develop 2018-05-30 14:10:47 +02:00
ines 0baaf836cf Update formatting [ci skip] 2018-05-30 13:32:49 +02:00
ines 3913e18201 Add self-attentive-parser to universe (see #59) 2018-05-30 13:31:28 +02:00
ines 4a62486340 Merge branch 'master' into develop 2018-05-30 13:01:01 +02:00
ines 605c663a4c Fix HTML merger examples (see #2390) 2018-05-30 12:22:32 +02:00
ines d0b16aa014 Update list of languages 2018-05-26 18:56:26 +02:00
Samuel Pouyt 5f988b8e9c Update _custom.jade (#2372)
It seems based on the doc and trying out that the `en` or `[lang]` is missing from the `spacy model-init`
2018-05-26 18:17:12 +02:00
ines d84a830d79 Merge branch 'master' of https://github.com/explosion/spaCy 2018-05-26 17:57:05 +02:00
ines fb923b31ea Fix bad HTML example (see #2376) and turn it into section on matcher + components
Avoid problems caused by merging while matching (e.g. index errors). Creating a Matcher component also better reflects the recommended best practices.
2018-05-26 17:57:02 +02:00
Shantam Raj 592834183a corrected spelling (#2359)
changed **interpretted** to **interpreted**
2018-05-24 13:29:52 +02:00
ines 8adb967e0c Fix from source quickstart instructions for Windows
See: https://stackoverflow.com/a/50478036/6400719
2018-05-24 12:42:16 +02:00
Shantam Raj 1a4682dd0b Update _training.jade (#2340)
* Update _training.jade

Correcting grammar. Replacing "The" with "To".

* Create armsp.md

* Update armsp.md
2018-05-21 11:09:33 +02:00
ines ff1082d8e4 Add version tag in CLI docs [ci skip] 2018-05-21 01:17:49 +02:00
Ines Montani d4cc736b7c 💫 Improve model downloads: check for existing install, customise pip and use requests library again (#2346)
* Go back to using requests instead of urllib (closes #2320)

Fewer dependencies are good, but this one was simply causing too many other problems around SSL verification and Python 2/3 compatibility. requests is a popular enough package that it's okay for spaCy to depend on it – and this will hopefully make model downloads less flakey.

* Only download model if not installed (see #1456)

Use #egg=model==version to allow pip to check for existing installations. The download is only started if no installation matching the package/version is found. Fixes a long-standing inconvenience.

* Pass additional options to pip when installing model (resolves #1456)

Treat all additional arguments passed to the download command as pip options to allow user to customise the command. For example:

python -m spacy download en --user

* Add CLI option to enable installing model package dependencies

* Revert "Add CLI option to enable installing model package dependencies"

This reverts commit 9336ffe695.

* Update documentation
2018-05-20 20:26:56 +02:00
vishnumenon ae3719ece5 Fix the code for FACILITIY entities (#2324)
* Fix the code for FACILITIY entities

As far as I can tell, the default models all use "FAC" rather than "FACILITY"

* Added my Contributor Agreement

* Rename vishnumenon to vishnumenon.md
2018-05-12 15:19:17 +02:00
ines ac25bc4016 Add docs section on sentence segmentation [ci skip] 2018-05-07 21:25:20 +02:00
ines 14148cd147 Fix formatting and wording 2018-05-07 21:24:35 +02:00
ines f803da609f Add scattertext [ci skip] 2018-05-07 19:10:23 +02:00
ines c9547b7b8b Update Juniper (see #2293) 2018-05-03 15:36:02 +02:00
Alex Villarreal 647f2544c5 Fix code sample for span.set_extension (#2286) 2018-05-03 00:39:22 +02:00
Alex Villarreal 13d562e1a4 Fix code sample for Doc.set_extension (#2282)
* Fix code sample for `set_extension`

The previous sample code for `set_extension` fails the assertion at the end, because `city_getter` it checked if the whole document text matches any of the city names. Now it checks if any of the city names is contained in the document text.

* Contributor agreement
2018-05-02 10:16:05 +02:00
Shirish Kadam d98a90440f Added Adam project to spaCy Universe (#2275)
* Added 5hirish to contributors

* Added Adam Qas Project to spaCy Universe

* Remove $ from code example
2018-04-30 22:25:01 +02:00
ines 56e7faf16b Fix spacing 2018-04-30 22:24:40 +02:00
ines 6efb4cdf88 Use Juniper and tidy up 2018-04-30 18:48:35 +02:00
ines 45bb8d75a5 Fix overflow issues on small screens [ci skip] 2018-04-29 03:17:36 +02:00
Ines Montani 49cee4af92
💫 Interactive code examples, spaCy Universe and various docs improvements (#2274)
* Integrate Python kernel via Binder

* Add live model test for languages with examples

* Update docs and code examples

* Adjust margin (if not bootstrapped)

* Add binder version to global config

* Update terminal and executable code mixins

* Pass attributes through infobox and section

* Hide v-cloak

* Fix example

* Take out model comparison for now

* Add meta text for compat

* Remove chart.js dependency

* Tidy up and simplify JS and port big components over to Vue

* Remove chartjs example

* Add Twitter icon

* Add purple stylesheet option

* Add utility for hand cursor (special cases only)

* Add transition classes

* Add small option for section

* Add thumb object for small round thumbnail images

* Allow unset code block language via "none" value

(workaround to still allow unset language to default to DEFAULT_SYNTAX)

* Pass through attributes

* Add syntax highlighting definitions for Julia, R and Docker

* Add website icon

* Remove user survey from navigation

* Don't hide GitHub icon on small screens

* Make top navigation scrollable on small screens

* Remove old resources page and references to it

* Add Universe

* Add helper functions for better page URL and title

* Update site description

* Increment versions

* Update preview images

* Update mentions of resources

* Fix image

* Fix social images

* Fix problem with cover sizing and floats

* Add divider and move badges into heading

* Add docstrings

* Reference converting section

* Add section on converting word vectors

* Move converting section to custom section and fix formatting

* Remove old fastText example

* Move extensions content to own section

Keep weird ID to not break permalinks for now (we don't want to rewrite URLs if not absolutely necessary)

* Use better component example and add factories section

* Add note on larger model

* Use better example for non-vector

* Remove similarity in context section

Only works via small models with tensors so has always been kind of confusing

* Add note on init-model command

* Fix lightning tour examples and make excutable if possible

* Add spacy train CLI section to train

* Fix formatting and add video

* Fix formatting

* Fix textcat example description (resolves #2246)

* Add dummy file to try resolve conflict

* Delete dummy file

* Tidy up [ci skip]

* Ensure sufficient height of loading container

* Add loading animation to universe

* Update Thebelab build and use better startup message

* Fix asset versioning

* Fix typo [ci skip]

* Add note on project idea label
2018-04-29 02:06:46 +02:00
ines a512fa60ef Remove upcoming option from docs for now 2018-04-28 23:32:18 +02:00
ines 6fb6371670 Add collapse_phrases option to displacy (closes #2266) 2018-04-28 23:06:50 +02:00
Matt Upson 87cc6b3599 Add missing comma to NN example in docs (#2255)
Also add a completed contributor agreement.
2018-04-28 14:56:00 +02:00
ines 4a3bea00c7 Update resources [ci skip] 2018-04-26 22:10:34 +02:00
Pradeep Kumar Tippa df389e5b74 spacy-101 vocab doc giving valid variable names (#2236) 2018-04-18 14:54:26 -07:00
ines ce63f8997b Update init-model docs 2018-04-10 21:42:54 +02:00
ines 0e847d7fe5 Fix typo 2018-04-09 14:51:14 +02:00
ines de137fba84 Add TensorBoard examples to examples overview [ci skip] 2018-04-03 16:01:52 +02:00
ines 6d87b28f15 Add Vietnamese to language overview [ci skip] 2018-04-03 16:01:36 +02:00
ines 9615ed5ed7 Update emoji/hashtag matcher example (resolves #2156) [ci skip] 2018-03-28 18:41:28 +02:00
ines ce6071ca89 Remove ftfy dependency and update docs 2018-03-28 12:09:42 +02:00
ines 5ecc60cf3b Add book to resources [ci skip] 2018-03-24 17:12:56 +01:00
ines 53680642af Port over docs changes [ci skip] 2018-03-24 17:12:48 +01:00
Matthew Honnibal f9f46e5a07 Revert matcher fixes from GregDubbin 2018-02-18 10:59:28 +01:00
ines 612c79a4f5 Update first matcher example and match_id (resolves #1989) 2018-02-17 11:57:38 +01:00
ines ca56fb53d1 Add user survey to navigation [ci skip] 2018-02-15 12:14:30 +01:00
ines cab5b775e7 Document ENT_TYPE matcher attribute [ci skip] 2018-02-15 12:14:19 +01:00
Pradeep Kumar Tippa 416cd021ce Added TAG from spacy symbols which used below 2018-02-09 19:16:59 +05:30
Pradeep Kumar Tippa 01cc9cd9c0 assert statement syntax fix in doc 2018-02-09 19:16:25 +05:30
Pradeep Kumar Tippa a78062e466 Merge remote-tracking branch 'upstream/master' into web-doc-patches 2018-02-09 19:13:19 +05:30
ines ab33e274f5 Add more details on symlink error & Windows solution (resolves #1941) [ci skip] 2018-02-09 10:43:33 +01:00
ines 8eaa934382 Merge branch 'master' of https://github.com/explosion/spaCy 2018-02-09 10:23:36 +01:00
ines e9f67be04d Fix regex flag matcher example (resolves #1950) 2018-02-09 10:23:33 +01:00
ines fc4ae04c55 Document LENGTH attribute in matcher 2018-02-09 10:23:03 +01:00
Pradeep Kumar Tippa 8a7467b26e Merge remote-tracking branch 'upstream/master' into web-doc-patches 2018-02-09 13:54:26 +05:30
Orion Montoya 24af6375db
update link to Honnibal and Johnson 2015
aclweb.org is throwing a gateway timeout on the link as `https`+`aclweb.org`, but is fine with `https`+`www.aclweb.org` (also with `http`+`aclweb.org`, but let's keep it in `https`, shall we?
2018-02-08 10:49:09 -08:00
Pradeep Kumar Tippa 03113d6779 Fixing navigating parse tree doc under dependency parse 2018-02-08 19:34:15 +05:30
ines a3b965b29d Remove UPPER from Matcher attributes docs (resolves #1949) 2018-02-08 11:29:27 +01:00
ines 696ae87b47 Fix whitespace 2018-02-08 11:28:54 +01:00
ines 26bc75134d Fix typo 2018-02-08 11:28:44 +01:00
Pradeep Kumar Tippa da9d687e75
Fixing typo from taining to training 2018-02-07 16:49:25 +05:30
Pradeep Kumar Tippa ed7d268e93
Fixing vocab doc
Replacing "like" with "love", coffee suffix should be "fee" but not "ffe"
2018-02-07 14:55:12 +05:30
ines f377c483e4 Add note on manual entity order in displaCy [ci skip] 2018-02-07 01:08:42 +01:00
ines 58eb178667 Update Doc.char_span docs [ci skip] 2018-02-07 01:08:30 +01:00
sayf eddine hammemi 86e7727855 Fix typo in the word build. 2018-02-04 20:48:45 +01:00
ines 901bc0e85f Add Persian to list of languages [ci skip] 2018-02-01 04:47:34 +01:00
Hassan Shamim a0b912c528 fix broken link to test suite models 2018-01-30 15:01:01 -08:00
greg daefed0a34 Correct documentation of '+' and '*' ops 2018-01-22 15:55:44 -05:00
ines 67ba73351d Fix typo and use better serialization example (resolves #1851) [ci skip] 2018-01-16 18:42:03 +01:00
ines 7943a8e90c Add spacy-lookup by @mpuig [ci skip] 2018-01-16 00:28:46 +01:00
ines 5684206154 Add LanguageCrunch by @artpar [ci skip] 2018-01-15 16:14:26 +01:00
Mateusz Tatusko dda0e58c11
Update _pos-tags.jade
really small changes to English tags description, but might help some people while working on projects
1) -PRB- should be -RRB- instead 
2) space gets tagged as _SP, and not SP
2018-01-15 12:01:51 +09:00
ines 0536e91564 Add note on Tagger.tag_names vs. Tagger.labels (see #1666) [ci skip] 2018-01-14 14:37:19 +01:00
ines bbee48080d Clarify hyperparameters and alias usage in spacy train (resolves #1838) [ci skip] 2018-01-14 14:32:50 +01:00
ines 4daba3abda Add regex section to rule-based matching docs (see #1567, #1833) [ci skip] 2018-01-14 14:22:13 +01:00
Ines Montani 36f426fe0a
Merge pull request #1808 from fucking-signup/master
Fix issue #1769
2018-01-12 21:12:02 +00:00
ines cfac5b955f Fix aligment issues with newsletter signup form 2018-01-12 22:06:44 +01:00
ines 65babd9e2e Fix typo, formatting and operator descriptions (resolves #1820) 2018-01-12 22:06:27 +01:00
Matthew Honnibal a2a06dce24
Merge pull request #1792 from explosion/feature-improve-model-download
💫 Improve model downloading and linking
2018-01-11 20:02:08 +01:00
Ines Montani 11676b47f2
Merge pull request #1828 from wrathagom/patch-1
Small Grammar Fix to _basics.jade
2018-01-11 17:27:23 +00:00
pbnsilva 4cfd848bc3 Fixes typo in PhraseMatcher API docs 2018-01-11 17:35:59 +01:00
Caleb M. Keller e68f6bf890
Small Grammar Fix to _basics.jade
Fixed an incorrect word order.
2018-01-11 09:26:47 -05:00
Matthew Honnibal 7ca49c2061
Merge branch 'master' into feature-improve-model-download 2018-01-10 18:21:55 +01:00
Kit db6e4ba72e
Update code example according to new changes 2018-01-08 03:45:56 +01:00
ines ef210c73dd Update cli.download and cli.validate docs 2018-01-03 21:34:03 +01:00
ines cc9df10e69 Document util.set_lang_class (see #1737) 2018-01-03 20:13:25 +01:00
Ines Montani 874f174ab1
Merge pull request #1790 from nirdesh37/patch-1
Update goldparse.jade
2018-01-03 18:37:07 +00:00
ines 1fa6ba8130 Fix Doc.from_array example to make it work (see #1527) 2018-01-03 16:59:38 +01:00
ines 49635350f0 Add .from_disk() to pipeline component init example (resolves #1728) 2018-01-03 16:50:24 +01:00
ines 95063ba26b Update tests documentation (resolves #1781) 2018-01-03 16:42:26 +01:00
nirdesh37 67fdceed6a
Update goldparse.jade 2018-01-03 17:25:21 +05:30
Martin Andrews e4355dade2
Documentation example fix : token.head needs '==' rather than 'is'
(similar change to #1689, it seems).
2017-12-18 18:12:10 +08:00
Kristofer Berggren 1cb8c997fb
Fix typo Span -> Token on Token API page
Change Span.vector_norm to Token.vector_norm.
2017-12-17 20:32:19 +08:00
Ines Montani 4befd8bd44
Merge pull request #1724 from mpuels/patch-7
doc: Fix minor mistakes
2017-12-17 12:09:17 +00:00
ines 21482b391b Fix head 2017-12-16 13:48:19 +01:00
mpuels b3df2a2ffd
doc: Fix minor mistakes 2017-12-14 20:55:59 +01:00
mpuels 3f7bedadee
doc: Fix minor mistakes 2017-12-13 11:37:24 +01:00
ines 24e80c51b8 Document init-model command 2017-12-07 10:14:37 +01:00
mpuels e3af19a076
doc: Replace 'is not' with '!=' in code example
The function `dependency_labels_to_root(token)` defined in section *Get syntactic dependencies* does not terminate. Here is a complete example:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head is not token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

Replacing `is not` with `!=` solves the issue:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head != token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

The output is

    ['cc', 'nsubj']
2017-12-06 20:08:42 +01:00
mpuels 82e575ebfb
doc: Fix assert statement in Lightning Tour
Python 3 throws an error message on the original assert statement. Also, according to the Python documentation regarding the assert statement (https://docs.python.org/3/reference/simple_stmts.html#the-assert-statement), `assert` takes at least one argument and at most two. In the two-argument form the second argument is meant as an error message to be displayed when the assertion fails. I don't think this is intended in this case.
2017-12-06 16:40:51 +01:00
mpuels 662601f01c
doc: Add missing *-operator to nlp.disable_pipes()
I'm using SpaCy version 2.0.3. If I don't use the *-operator in the example, Python throws an error message. With the operator it works fine. Also according to the documentation of the function `nlp.disable_pipes()`, it expects one or more strings as arguments and not one argument being a list of strings.
2017-12-06 15:26:43 +01:00
ines b078e276e6 Document offsets_from_biluo_tags 2017-12-06 13:40:51 +01:00
ines fb663f9b7d Add Russian to list of languages 2017-12-06 13:40:32 +01:00
ines 58a19518cf Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 13:17:58 +01:00
ines 7ade336ab7 Add "Unknown locale" issue to troubleshooting guide (see #1684, #1641, #1517) 2017-12-05 13:17:55 +01:00
Mark Dodwell 9d4c185860
Fix link to CLEAR Style dependency labels PDF 2017-12-04 23:28:06 -08:00
ines 40638b7cdf Update resources 2017-12-02 04:16:03 +01:00
ines 9ea8a7cf0c Add spacy_cld to extensions 2017-12-01 23:21:33 +01:00
ines 8d3f29322f Add spacy_hunspell to resources (see #315) 2017-11-29 09:33:22 +01:00
atomobianco f6a82da907
Corrected char index instead of token index
Changed the index used to add the label because `displacy.render` apparently uses char index
2017-11-26 23:55:25 +01:00
ines bda6e2a816 Add training example to lightning tour 2017-11-26 18:04:18 +01:00
ines 89f8b1fba0 Update example documents 2017-11-26 18:04:04 +01:00
ines 65d66b81f1 Fix typo 2017-11-26 18:03:44 +01:00
ines e4ee666be5 Fix biluo_tags_from_offsets example and docs 2017-11-26 16:37:32 +01:00
ines 434030e0d0 Fix requirements.txt example (see #1638) 2017-11-26 15:53:19 +01:00
Matthew Honnibal 6bc9917a0e
Another small fix to component docs 2017-11-23 11:47:20 +01:00
markulrich c9b63c0dfc Use correct local parameter in example MyComponent (and added markulrich.md contributor file) 2017-11-22 15:59:08 -08:00
ines 4f7e64e371 Update resources 2017-11-18 02:53:00 +01:00
ines c3051e95f7 Add note on attribute extension defaults (resolves #1587) 2017-11-17 19:14:29 +01:00
ines 954f8cc6d1 Update syntax theme (should move the modifications out to an extension sometime) 2017-11-17 19:13:53 +01:00
Raphaël Bournhonesque a0793fd4cc
Fix typo 2017-11-17 17:57:55 +01:00
Martino Mensio ce1aade41e small typo on docs 2017-11-17 16:20:22 +01:00
pavillet ad2935f0c3
Update _spacy.jade
Doc example gives 'object is not subscriptable' error.
Correcting as an attribuet
2017-11-17 00:02:20 +01:00
ines 40c4e8fc09 Remove "optional" from dev_data arg and add more info (see #1578) 2017-11-14 20:26:05 +01:00
KMLDS d5b20ac3b6
Update span.jade 2017-11-13 19:27:20 -05:00
ines bc79274706 Fix typo 2017-11-13 17:00:03 +01:00
ines 7a7b01feb1 Update links 2017-11-13 08:30:06 +01:00
ines b3e502a076 Add videos section to resources 2017-11-13 08:29:57 +01:00
ines f2b6b98b75 Fix typo in code example (resolves #1556) 2017-11-13 08:29:16 +01:00
ines ceb2c596f1 Update conda details 2017-11-11 13:07:00 +01:00
ines 4a97def06a Update features 2017-11-10 19:05:10 +01:00
ines dea5636d6c Fix broken links 2017-11-10 13:06:38 +01:00
Wahib Faizi 0da56f8ef8
Fix typo. Add missing '='. 2017-11-10 14:51:24 +03:00
ines 4c5d2c80d5 Re-add python -m to commands, too brittle :( (see #1536) 2017-11-10 02:30:55 +01:00
ines ee5697a1cd Fix training tips 2017-11-10 00:19:42 +01:00
ines 6ae0ebfa3a Update training tips 2017-11-10 00:17:10 +01:00
ines b20779bac4 Update resources 2017-11-09 23:05:37 +01:00
ines ed84688935 Remove old link 2017-11-09 15:34:12 +01:00
Ines Montani e5b9ccdb5c
Merge pull request #1526 from mcsalgado/fix-typos
fix typos
2017-11-09 15:33:55 +01:00
Victor Salgado fe1d969d5f fix typos 2017-11-09 10:55:13 -02:00
Mathias Deschamps 25b26f0d64
Fix similarity visual
Doc was showing similarity when dissimilar
2017-11-09 11:08:26 +01:00
ines 98767122a7 Fix typos 2017-11-09 04:13:03 +01:00
ines e87eb11beb Update package.json 2017-11-09 04:12:57 +01:00
ines 33b84f4c39 Change clear_vectors to reset_vectors (resolves #1516) 2017-11-08 18:11:23 +01:00
ines 97a5892347 Document Vectors.resize() and update v2 incompatibilities (resolves #1514) 2017-11-08 17:11:11 +01:00
ines c0a7a32bf8 Add en.stop_words change to v2 docs (resolves #1512) 2017-11-08 16:30:46 +01:00
ines 9b09b6b0cd Fix formatting 2017-11-08 16:30:23 +01:00
ines f0bdfb4471 Fix vector listing for core sm models in list overview (see #1513) 2017-11-08 16:24:27 +01:00
ines 94cd3d51db Update v2 docs and model info
Take out speed tables until we fix our benchmark tests on CPU and GPU
2017-11-08 11:43:00 +01:00
ines 14f97cfd20 Add note on stream processing to migration guide (see #1508) 2017-11-08 01:53:36 +01:00
ines 5d1162cf21 Improve nlp.update / training loop overview (see #1507) 2017-11-08 01:17:42 +01:00
ines 2229aba71c Update website 2017-11-08 01:06:30 +01:00
ines 1768703e1c Update website for v2.0 2017-11-07 14:48:17 +01:00
ines e4a05385d6 Update docs 2017-11-07 12:33:43 +01:00
ines a4662a31a9 Move model package templates to cli.package and update docs 2017-11-07 12:15:35 +01:00
ines a09c096d3c Get docs ready for v2.0.0 2017-11-07 12:00:43 +01:00
ines 173b1551af Update examples 2017-11-07 01:22:30 +01:00
ines c37837cad1 Update training docs 2017-11-07 01:06:31 +01:00
ines c7bda87b17 Update model docs and add tips section 2017-11-07 01:05:37 +01:00
ines a1261e8632 Fix formatting 2017-11-07 01:05:30 +01:00
ines 912c1b1821 Document "simple training style" 2017-11-07 00:23:19 +01:00
ines ad6438ccdf Update aside labels and under construction mixin 2017-11-07 00:23:00 +01:00
ines 8fb48b9b91 Update and document new util functions 2017-11-07 00:22:43 +01:00
ines 6447b8e396 Update v2 details 2017-11-06 21:15:36 +01:00
ines 008d7408cf Make vectors vs. tensors more explicit in 101 (see #1498) 2017-11-06 20:16:38 +01:00
ines 71852d3f25 Fix code mixins 2017-11-06 20:16:19 +01:00
ines 3b0699c9fe Update benchmarks and data table style 2017-11-06 19:36:02 +01:00
ines ddff7dc474 Update GPU install docs 2017-11-06 19:35:36 +01:00
ines 64d0f97c67 Update benchmarks and models 2017-11-06 18:19:00 +01:00
Matthew Honnibal 6fdffd7246
Merge pull request #1497 from explosion/feature/improve-optimizer-handling
💫 Improve optimizer handling
2017-11-06 16:41:15 +01:00
ines 972298e0c9 Update Pipe component docs and training API 2017-11-06 14:42:24 +01:00
ines f48e1973ed Fix accuracy table descriptions 2017-11-06 14:12:11 +01:00
ines 2d85ee6b5d Fix broken link 2017-11-06 13:27:30 +01:00
ines efb0a7e934 Fix broken links 2017-11-06 13:20:36 +01:00
ines 42a99eae02 Update troubleshooting guide 2017-11-06 13:17:09 +01:00
ines 2dca9e71a1 Add notes on catastrophic forgetting (see #1496) 2017-11-06 13:17:02 +01:00
ines e68d31bffa Update models quickstart usage example 2017-11-06 13:06:26 +01:00
ines 2fe2c4942f Update models directory and listing 2017-11-06 13:04:29 +01:00
ines df1bdc7173 Add Dutch model 2017-11-06 02:44:59 +01:00
ines 333bef482f Update pattern for Prism.js Python 2017-11-06 02:44:24 +01:00
ines 6b08aefd0c Update formatting and styleguide 2017-11-05 23:31:31 +01:00
ines e61a067c4b Update v2 docs 2017-11-05 21:41:56 +01:00
ines 86d6bd7503 Fix wording 2017-11-05 19:23:50 +01:00
ines 6742657c4d Fix website asset versioning 2017-11-05 19:23:45 +01:00
ines 2ca82d1f6e Take out pt_core_news_sm for now 2017-11-05 18:57:04 +01:00
ines a6ffa942bb Update UD schemes 2017-11-05 18:46:24 +01:00
ines 3fa8900a6b Don't include tag and label schemes in usage guide 2017-11-05 18:21:49 +01:00
ines 4810be4b44 Update POS scheme docs and add links for other schemes 2017-11-05 18:16:34 +01:00
ines e7d0641125 Update POS row mixins 2017-11-05 18:16:16 +01:00
ines 15de2bb01d Update and simplify other annotation scheme data 2017-11-05 16:09:48 +01:00
ines 2d59dd374b Use collapsible sections for pos/dep scheme and update
Will ensure better overview as we add more schemes for more languages
2017-11-05 16:09:30 +01:00
ines a9c77e01b4 Add accordion component (collapsible section) 2017-11-05 16:08:13 +01:00
ines 3d4dff1845 Remove comment 2017-11-05 16:07:14 +01:00
ines b53c2010db Add global focus style for links 2017-11-05 16:07:00 +01:00
ines f092506578 Use hidden attribute instead of style.display 2017-11-05 16:06:50 +01:00
ines 0e8157674a Add Portuguese and French 2017-11-04 23:07:21 +01:00
ines d9fa3c6054 Update adding languages example 2017-11-04 15:12:39 +01:00
ines c83fe54f0c Update venv docs in installation instructions 2017-11-04 14:27:55 +01:00
ines 2940938bd8 Use more distinct style for checkboxes in quickstart 2017-11-04 14:24:30 +01:00
ines 4793d56a3e Update commands for building from source 2017-11-04 14:24:14 +01:00
ines 177bf4ee39 Update GitHub topic links 2017-11-04 14:02:28 +01:00
ines 2639ecd5f8 Add docs note on custom tokenizer rules (see #1491) 2017-11-03 23:33:18 +01:00
ines 380f2441b4 Fix script includes 2017-11-03 18:51:03 +01:00
Abhinav Sharma c740277f9f
Minor typo [ nad => and ] 2017-11-03 16:30:44 +05:30
ines 1e16374687 Update models list to reflect spaCy v2.0.0a18 2017-11-03 11:29:34 +01:00
ines a62b0727d8 Tidy up and always use bundle in built site for now
Just to be safe
2017-11-03 11:29:21 +01:00
ines d0f88af5b6 Hide error earlier 2017-11-03 11:29:04 +01:00
ines 43512c68b2 Fix vector details in model overview 2017-11-02 20:04:13 +01:00
ines 9baab241b4 Add skeleton language data for Turkish 2017-11-02 16:32:24 +01:00
ines 31e349a62c Update model families 2017-11-02 16:13:38 +01:00
ines 15cbc61a6e Adjust rendering of large numbers
1234 -> 1.2k
12345 -> 12.3k
123456 -> 123k
1234567 -> 1.2m
2017-11-02 16:13:18 +01:00
ines 391fce09d9 Update licenses 2017-11-01 23:04:40 +01:00
ines c6fea3e5f6 Add Romanian and Croatian skeletons (experimental)
Add language data templates to make it easier for others to contribute to the language support
2017-11-01 23:04:28 +01:00
ines 408f450ce0 Tidy up 2017-11-01 23:01:12 +01:00
ines 2fa53b39d5 Add dev dependency 2017-11-01 23:01:06 +01:00
ines 1976fb157f Update licenses 2017-11-01 21:49:57 +01:00
ines 2ba4e4fc88 Fix broken links and add check_links shortcut script 2017-11-01 21:11:10 +01:00
ines e5a4c31bb4 Adjust code line height 2017-11-01 19:49:42 +01:00
ines 5dd0d6a383 Update lightning tour 2017-11-01 19:49:36 +01:00
ines 9b4c38fe9f Add button option to terminal component 2017-11-01 19:49:27 +01:00
ines 12954ab218 Don't document the tensorizer for now 2017-11-01 19:49:04 +01:00
ines a7a76ea8c5 Update backwards incompatibilities
Also add separate section for deprecated
2017-11-01 16:31:57 +01:00
ines 4f77bb8476 Fix error handling 2017-11-01 16:29:55 +01:00
ines 5ab4e96144 Update v2 guide and split into partials 2017-11-01 14:13:36 +01:00
ines 1c7313051f Document Token.is_sent_start 2017-11-01 14:13:22 +01:00
ines 9e429b5a8a Update formatting of deprecation note 2017-11-01 14:13:08 +01:00
ines 0fbab8160d Update GloVe vectors example 2017-11-01 13:14:43 +01:00
ines a6f6bd6c98 Adjust tag spacing 2017-11-01 02:04:00 +01:00
ines f84660986a Update example sentences for models quickstart 2017-11-01 01:57:33 +01:00
ines 3b7ec64caa Add PYTHONPATH to build from source quickstart 2017-11-01 01:52:45 +01:00
ines 092333afd4 Update vector details and number conversion 2017-11-01 01:47:31 +01:00
ines 5fd851a80b Log errors 2017-11-01 01:46:50 +01:00
ines 07d02c3304 Update vectors and similarity usage guide 2017-11-01 01:25:17 +01:00
ines 0d8f4a534b Update Vectors API docs 2017-11-01 00:56:54 +01:00
ines 9eb998443f Update language tokenizer dependencies 2017-11-01 00:56:35 +01:00
ines 0cde065ed9 Add Irish to list of languages (see #1152) 2017-11-01 00:56:21 +01:00
Ines Montani 3c8db3e4da
Merge pull request #1473 from explosion/refactor-javascript
Refactor website JS and add model comparison tool
2017-10-31 14:02:05 +01:00
ines be5b635388 Remove "needs model" and add info about models (see #1471) 2017-10-31 13:37:55 +01:00
ines 5af6c8b746 Update training docs 2017-10-30 20:28:00 +01:00
ines 8ad4f3f6e5 Take out JSON format include in tagger/parser 2017-10-30 19:48:35 +01:00
ines 33af6ac69a Use even smaller examle size
100 was still too much, so try 20 instead
2017-10-30 19:46:45 +01:00
ines f02b0af821 Fix path and use smaller example size
500 was too larger and caused laggy rendering
2017-10-30 19:44:35 +01:00
ines 18dde7869a Update training data docs and add vocab JSONL 2017-10-30 19:40:05 +01:00
ines 57534253e6 Move CLI docs to own page 2017-10-30 19:39:26 +01:00
ines ec657c1ddc Update vocab docs and document Vocab.prune_vectors 2017-10-30 19:35:41 +01:00
ines 12343e23fd Update CLI docs and document vocab command 2017-10-30 18:59:08 +01:00
ines 5598542055 Add link 2017-10-30 18:58:55 +01:00
ines abf8aa05d3 Populate --create-meta defaults from file if available
If meta.json is found in directory and user chooses to overwrite it, show existing data as defaults.
2017-10-30 18:39:38 +01:00
ines 3ffbb64ab6 Unify chart options and update styleguide 2017-10-30 17:25:49 +01:00
ines 14ad92d337 Ensure fallbacks / progressive enhancement if JS disabled 2017-10-30 16:16:19 +01:00
ines 1eb1ed0c7c Add tool for model comparison (experimental)
User can select two model and their meta is fetched from GitHub. Features, accuracy figures and speed benchmarks are displayed in a table, with an additional chart comparing the accuracy scores if available. Main use case: demonstrating and visualising trade-offs between larger and smaller models of the same type.
2017-10-30 14:09:43 +01:00
ines fb2710211b Integrate rollup into website build process 2017-10-30 14:08:26 +01:00
ines 38ef4274b6 Remove confusing icon for non-compatible models
ModelLoader will now output "not compatible" if no compatible version of model is found for a spaCy version
2017-10-30 14:07:42 +01:00
ines 8db3da3c3d Refactor JS, split into modules and add nomodule option
rollup.js will be compiled by the rollup package and Babel on build, and will be loaded if a browser doesn't yet support JS modules
2017-10-30 14:06:25 +01:00
ines 5453821a9f Update NER annotation scheme
Add note on training data sources and include coarse-grained Wikipedia scheme
2017-10-30 13:53:49 +01:00
ines df149455f9 Don't ever wrap navigation bar contents 2017-10-30 13:16:20 +01:00
ines 74dd0ee2c2 Prevent responsive tables form scrolling vertically 2017-10-30 13:16:06 +01:00
ines ae45446978 Remove comment 2017-10-30 13:15:46 +01:00
ines 25f6331550 Allow other style arguments on +grid-col 2017-10-30 13:15:30 +01:00
ines 08869c19fd Merge mixins and mixins-base
The distinction was never clear anyways and it was progressively getting messier. So all mixins live in one file now.
2017-10-30 13:15:13 +01:00
ines ae2ad5becc Remove charts from model direcory and add speed benchmarks
With speed benchmarks, charts ended up taking up too much space – and they were mostly data porn and not particularly useful anyways. Instead, we might add a "Compare" page that fetches all models and lets the user compare two or more models in terms of accuracy, speed etc.
2017-10-29 03:58:19 +01:00
ines 47fd254ba7 Combine table scroll shadows if row has only one cell 2017-10-29 03:56:37 +01:00
ines b11928abc2 Adjust labels, spacing and hack specificity 2017-10-29 03:56:09 +01:00
ines af0ba014d2 Document +code-new and +code-old 2017-10-29 03:54:13 +01:00
ines 9b6828bd83 Add height option to +chart and document 2017-10-29 03:53:59 +01:00
ines e18744823b Add placeholders for Italian and Portuguese models 2017-10-29 01:29:39 +02:00
ines 3b1cfa3455 Add GPL license link 2017-10-29 01:18:32 +02:00
ines 5147cdc468 Fix formatting and add missing v2 label 2017-10-29 01:18:09 +02:00
ines 53bfcdba31 Make tooltips/tags and old/new code blocks more accessible (see #(see #1471))
Always add tooltip text as hidden label. Use different tooltip icons for tags and inline help icons. Add labels to old/new code blocks and add option to customise label text.
2017-10-29 01:17:49 +02:00
ines 4a4f9666b2 Improve style/accessibility of yes/no/neutral icons (see #1471)
Use distinctive icons instead of only colour, add proper handling of labels (hidden or visible, but always present) with optional custom text.
2017-10-29 01:14:30 +02:00
ines a8e10f94e4 Tidy up Lexeme and update docs 2017-10-27 21:07:50 +02:00
ines 5167a0cce2 Tidy up Vectors and docs 2017-10-27 19:45:19 +02:00