Commit Graph

43 Commits

Author SHA1 Message Date
cedar101 58f06e6180 Korean support (#3901)
* start lang/ko

* add test codes

* using natto-py

* add test_ko_tokenizer_full_tags()

* spaCy contributor agreement

* external dependency for ko

* collections.namedtuple for python version < 3.5

* case fix

* tuple unpacking

* add jongseong(final consonant)

* apply mecab option

* Remove Pipfile for now


Co-authored-by: Ines Montani <ines@ines.io>
2019-07-09 22:23:16 +02:00
Ines Montani e597110d31
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->

## Description

The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.

This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.


### Types of change
enhancement

## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 19:31:19 +01:00
Ines Montani 75f3234404
💫 Refactor test suite (#2568)
## Description

Related issues: #2379 (should be fixed by separating model tests)

* **total execution time down from > 300 seconds to under 60 seconds** 🎉
* removed all model-specific tests that could only really be run manually anyway – those will now live in a separate test suite in the [`spacy-models`](https://github.com/explosion/spacy-models) repository and are already integrated into our new model training infrastructure
* changed all relative imports to absolute imports to prepare for moving the test suite from `/spacy/tests` to `/tests` (it'll now always test against the installed version)
* merged old regression tests into collections, e.g. `test_issue1001-1500.py` (about 90% of the regression tests are very short anyways)
* tidied up and rewrote existing tests wherever possible

### Todo

- [ ] move tests to `/tests` and adjust CI commands accordingly
- [x] move model test suite from internal repo to `spacy-models`
- [x] ~~investigate why `pipeline/test_textcat.py` is flakey~~
- [x] review old regression tests (leftover files) and see if they can be merged, simplified or deleted
- [ ] update documentation on how to run tests


### Types of change
enhancement, tests

## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [ ] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-07-24 23:38:44 +02:00
ines c0b62ce13c Ignore pytest cache 2018-07-19 12:30:09 +02:00
Mathias Deschamps d82f868e1c Ignore pycharm project files 2017-11-13 17:46:05 +01:00
ines bfb512f45a Add website package.json and fix gitignore 2017-10-04 00:18:41 +02:00
ines 22dd929b65 Add models documentation 2017-10-03 14:28:03 +02:00
ines 371b21f82d Don't ignore /bin directory 2017-08-14 12:18:30 +02:00
ines c862527474 Add more variations of .env to gitignore 2017-06-02 21:08:39 +02:00
ines 57beef5d36 Tidy up .gitignore 2017-05-18 13:51:31 +02:00
Em 1bb364a3b5 Adding venv to .gitignore 2017-03-10 16:52:04 -08:00
Em 426d17167f Added string manipulation for spans 2017-03-10 16:50:02 -08:00
ines 00728a23f0 Fix path in gitignore 2017-02-24 18:26:32 +01:00
Ines Montani 427e942e84 Ignore temporary files 2016-11-24 19:21:27 +01:00
Mark Amery bc368e4237 Ignore entire data folder
Previously only some of its content was ignored, so running

    python -m spacy.en.download all

after installing from a local repo would create unstaged changes.
2016-11-20 20:33:23 +00:00
Mark Amery 094c51f496 Add cythonize.json to .gitignore
This gets generated for me when installing from the local repo with pip using `sudo pip3 install -e .` from within the spaCy folder. I figure it should be ignored.
2016-11-20 13:55:52 +00:00
Ines Montani f0868dfc6b Update .gitignore 2016-11-01 01:13:56 +01:00
Ines Montani 8cef8ebac5 Update .gitignore 2016-10-31 19:20:03 +01:00
Ines Montani 7615b41bff Update to new website 2016-10-31 19:04:15 +01:00
Matthew Honnibal ae29b9bdfd Fix travis and README conflicts 2016-10-19 00:16:11 +02:00
Ines Montani 504b80b6da Update gitignore 2016-10-03 20:19:05 +02:00
Matthew Honnibal 89174cda74 Ignore pyenv .python-version file 2016-09-30 20:44:52 +02:00
Matthew Honnibal ea6fda0e05 Add tmp/ folder to gitignore 2016-09-30 20:40:52 +02:00
Ines Montani f321272bee Update gitignore for website 2016-04-01 00:36:56 +11:00
Oleg Zdornyy a774131671 Added reloadable English() example for inv. count 2016-03-09 19:35:55 -08:00
maxirmx 59d85adff5 Added Windows file to .gitignore 2015-10-13 10:58:30 +03:00
maxirmx 8e03239ac5 Merge remote-tracking branch 'refs/remotes/honnibal/master'
Conflicts:
	setup.py
2015-10-10 17:38:06 +03:00
Matthew Honnibal 7820c504d7 * Add sass-cache to gitignore 2015-09-24 18:14:21 +10:00
Matthew Honnibal f9a6bea746 * Ignore keys and other things 2015-08-22 22:12:07 +02:00
Matthew Honnibal 221f7e51c7 * Ignore spacy/serialize/*.cpp 2015-07-17 01:36:49 +02:00
Matthew Honnibal ba9a22ae0b * Ignore cpp files in spacy/tokens 2015-07-13 22:30:15 +02:00
Jordan Suchow 3005c86682 Don't track generated data files 2015-04-19 13:25:42 -07:00
Matthew Honnibal c0a3e25b43 * Upd gitignore 2015-04-08 07:48:04 +02:00
Matthew Honnibal 49df1b7002 * Ignore .tgz files 2015-03-26 16:44:42 +01:00
Matthew Honnibal ef2493a3bd * Upd gitignore 2015-01-30 16:49:44 +11:00
Matthew Honnibal a68ecc50fa * Ignore cpp files within en dir 2014-12-23 15:19:01 +11:00
Matthew Honnibal e3b123e6e0 * Ignore cpp files from parser 2014-12-18 09:05:51 +11:00
Matthew Honnibal 66cb4f96e1 * Upd gitignore 2014-11-12 23:25:27 +11:00
Matthew Honnibal bebfd7940d * Upd gitignore 2014-08-20 17:04:33 +02:00
Matthew Honnibal 6050209f65 * Update gitignore 2014-08-02 22:06:30 +01:00
Matthew Honnibal 60a2dac13c * Upd gitignore 2014-07-07 05:10:19 +02:00
Matthew Honnibal 81f6491cb4 * Add gitignore 2014-07-05 20:50:01 +02:00
honnibal e3fad681c5 Initial commit 2014-07-04 01:15:40 +10:00