Rajat
8b8efa1b42
update spacy universe with my project ( #5497 )
...
* added contextualSpellCheck in spacy universe meta
* removed extra formatting by code
* updated with permanent links
* run json linter used by spacy
* filled SCA
* updated the description
2020-05-25 11:30:23 +02:00
Sofie Van Landeghem
ae1c179f3a
Remove the nested quote
2020-05-23 17:58:19 +02:00
Jannis
aa53ce6996
Documentation Typo Fix ( #5492 )
...
* Fix typo
Change 'realize' to 'realise'
* Add contributer agreement
2020-05-22 19:50:26 +02:00
Matthew Honnibal
f6078d866a
Merge pull request #5121 from adrianeboyd/bugfix/revert-token-match
...
Revert token_match priority changes from #4374 and extend token match options
2020-05-22 14:42:51 +02:00
Ines Montani
65c7e82de2
Auto-format and remove 2.3 feature [ci skip]
2020-05-22 13:50:30 +02:00
Adriane Boyd
e4a1b5dab1
Rename to url_match
...
Rename to `url_match` and update docs.
2020-05-22 12:41:03 +02:00
Adriane Boyd
730fa493a4
Merge remote-tracking branch 'upstream/master' into bugfix/revert-token-match
2020-05-22 12:18:00 +02:00
Ines Montani
ee027de032
Update universe and display of videos [ci skip]
2020-05-21 21:54:23 +02:00
Ines Montani
53da6bd672
Add course to landing [ci skip]
2020-05-21 20:45:33 +02:00
Kevin Lu
c7c4cd5fe1
Changed pyate code example in universe.json
2020-05-20 09:11:32 -07:00
Kevin Lu
0a5b140235
Update universe.json
2020-05-19 20:12:21 -07:00
Ines Montani
f333c2a011
Merge pull request #5386 from svlandeg/fix/nel-docs
2020-05-10 12:00:09 +02:00
Travis Hoppe
d4cc18b746
Added author information for NLPre ( #5414 )
...
* Add author links for NLPre and update category
* Add contributor statement
2020-05-08 11:28:54 +02:00
adrianeboyd
4a15b559ba
Clarify Token.pos as UPOS ( #5419 )
2020-05-08 10:36:25 +02:00
adrianeboyd
a2345618f1
Fix Token API docs from #5375 ( #5418 )
2020-05-08 10:25:02 +02:00
Adriane Boyd
565e0eef73
Add tokenizer option for token match with affixes
...
To fix the slow tokenizer URL (#4374 ) and allow `token_match` to take
priority over prefixes and suffixes by default, introduce a new
tokenizer option for a token match pattern that's applied after prefixes
and suffixes but before infixes.
2020-05-05 10:35:33 +02:00
Adriane Boyd
792c8af8cf
Merge remote-tracking branch 'upstream/master' into bugfix/revert-token-match
2020-05-05 09:25:57 +02:00
svlandeg
ebaed7dcfa
Few more updates to the EL documentation
2020-04-30 10:17:06 +02:00
adrianeboyd
bdff76dede
Various updates/additions to CLI scripts ( #5362 )
...
* `debug-data`: determine coverage of provided vectors
* `evaluate`: support `blank:lg` model to make it possible to just evaluate
tokenization
* `init-model`: add option to truncate vectors to N most frequent vectors
from word2vec file
* `train`:
* if training on GPU, only run evaluation/timing on CPU in the first
iteration
* if training is aborted, exit with a non-0 exit status
2020-04-29 12:56:46 +02:00
Sofie Van Landeghem
cfdaf99b80
Fix passing of component configuration ( #5374 )
...
* add kwargs to to_disk methods in docs - otherwise crashes on 'exclude' argument
* add fix and test for Issue 5137
2020-04-29 12:56:17 +02:00
Ines Montani
63885c1836
Remove u string and auto-format [ci skip]
2020-04-29 12:54:57 +02:00
Sofie Van Landeghem
f67343295d
Update NEL examples and documentation ( #5370 )
...
* simplify creation of KB by skipping dim reduction
* small fixes to train EL example script
* add KB creation and NEL training example scripts to example section
* update descriptions of example scripts in the documentation
* moving wiki_entity_linking folder from bin to projects
* remove test for wiki NEL functionality that is being moved
2020-04-29 12:53:53 +02:00
adrianeboyd
a6e521cd79
Add is_sent_end token property ( #5375 )
...
Reconstruction of the original PR #4697 by @MiniLau.
Removes unused `SENT_END` symbol and `IS_SENT_END` from `Matcher` schema
because the Matcher is only going to be able to support `IS_SENT_START`.
2020-04-29 12:53:16 +02:00
Ines Montani
a77754120d
Merge pull request #5177 from nlptechbook/patch-5
2020-04-29 12:52:21 +02:00
Ines Montani
1cbb272a6b
Update website/meta/universe.json
2020-04-29 12:51:44 +02:00
Ines Montani
732629b0dd
Update website/meta/universe.json
2020-04-29 12:51:37 +02:00
adrianeboyd
90ce34db42
Add cuda101 and cuda102 options to setup ( #5377 )
...
* Add cuda101 and cuda102 options to setup
* Update cudaNNN options in docs
2020-04-29 12:51:12 +02:00
Louis Guitton
a27c4014f5
Add mlflow to spaCy universe ( #5352 )
...
* Add mlflow to universe
* Use mlflow black logo
2020-04-29 10:18:03 +02:00
adrianeboyd
792aa7b6ab
Remove references to textcat spans ( #5360 )
...
Remove references to unimplemented `TextCategorizer` span labels in
`GoldParse` and `Doc`.
2020-04-27 18:01:12 +02:00
adrianeboyd
90c754024f
Update nlp.vectors to nlp.vocab.vectors ( #5357 )
2020-04-27 10:53:05 +02:00
Mike
481574cbc8
[minor doc change] embedding vis. link is broken in `website/docs/usage/examples.md` ( #5325 )
...
* The embedding vis. link is broken
The first link seems to be reasonable for now unless someone has an updated embedding vis they want to share?
* contributor agreement
* Update Mlawrence95.md
* Update website/docs/usage/examples.md
Co-Authored-By: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2020-04-21 20:35:12 +02:00
Ines Montani
b919844fce
Tidy up and fix alignment of landing cards ( #5317 )
2020-04-20 20:33:13 +02:00
laszabine
fb73d4943a
Amend documentation to Language.evaluate ( #5319 )
...
* Specified usage of arguments to Language.evaluate
* Created contributor agreement
2020-04-16 20:00:18 +02:00
Thomas Thiebaud
1eef60c658
Add spacy_fastlang to universe ( #5271 )
...
* Add spacy_fastlang to universe
* Sign SCA
2020-04-15 13:50:46 +02:00
Sofie Van Landeghem
a3965ec13d
tag-map-path since 2.2.4 instead of 2.2.3 ( #5289 )
2020-04-14 14:53:47 +02:00
Marek Grzenkowicz
6a8a52650f
[ Closes #5292 ] Fix typo in option name "--n-save_every" ( #5293 )
...
* Sign contributor agreement for chopeen
* Fix typo in option name and close #5292
2020-04-11 23:35:01 +02:00
Sofie Van Landeghem
7ad0fcf01d
fix json ( #5267 )
2020-04-08 12:58:09 +02:00
vincent d warmerdam
f329d5663a
add "whatlies" to spaCy universe ( #5252 )
...
* Add "whatlies"
We're releasing it on our side officially on the 16th of April. If possible, let's announce around the same time :)
* sign contributor thing
* Added fancy gif
as the image
* Update universe.json
Spellin error and spaCy clarification.
2020-04-06 11:29:30 +02:00
nlptechbook
ddf3c2430d
Update universe.json
2020-04-03 12:10:03 -04:00
Sofie Van Landeghem
1137420840
Small doc fixes ( #5250 )
...
* fix link
* torchtext instead tochtext
2020-04-03 13:01:43 +02:00
Nikhil Saldanha
d1ddfa1cb7
update docs for EntityRecognizer.predict
...
return type was wrongly written as a tuple, changed to syntax.StateClass
2020-03-28 18:13:02 +01:00
Tiljander
e53232533b
Describing priority rules for overlapping matches ( #5197 )
...
* Describing priority rules for overlapping matches
* Create Tiljander.md
* Describing priority rules for overlapping matches
* Update website/docs/api/entityruler.md
Co-Authored-By: Ines Montani <ines@ines.io>
Co-authored-by: Ines Montani <ines@ines.io>
2020-03-26 13:13:22 +01:00
adrianeboyd
d88a377bed
Remove Vectors.from_glove ( #5209 )
2020-03-26 10:45:47 +01:00
nlptechbook
b52e1ab677
Update universe.json
...
A bot powered by Clarifai Predict API and spaCy. Can be found in Telegram messenger at @pic2phrase_bot
2020-03-21 11:39:15 -04:00
Baciccin
3b53617a69
Add Ligurian language
2020-03-19 21:37:01 -07:00
Ines Montani
80e7e1347e
Update universe.json [ci skip]
2020-03-17 22:21:34 +01:00
Ines Montani
eda6eff8b1
Update universe.json [ci skip]
2020-03-17 22:19:29 +01:00
Ines Montani
16e7301d34
Merge pull request #5161 from pmbaumgartner/master
...
add gobbli to spacy-universe 🥳
2020-03-17 22:18:30 +01:00
Peter B
b04057c204
add mentions of spaCy use
2020-03-17 15:03:43 -04:00
Ines Montani
b2b01a5c8b
Update universe.json [ci skip]
2020-03-17 19:53:31 +01:00