spaCy

Commit Graph

Author	SHA1	Message	Date
ines	607ba458e7	Fix whitespace	2017-05-08 15:42:31 +02:00
ines	60db497525	Add update_exc and expand_exc to util Doesn't require separate language data util anymore	2017-05-08 15:42:12 +02:00
ines	6e5bd4f228	Remove unused functions from deprecated	2017-05-08 15:40:16 +02:00
ines	f68e420bc0	Add PRON_LEMMA and DET_LEMMA to deprecated Will be replaced with proper values across the language data later.	2017-05-08 15:35:30 +02:00
ines	bd6a7cf4f6	Simplify deprecated model downloading Only relevant for spaCy < v1.7.0.	2017-05-08 15:32:10 +02:00
ines	95edd9e896	Let parse_package_meta take full path	2017-05-08 15:30:48 +02:00
ines	326746eb15	Add util function to resolve arg to model path 1. check if in data dir or shortcut link 2. check if installed as a pip package 3. check if string is path to model 4. check if Path or Path-like object	2017-05-08 15:29:47 +02:00
ines	a7801e7342	Update spacy.load() path argument is now deprecated and name can either take a model name or path. Implement lazy loading by importing module and read Language class name off __all__.	2017-05-08 15:27:25 +02:00
ines	94697e9afc	Fix typo	2017-05-08 02:00:37 +02:00
ines	0ee2a22b67	Merge branch 'pr/1024' into develop	2017-05-08 01:12:44 +02:00
ines	c4492d260a	Fix kwargs	2017-05-08 01:05:24 +02:00
ines	b5a726c5cd	Tidy up deprecated.py	2017-05-07 23:29:22 +02:00
ines	59c3b9d4dd	Tidy up CLI and fix print functions	2017-05-07 23:25:29 +02:00
ines	311704674d	Add path2str compat function	2017-05-07 23:24:56 +02:00
ines	e34069db9f	Move is_package and get_model_package_path to util	2017-05-07 23:24:51 +02:00
ines	957ba676b4	Add model files base path to about.py	2017-05-07 23:22:35 +02:00
ines	8d8dd9ceb2	Don't set default value for model	2017-05-07 23:22:21 +02:00
ines	b1f22c5a10	Fix formatting	2017-05-03 20:11:02 +02:00
ines	a04b5be1b2	Add glossary for annotation scheme (closes #1034 ) Can be imported as explain from spacy.glossary, or called as spacy.explain(term)	2017-05-03 17:02:17 +02:00
Gregory Howard	929f2792a7	Rennaming cls in module. cls is now a class	2017-05-03 15:41:07 +02:00
Gregory Howard	0e8c41ea4f	Adding method lemmatizer for every class	2017-05-03 12:14:42 +02:00
Gregory Howard	32ca07989e	adding export japanese	2017-05-03 11:07:29 +02:00
Grégory Howard	f9d7144224	Merge branch 'master' into master	2017-05-03 11:04:51 +02:00
Gregory Howard	f2ab7d77b4	Lazy imports language	2017-05-03 11:01:42 +02:00
Ines Montani	3ea23a3f4d	Fix formatting	2017-05-03 09:44:38 +02:00
Ines Montani	d730eb0c0d	Raise custom ImportError if importing janome fails	2017-05-03 09:43:29 +02:00
Ines Montani	949ad6594b	Add newline	2017-05-03 09:38:43 +02:00
Ines Montani	d12ca587ea	Add newline	2017-05-03 09:38:29 +02:00
Ines Montani	8676cd0135	Add newline	2017-05-03 09:38:07 +02:00
Yasuaki Uechi	c8f83aeb87	Add basic japanese support	2017-05-03 13:56:21 +09:00
Gregory Howard	c0afcd22bb	Merge remote-tracking branch 'remotes/upstream/master'	2017-04-27 14:42:54 +02:00
Matthew Honnibal	31ec9e1371	Merge branch 'master' of https://github.com/explosion/spaCy	2017-04-27 13:21:39 +02:00
Matthew Honnibal	2da16adcc2	Add dropout optin for parser and NER Dropout can now be specified in the `Parser.update()` method via the `drop` keyword argument, e.g. nlp.entity.update(doc, gold, drop=0.4) This will randomly drop 40% of features, and multiply the value of the others by 1. / 0.4. This may be useful for generalising from small data sets. This commit also patches the examples/training/train_new_entity_type.py example, to use dropout and fix the output (previously it did not output the learned entity).	2017-04-27 13:18:39 +02:00
Gregory Howard	92f368f83b	Removing extra spaces	2017-04-27 12:02:14 +02:00
Gregory Howard	13b6957c8e	Adding unitest for tokenization in french (with title)	2017-04-27 11:53:44 +02:00
Gregory Howard	8ff4682255	correcting tokenizer exception. Adding tests for lemmatization	2017-04-27 11:52:14 +02:00
Ines Montani	7da9cefd25	Merge pull request #1022 from luvogels/master Initial support for Norwegian Bokmål	2017-04-27 11:16:06 +02:00
Ines Montani	c9e592ae6c	Add newline	2017-04-27 11:15:41 +02:00
Ines Montani	5942adccc2	Add newline	2017-04-27 11:15:19 +02:00
Ines Montani	4cd9269aef	Add newline	2017-04-27 11:15:04 +02:00
Ines Montani	ccf13ecc21	Add newline	2017-04-27 11:14:42 +02:00
Ines Montani	03d2b0cc05	Add newline	2017-04-27 11:14:26 +02:00
Gregory Howard	44cb486849	Adding unitest for tokenization in french (with title)	2017-04-27 10:59:38 +02:00
Gregory Howard	ad8129cb45	Improvement of rules now title insentive and have same declaration format	2017-04-27 10:23:56 +02:00
luvogels	d12a0b6431	Hooked up tokenizer tests	2017-04-26 23:21:41 +02:00
Matthew Honnibal	f0e1606d27	Increment version	2017-04-26 20:25:41 +02:00
luvogels	b331929a7e	Merge branch 'master' of https://github.com/luvogels/spaCy	2017-04-26 19:15:48 +02:00
luvogels	8de59ce3b9	Added tokenizer tests	2017-04-26 19:10:18 +02:00
Matthew Honnibal	4d98511db7	Make Span hashable. Closes #1019	2017-04-26 19:01:05 +02:00
Matthew Honnibal	24c4c51f13	Try to make test999 less flakey	2017-04-26 18:42:06 +02:00

1 2 3 4 5 ...

2836 Commits