2014-09-25 16:42:20 +00:00
|
|
|
.. spaCy documentation master file, created by
|
|
|
|
sphinx-quickstart on Tue Aug 19 16:27:38 2014.
|
|
|
|
You can adapt this file completely to your liking, but it should at least
|
|
|
|
contain the root `toctree` directive.
|
|
|
|
|
|
|
|
spaCy NLP Tokenizer and Lexicon
|
|
|
|
================================
|
|
|
|
|
2014-11-02 13:13:19 +00:00
|
|
|
spaCy is a library for industrial strength NLP in Python and Cython. Its core
|
|
|
|
values are efficiency, accuracy and minimalism.
|
|
|
|
|
|
|
|
* Efficiency: spaCy is
|
|
|
|
|
|
|
|
It does not attempt to be comprehensive,
|
|
|
|
or to provide lavish syntactic sugar. This isn't a library that covers 43 known
|
|
|
|
algorithms to do X. You get 1 --- the best one --- with a simple, low-level interface.
|
|
|
|
For commercial users, the code is free but the data isn't. For researchers, both
|
|
|
|
are free and always will be.
|
|
|
|
|
|
|
|
Comparison
|
|
|
|
----------
|
|
|
|
+-------------+-------------+---+-----------+--------------+
|
|
|
|
| POS taggers | Speed (w/s) | % Acc. (news) | % Acc. (web) |
|
|
|
|
+-------------+-------------+---------------+--------------+
|
|
|
|
| spaCy | | | |
|
|
|
|
+-------------+-------------+---------------+--------------+
|
|
|
|
| Stanford | 16,000 | | |
|
|
|
|
+-------------+-------------+---------------+--------------+
|
|
|
|
| NLTK | | | |
|
|
|
|
+-------------+-------------+---------------+--------------+
|
2014-09-26 16:40:18 +00:00
|
|
|
|
|
|
|
|
2014-09-25 16:42:20 +00:00
|
|
|
.. toctree::
|
2014-10-15 10:50:34 +00:00
|
|
|
:hidden:
|
2014-09-25 16:42:20 +00:00
|
|
|
:maxdepth: 3
|
|
|
|
|
2014-10-15 10:50:34 +00:00
|
|
|
what/index.rst
|
|
|
|
why/index.rst
|
|
|
|
how/index.rst
|