💫 Industrial-strength Natural Language Processing (NLP) in Python
Go to file
Matthew Honnibal 1629b33082 * Fix copying of tokenizer data in init_model 2015-04-12 04:45:31 +02:00
bin * Fix copying of tokenizer data in init_model 2015-04-12 04:45:31 +02:00
contributors * Fix nested lists 2015-03-25 14:38:59 +01:00
docs * Fix quickstart 2015-03-19 00:09:39 -04:00
lang_data/en * Bug fixes to tokenization, and support for times 2015-03-26 16:44:48 +01:00
spacy * Remove Spans class 2015-04-12 04:07:29 +02:00
tests * Add test for Issue #44 2015-04-07 06:05:18 +02:00
.gitignore * Upd gitignore 2015-04-08 07:48:04 +02:00
.travis.yml * Upd travis.yml 2015-01-31 13:50:30 +11:00
LICENSE.txt * Add license file 2015-01-26 03:07:18 +11:00
MANIFEST.in * Add manifest file 2015-01-30 16:49:02 +11:00
README.md Update README.md 2015-02-01 18:38:22 +11:00
bootstrap_python_env.sh * Add bootstrap script 2015-03-16 14:01:36 -04:00
dev_setup.py * Upd dev_setup 2015-01-03 21:02:03 +11:00
fabfile.py * Add train function to fabfile 2015-04-08 22:47:59 +02:00
requirements.txt * Require advanced version of cymem 2015-02-01 17:04:59 +11:00
setup.py * Move Span/Spans to separate files. Currently duplicates lots of Tokens functionality. Should probably be integrated into Tokens 2015-03-26 16:44:48 +01:00
wordnet_license.txt * Add WordNet license file 2015-02-01 16:11:53 +11:00

README.md

spaCy

Build Status

http://honnibal.github.io/spaCy

Fast, state-of-the-art natural language processing pipeline. Commercial licenses available, or use under AGPL.

Version 0.40 released

2014-02-01

  • Several bug-fixes have now been pushed to master
  • Tests fail on some platforms, including Travis CI, due to memory errors.
  • Tests pass on my local machines OSX and Ubuntu machines (for Python2.7 and Python 3.4)

The problem is likely due to non-portable usage of the Py_UNICODE data type in my Cython code, or possibly in the binary file formats of lexemes.bin, vec.bin, or the model file read by thinc.learner.LinearModel.

I'm trying to reproduce the problem. Once this is fixed and docs are updated I will push version 0.4 to PyPi.

I have a flight from Sydney to New York in 24 hours, so this problem may remain unfixed for a few days.

Supports:

  • CPython 2.7
  • CPython 3.4
  • OSX
  • Linux

Want to support:

  • Windows

Difficult to support:

  • PyPy 2.7
  • PyPy 3.4