* Work on API docs

2015-07-07 21:35:22 +02:00 · 2015-07-07 21:35:22 +02:00 · 68eff957a5
parent 1d2deb4616
commit 68eff957a5
1 changed files with 25 additions and 23 deletions
--- a/docs/source/api.rst
+++ b/docs/source/api.rst
@ -31,35 +31,37 @@ e.g. `spacy.en.English`. The pipeline class reads the data from disk, from a
 specified directory.  By default, spaCy installs data into each language's
 package directory, and loads it from there.
 .. autoclass:: spacy.en.English
  :members:
 .. code::
-The class `spacy.en.English` is the main entry-point for the English pipeline
+  class English(object):    
-(other languages to come).
+      ...
      def __init__(self,
        data_dir=path.join(path.dirname(__file__), 'data'),
        Tokenizer=Tokenizer.from_dir,
        Tagger=EnPosTagger,
        Parser=Createarser(ArcEager),
        Entity=CreateParser(BiluoNER),
        load_vectors=True
      ):
-  +------------+----------------------------------------+-------------+--------------------------+
+data\_dir
-  | Attribute  | Type                                   | Attr API    | Notes                    |
+  Usually left default. The data directory.  May be None, to disable any data loading (including
-  +============+========================================+=============+==========================+
+  the vocabulary).
  | strings    | :py:class:`strings.StringStore`        | __getitem__ | string <-> int  mapping  |
  +------------+----------------------------------------+-------------+--------------------------+
  | vocab      | :py:class:`vocab.Vocab`                | __getitem__ | Look up Lexeme object    |
  +------------+----------------------------------------+-------------+--------------------------+
  | tokenizer  | :py:class:`tokenizer.Tokenizer`        | __call__    | Get Tokens given unicode |
  +------------+----------------------------------------+-------------+--------------------------+
  | tagger     | :py:class:`en.pos.EnPosTagger`         | __call__    | Set POS tags on Tokens   |
  +------------+----------------------------------------+-------------+--------------------------+
  | parser     | :py:class:`syntax.parser.GreedyParser` | __call__    | Set parse on Tokens      |
  +------------+----------------------------------------+-------------+--------------------------+
  | entity     | :py:class:`syntax.parser.GreedyParser` | __call__    | Set entities on Tokens   |
  +------------+----------------------------------------+-------------+--------------------------+
  | mwe_merger | :py:class:`multi_words.RegexMerger`    | __call__    | Apply regex for units    |
  +------------+----------------------------------------+-------------+--------------------------+
 Tokenizer
  Usually left default. A class/function that creates the tokenizer.
  Its signature should be:
    :code:`(Vocab vocab, unicode data_dir)(unicode) --> Tokens`
-.. autoclass:: spacy.en.English
+Tagger / Parser / Entity
-  :members:
+  Usually left default. A class/function that creates the part-of-speech tagger /
  syntactic dependency parser / named entity recogniser.
  May be None or False, to disable tagging. Otherwise, its signature should be:
    :code:`(Vocab vocab, unicode data_dir)(Tokens) --> None`
 load_vectors
  A boolean value to control whether the word vectors are loaded.
 .. autoclass:: spacy.tokens.Tokens
  :members: