Fix list formatting

2016-05-05 00:18:25 +10:00 · 2016-05-05 00:18:25 +10:00 · 886bf55bd9
parent 1b8b888a57
commit 886bf55bd9
1 changed files with 16 additions and 2 deletions
--- a/README.rst
+++ b/README.rst
@ -37,26 +37,39 @@ The German model provides tokenization, POS tagging, sentence boundary detection

 Bugfixes
 --------
-* spaCy < 0.100.7 had a bug in the semantics of the Token.__str__ and Token.__unicode__
-built-ins: they included a trailing space.
+
+* spaCy < 0.100.7 had a bug in the semantics of the Token.__str__ and Token.__unicode__ built-ins: they included a trailing space.
 * Improve handling of "infixed" hyphens. Previously the tokenizer struggled with multiple hyphens, such as "well-to-do".
+
 * Improve handling of periods after mixed-case tokens
+
 * Improve lemmatization for English special-case tokens
+
 * Fix bug that allowed spaces to be treated as heads in the syntactic parse
+
 * Fix bug that led to inconsistent sentence boundaries before and after serialisation.
+
 * Fix bug from deserialising untagged documents.

 Features
 --------

 * Labelled dependency parsing (91.8% accuracy on OntoNotes 5)
+
 * Named entity recognition (82.6% accuracy on OntoNotes 5)
+
 * Part-of-speech tagging (97.1% accuracy on OntoNotes 5)
+
 * Easy to use word vectors
+
 * All strings mapped to integer IDs
+
 * Export to numpy data arrays
+
 * Alignment maintained to original string, ensuring easy mark up calculation
+
 * Range of easy-to-use orthographic features.
+
 * No pre-processing required. spaCy takes raw text as input, warts and newlines and all.

 Top Peformance
@ -64,6 +77,7 @@ Top Peformance

 * Fastest in the world: <50ms per document.  No faster system has ever been
  announced.
+
 * Accuracy within 1% of the current state of the art on all tasks performed
  (parsing, named entity recognition, part-of-speech tagging).  The only more
  accurate systems are an order of magnitude slower or more.