spaCy/spacy/tests/regression/test_issue595.py

# coding: utf-8
from __future__ import unicode_literals

from ...symbols import POS, VERB, VerbForm_inf
from ...vocab import Vocab
from ...lemmatizer import Lemmatizer
from ..util import get_doc

import pytest


def test_issue595():
    """Test lemmatization of base forms"""
    words = ["Do", "n't", "feed", "the", "dog"]
    tag_map = {'VB': {POS: VERB, VerbForm_inf: True}}
    rules = {"verb": [["ed", "e"]]}

    lemmatizer = Lemmatizer({'verb': {}}, {'verb': {}}, rules)
    vocab = Vocab(lemmatizer=lemmatizer, tag_map=tag_map)
    doc = get_doc(vocab, words)

    doc[2].tag_ = 'VB'
    assert doc[2].text == 'feed'
    assert doc[2].lemma_ == 'feed'
Tidy up regression tests 2017-01-10 18:24:10 +00:00			`# coding: utf-8`
Change test595 to mock data, instead of requiring model. 2016-12-18 12:28:51 +00:00			`from __future__ import unicode_literals`
Test #595 -- Bug in lemmatization of base forms. 2016-11-03 23:27:32 +00:00
Change test595 to mock data, instead of requiring model. 2016-12-18 12:28:51 +00:00			`from ...symbols import POS, VERB, VerbForm_inf`
			`from ...vocab import Vocab`
			`from ...lemmatizer import Lemmatizer`
Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 21:00:37 +00:00			`from ..util import get_doc`
Test #595 -- Bug in lemmatization of base forms. 2016-11-03 23:27:32 +00:00
Tidy up regression tests 2017-01-10 18:24:10 +00:00			`import pytest`

Test #595 -- Bug in lemmatization of base forms. 2016-11-03 23:27:32 +00:00
Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 21:00:37 +00:00			`def test_issue595():`
			`"""Test lemmatization of base forms"""`
			`words = ["Do", "n't", "feed", "the", "dog"]`
Fix regression test 2017-03-25 21:35:07 +00:00			`tag_map = {'VB': {POS: VERB, VerbForm_inf: True}}`
Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 21:00:37 +00:00			`rules = {"verb": [["ed", "e"]]}`
Change test595 to mock data, instead of requiring model. 2016-12-18 12:28:51 +00:00
Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 21:00:37 +00:00			`lemmatizer = Lemmatizer({'verb': {}}, {'verb': {}}, rules)`
			`vocab = Vocab(lemmatizer=lemmatizer, tag_map=tag_map)`
			`doc = get_doc(vocab, words)`
Change test595 to mock data, instead of requiring model. 2016-12-18 12:28:51 +00:00
Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 21:00:37 +00:00			`doc[2].tag_ = 'VB'`
			`assert doc[2].text == 'feed'`
			`assert doc[2].lemma_ == 'feed'`