diff --git a/bin/wiki_entity_linking/README.md b/bin/wiki_entity_linking/README.md index 56d0c1415..4e4af5c21 100644 --- a/bin/wiki_entity_linking/README.md +++ b/bin/wiki_entity_linking/README.md @@ -2,7 +2,7 @@ ### Step 1: Create a Knowledge Base (KB) and training data -Run `wikipedia_pretrain_kb.py` +Run `wikidata_pretrain_kb.py` * This takes as input the locations of a **Wikipedia and a Wikidata dump**, and produces a **KB directory** + **training file** * WikiData: get `latest-all.json.bz2` from https://dumps.wikimedia.org/wikidatawiki/entities/ * Wikipedia: get `enwiki-latest-pages-articles-multistream.xml.bz2` from https://dumps.wikimedia.org/enwiki/latest/ (or for any other language)