From 83662fd9a1a673f94805a4d8c0e64749a79f323a Mon Sep 17 00:00:00 2001 From: LE DUC ANH Date: Wed, 8 Jul 2020 11:41:37 +0900 Subject: [PATCH 1/5] Update README.md --- README.md | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 4f2e463..bb83d23 100644 --- a/README.md +++ b/README.md @@ -6,9 +6,12 @@ OCR system for recognizing modern Japanese magazines This repo contains an OCR sytem for converting modern Japanese images to text. This is a result of [N2I project](http://codh.rois.ac.jp/collaboration/#n2i) for digitization of modern Japanese documents. -The system has 2 main modules: text line extraction and text line recognition. The overall architechture is shown in the below figure. +The system has 2 main modules: text line extraction and text line recognition. The overall architechture is shown in the below figures. +![alt text](https://github.com/ducanh841988/Kindai-OCR/blob/master/images/TextlineExtraction.jpg "text line extraction") For text line extraction, we retrain the CRAFT (Character Region Awareness for Text Detection) on 1000 annotated images provided by Center for Research and Development of Higher Education, The University of Tokyo. +![alt text](https://github.com/ducanh841988/Kindai-OCR/blob/master/images/TextlineRecognition.jpg "text line recognition") + For text line recognition, we employ the attention-based encoder-decoder on our previous publication. We train the text line recognition on 1000 annotated images and 1600 unannotated images provided by Center for Research and Development of Higher Education and National Institute for Japanese Language and Linguistics, respectively. From dcb9fc86d490f357f18af50b37a8ebaf438c3e85 Mon Sep 17 00:00:00 2001 From: LE DUC ANH Date: Wed, 8 Jul 2020 12:19:17 +0900 Subject: [PATCH 2/5] Update README.md --- README.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index bb83d23..3d88e06 100644 --- a/README.md +++ b/README.md @@ -24,7 +24,8 @@ For text line recognition, we employ the attention-based encoder-decoder on our ## Running Kindai OCR -- You should first download the pre_trained models and put them into ./pretrain/ folder. +- You should first download the pre_trained models and put them into ./pretrain/ folder. +[VGG model](https://drive.google.com/file/d/1_A1dEFKxyiz4Eu1HOCDbjt1OPoEh90qr/view?usp=sharing), [CRAFT model](https://drive.google.com/file/d/1-9xt_jjs4btMrz5wzrU1-kyp2c6etFab/view?usp=sharing), [OCR model](https://drive.google.com/file/d/1mibg7D2D5rvPhhenLeXNilSLMBloiexl/view?usp=sharing) - Copy your images into ./data/test/ folder - run the following script to recognize images: `python test.py` From 18b50939b2bc5729fb533e46aede1186965f7063 Mon Sep 17 00:00:00 2001 From: LE DUC ANH Date: Wed, 8 Jul 2020 12:23:07 +0900 Subject: [PATCH 3/5] Update README.md --- README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.md b/README.md index 3d88e06..88fef93 100644 --- a/README.md +++ b/README.md @@ -32,4 +32,7 @@ For text line recognition, we employ the attention-based encoder-decoder on our - The recognized text transcription is in ./data/result.xml and the result images are in ./data/result/ - If you may have to check the path to Japanese font in test.py for correct visualization results. `fontPIL = '/usr/share/fonts/truetype/fonts-japanese-gothic.ttf' # japanese font` + + - An example result from our OCR system +![alt text]( https://github.com/ducanh841988/Kindai-OCR/blob/master/data/result/res_k188701_021_39.jpg "OCR result") From da54e14bcf8a866fe7663d36ea021b44ae05f292 Mon Sep 17 00:00:00 2001 From: LE DUC ANH Date: Wed, 8 Jul 2020 12:25:25 +0900 Subject: [PATCH 4/5] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 88fef93..36503bb 100644 --- a/README.md +++ b/README.md @@ -34,5 +34,5 @@ For text line recognition, we employ the attention-based encoder-decoder on our `fontPIL = '/usr/share/fonts/truetype/fonts-japanese-gothic.ttf' # japanese font` - An example result from our OCR system -![alt text]( https://github.com/ducanh841988/Kindai-OCR/blob/master/data/result/res_k188701_021_39.jpg "OCR result") + From dc3e36fb6b00766bdaaab698fb79c5ba2489a2b9 Mon Sep 17 00:00:00 2001 From: LE DUC ANH Date: Wed, 8 Jul 2020 14:34:16 +0900 Subject: [PATCH 5/5] Update README.md --- README.md | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 36503bb..26462e5 100644 --- a/README.md +++ b/README.md @@ -20,7 +20,13 @@ For text line recognition, we employ the attention-based encoder-decoder on our ## Installing Kindai OCR - +python==3.7.4 +torch==1.4.0 +torchvision==0.2.1 +opencv-python==3.4.2.17 +scikit-image==0.14.2 +scipy==1.1.0 +Polygon3 ## Running Kindai OCR