Merge branch 'master' of https://github.com/ducanh841988/Kindai-OCR
This commit is contained in:
commit
7ff9246af5
17
README.md
17
README.md
|
@ -6,9 +6,12 @@ OCR system for recognizing modern Japanese magazines
|
||||||
This repo contains an OCR sytem for converting modern Japanese images to text.
|
This repo contains an OCR sytem for converting modern Japanese images to text.
|
||||||
This is a result of [N2I project](http://codh.rois.ac.jp/collaboration/#n2i) for digitization of modern Japanese documents.
|
This is a result of [N2I project](http://codh.rois.ac.jp/collaboration/#n2i) for digitization of modern Japanese documents.
|
||||||
|
|
||||||
The system has 2 main modules: text line extraction and text line recognition. The overall architechture is shown in the below figure.
|
The system has 2 main modules: text line extraction and text line recognition. The overall architechture is shown in the below figures.
|
||||||
|
![alt text](https://github.com/ducanh841988/Kindai-OCR/blob/master/images/TextlineExtraction.jpg "text line extraction")
|
||||||
|
|
||||||
For text line extraction, we retrain the CRAFT (Character Region Awareness for Text Detection) on 1000 annotated images provided by Center for Research and Development of Higher Education, The University of Tokyo.
|
For text line extraction, we retrain the CRAFT (Character Region Awareness for Text Detection) on 1000 annotated images provided by Center for Research and Development of Higher Education, The University of Tokyo.
|
||||||
|
![alt text](https://github.com/ducanh841988/Kindai-OCR/blob/master/images/TextlineRecognition.jpg "text line recognition")
|
||||||
|
|
||||||
For text line recognition, we employ the attention-based encoder-decoder on our previous publication. We train the text line recognition on 1000 annotated images and 1600 unannotated images provided by Center for Research and Development of Higher Education and National Institute for Japanese Language and Linguistics, respectively.
|
For text line recognition, we employ the attention-based encoder-decoder on our previous publication. We train the text line recognition on 1000 annotated images and 1600 unannotated images provided by Center for Research and Development of Higher Education and National Institute for Japanese Language and Linguistics, respectively.
|
||||||
|
|
||||||
|
|
||||||
|
@ -17,11 +20,18 @@ For text line recognition, we employ the attention-based encoder-decoder on our
|
||||||
|
|
||||||
|
|
||||||
## Installing Kindai OCR
|
## Installing Kindai OCR
|
||||||
|
python==3.7.4
|
||||||
|
torch==1.4.0
|
||||||
|
torchvision==0.2.1
|
||||||
|
opencv-python==3.4.2.17
|
||||||
|
scikit-image==0.14.2
|
||||||
|
scipy==1.1.0
|
||||||
|
Polygon3
|
||||||
|
|
||||||
|
|
||||||
## Running Kindai OCR
|
## Running Kindai OCR
|
||||||
- You should first download the pre_trained models and put them into ./pretrain/ folder.
|
- You should first download the pre_trained models and put them into ./pretrain/ folder.
|
||||||
|
[VGG model](https://drive.google.com/file/d/1_A1dEFKxyiz4Eu1HOCDbjt1OPoEh90qr/view?usp=sharing), [CRAFT model](https://drive.google.com/file/d/1-9xt_jjs4btMrz5wzrU1-kyp2c6etFab/view?usp=sharing), [OCR model](https://drive.google.com/file/d/1mibg7D2D5rvPhhenLeXNilSLMBloiexl/view?usp=sharing)
|
||||||
- Copy your images into ./data/test/ folder
|
- Copy your images into ./data/test/ folder
|
||||||
- run the following script to recognize images:
|
- run the following script to recognize images:
|
||||||
`python test.py`
|
`python test.py`
|
||||||
|
@ -29,3 +39,6 @@ For text line recognition, we employ the attention-based encoder-decoder on our
|
||||||
- If you may have to check the path to Japanese font in test.py for correct visualization results.
|
- If you may have to check the path to Japanese font in test.py for correct visualization results.
|
||||||
`fontPIL = '/usr/share/fonts/truetype/fonts-japanese-gothic.ttf' # japanese font`
|
`fontPIL = '/usr/share/fonts/truetype/fonts-japanese-gothic.ttf' # japanese font`
|
||||||
|
|
||||||
|
- An example result from our OCR system
|
||||||
|
<img src="https://github.com/ducanh841988/Kindai-OCR/blob/master/data/result/res_k188701_021_39.jpg" width="700">
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue