mirror of https://github.com/ttop32/JMTrans.git
58 lines
2.9 KiB
Markdown
58 lines
2.9 KiB
Markdown
# JMTrans - Japanese Manga Translator
|
|
get japanese manga from url to translate manga image using SickZil(tensorflow model), google ocr and googletrans
|
|
|
|
This project code is still under processing on the colab environment. For use, run colab in below url.
|
|
https://colab.research.google.com/drive/1XbR7fNXtT4TGlLI1FBcCQv7Gj5mlDvwb?usp=sharing
|
|
|
|
# result
|
|
![result](doc/result1.png)
|
|
![result](doc/result2.png)
|
|
![result](doc/result3.png)
|
|
![result](doc/result4.png)
|
|
![result](doc/result5.png)
|
|
![result](doc/result6.png)
|
|
![result](doc/result7.png)
|
|
|
|
# Workflow
|
|
- use gallery-dl to get managa from url
|
|
- do text segmentation from manga image using SickZil
|
|
- use opencv to crop text image based on text segmentation results
|
|
- get text from image using pytesseract ocr and nhocr
|
|
- translating using googletrans
|
|
- use pil to place translated text
|
|
|
|
# Supported Languages (for translation destination)
|
|
For translating, it uses google trans. It supports :
|
|
afrikaans, albanian, amharic, arabic, armenian, azerbaijani, basque, belarusian, bengali, bosnian, bulgarian, catalan, chichewa, chinese (simplified), chinese (traditional), corsican, croatian, czech, danish, dutch, english, esperanto, estonian, filipino, filipino, finnish, french, frisian, galician, georgian, german, greek, gujarati, haitian creole, hausa, hawaiian, hindi, hungarian, icelandic, igbo, indonesian, irish, italian, japanese, kazakh, khmer, korean, kurdish (kurmanji), kyrgyz, lao, latin, latvian, lithuanian, luxembourgish, macedonian, malagasy, malay, malayalam, maltese, maori, marathi, mongolian, myanmar (burmese), nepali, norwegian, pashto, persian, polish, portuguese, punjabi, romanian, russian, samoan, scots gaelic, serbian, sesotho, shona, sindhi, sinhala, slovak, slovenian, somali, spanish, sundanese, swahili, swedish, tajik, tamil, thai, turkish, ukrainian, urdu, uzbek, vietnamese, welsh, xhosa, yiddish, yoruba, zulu
|
|
|
|
# Supported url
|
|
gallery-dl is used to download. Its support sites are:
|
|
- [supported site list](https://github.com/mikf/gallery-dl/blob/master/docs/supportedsites.rst)
|
|
|
|
# Acknowledgement and References
|
|
- [SickZil-Machine](https://github.com/KUR-creative/SickZil-Machine)
|
|
- [OpenCV with Python wrapper](https://pypi.org/project/opencv-python/)
|
|
- [Google Translate API for Python](https://pypi.org/project/googletrans/)
|
|
- [Tesseract](https://github.com/tesseract-ocr/tesseract)
|
|
- [Pytesseract](https://pypi.python.org/pypi/pytesseract)
|
|
- [nhocr](https://github.com/fireae/nhocr)
|
|
- [text-detection](https://github.com/qzane/text-detection)
|
|
- [ehnd](https://github.com/sokcuri/ehnd)
|
|
- [cefpython](https://github.com/cztomczak/cefpython)
|
|
|
|
|
|
# pip requirements
|
|
python3.5
|
|
pip install tensorflow==1.14.0
|
|
pip install googletrans
|
|
pip install tqdm funcy PyQt5 opencv-python Pillow
|
|
pip install --upgrade google-api-python-client google-auth-httplib2 google-auth-oauthlib
|
|
pip install pyinstaller
|
|
pip install cefpython3
|
|
pip install pywin32
|
|
pip install matplotlib
|
|
pip install imageio
|
|
pip install pyinstaller
|
|
|
|
|