mirror of https://github.com/ttop32/JMTrans.git
41 lines
2.4 KiB
Markdown
41 lines
2.4 KiB
Markdown
# manga translator
|
|
get japanese manga from url to translate manga image using SickZil(tensorflow model), ocr(pytesseract ocr and nhocr) and googletrans
|
|
|
|
This project code is still under processing on the colab environment. For use, run colab in below url.
|
|
https://colab.research.google.com/drive/1XbR7fNXtT4TGlLI1FBcCQv7Gj5mlDvwb?usp=sharing
|
|
|
|
# result
|
|
![result](doc/result1.png)
|
|
![result](doc/result2.png)
|
|
![result](doc/result3.png)
|
|
![result](doc/result4.png)
|
|
![result](doc/result5.png)
|
|
![result](doc/result6.png)
|
|
![result](doc/result7.png)
|
|
|
|
# Workflow
|
|
use gallery-dl to get managa from url
|
|
do text segmentation from manga image using SickZil
|
|
get text from image using pytesseract ocr and nhocr
|
|
translating using googletrans
|
|
use pil to place translated text
|
|
|
|
# Supported Languages (for destination translation)
|
|
For translating, it uses google trans. It supports :
|
|
afrikaans, albanian, amharic, arabic, armenian, azerbaijani, basque, belarusian, bengali, bosnian, bulgarian, catalan, chichewa, chinese (simplified), chinese (traditional), corsican, croatian, czech, danish, dutch, english, esperanto, estonian, filipino, filipino, finnish, french, frisian, galician, georgian, german, greek, gujarati, haitian creole, hausa, hawaiian, hindi, hungarian, icelandic, igbo, indonesian, irish, italian, japanese, kazakh, khmer, korean, kurdish (kurmanji), kyrgyz, lao, latin, latvian, lithuanian, luxembourgish, macedonian, malagasy, malay, malayalam, maltese, maori, marathi, mongolian, myanmar (burmese), nepali, norwegian, pashto, persian, polish, portuguese, punjabi, romanian, russian, samoan, scots gaelic, serbian, sesotho, shona, sindhi, sinhala, slovak, slovenian, somali, spanish, sundanese, swahili, swedish, tajik, tamil, thai, turkish, ukrainian, urdu, uzbek, vietnamese, welsh, xhosa, yiddish, yoruba, zulu
|
|
|
|
# Supported url
|
|
gallery-dl is used to download. Its support sites are:
|
|
- [text-detection](https://github.com/mikf/gallery-dl/blob/master/docs/supportedsites.rst)
|
|
|
|
# Acknowledgement and References
|
|
- [SickZil-Machine](https://github.com/KUR-creative/SickZil-Machine)
|
|
- [OpenCV with Python wrapper](https://pypi.org/project/opencv-python/)
|
|
- [Google Translate API for Python](https://pypi.org/project/googletrans/)
|
|
- [Tesseract](https://github.com/tesseract-ocr/tesseract)
|
|
- [Pytesseract](https://pypi.python.org/pypi/pytesseract)
|
|
- [nhocr](https://github.com/fireae/nhocr)
|
|
- [text-detection](https://github.com/qzane/text-detection)
|
|
|
|
|