Tutorial OCR Chinese(english) movies hardsub

DarkSpiderVenom · May 1, 2021

Optical Character Recognition (OCR) for natural images/scenes

他也不是故意要那么做的

Another option to recognize text from images, it can recognize directly from natural images, and it recognizes better than tesseract

Requirements:

Windows 10 64 bits
3 GB hard disk
8 GB RAM
Internet connection, to download the required languages only once

Software

Python

Download Python

The official home of the Python Programming Language

www.python.org

Pytorch

PyTorch

PyTorch Foundation is the deep learning community home for the open source PyTorch framework and ecosystem.

pytorch.org

EasyOCR (IA)

GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. - JaidedAI/EasyOCR

github.com

videosubfinder

VideoSubFinder

Download VideoSubFinder for free. The main purpose of this program is to provide functionality for extract hardcoded subtitles (hardsub) from video. It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions.

sourceforge.net

Installation

Download and Install Python 3.9.4 in C:\python39
python-3.9.4

Run PIP for Python
Open line command win+R and "cmd"

Bash:

C:\Python29\Scripts\pip.exe install easyocr

There are two options, if you have the Nvidia video card run step a), if you have only AMD/intel video card run step b)
a) Only for video cards CUDA/NVIDIA 11.1

Bash:

C:\Python29\Scripts\pip3.exe install torch==1.8.1+cu111 torchvision==0.9.1+cu111 torchaudio===0.8.1 -f https://download.pytorch.org/whl/torch_stable.html

b) FOR CPU (not video cards CUDA/NVIDIA)

Code:

C:\Python29\Scripts\pip3.exe install torch==1.8.1+cpu torchvision==0.9.1+cpu torchaudio===0.8.1 -f https://download.pytorch.org/whl/torch_stable.html

Download and install in C:\VideoSubFinder5x64 (rename of release_x64)
VideoSubfinder 5.5
For working of this program will be required "Microsoft Visual C++ Redistributable for Visual Studio 2015, 2017 and 2019"

Usage

Run VideoSubFinderWXW.exe
Open video and ajust area video(optional, but fast)
Press Button "Run Search"

Optional.- Manually delete images (explorer windows) that do not have text in folder C:\VideoSubFinder5x64\RGBImages

Run script ( Download )

Bash:

C:\Python29\python.exe easyOcrImage.py

or

Code:

easyOcrImage.py -l ch_tra -d "c:\youDirectoyImages"

At the end of the script, the text files(OCR) are generated in the folder TXTResults
And now just
Press the button "Create Sub From TXTResults" (save subtitle srt)

Python:

directoryDefault=r'C:\VideoSubFinder5x64\RGBImages'
extensions=[".jpg",".png",".jpeg",".bmp"]
languagesDefault="ch_tra"
import os
import argparse
def main():
    parser = argparse.ArgumentParser(formatter_class=argparse.RawDescriptionHelpFormatter,description=r"easyOcrImage.py -l en,ch_tra -d " + directoryDefault,epilog=codeLanguages)
    parser.add_argument('-l','--langs',dest="langs",default=languagesDefault,help="Separated by (,) \"en,ch_tra\" for mix langs english & Traditional Chinese")
    parser.add_argument('-d','--directory',dest="directory", default=directoryDefault,help='directory help')
    args = parser.parse_args()
    if not os.path.isdir(args.directory):
        print ("Not exists directory: " + args.directory )
        return
    parentDirectory = os.path.dirname(args.directory)
    directoryTXTResults = os.path.join(parentDirectory, "TXTResults")
    if os.path.isdir( directoryTXTResults ):
        directoryTxt=directoryTXTResults
    else:
        directoryTxt=args.directory
    os.system("title OCR for " + args.directory + " - " + args.langs)
    import easyocr
    reader = easyocr.Reader( args.langs.replace(" ","").split(",") )

    files = [x for x in os.listdir(args.directory) if os.path.splitext(x)[1] in extensions]
    for i,x in enumerate(files):
        os.system("title OCR {}/{} Processed".format(i,len(files)) )
        fileImage = os.path.join(args.directory,x)
        fileTxt = os.path.join(directoryTxt,x)
        result = reader.readtext(fileImage,detail=0, paragraph=True)
        with open(fileTxt+".txt", "w", encoding="utf-8") as f:
            f.write( " ".join(result) )

codeLanguages="""Languages
Code Name
--- ----
abq    Abaza
ady    Adyghe
af    Afrikaans
ang    Angika
ar    Arabic
as    Assamese
ava    Avar
az    Azerbaijani
be    Belarusian
bg    Bulgarian
bh    Bihari
bho    Bhojpuri
bn    Bengali
bs    Bosnian
ch_sim    Simplified Chinese
ch_tra    Traditional Chinese
che    Chechen
cs    Czech
cy    Welsh
da    Danish
dar    Dargwa
de    German
en    English
es    Spanish
et    Estonian
fa    Persian (Farsi)
fr    French
ga    Irish
gom    Goan Konkani
hi    Hindi
hr    Croatian
hu    Hungarian
id    Indonesian
inh    Ingush
is    Icelandic
it    Italian
ja    Japanese
kbd    Kabardian
kn    Kannada
ko    Korean
ku    Kurdish
la    Latin
lbe    Lak
lez    Lezghian
lt    Lithuanian
lv    Latvian
mah    Magahi
mai    Maithili
mi    Maori
mn    Mongolian
mr    Marathi
ms    Malay
mt    Maltese
ne    Nepali
new    Newari
nl    Dutch
no    Norwegian
oc    Occitan
pl    Polish
pt    Portuguese
ro    Romanian
ru    Russian
rs_cyrillic    Serbian (cyrillic)
rs_latin    Serbian (latin)
sck    Nagpuri
sk    Slovak (need revisit)
sl    Slovenian
sq    Albanian
sv    Swedish
sw    Swahili
ta    Tamil
tab    Tabassaran
te    Telugu
th    Thai
tl    Tagalog
tr    Turkish
ug    Uyghur
uk    Ukranian
ur    Urdu
uz    Uzbek
vi    Vietnamese (need revisit)"""
if __name__ == "__main__":
    main()

NEW NEW NEW February 2025
OCR WeChat
It offers OCR with automatic language detection for Chinese, Japanese and English (HORIZONTAL).

UMI-OCR

It has GUI and a variety of offline OCR engines, PaddleOCR, RapidOCR,Tesseract, wechatocr,

Installation

Install wechat on your pc with windows 10

WeChat - Connects with over 1 billion users | Chats · Calls · Life Services

WeChat is a social communication app serving over 1 billion users, supporting free chat, HD voice and video calls, Moments, and mobile payments, making communication and life more convenient.

www.wechat.com

微信 Windows 版

一款跨平台的通讯工具。支持单人、多人参与。通过手机网络发送语音、图片、视频和文字。

pc.weixin.qq.com

TEST IN 3.9.12.17 x64

https://dldir1v6.qq.com/weixin/Windows/WeChatSetup.exe

Install Python 3.xx

Download Python

The official home of the Python Programming Language

www.python.org

Download PLUGIN (4 files).

wechatocr.py

1. wechatocr.py.-It can operate independently. only wcocr.pyd is needed in the same directory
2. wcocr.pyd. is needed (dll)
3. wechat_config.py
4. __init__.py

NOTE:
wcocr.pyd.- This only works in version 3, in the 4 you can stop working, the dll you need is wcocr.pyd, and this can be updated on the link:

https://github.com/swigger/wechat-ocr

Install UMI-OCR (GUI)

Copy plugin in
C:\Program Files\Umi-OCR_v2.1.4\UmiOCR-data\plugins
Run Umi-OCR.exe, y change settings path

C:\Program Files\Tencent\WeChat\[3.9.12.17]
C:\Users\YOURNAME\AppData\Roaming\Tencent\WeChat\XPlugin\Plugins\WeChatOCR\7079\extracted\WeChatOCR.exe

The only important files are 4, and the directory containing the models (60 MB). It is recommended copied in single folder. And change the location, either at wechatocr.py if you only take care of this one, or/and also change in UMIOCR settings.

mmmojo.dll
mmmojo_64.dll
WeChatOCR.exe
x64.config
Model/

FPOCRRecog.xnet

OCRDetFP32.xnet.nas

OCRParaDetV1.1.0.26.xnet

OCRRecogFP32V1.1.0.26.xnet

RecDict

sohu_simp.txt

NOTE:
OTHER LINKS

GitHub - hiroi-sora/Umi-OCR_plugins: Umi-OCR 插件库

Umi-OCR 插件库. Contribute to hiroi-sora/Umi-OCR_plugins development by creating an account on GitHub.

github.com

Releases · eaeful/WechatOCR_umi_plugin

适用于 Umi-OCR 文字识别工具的 WeChatOCR 插件，调用微信OCR进行本地离线ocr识别文字的插件。本项目有两个版本，一个是插件自带微信ocr文件的版本，另外一个是手动填写微信安装目录和wechatocr.exe的版本 - eaeful/WechatOCR_umi_plugin

github.com

https://github.com/eaeful/WechatOCR_umi_plugin/releases/download/v1.0/WechatOCR_umi_plugin_full.zip
This is another wechatOCR plugin, with the full OCR files, but the plugin is a little unstable, it is recommended that only the EXE, DLL and models are used.

New addition of a plugin forUMI-OCR, works with or without UMIOCR. (required wcocr.pyd)

Python:

import os
import base64
directoryDefault=r'C:\VideoSubFinder5x64\RGBImages'
pathlocal = {
    "path": r"C:\Program Files\Tencent\WeChat\[3.9.12.17]",
    # C:\Users\yourname\AppData\Roaming\Tencent\WeChat\XPlugin\Plugins\WeChatOCR\7079\extracted\WeChatOCR.exe
    "pathOCR": os.path.join( os.getenv("APPDATA") , r"Tencent\WeChat\XPlugin\Plugins\WeChatOCR\7079\extracted\WeChatOCR.exe")
}

def main():
    import argparse
    parser = argparse.ArgumentParser(formatter_class=argparse.RawDescriptionHelpFormatter,description=r"wechatocr -d directory/image")
    parser.add_argument('-d','--file_dir',default=directoryDefault, help='Directory or image')
    args = parser.parse_args()
    if os.path.isfile(args.file_dir):
        files=[args.file_dir]
        directory=os.path.dirname(args.file_dir)
    elif os.path.isdir(args.file_dir):
        directory=args.file_dir
        files = [x for x in os.listdir(directory) if os.path.splitext(x)[1] in [".jpg",".png",".jpeg",".bmp"] ]
    else:
        return
    parentDirectory = os.path.dirname(directory)
    directoryTXTResults = os.path.join(parentDirectory, "TXTResults")
    if os.path.isdir( directoryTXTResults ):
        directoryTxt=directoryTXTResults
    else:
        directoryTxt=directory
    os.system("title OCR for " + directory)
    ocrApi=Api(pathlocal)
    ocrApi.start()
      
    print("Processing in: " + directory)
    for i,x in enumerate(files):
        os.system("title OCR {}/{} Processed {}".format(i,len(files),x) )
        fileImage = os.path.join(directory,x)
        fileTxt = os.path.join(directoryTxt,x)
        ocrApi.runPath(fileImage)
        results = ocrApi.runPath(fileImage)
        # breakpoint()
        text=[x['text'] for x in results["data"]]
        with open(fileTxt+".txt", "w", encoding="utf-8") as f:
            f.write( " ".join(text) )

def cxybox(x):
    return [ [int(x["left"]),int(x["top"])], [int(x["right"]),int(x["top"])], [int(x["right"]),int(x["bottom"])], [int(x["left"]),int(x["bottom"])] ]
class Api:
    def __init__(self, globalArgd):
        self.lang = ""
        self.path = globalArgd["path"]
        self.pathOCR = globalArgd["pathOCR"]
    def start(self, argd=None):
        try:
            from . import wcocr
        except:
            try:
                import wcocr
            except:
                return "[Error] In Module wcocr (wcocr.pyd)"

        self.wcocr=wcocr
        if self.wcocr.init(self.pathOCR, self.path):
            return ""
        else:
            err = "[Error] Initialization OCR (Not Found path)"
            print(err)
            return err

    def stop(self):
        self.wcocr=None

    def runPath(self, imgPath: str):
        return self._ocr(imgPath)

    def runBytes(self, imageBytes):
        imgPath="temp_image.png"
        with open(imgPath, "wb") as f:
            f.write(imageBytes)
        return self._ocr(imgPath)

    def runBase64(self, imageBase64):
        try:
            imageBytes = base64.b64decode(imageBase64)
            return self.runBytes(imageBytes)
        except Exception as e:
            return {"code": 102, "data": f"[Error] Base64：{str(e)}"}

    def _ocr(self,imgPath):
        results=self.wcocr.ocr(imgPath)
        if "ocr_response" in results:
            res = {
                "code": 100,
                "data": [{"text":x['text'],"box":cxybox(x),"score":x['rate']} for x in results['ocr_response']],
            }
        else:
            res = {"code": 102, "data": "Error in results"}
      
        return res

if __name__ == "__main__":
    main()

Reminder, Download link 28,000+ Subtitle pack! (2001-2021)
https://www.akiba-online.com/thread...not-a-sub-request-thread.1920331/post-4193115

frostieff · May 14, 2021

thank you soo much for taking the time to create this tutorial however im new to python and im facing issues when im trying to run the bash or the script.

C:\Python39\python.exe: can't open file 'C:\Users\easyOcrImage.py': [Errno 2] No such file or directory

I have downloaded the latest python, pip and easyocr have been installed when i used C:\Python39\Scripts\pip.exe install easyocr

Any help will be much appreciated.

ironfevers · May 15, 2021

Nice guide. Do you have a recommended image viewer to delete blanks? It's kinda tedious to manually delete blanks. I was thinking you can click Create TXTImages which automatically detects blanks. Then you can run a script to compare to see if the RGBImage filename is in the same list as TXTImages, if they're the same then keep the image. If not in the list, then move it to a BlankImages folder to manually double check.

DarkSpiderVenom · May 15, 2021

frostieff said:
thank you soo much for taking the time to create this tutorial however im new to python and im facing issues when im trying to run the bash or the script.

C:\Python39\python.exe: can't open file 'C:\Users\fadyf\easyOcrImage.py': [Errno 2] No such file or directory

I have downloaded the latest python, pip and easyocr have been installed when i used C:\Python39\Scripts\pip.exe install easyocr

Any help will be much appreciated.

you can put screenshots of your command line

DarkSpiderVenom · May 15, 2021

ironfevers said:
Nice guide. Do you have a recommended image viewer to delete blanks? It's kinda tedious to manually delete blanks. I was thinking you can click Create TXTImages which automatically detects blanks. Then you can run a script to compare to see if the RGBImage filename is in the same list as TXTImages, if they're the same then keep the image. If not in the list, then move it to a BlankImages folder to manually double check.

The old photo viewer is very good at removing images (fast). It is only a matter of activating it in windows 10 through the registry
https://www.howtogeek.com/225844/ho...ewer-your-default-image-viewer-on-windows-10/

And the other is to eliminate the subtitle lines, I use aegisub, it is very fast to eliminate and make corrections

frostieff · May 20, 2021

Am i supposed to extract easyocr in the python folder i have?

mei2 · May 25, 2021

Nice tutorial. Thanks for the good job!
Are Chinese subtitles (say in Sukebei) in simplified or traditional Chinese mostly?

Taako · Nov 16, 2021

I guess you have to be familiar with Python?
The steps should be clearer if you run into an issue.
It's possible also that it's need updating. Those are my only issue with this. Meanwhile, I'm still making subs regardless.
Also remember guys, just because it didn't work for me... doesn't mean it won't work

maload · Dec 8, 2021

hmm. red text mean its no good

julian102 · Feb 23, 2022

can this be done on Mac u think?

Taako · Feb 24, 2022

julian102 said:
can this be done on Mac u think?

Have no idea, bro. Personally gave up on it. The instructions for noobs who don't know python makes it difficult to learn.
I suppose the original creator had good intentions but it hasn't help increased subs. I think only a select few can run it

The... tech support might be able to help you

Good luck https://www.akiba-online.com/forums/tech-support.88/

frostieff · Mar 14, 2022

Taako said:
Have no idea, bro. Personally gave up on it. The instructions for noobs who don't know python makes it difficult to learn.
I suppose the original creator had good intentions but it hasn't help increased subs. I think only a select few can run it
The... tech support might be able to help you Good luck https://www.akiba-online.com/forums/tech-support.88/

Reddit (python for new comers) or StackExhcange is where we will have to go and ask

Taako · Mar 15, 2022

frostieff said:
Reddit (python for new comers) or StackExhcange is where we will have to go and ask

I will keep that in mind, but it seems some don't have patience when dealing with noobs lol

faebladezzz · May 11, 2022

is it just me or the total run time of the original JAV vs JAV that have chinese hardsub was different?
for example case for SPRD-1454 the original run time is 01:56:10 but the chinese hardsub version run time is 01:55:42.

i've try this method to extact the subtitle just now, but there is some problem here. If we use the sub (that we extract) with the chinese hardsub version video the timing was great, but when i use it at the original version the subtitle timing always miss -1 (increase every 3 minutes)

are they decreasing the frame rate of the video for the chinese hardsub version ?
is there's a way to fix it ?

Thank you in advance

SamKook · May 28, 2022

Subtitles use time to sync themselves to the video so even if the framerate was different, it wouldn't matter and still work.

With that said, it could be an issue during encoding if the encoder duplicated frames during encoding or stuff like that. The time difference could also be that the video is cut in a different way, like removing end credits or ads, stuff like that.

Can't say much more than that without more details.

whiteknightlulw · Sep 12, 2022

I noticed that recent chinese hardsub release have the sub appear in a scrolling manner moving from left to right making the detection kinda inaccurate. It also has different colored text that seems to also affect its accuracy. Any workarounds for this?

mei2 · Sep 12, 2022

For those who are interested to tackle Rainbow+Karaoke hardsubs, here is my VSF settings that works reasonably well [the key word being reasonably

]::

Sub detection (RGBImages):

- I have added ca 10 colour filters --it should work well with the current set of colours used;

- I have adjusted the frame rate and text timing to manage the current karaoke/typewriter speed;

- It uses gpu flag for nvidia. If you don't need it, juts deselect it in the settings menu;

- The good: I have tested it on 5 new releases. The sub detection generated about 90% of hardsubs right;

- The bad: the timing is not accurate --because I had to play with the frame rate timing to get the entire moving subs. In general ech sub line needs to be adjusted to start earlier about 100ms per character.

- The ugly: cleaning up RGBImages is still a pain / manual. Though this version of cfg has produced least amount of garbage for me;

Image OCR (TextImages):

- Using VSF inbuilt OCR has not produced the best results for me --about 80%;

- VSF specially fails when white background

- I have had the best results with Google Docs. But too manual so far for me. If someone has a good OCR solution, please let me know.

- PaddleOCR has been rather good too, but I don't have a good script for it yet --I tested the toolbox so far.

A big thank you to @MrKid to push me to get it done. Hope the others can share your settings and workarounds so we can speed up the (soft)sub production.

superman4207 · May 4, 2023

I think the problem people are running into with following these instructions is because the OP forgets to mention to download his custom script that he attached to his post: "easyOcrImage.zip." as a STEP.

Follow his instructions until after you've used VideoSubFinder to extract the images of the text.
- Then download his file and unzip to your C: drive (this is just easier, trust me).
- Open cmd.exe.
- Type

Code:

/cd

(if you need to) and press Enter. Just make sure you're at the C; drive.

Code:

C:\>

- Copy and paste his command, updating the directory to match whatever yours is actually called:

Code:

C:\>easyOcrImage.py -l ch_tra -d "c:\VideoSubFinder\RGBImages\"

And it will start to OCR scan your files that were extracted. With his instructions and script, you don't need to locate the directory of anything except for where your RGBImages are.

Important Note: If you are using VideoSubFinder's Create Cleared Text Images (RGBImages->TXTImages) feature, make sure to change the path to point easyOCR to the TXTImages directory instead.

Code:

C:\>easyOcrImage.py -l ch_tra -d "c:\VideoSubFinder\TXTImages\"

hobbies · May 14, 2023

superman4207 said:
I think the problem people are running into with following these instructions is because the OP forgets to mention to download his custom script that he attached to his post: "easyOcrImage.zip." as a STEP.

Follow his instructions until after you've used VideoSubFinder to extract the images of the text.
- Then download his file and unzip to your C: drive (this is just easier, trust me).
- Open cmd.exe.
- Type

Code:

/cd

(if you need to) and press Enter. Just make sure you're at the C; drive.

Code:

C:\>

- Copy and paste his command, updating the directory to match whatever yours is actually called:

Code:

C:\>easyOcrImage.py -l ch_tra -d "c:\VideoSubFinder\RGBImages\"

And it will start to OCR scan your files that were extracted. With his instructions and script, you don't need to locate the directory of anything except for where your RGBImages are.

Important Note: If you are using VideoSubFinder's Create Cleared Text Images (RGBImages->TXTImages) feature, make sure to change the path to point easyOCR to the TXTImages directory instead.

Code:

C:\>easyOcrImage.py -l ch_tra -d "c:\VideoSubFinder\TXTImages\"

i follow your instructions but i get
'/cd' is not recognized as an internal or external command,
operable program or batch file.

'C:\' is not recognized as an internal or external command,
operable program or batch file.

SamKook · May 15, 2023

The "C:\>" is not part of the command, it's the command prompt telling you where it's pointing to.

And the "\cd" I assume they meant type "cd \" to go to the root of the c drive since the default usually is "C:\Windows\system32>" meaning you're in the system32 folder in the Windows folder that's located in the C drive.

Tutorial OCR Chinese(english) movies hardsub

Was this tutorial helpful?

yes

no

Member

Optical Character Recognition (OCR) for natural images/scenes​

Requirements:​

Software​

Installation​

Usage​

Attachments

New Member

Active Member

Member

Member

New Member

Well-Known Member

Akiba Citizen

Well-Known Member

Attachments

New Member

Akiba Citizen

New Member

Akiba Citizen

New Member

Grand Wizard

New Member

Well-Known Member

Attachments

JAV Perv Enthusiast

New Member

Grand Wizard

Similar threads

Optical Character Recognition (OCR) for natural images/scenes

Requirements:

Software

Installation

Usage