Timeline for ffprobe OCR of a subtitle stream
Current License: CC BY-SA 4.0
4 events
when toggle format | what | by | license | comment | |
---|---|---|---|---|---|
Apr 30, 2022 at 14:45 | comment | added | Gyan | Add mpdecimate after the ocr filter to strip duplicates. | |
Apr 30, 2022 at 10:24 | vote | accept | Minty | ||
Apr 30, 2022 at 10:24 | comment | added | Minty |
As I understand, this renders the subtitles on top of a black background of a size hd720 (I had to put hd1080 since my source is Subtitle: hdmv_pgs_subtitle, 1920x1080 ), and then OCRs the entire thing frame by frame, as I'm getting multiple reads of the same text. That's... pretty horrible. Beyond slow. But works, so thanks for showing me the way and an interesting trick! Since this appears to be the only way to do it with ffmpeg, I guess I'll stick to Subtitle Edit and maybe raw Tesseract at some point. PS I love your work.
|
|
Apr 30, 2022 at 4:41 | history | answered | Gyan | CC BY-SA 4.0 |