Skip to main content

Timeline for ffprobe OCR of a subtitle stream

Current License: CC BY-SA 4.0

4 events
when toggle format what by license comment
Apr 30, 2022 at 14:45 comment added Gyan Add mpdecimate after the ocr filter to strip duplicates.
Apr 30, 2022 at 10:24 vote accept Minty
Apr 30, 2022 at 10:24 comment added Minty As I understand, this renders the subtitles on top of a black background of a size hd720 (I had to put hd1080 since my source is Subtitle: hdmv_pgs_subtitle, 1920x1080), and then OCRs the entire thing frame by frame, as I'm getting multiple reads of the same text. That's... pretty horrible. Beyond slow. But works, so thanks for showing me the way and an interesting trick! Since this appears to be the only way to do it with ffmpeg, I guess I'll stick to Subtitle Edit and maybe raw Tesseract at some point. PS I love your work.
Apr 30, 2022 at 4:41 history answered Gyan CC BY-SA 4.0