tesseract OCR giving no output and no diagnostics

Question

I've used tesseract for OCR a few times in the past successfully. On macOS; it was installed by "home-brew"

Today, I did

WGroleau@MBP ~ % brew upgrade # to make sure everything is the latest and then …
WGroleau@MBP ~ % tesseract ~/Downloads/temp.jpg stdout -l chi_sim
福佳生活饶

The last (fifth) character was incorrect, so I made a minor graphic edit to that character and ran the same command. No output, no diagnostics. Ran it verbose—still no diagnostics, only what libraries it used.

Cropped the edited character out of the file and tried. Still no output, no diagnostics.

What do I do next?

Here's the file after edit but before cropping:

Update: If I tell it to use "Legacy engine only," I get:

Error: Tesseract (legacy) engine requested, but components are not present in /usr/local/share/tessdata/chi_sim.traineddata!!
Failed loading language 'chi_sim'
Tesseract couldn't load any languages!
Could not initialize tesseract.

Do you still have access to the original temp.jpg? What if you edit it in a different program? Or even re-save the current file using a different program. I mean it looks like the first program saved the jpg in a form that triggers some bug(?) in tesseract. If another program manages to produce a "more compatible" jpg then the simplest workaround will be to use it instead of the first program when editing for tesseract. — Kamil Maciorowski, Commented Aug 7, 2023 at 11:07
'temp.jpg' is the name I overwrite almost daily for throwaway items. I don't have another graphic editor. But I've used GraphicConverter many times this way with another OCR and twice successfully with tesseract. Doing it today does get a diagnostic—a usage screen as if no filename was provided. — WGroleau, Commented Aug 7, 2023 at 14:13

WGroleau · Accepted Answer · 2023-08-09 18:07:35Z

0

Tried adding the option --psm 13 and it worked. Don’t get why it worked two or more times before on .jpg saved by the same program. Nor why there isn't a diagnostic message when it silently fails (first instance) or a false one on the second instance.

answered Aug 9, 2023 at 18:07

WGroleau

8241 gold badge7 silver badges17 bronze badges

Add a comment |

Stack Exchange Network

tesseract OCR giving no output and no diagnostics

1 Answer 1

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged
tesseract-ocr
.

Hot Network Questions

tesseract OCR giving no output and no diagnostics

1 Answer 1

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged tesseract-ocr.

Related

Hot Network Questions

Not the answer you're looking for? Browse other questions tagged
tesseract-ocr
.