How to do OCR on a PDF document? [duplicate]

Question

Possible Duplicate:
How to extract text with OCR from a PDF on Linux?

I have a few documents in English and Hebrew that I scanned in and converted to PDF format.

Is there some free or cheap utility that can process a scanned PDF and do OCR, at least in English, preferably also in Hebrew?

Thanks!

A couple of similar questions. superuser.com/questions/28426/… superuser.com/questions/64124/… superuser.com/questions/97470/… — heavyd, Commented Feb 16, 2010 at 16:47
The author of this question did not specify that he is running Linux. The so-called possible duplicate question is too localized, and may not apply at all to the author of this question. — eleven81, Commented Feb 16, 2010 at 17:03
Not only this is not duplicate - it's still unanswered. All 3 answers only yields into text extracts and not a PDF text-selectable document. — cregox, Commented Jun 28, 2013 at 16:05

eleven81 · Accepted Answer · 2010-02-16 16:54:29Z

1

I found a list of free OCR software for Windows.

However, these programs need an image input, not a PDF input. For this, try a PDF-to-JPG converter.

answered Feb 16, 2010 at 16:54

community wiki

eleven81

Add a comment |

eleven81 · Accepted Answer · 2010-02-16 16:47:59Z

1

I found an interesting idea that lets Google do all the work of OCR'ing the PDF files for you.

answered Feb 16, 2010 at 16:47

community wiki

eleven81

Rather than what's at that link, it's simpler now to just use docs.google.com/viewer now.
– ShreevatsaR
Commented Aug 29, 2010 at 2:37

Add a comment |

Dennis · Accepted Answer · 2010-02-16 16:47:33Z

0

Personally, I would use Ghostview to convert them to an image, then Tesseract to convert them to text. This is a totally free, open source, cross platform solution that I have had very good results with when trying to convert plain text. I don't use it for complex documents with tables and such, but for plain text you can't beat the price.

answered Feb 16, 2010 at 16:47

community wiki

Dennis

Add a comment |

Stack Exchange Network

How to do OCR on a PDF document? [duplicate]

3 Answers 3

Not the answer you're looking for? Browse other questions tagged
pdf
ocr
english
hebrew
.

Linked

Hot Network Questions

How to do OCR on a PDF document? [duplicate]

3 Answers 3

Not the answer you're looking for? Browse other questions tagged pdfocrenglishhebrew.

Linked

Related

Hot Network Questions

Not the answer you're looking for? Browse other questions tagged
pdf
ocr
english
hebrew
.