0

I have a a DJVU file where I can select, copy, highlight the text of it, i.e. the DJVU is text readable or character recognizable. I want to convert this file to character recognizable pdf file. Is there online charge-free system or free software to that?

Thanks.

3
  • superuser.com/q/185523/777184
    – Madhubala
    Commented Aug 12, 2020 at 10:02
  • @Madhubala Could u specifically tell which one converts to text recognizable pdf file plz?
    – Michael
    Commented Aug 12, 2020 at 11:23
  • Added it as an answer , check if it helps and feel free to accept if it answers your question
    – Madhubala
    Commented Aug 12, 2020 at 12:50

1 Answer 1

2

I use DjVuLibre : DjVu

In DjVuLibre

To convert it to pdf : File > Export as… > PDF format

To convert pdf file to character recognizable pdf file :

you can use Acrobat Pro/Standard DC 2020 (direct link Acrobat Pro/Standard DC 2020(it comes with trial version)

in Acrobat > Tools > Scan and OCR > Recognize Text > in this file

IMG1 enter image description here

Or

you can use tesseract-ocr : tesseract-ocr download page.I use it with wsl/linux.Here is the bash script (it uses ghostscript && tesseract)

#!/bin/bash

notify-send "started gs" && pdftoppm -r 660 *.pdf outputname -png && for i in *png; do b=`basename "$i" .png`; tesseract "$i" "$b" pdf; done && gs -dBATCH -dNOPAUSE -q -sDEVICE=pdfwrite -dAutoRotatePages=/None -sOutputFile=finished.pdf outputname-*.pdf && notify-send "done" && notify-send "deleting left overs" && rm *png && rm outputname-*.pdf && notify-send "cleaned"

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged .