9

My supervisor usually requests me to deliver some kind of written reports. I write them in LaTeX, deliver them as PDF, and get them back with lots of comments. She uses Acrobat Professional to create the comments.

I would like to extract a list of all comments from the file so I can in a second step import them into a task management tool. Some simple text format will suffice, I can write a script to add markup if needed. All I want is that single comments are separated somehow, e.g. by two blank rows (I think she sometimes adds newlines into comments).

I strongly prefer a solution which works under Linux, but in the worst case will accept something which requires Windows. I don't have a license for Acrobat professional or other paid PDF readers, so please recommend free software if possible.

3 Answers 3

4

Since you use LaTeX, the texlive-latex-extra package on Ubuntu contains the pdfannotextractor executable that can extract comments for you. This uses libpdfbox-java that you will also need to install.

Another option is to install python-poppler and use this Python script from a similar stackoverflow question (see answer by Enno Gröper): Parse annotations from a pdf

2
  • 2
    notes: a) do not install libpdfbox-java directly as this program relies on an old version of pdfbox rather run pdfannotextractor --install, b) the program doesn't really work (in my experience) to extract annotations to plain text
    – jaimedash
    Commented Apr 27, 2017 at 18:28
  • 2
    Yep, also not working for me: !!! Warning: Annotation on page 1 not recognized! java.lang.Exception: Unsupported annotation subtype: FreeText
    – naught101
    Commented Aug 29, 2017 at 3:07
2

Comments in PDFs can be exported/imported as FDF files. FDF files are usually only a few megabytes in size (unless the comments include images or other files). The comments list can also be exported to Word. Both options are found in the options menu in the comments pane as in the screenshot below (from Acrobat X Professional).

enter image description here

0

I think the terminology is Annotations, so that may help in your searches. ;)

Otherwise, perhaps an online solution such as https://www.sumnotes.net/ ?

Note: I found this particular link in an answer on another SU question: Print only the annotations of a pdf. I've never used it myself.

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged .