Questions tagged [pdf]
Portable Document Format (PDF) is an open standard for electronic document exchange maintained by the International Organization for Standardization (ISO). Questions can be about creating, reading, editing PDFs using different languages.
82
questions
6
votes
2
answers
702
views
Speed up search function for PDFs
This function takes a file path to a PDF file, and a search string(s). It spits out a count of the number of times the string(s) shows in the PDF. Any ideas how I can make it faster?
It can be tested ...
1
vote
1
answer
63
views
Re-write of PDF invoices from user input, decoupling of Invoice from FPDF class
Following on from this post and taking on board some of the suggestions, what follows is a re-write of the original python code. It's also just a rewrite of some of the functionality, this code only ...
1
vote
2
answers
182
views
PDF invoices from user input
Firstly the goal of this code is a little more complex than in this post and the other invoice questions are C#
Is there anything that just pops out as immeadiately unpythonic/wrong ?
I have a few ...
0
votes
0
answers
79
views
Walk a folder of files and extract creation date, countries mentioned and a summary based on their text for each
I made a script that scans a folder of 723 documents (pdf, pptx, docx) to extract the 'Name', 'Type', 'NbPages', 'Creation Date', 'Period Date', 'Countries', 'Summary', 'Indicateurs', 'Path'. I think ...
4
votes
2
answers
183
views
Unstructured to Structured TOC
The following code tries to convert an unstructured TOC with bounding box layout data given by the output of pdftotext -bbox-layout -f 11 -l 13 new_book.pdf toc.html...
7
votes
3
answers
876
views
Brute force search for pdf password in python
I forgot the password to my pdf. I remembered a few characters, so I made a simple password-cracker program in Python. Is the document lost now given the number of combinations? Out of curiosity, how ...
3
votes
1
answer
66
views
PDFcomp: A PDF comprimer easy script
I developed this script in dead time to be able to compress the PDFs that they send me to university and also to be able to send scans to the various institutions that require formats that do not ...
4
votes
2
answers
326
views
Attempting an OOP approach for PDF Paranoia - Automate the Boring Stuff CH 15
This exercise is called PDF Paranoia and it comes from CH 15 of the book Automate the Boring Stuff With Python - second edition. There are two parts to the exercise, which I've separated into two ...
3
votes
1
answer
2k
views
Adding a PDF file using iText7
Can I improve this code and make it more beautiful? For example, I am using a switch statement and for each page but I only add edits to page 1 and page 2?
...
3
votes
1
answer
393
views
Resume Builder using Jinja Templates and HTML
I have attempted to hack together a CV/Resume builder and I would like a review. The aim is to output a styled PDF containing dynamic fields. I have taken a simple HTML file, an arguably bloated CSS ...
0
votes
1
answer
190
views
How to optimize the code of my pdf generation from DGV code using iText5 & c#?
I have 2 DGV's one containing bills info & another containing credit notes info. When at least one item from both the DGV's are selected then it should generate a pdf (result1) taking data from ...
4
votes
1
answer
130
views
Shell script to download Project Euler problems and combine to PDF
This is a script I created that downloads Project Euler webpages and combines them to PDF. The script also downloads animated files.
...
3
votes
1
answer
229
views
Using fluent builder pattern for generating pdf
I'm testing to use a fluent builder pattern for generating a pdf file using AbcPDF.
I've looked at several methods of accomplishing this and I'd be very glad to get some input on what I've written so ...
4
votes
1
answer
296
views
Copy data from a pdf into word for the purpose of starting a report
In order to write some reports I have to manually copy the name of client, address, claim number, etc from a pdf that I receive. I want to automate this part of the report writing by using vba to copy ...
7
votes
1
answer
2k
views
Speeding up Python program that converts DOCX to PDF in Windows
This is meant to be a performance-centric question as this type of conversion is obviously very common. I'm wondering about the possibilities for making this process faster.
I have a program that ...