Skip to main content

All Questions

0 votes
1 answer
783 views

Why does copying text between Notepad++ files create files with different bytes?

I've created a simple pdf [hi.pdf] with the word hi and when I open it in Notepad++, its encoding is ANSI, which I assume is Notepad++'s best guess, with it opening successfully when I Save as ...
David Klempfner's user avatar
5 votes
0 answers
384 views

Can I tinker with the encoding when using pdftotext to convert PDF to text?

Sometimes when I do pdftotext it results in perfect text. I assume this is because the actual unicode text data is embedded directly in the PDF itself, and simply read out. But other times (around ...
Lance's user avatar
  • 387
4 votes
1 answer
898 views

Why does this PDF appear to encode parentheses correctly but doesn't when using pdftotext or copying and pasting?

Here are links to some journal articles: https://doi.org/10.1149/1.2183927 https://doi.org/10.1149/1.2988135 https://doi.org/10.1149/1.3021012 https://doi.org/10.1149/1.2159298 They all encode ...
Nathaniel M. Beaver's user avatar