All Questions
3
questions
0
votes
1
answer
783
views
Why does copying text between Notepad++ files create files with different bytes?
I've created a simple pdf [hi.pdf] with the word hi and when I open it in Notepad++, its encoding is ANSI, which I assume is Notepad++'s best guess, with it opening successfully when I Save as ...
5
votes
0
answers
384
views
Can I tinker with the encoding when using pdftotext to convert PDF to text?
Sometimes when I do pdftotext it results in perfect text. I assume this is because the actual unicode text data is embedded directly in the PDF itself, and simply read out.
But other times (around ...
4
votes
1
answer
898
views
Why does this PDF appear to encode parentheses correctly but doesn't when using pdftotext or copying and pasting?
Here are links to some journal articles:
https://doi.org/10.1149/1.2183927
https://doi.org/10.1149/1.2988135
https://doi.org/10.1149/1.3021012
https://doi.org/10.1149/1.2159298
They all encode ...