Skip to main content

All Questions

22 questions with no upvoted or accepted answers
5 votes
0 answers
384 views

Can I tinker with the encoding when using pdftotext to convert PDF to text?

Sometimes when I do pdftotext it results in perfect text. I assume this is because the actual unicode text data is embedded directly in the PDF itself, and simply read out. But other times (around ...
Lance's user avatar
  • 387
2 votes
1 answer
299 views

How to send an e‑mail to an address with Latin9/iso‑8859‑15 characters inside the username part of the address?

As part of finding a job, I need to send an e‑mail to an address which contains latin letters with accents inside the username. I know this is not standard, but they did it and there’s less than 1000 ...
user2284570's user avatar
  • 1,883
2 votes
0 answers
446 views

Execute sh file with Russian or Chines Chars - Saved as UTF-8 or Unicode

I have a file which has some russian chars in. Below is the content of by sh file. #!/bin/sh sed -i "s/\bVAR1\b/Привет, как ты/g" file1.txt When i save this file i had to save this as UTF-8 or ...
Venkatesan Sundar's user avatar
2 votes
0 answers
1k views

File names garbled in rar archive, possibly double utf8 encoded? Can't figure out how to reverse

I'm trying to restore the uploads folder of a wordpress installation. The folder went through some combination of ftp or sftp transfer and got compressed by rar. (I don't know the exact process, it ...
proto-n's user avatar
  • 121
2 votes
0 answers
436 views

How is web form content encoded in Safari's cache?

A friend of mine just accidentally navigated off a web page where she entered a rather lengthy text into a web form. When navigating back to the page, the text was lost. (Spoiler: The page had an ...
schluchc's user avatar
  • 462
1 vote
0 answers
283 views

How to use ISO8859-9 encoding in terminal?

I maked a file containing "ırmak" with a text editor via encoding ISO8859-9. Then, I tried to print the content with "cat" command in the terminal. But I could not. I use the ...
user's user avatar
  • 11
1 vote
0 answers
93 views

How to fix accentuation encoding with cmd.exe running inside bash?

I installed https://www.msys2.org/ and setup an ssh server for it. With this I can connect to my machine and work remotely. The problem is that some application as Visual Studio tools or windows ...
user's user avatar
  • 468
1 vote
0 answers
1k views

WLMail displays messages correctly on screen, but prints  where there is =C2=A0 for some messages

There are many questions online related to this. And I understand that =C2=A0 stands for non-breaking space when it appears within a mime part with type: Content-Type: text/html; charset=UTF-8 My ...
Doochz's user avatar
  • 174
1 vote
1 answer
404 views

Automated script to change page encoding after page has been downloaded in Google Chrome

guys! My Chrome doesn't properly recognize page encoding of our local TWIKI site. So, every time i enter the site i have to set suitable encoding manually (Settings -> Tools -> Encoding -> Cyrillic (...
theendrew's user avatar
0 votes
2 answers
688 views

Restoring corrupted UTF-8 files

After my PC broke down I managed to make a backup of the relevant files before reinstalling Windows. Now that I'm restoring those files and setting the system up I noticed that some of the files got ...
wileecoyote's user avatar
0 votes
1 answer
915 views

Printf in gawk with the correct encoding?

I'm wondering: can gawk printf in any format besides ASCII? Currently, I'm using gawk match() to search through some UTF-8 text. When I go ahead and print out the matches gawk finds, it ends up like ...
ixns's user avatar
  • 1
0 votes
0 answers
847 views

What is this for a file name encoding and how to fix it?

On my Linux machine I found old files (at least from 2004 if not older), so possibly Win9x days. Maybe they came over some old FAT drive on my disk or some old Samba share. Umlaute are very weirdly ...
divB's user avatar
  • 589
0 votes
2 answers
2k views

Finding the encoding of a text file containing weird characters

I recently received a file, of Turkish origin, where the file has some English words which I can easily read, and some weird characters. I wonder if this file is encoded, encrypted or sth else. I ...
Sean Goudarzi's user avatar
0 votes
0 answers
1k views

How do you create custom zalgo text?

I know what zalgo text is and that there are a few websites that can make it for you. But I'm looking for how I can make it however I want." HͥAͣQͫ" is an example, how can I make it so I can choose ...
MadMagic's user avatar
0 votes
0 answers
1k views

Interpret text file with some hex codes?

I have a file with contents looking like PK\u0003\u0004\u0014\u0000\u0006\u0000\b\u0000\u0000\u0000!\u0000À¸<91><91>¢\u0001. However, I have a different version of the same file looking ...
3yakuya's user avatar
  • 161

15 30 50 per page