Skip to main content

All Questions

0 votes
1 answer
236 views

UTF-8 Decoders fail to decode the encoded strings

I have some encoded values values which I believe is UTF-8. Now I dont really know if it is UTF-8 or not because other online tool and steps to decode UTF-8 is not working, BUT an open source tool ...
Solo's user avatar
  • 3
2 votes
1 answer
144 views

VIM uses wrong encoding - but only in status messages

I ran into a strange issue with my ArchLinux setup. Vim uses correct encoding for reading/displaying files but these status messages (which displays the current mode or reports back when the buffer is ...
Gabor Garami's user avatar
4 votes
1 answer
1k views

How to identify a file encoding?

I'm trying to figure out the encoding of a text file. I did try a lot of the common ones (with Notepad++), but I've failed so far. A few hints: The file was originally an Eudora mbx file, with mostly ...
jmr's user avatar
  • 536
0 votes
1 answer
1k views

Wrong character encoding in ssh session – but not for all connectios

I have an odd issue when connecting to my (Ubuntu) server via SSH. If I connect from my Gentoo box, all is fine. All Umlauts etc. work, I can type "ÄÖÜ" and so on. If I do the same from my ...
Tobias Leupold's user avatar
0 votes
1 answer
931 views

How to read Linux text files in Windows system?

For example, I run the top command and store it to a file in Linux, after that I open that file in Windows it contains some gibberish. Here is the file viewed in Notepad++: The option to convert to ...
TeaViris's user avatar
1 vote
0 answers
283 views

How to use ISO8859-9 encoding in terminal?

I maked a file containing "ırmak" with a text editor via encoding ISO8859-9. Then, I tried to print the content with "cat" command in the terminal. But I could not. I use the ...
user's user avatar
  • 11
1 vote
1 answer
805 views

Convert Korean files that are showing up incorrectly to utf-8 - character shows Çѱ¹Ÿî

I was just about to ask this after a long time of searching so decided to answer my own question... I downloaded Korean subtitles in an .smi file that was in zip archive. When I extracted it, the ...
iateadonut's user avatar
0 votes
2 answers
688 views

Restoring corrupted UTF-8 files

After my PC broke down I managed to make a backup of the relevant files before reinstalling Windows. Now that I'm restoring those files and setting the system up I noticed that some of the files got ...
wileecoyote's user avatar
0 votes
1 answer
915 views

Printf in gawk with the correct encoding?

I'm wondering: can gawk printf in any format besides ASCII? Currently, I'm using gawk match() to search through some UTF-8 text. When I go ahead and print out the matches gawk finds, it ends up like ...
ixns's user avatar
  • 1
1 vote
0 answers
93 views

How to fix accentuation encoding with cmd.exe running inside bash?

I installed https://www.msys2.org/ and setup an ssh server for it. With this I can connect to my machine and work remotely. The problem is that some application as Visual Studio tools or windows ...
user's user avatar
  • 468
0 votes
1 answer
783 views

Why does copying text between Notepad++ files create files with different bytes?

I've created a simple pdf [hi.pdf] with the word hi and when I open it in Notepad++, its encoding is ANSI, which I assume is Notepad++'s best guess, with it opening successfully when I Save as ...
David Klempfner's user avatar
0 votes
0 answers
847 views

What is this for a file name encoding and how to fix it?

On my Linux machine I found old files (at least from 2004 if not older), so possibly Win9x days. Maybe they came over some old FAT drive on my disk or some old Samba share. Umlaute are very weirdly ...
divB's user avatar
  • 589
0 votes
2 answers
2k views

Finding the encoding of a text file containing weird characters

I recently received a file, of Turkish origin, where the file has some English words which I can easily read, and some weird characters. I wonder if this file is encoded, encrypted or sth else. I ...
Sean Goudarzi's user avatar
0 votes
1 answer
184 views

BER decode SubjectAltName and CHOICE?

I'm having trouble working out the syntax when decoding a SubjectAltName in a TLS self-signed certificate. I believe the certificate is well formed. The trouble is, I don't understand how to decode ...
jww's user avatar
  • 12.3k
5 votes
0 answers
384 views

Can I tinker with the encoding when using pdftotext to convert PDF to text?

Sometimes when I do pdftotext it results in perfect text. I assume this is because the actual unicode text data is embedded directly in the PDF itself, and simply read out. But other times (around ...
Lance's user avatar
  • 387
2 votes
1 answer
299 views

How to send an e‑mail to an address with Latin9/iso‑8859‑15 characters inside the username part of the address?

As part of finding a job, I need to send an e‑mail to an address which contains latin letters with accents inside the username. I know this is not standard, but they did it and there’s less than 1000 ...
user2284570's user avatar
  • 1,883
0 votes
0 answers
1k views

How do you create custom zalgo text?

I know what zalgo text is and that there are a few websites that can make it for you. But I'm looking for how I can make it however I want." HͥAͣQͫ" is an example, how can I make it so I can choose ...
MadMagic's user avatar
0 votes
1 answer
1k views

Which type of character encoding consumes the most memory?

I will store a long (45132 character) string in a Postrgres database whilst preserving every character (including really rare ones). Postgres can store strings up to 1GB (see here). In terms of the ...
stevec's user avatar
  • 839
2 votes
1 answer
3k views

What text encoding is used in "Zip archive data, at least v?[0x314] to extract"?

I tried multiple search terms on the usual suspect search engines as well as on stack overflow and superuser, but can't find anything on this. Normally a Zip archive magic pattern is "at least v1.0" ...
Mike M's user avatar
  • 23
0 votes
2 answers
5k views

Exporting from Excel to CSV replaces Japanese characters with ??? even though Windows, Office locale is Japan/Japanese

I am exporting an excel file (Excel 2016) containing Japanese characters into CSV. (Note : I am not exporting to CSV UTF-8 provided). In the process, all Japanese characters are replaced with '?' My ...
user10989928's user avatar
0 votes
0 answers
1k views

Interpret text file with some hex codes?

I have a file with contents looking like PK\u0003\u0004\u0014\u0000\u0006\u0000\b\u0000\u0000\u0000!\u0000À¸<91><91>¢\u0001. However, I have a different version of the same file looking ...
3yakuya's user avatar
  • 161
2 votes
0 answers
446 views

Execute sh file with Russian or Chines Chars - Saved as UTF-8 or Unicode

I have a file which has some russian chars in. Below is the content of by sh file. #!/bin/sh sed -i "s/\bVAR1\b/Привет, как ты/g" file1.txt When i save this file i had to save this as UTF-8 or ...
Venkatesan Sundar's user avatar
1 vote
2 answers
48k views

¢tRÂà³Ab.Ÿân TXT files: how to switch from weird characters back to normal?

So, I have on a flash drive a txt file generated in Cyrillic (my own work, own pen drive), a few years old. Now I needed to open it, only to see this kind of mess: . I wonder why is this happening and ...
Alex's user avatar
  • 11
0 votes
2 answers
1k views

What is causing these two apparently identical files to have different hashes?

I am unable to figure out why the following two files are yielding different hashes (SHA1, CRC32, SHA384, whatever): https://cdn.jsdelivr.net/npm/[email protected]/dist/jsonify-error.js https://...
Pedro A's user avatar
  • 177
0 votes
0 answers
681 views

What encoding is needed to read this HTML file containing Spanish characters?

I downloaded this HTML file from archive.org, which is a digitilised book, written in Spanish. As such, it has plenty of accents and other unique symbols (e.g. ñ). Now, when I open the html file in a ...
luchonacho's user avatar
0 votes
1 answer
943 views

Symbols are lost after conversion from UTF-8 to ISO8859-1 and back to UTF-8

I have a file with properties in French. I would like to convert it to ISO8859-1. But after conversion, some symbols are lost. What is wrong? > cat fr.properties VAR2="élément n’a" > cat fr....
Volodymyr Bezuglyy's user avatar
2 votes
2 answers
2k views

How to fix encoding - curly apostrophe appears as ‰Ûª

I have a text-file in which all the ASCII characters appear correctly but some others do not. In particular there is this word: don‰Ûªt In hex the bytes are 64 6f 6e 89 db aa 74. Obviously, it is ...
user1310503's user avatar
0 votes
2 answers
2k views

Binary encoding formats

echo random text > text_file Saves text_file in text format with ASCII encoding. To check the encoding, I do chardetect text_file which tells me that the file is ASCII encoded. Now I have a jpg ...
GypsyCosmonaut's user avatar
-1 votes
1 answer
449 views

Opening a 7z file on mac shows strange encodings

I am opening this file: https://dataverse.harvard.edu/file.xhtml?fileId=2901965&version=1.0 On TextEdit. When I do, the file looks like: 7zºØ'· t(ù‚]%N ò„,¢‡ò] ¬CRJ∫√ıìaIÄÅ≤íhYTÔ1/ıG3À=ôN’(...
Dhruv Ghulati's user avatar
0 votes
0 answers
701 views

How can I make `file -i` command return ISO-8859-9?

Let's say I have a text file with content şşş and in vi/vim I have set fileencoding as :set fileencoding=ISO-8859-9. Later when I check the encoding with file -i <myFile> it gives text/...
destan's user avatar
  • 1,147
5 votes
2 answers
21k views

What is the difference between Windows-1252 and ANSI encoding?

I'm trying to convert UTF-8 to ANSI encoding through a tool. But it shows Western European (Windows)-1252 instead of ANSI. Are they both the same thing? Should I go ahead with this?
Abd's user avatar
  • 119
153 votes
2 answers
230k views

How do I change the character encoding for a webpage in Chrome?

Google Chrome, like almost every web browser in recent memory, used to have an option to change the character encoding for the webpage being viewed by going to Menu › Tools › Encoding (or some similar ...
Meshaal's user avatar
  • 1,632
0 votes
1 answer
10k views

Internet Explorer 11 not displaying UTF-8 encoded pages correctly

Some pages in IE 11, that seem to be UTF-8, are not displaying correctly. The same page displays correctly in Mozilla Firefox. I first noticed this in Amazon.com on the Orders page (pretty much any of ...
BillDOe's user avatar
  • 1,741
0 votes
1 answer
2k views

incorrect encoding of rsyslog output

I'm running a rsyslog service in a docker container. Its configuration(rsyslog.conf) looks like below, module(load="imuxsock") # local system logging support (e.g. via logger command) ...
Kane's user avatar
  • 505
5 votes
3 answers
13k views

Determine and change filename encoding on Windows

I have files on a Windows server that have certain accented characters in the name. On Windows Explorer files are displayed normally but running 'dir' at the command prompt with default settings ...
nixer's user avatar
  • 61
1 vote
1 answer
257 views

Notepad++ proper encoding settings

Notepad++ can't display characters like ⊰ http://graphemica.com/%E2%8A%B0 . What is the best settings to solve this problem?
user2132188's user avatar
27 votes
5 answers
27k views

remove <200b> character from text file

I have a huge text file containing this string/character <200b> that I want to delete. I tried with sed but it didn't work. sed 's/<200b>//g' file The character never shows when I open the ...
user2598997's user avatar
0 votes
3 answers
3k views

Accented character issue

I have a bunch of websites encoded in UTF-8. They contain accents and those weird n's with the squiggles over them and all kinds of fun stuff. When I try and open them in any text editor, even ...
Simon Kiely's user avatar
0 votes
0 answers
684 views

Improper Encoding in Chrome

I started to get this weird issue with chrome (latest) where specific characters would be displayed as other characters. Here is one example so instead of < and > it displays W (with the ...
Nick Ginanto's user avatar
  • 1,405
0 votes
2 answers
3k views

Determine The Encoding of a specific character in Word 2010

I'm having a problem with a generated document (coming from crystal reports engine). Initially hyphens are visible but if the text is copied and pasted with "keep text only" option or "remove ...
bumble_bee_tuna's user avatar
0 votes
1 answer
1k views

How can I tell what the character encoding of a string of text is?

Troubleshooting an irritating password issue. There's an application we are using that requires a login; I type in the password, it tells me it's wrong. If I open a password manager and copy and ...
Bart Silverstrim's user avatar
2 votes
0 answers
1k views

File names garbled in rar archive, possibly double utf8 encoded? Can't figure out how to reverse

I'm trying to restore the uploads folder of a wordpress installation. The folder went through some combination of ftp or sftp transfer and got compressed by rar. (I don't know the exact process, it ...
proto-n's user avatar
  • 121
4 votes
2 answers
4k views

Which default encoding for Vim: utf8 or latin1?

I'm writing text in European languages as French, German, Italian. All characters are compatible with latin1 encoding. Thats why I decided to set the default encoding in vim to latin1. Once in ...
Reman's user avatar
  • 231
2 votes
0 answers
436 views

How is web form content encoded in Safari's cache?

A friend of mine just accidentally navigated off a web page where she entered a rather lengthy text into a web form. When navigating back to the page, the text was lost. (Spoiler: The page had an ...
schluchc's user avatar
  • 462
1 vote
0 answers
1k views

WLMail displays messages correctly on screen, but prints  where there is =C2=A0 for some messages

There are many questions online related to this. And I understand that =C2=A0 stands for non-breaking space when it appears within a mime part with type: Content-Type: text/html; charset=UTF-8 My ...
Doochz's user avatar
  • 174
1 vote
1 answer
3k views

How to encode a text string into QP?

I would like to encode some basic text strings into QP. I have tested several online encoders on the web, and none of them worked. One of them can be found at Webatic. I can use it to decode QP into ...
Samir's user avatar
  • 20.9k
3 votes
2 answers
2k views

What are these strange characters? [duplicate]

I couldn't put them in the title because they actually seem to have many bytes: I took some screenshots:
Rafael's user avatar
  • 165
1 vote
3 answers
7k views

Displaying Korean characters properly on a computer with English Windows XP

I have Windows XP Professional Version 2002 SP 3 English version I have installed the required input methods in Regional and Language Settings and as far as I know all the other options provided there ...
user13267's user avatar
  • 1,721
1 vote
1 answer
8k views

Convert .txt. file from windows-1251 to utf-8

I'm trying to read the cyrillic text that is stored in .txt format. Once I open it with OpenOffice, it is OK. Once I open it with Notepad++, it shows unreadable symbols. Setting Windows-1251 prior ...
Haradzieniec's user avatar
0 votes
0 answers
300 views

Vim scripting with Tcl: encoding mismatch

I am trying to write a plugin for Vim using Tcl as the backend language. I have written and tested most of the code and it seems to work fine except for one hurdle I can't seem to get over: encoding. ...
user avatar

15 30 50 per page