All Questions
Tagged with character-encoding encoding
66
questions
27
votes
5
answers
27k
views
remove <200b> character from text file
I have a huge text file containing this string/character <200b> that I want to delete. I tried with sed but it didn't work.
sed 's/<200b>//g' file
The character never shows when I open the ...
0
votes
1
answer
10k
views
Internet Explorer 11 not displaying UTF-8 encoded pages correctly
Some pages in IE 11, that seem to be UTF-8, are not displaying correctly. The same page displays correctly in Mozilla Firefox. I first noticed this in Amazon.com on the Orders page (pretty much any of ...
32
votes
1
answer
37k
views
What is this '°͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌' strange character?
I saw this °͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌ strange character online. I also noted that:
It needs 26 backspaces to delete but behaves like one character when selecting.
It is drawn vertically covering ...
0
votes
1
answer
236
views
UTF-8 Decoders fail to decode the encoded strings
I have some encoded values values which I believe is UTF-8. Now I dont really know if it is UTF-8 or not because other online tool and steps to decode UTF-8 is not working, BUT an open source tool ...
2
votes
1
answer
144
views
VIM uses wrong encoding - but only in status messages
I ran into a strange issue with my ArchLinux setup. Vim uses correct encoding for reading/displaying files but these status messages (which displays the current mode or reports back when the buffer is ...
4
votes
1
answer
1k
views
How to identify a file encoding?
I'm trying to figure out the encoding of a text file. I did try a lot of the common ones (with Notepad++), but I've failed so far.
A few hints: The file was originally an Eudora mbx file, with mostly ...
0
votes
1
answer
1k
views
Wrong character encoding in ssh session – but not for all connectios
I have an odd issue when connecting to my (Ubuntu) server via SSH.
If I connect from my Gentoo box, all is fine. All Umlauts etc. work, I can type "ÄÖÜ" and so on.
If I do the same from my ...
0
votes
1
answer
931
views
How to read Linux text files in Windows system?
For example, I run the top command and store it to a file in Linux, after that I open that file in Windows it contains some gibberish. Here is the file viewed in Notepad++:
The option to convert to ...
1
vote
2
answers
48k
views
¢tRÂà³Ab.Ÿân TXT files: how to switch from weird characters back to normal?
So, I have on a flash drive a txt file generated in Cyrillic (my own work, own pen drive), a few years old. Now I needed to open it, only to see this kind of mess:
.
I wonder why is this happening and ...
5
votes
2
answers
21k
views
What is the difference between Windows-1252 and ANSI encoding?
I'm trying to convert UTF-8 to ANSI encoding through a tool.
But it shows Western European (Windows)-1252 instead of ANSI.
Are they both the same thing? Should I go ahead with this?
1
vote
0
answers
283
views
How to use ISO8859-9 encoding in terminal?
I maked a file containing "ırmak" with a text editor via encoding ISO8859-9.
Then, I tried to print the content with "cat" command in the terminal. But I could not.
I use the ...
1
vote
1
answer
805
views
Convert Korean files that are showing up incorrectly to utf-8 - character shows Çѱ¹Ÿî
I was just about to ask this after a long time of searching so decided to answer my own question...
I downloaded Korean subtitles in an .smi file that was in zip archive. When I extracted it, the ...
0
votes
2
answers
688
views
Restoring corrupted UTF-8 files
After my PC broke down I managed to make a backup of the relevant files before reinstalling Windows.
Now that I'm restoring those files and setting the system up I noticed that some of the files got ...
16
votes
1
answer
33k
views
Determining the encoding of a file on Mac OS X?
How can I determine the character encoding of plain text files (such as HTML) on Mac OS X?
2
votes
2
answers
2k
views
How to fix encoding - curly apostrophe appears as ‰Ûª
I have a text-file in which all the ASCII characters appear correctly but some others do not. In particular there is this word:
don‰Ûªt
In hex the bytes are 64 6f 6e 89 db aa 74. Obviously, it is ...