Skip to main content

All Questions

27 votes
5 answers
27k views

remove <200b> character from text file

I have a huge text file containing this string/character <200b> that I want to delete. I tried with sed but it didn't work. sed 's/<200b>//g' file The character never shows when I open the ...
0 votes
1 answer
10k views

Internet Explorer 11 not displaying UTF-8 encoded pages correctly

Some pages in IE 11, that seem to be UTF-8, are not displaying correctly. The same page displays correctly in Mozilla Firefox. I first noticed this in Amazon.com on the Orders page (pretty much any of ...
32 votes
1 answer
37k views

What is this '°͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌' strange character?

I saw this °͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌͌ strange character online. I also noted that: It needs 26 backspaces to delete but behaves like one character when selecting. It is drawn vertically covering ...
0 votes
1 answer
236 views

UTF-8 Decoders fail to decode the encoded strings

I have some encoded values values which I believe is UTF-8. Now I dont really know if it is UTF-8 or not because other online tool and steps to decode UTF-8 is not working, BUT an open source tool ...
2 votes
1 answer
144 views

VIM uses wrong encoding - but only in status messages

I ran into a strange issue with my ArchLinux setup. Vim uses correct encoding for reading/displaying files but these status messages (which displays the current mode or reports back when the buffer is ...
4 votes
1 answer
1k views

How to identify a file encoding?

I'm trying to figure out the encoding of a text file. I did try a lot of the common ones (with Notepad++), but I've failed so far. A few hints: The file was originally an Eudora mbx file, with mostly ...
0 votes
1 answer
1k views

Wrong character encoding in ssh session – but not for all connectios

I have an odd issue when connecting to my (Ubuntu) server via SSH. If I connect from my Gentoo box, all is fine. All Umlauts etc. work, I can type "ÄÖÜ" and so on. If I do the same from my ...
0 votes
1 answer
931 views

How to read Linux text files in Windows system?

For example, I run the top command and store it to a file in Linux, after that I open that file in Windows it contains some gibberish. Here is the file viewed in Notepad++: The option to convert to ...
1 vote
2 answers
48k views

¢tRÂà³Ab.Ÿân TXT files: how to switch from weird characters back to normal?

So, I have on a flash drive a txt file generated in Cyrillic (my own work, own pen drive), a few years old. Now I needed to open it, only to see this kind of mess: . I wonder why is this happening and ...
5 votes
2 answers
21k views

What is the difference between Windows-1252 and ANSI encoding?

I'm trying to convert UTF-8 to ANSI encoding through a tool. But it shows Western European (Windows)-1252 instead of ANSI. Are they both the same thing? Should I go ahead with this?
1 vote
0 answers
283 views

How to use ISO8859-9 encoding in terminal?

I maked a file containing "ırmak" with a text editor via encoding ISO8859-9. Then, I tried to print the content with "cat" command in the terminal. But I could not. I use the ...
1 vote
1 answer
805 views

Convert Korean files that are showing up incorrectly to utf-8 - character shows Çѱ¹Ÿî

I was just about to ask this after a long time of searching so decided to answer my own question... I downloaded Korean subtitles in an .smi file that was in zip archive. When I extracted it, the ...
0 votes
2 answers
688 views

Restoring corrupted UTF-8 files

After my PC broke down I managed to make a backup of the relevant files before reinstalling Windows. Now that I'm restoring those files and setting the system up I noticed that some of the files got ...
16 votes
1 answer
33k views

Determining the encoding of a file on Mac OS X?

How can I determine the character encoding of plain text files (such as HTML) on Mac OS X?
2 votes
2 answers
2k views

How to fix encoding - curly apostrophe appears as ‰Ûª

I have a text-file in which all the ASCII characters appear correctly but some others do not. In particular there is this word: don‰Ûªt In hex the bytes are 64 6f 6e 89 db aa 74. Obviously, it is ...

15 30 50 per page
1
2 3 4 5