Questions tagged [character-encoding]
A character encoding system consists of a code that pairs each character from a given repertoire with something else — such as a bit pattern, sequence of natural numbers, octets, or electrical pulses — in order to facilitate the transmission of data (generally numbers or text) through telecommunication networks or for data storage.
305
questions
0
votes
1
answer
220
views
UTF-8 Decoders fail to decode the encoded strings
I have some encoded values values which I believe is UTF-8. Now I dont really know if it is UTF-8 or not because other online tool and steps to decode UTF-8 is not working, BUT an open source tool ...
0
votes
1
answer
318
views
notepad is displaying txt file contents as weird symbols
I am writing integer values into file from kotlin (kotlin Int type) using something like this
var1BufferedWriter?.write(String.format("%d\n", intvar ) )
var2BufferedWriter?.write(String....
2
votes
1
answer
140
views
VIM uses wrong encoding - but only in status messages
I ran into a strange issue with my ArchLinux setup. Vim uses correct encoding for reading/displaying files but these status messages (which displays the current mode or reports back when the buffer is ...
5
votes
1
answer
1k
views
Grep search for text in an ISO-8859-1 encoded file
I'm attempting to use grep to search for text patterns from an ISO-8859-1 encoded file: wordsList
When I execute a search, all of the matches are returned, but the accented characters are stripped. ...
4
votes
1
answer
1k
views
How to identify a file encoding?
I'm trying to figure out the encoding of a text file. I did try a lot of the common ones (with Notepad++), but I've failed so far.
A few hints: The file was originally an Eudora mbx file, with mostly ...
2
votes
0
answers
277
views
ffmpeg printing unknown glyph between characters on utf-8 subtitle
FFmpeg is printing unknown glyphs between some two characters but the weird thing is it's not replacing a character that doesn't exist in the font, it's just printing a new unknown glygh and I can't ...
0
votes
0
answers
584
views
How to change Excel character set?
I have an automatically-generated CSV file which contains accented characters. These appear fine when the file is opened with Notepad++. However, accented characters appear mangled in Excel (e.g. é ...
0
votes
1
answer
1k
views
Wrong character encoding in ssh session – but not for all connectios
I have an odd issue when connecting to my (Ubuntu) server via SSH.
If I connect from my Gentoo box, all is fine. All Umlauts etc. work, I can type "ÄÖÜ" and so on.
If I do the same from my ...
0
votes
1
answer
911
views
How to read Linux text files in Windows system?
For example, I run the top command and store it to a file in Linux, after that I open that file in Windows it contains some gibberish. Here is the file viewed in Notepad++:
The option to convert to ...
1
vote
1
answer
281
views
does batch program support any ascii characters or is there's a way to create encoder and decoder program with another tools?
i wrote batch encoder and decoder from adrianvdh and customize some of the text string input, but the decoder one aren't working, because i put special symbols inside there.
here's the string of the ...
0
votes
1
answer
193
views
AWK: "invalid regexp: Invalid collation character" -- how do I make it valid?
I have an awk script that must process millions of records, but I need to remove any containing a multibyte character.
In one environment where I work, the following simplified shell sequence ...
1
vote
0
answers
267
views
How to use ISO8859-9 encoding in terminal?
I maked a file containing "ırmak" with a text editor via encoding ISO8859-9.
Then, I tried to print the content with "cat" command in the terminal. But I could not.
I use the ...
1
vote
1
answer
778
views
Convert Korean files that are showing up incorrectly to utf-8 - character shows Çѱ¹Ÿî
I was just about to ask this after a long time of searching so decided to answer my own question...
I downloaded Korean subtitles in an .smi file that was in zip archive. When I extracted it, the ...
0
votes
2
answers
672
views
Restoring corrupted UTF-8 files
After my PC broke down I managed to make a backup of the relevant files before reinstalling Windows.
Now that I'm restoring those files and setting the system up I noticed that some of the files got ...
0
votes
0
answers
984
views
Redirect the Windows console output of a Python script without raising UnicodeEncodeError
I have a Python script called scratch_1.py which can be simplified for the sake of the question to:
s = "∞"
print(s)
If I run the script in the Windows console, I get the expected output:
...