Skip to main content

Questions tagged [character-encoding]

A character encoding system consists of a code that pairs each character from a given repertoire with something else — such as a bit pattern, sequence of natural numbers, octets, or electrical pulses — in order to facilitate the transmission of data (generally numbers or text) through telecommunication networks or for data storage.

0 votes
1 answer
220 views

UTF-8 Decoders fail to decode the encoded strings

I have some encoded values values which I believe is UTF-8. Now I dont really know if it is UTF-8 or not because other online tool and steps to decode UTF-8 is not working, BUT an open source tool ...
Solo's user avatar
  • 3
0 votes
1 answer
318 views

notepad is displaying txt file contents as weird symbols

I am writing integer values into file from kotlin (kotlin Int type) using something like this var1BufferedWriter?.write(String.format("%d\n", intvar ) ) var2BufferedWriter?.write(String....
user13267's user avatar
  • 1,711
2 votes
1 answer
140 views

VIM uses wrong encoding - but only in status messages

I ran into a strange issue with my ArchLinux setup. Vim uses correct encoding for reading/displaying files but these status messages (which displays the current mode or reports back when the buffer is ...
Gabor Garami's user avatar
5 votes
1 answer
1k views

Grep search for text in an ISO-8859-1 encoded file

I'm attempting to use grep to search for text patterns from an ISO-8859-1 encoded file: wordsList When I execute a search, all of the matches are returned, but the accented characters are stripped. ...
Jeffrey Bauer's user avatar
4 votes
1 answer
1k views

How to identify a file encoding?

I'm trying to figure out the encoding of a text file. I did try a lot of the common ones (with Notepad++), but I've failed so far. A few hints: The file was originally an Eudora mbx file, with mostly ...
jmr's user avatar
  • 536
2 votes
0 answers
277 views

ffmpeg printing unknown glyph between characters on utf-8 subtitle

FFmpeg is printing unknown glyphs between some two characters but the weird thing is it's not replacing a character that doesn't exist in the font, it's just printing a new unknown glygh and I can't ...
Shayan's user avatar
  • 1,594
0 votes
0 answers
584 views

How to change Excel character set?

I have an automatically-generated CSV file which contains accented characters. These appear fine when the file is opened with Notepad++. However, accented characters appear mangled in Excel (e.g. é ...
dr_'s user avatar
  • 4,528
0 votes
1 answer
1k views

Wrong character encoding in ssh session – but not for all connectios

I have an odd issue when connecting to my (Ubuntu) server via SSH. If I connect from my Gentoo box, all is fine. All Umlauts etc. work, I can type "ÄÖÜ" and so on. If I do the same from my ...
Tobias Leupold's user avatar
0 votes
1 answer
911 views

How to read Linux text files in Windows system?

For example, I run the top command and store it to a file in Linux, after that I open that file in Windows it contains some gibberish. Here is the file viewed in Notepad++: The option to convert to ...
TeaViris's user avatar
1 vote
1 answer
281 views

does batch program support any ascii characters or is there's a way to create encoder and decoder program with another tools?

i wrote batch encoder and decoder from adrianvdh and customize some of the text string input, but the decoder one aren't working, because i put special symbols inside there. here's the string of the ...
user avatar
0 votes
1 answer
193 views

AWK: "invalid regexp: Invalid collation character" -- how do I make it valid?

I have an awk script that must process millions of records, but I need to remove any containing a multibyte character. In one environment where I work, the following simplified shell sequence ...
Kyle Banerjee's user avatar
1 vote
0 answers
267 views

How to use ISO8859-9 encoding in terminal?

I maked a file containing "ırmak" with a text editor via encoding ISO8859-9. Then, I tried to print the content with "cat" command in the terminal. But I could not. I use the ...
user's user avatar
  • 11
1 vote
1 answer
778 views

Convert Korean files that are showing up incorrectly to utf-8 - character shows Çѱ¹Ÿî

I was just about to ask this after a long time of searching so decided to answer my own question... I downloaded Korean subtitles in an .smi file that was in zip archive. When I extracted it, the ...
iateadonut's user avatar
0 votes
2 answers
672 views

Restoring corrupted UTF-8 files

After my PC broke down I managed to make a backup of the relevant files before reinstalling Windows. Now that I'm restoring those files and setting the system up I noticed that some of the files got ...
wileecoyote's user avatar
0 votes
0 answers
984 views

Redirect the Windows console output of a Python script without raising UnicodeEncodeError

I have a Python script called scratch_1.py which can be simplified for the sake of the question to: s = "∞" print(s) If I run the script in the Windows console, I get the expected output: ...
Wok's user avatar
  • 187

15 30 50 per page
1
2 3 4 5
21