Re: [Pidgin] #5498: Umlaute like äöü get transformed to dv| from pidgin to trillian
Pidgin
trac at pidgin.im
Fri Apr 11 11:19:21 EDT 2008
#5498: Umlaute like äöü get transformed to dv| from pidgin to trillian
---------------------+------------------------------------------------------
Reporter: atlanx | Owner: MarkDoliner
Type: defect | Status: new
Priority: minor | Milestone:
Component: ICQ | Version: 2.4.1
Resolution: | Keywords: umlaute, coding
Pending: 0 |
---------------------+------------------------------------------------------
Comment (by jd49):
If äöü becomes dv|, the sending client is using CP1252 or a similar
encoding and the first bit of each byte is turned from 1 to 0.
ä (CP1252) = '''1'''110 0100 --> d (ASCII) ='''0'''110 0100[[BR]]
ö (CP1252) = '''1'''111 0110 --> v (ASCII) ='''0'''111 0110[[BR]]
ü (CP1252) = '''1'''111 1100 --> | (ASCII) ='''0'''111 1100
I think this happens because the receiving client is using an encoding
that doesn't care about the first bit, i. e. it contains all ascii
characters twice but no non-ascii characters.
deryni: Afaik, encoding boxes do accept multiple encodings, at least on
IRC. But it might be different on ICQ.
atlanx: CP1251 is an obsolete cyrillic encoding.[[BR]]
ANSI codepages for latin are:[[BR]]
- CP1252 (Western Europe)[[BR]]
- CP1250 (Eastern Europe)[[BR]]
- CP1254 (Turkish)[[BR]]
- CP1257 (Baltic Countries)[[BR]]
- CP1258 (Vietnamese)[[BR]]
What you need to use for German is CP1252.
--
Ticket URL: <http://developer.pidgin.im/ticket/5498#comment:3>
Pidgin <http://pidgin.im>
Pidgin
More information about the Tracker
mailing list