Re: [Pidgin] #5498: Umlaute like äöü get transformed to dv| from pidgin to trillian

Pidgin trac at pidgin.im
Fri Apr 11 11:19:21 EDT 2008


#5498: Umlaute like äöü get transformed to dv| from pidgin to trillian
---------------------+------------------------------------------------------
  Reporter:  atlanx  |       Owner:  MarkDoliner    
      Type:  defect  |      Status:  new            
  Priority:  minor   |   Milestone:                 
 Component:  ICQ     |     Version:  2.4.1          
Resolution:          |    Keywords:  umlaute, coding
   Pending:  0       |  
---------------------+------------------------------------------------------
Comment (by jd49):

 If äöü becomes dv|, the sending client is using CP1252 or a similar
 encoding and the first bit of each byte is turned from 1 to 0.

 ä (CP1252) = '''1'''110 0100 --> d (ASCII) ='''0'''110 0100[[BR]]
 ö (CP1252) = '''1'''111 0110 --> v (ASCII) ='''0'''111 0110[[BR]]
 ü (CP1252) = '''1'''111 1100 --> | (ASCII) ='''0'''111 1100

 I think this happens because the receiving client is using an encoding
 that doesn't care about the first bit, i. e. it contains all ascii
 characters twice but no non-ascii characters.

 deryni: Afaik, encoding boxes do accept multiple encodings, at least on
 IRC. But it might be different on ICQ.

 atlanx: CP1251 is an obsolete cyrillic encoding.[[BR]]
 ANSI codepages for latin are:[[BR]]
 - CP1252 (Western Europe)[[BR]]
 - CP1250 (Eastern Europe)[[BR]]
 - CP1254 (Turkish)[[BR]]
 - CP1257 (Baltic Countries)[[BR]]
 - CP1258 (Vietnamese)[[BR]]
 What you need to use for German is CP1252.

-- 
Ticket URL: <http://developer.pidgin.im/ticket/5498#comment:3>
Pidgin <http://pidgin.im>
Pidgin


More information about the Tracker mailing list