[Pidgin] #1269: Stripping HTML to Plain Text Discards Content Between "<" and ">"

Pidgin trac at pidgin.im
Fri May 25 00:08:18 EDT 2007


#1269: Stripping HTML to Plain Text Discards Content Between "<" and ">"
--------------------------+-------------------------------------------------
 Reporter:  sstangl       |       Type:  defect
   Status:  new           |   Priority:  minor 
Component:  pidgin (gtk)  |    Version:  2.0   
 Keywords:                |    Pending:  0     
--------------------------+-------------------------------------------------
 I'm using logging in text-only format. When I paste six lines or so of
 conversation, the conversation text is sent as HTML, and there is no bug.
 However, when I paste eight or more lines of conversation, the message is
 stripped of formatting in the compose space, and it is thereafter that
 this bug occurs.

 This '''only''' happens when the pasted text is from an ongoing
 conversation. If, for example, I would paste the examples below into a
 conversation window, they would be sent in their entirety.

 Pasting the following text into a conversation window:
 {{{
 (11:46:12 PM) Meh: foo <
 (11:46:12 PM) Meh: foo <
 (11:46:13 PM) Meh: bar
 (11:46:13 PM) Meh: >
 (11:46:14 PM) Meh: >
 (11:46:14 PM) Meh: baz
 (11:46:14 PM) Meh: baz
 }}}
 ...produces the following output sent to the recipient:
 {{{
 (11:46:12 PM) Meh: foo
 (11:46:14 PM) Meh: >
 (11:46:14 PM) Meh: baz
 (11:46:14 PM) Meh: baz
 }}}
 ...where all the inside text is lost (presumably gobbled up when parsing
 for HTML).

 As far as I can tell, the error is that when the message is too large, the
 stripping of HTML to convert the HTML conversation to plain text comes
 after the conversion of entities, such that < and > are converted to
 their ascii equivalents and therefore everything between them is
 inclusively treated as HTML.

-- 
Ticket URL: <http://developer.pidgin.im/ticket/1269>
Pidgin <http://pidgin.im>
Pidgin


More information about the Tracker mailing list