Bug Tracker item #2796390, was opened at 2009-05-25 13:33
Message generated for change (Comment added) made by sbajic
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=1126467&aid=2796390&group_id=250683

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Vadim Zeitlin (vadz)
Assigned to: Nobody/Anonymous (nobody)
Summary: poor performance of tokenization functions

Initial Comment:
At least the functions _ds_decode_quoted() and _ds_degenerate_message() have 
O(N^2) asymptotic performance (where N is the length of the input message) as 
they iterate over a string and use memmove() inside the loop.

In practice this means that processing a big but not huge (~2MB) base64-encoded 
message with a lot of HTML fragments inside it takes ages (~an hour) on a 
reasonably fast modern machine which is unacceptable.

----------------------------------------------------------------------

>Comment By: Stevan Bajic (sbajic)
Date: 2009-05-25 21:32

Message:
Could you attach such an message (not huge but ~2MB) triggering the long
decode time?
Do you have any recommendation how to enhance the decoding? Or maybe you
have a patch for speeding up the decoding?
I have tested other code doing quoted printable decoding and most of them
are slower then the one found in DSPAM. If you have any reference how the
decoding can be done faster then let me know.

btw: _ds_decode_quoted() is dealing with quoted-printable and not with
base64 encoding.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=1126467&aid=2796390&group_id=250683

------------------------------------------------------------------------------
Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT
is a gathering of tech-side developers & brand creativity professionals. Meet
the minds behind Google Creative Lab, Visual Complexity, Processing, & 
iPhoneDevCamp asthey present alongside digital heavyweights like Barbarian
Group, R/GA, & Big Spaceship. http://www.creativitycat.com 
_______________________________________________
Dspam-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspam-devel

Reply via email to