[ 
https://issues.apache.org/jira/browse/MIME4J-283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16750987#comment-16750987
 ] 

ASF GitHub Bot commented on MIME4J-283:
---------------------------------------

Github user hirthwork closed the pull request at:

    https://github.com/apache/james-mime4j/pull/26


> DecoderUtil performance fix
> ---------------------------
>
>                 Key: MIME4J-283
>                 URL: https://issues.apache.org/jira/browse/MIME4J-283
>             Project: James Mime4j
>          Issue Type: Improvement
>          Components: parser (core)
>    Affects Versions: master, 0.8.2
>            Reporter: Dmitry Potapov
>            Priority: Minor
>         Attachments: patch
>
>
> DecoderUtil currently uses the following regex pattern for rfc2047-encoded 
> words: 
> {code:java}
> "(.*?)=\\?(.+?)\\?(\\w)\\?(.*?)\\?="
> {code}
> First capturing group {{(.*?)}} is a very expensive regular expression 
> causing next pattern node evaluation on every input character. Because of 
> this decoding of 4 KB input ({{To:}} field with 40-80 recipients) takes up to 
> 200ms on modern CPUs.
> At the same time, this capturing group used only to store separator text 
> between encoded words. Proposed patch reuses existing {{tailIndex}} for 
> separator text extraction and same input decoding now takes only 1-2ms.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to