[ 
https://issues.apache.org/jira/browse/PDFBOX-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alfred updated PDFBOX-4877:
---------------------------
    Description: 
I am testing text extraction from PDF and profiling the execution.

I found that the third major time consumer is with matrix multiplicaitons.

The Matrix class spends large amounts of time copying results to new instances. 

Also, the if statements are slowing down execution as they kill performance in 
modern CPUs.

  was:
I am testing text extraction from PDF and profiling the execution.

I found that the third major time consumer is with matrix multiplicaitons.

The Matrix class spends large amounts of time copying results to new instances. 


> Matrix class performance improvements
> -------------------------------------
>
>                 Key: PDFBOX-4877
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4877
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing, Text extraction
>    Affects Versions: 2.0.20, 3.0.0 PDFBox
>            Reporter: Alfred
>            Priority: Major
>              Labels: Optimization
>         Attachments: PDFBOX-4877.patch
>
>   Original Estimate: 1m
>  Remaining Estimate: 1m
>
> I am testing text extraction from PDF and profiling the execution.
> I found that the third major time consumer is with matrix multiplicaitons.
> The Matrix class spends large amounts of time copying results to new 
> instances. 
> Also, the if statements are slowing down execution as they kill performance 
> in modern CPUs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to