[
https://issues.apache.org/jira/browse/PDFBOX-533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12757270#action_12757270
]
Navendu Garg edited comment on PDFBOX-533 at 9/18/09 10:14 AM:
---------------------------------------------------------------
System details:
JDK: 1.5_20
RAM: 2GB
Processor: Intel(R) Core Duo (T7300, 2.00 GHz)
PDFBox Version: 0.7.4-dev (this was available briefly on the PDFBox site
longtime ago)
I have attached the code that I used to run my test. I ran the test with 256M
Heap space and -server option.
It took about on an average 27s to convert pdf to text. I ran this 10 times.
I have attached the code I used to run this test.
was (Author: navendugarg):
System details:
JDK: 1.5_20
RAM: 2GB
Processor: Intel(R) Core Duo (T7300, 2.00 GHz)
PDFBox Version: 0.7.4-dev (this was available briefly on the PDFBox site
longtime ago)
I have attached the code that I used to run my test. I ran the test with 256M
Heap space and -server option.
It took about on an average 27s to convert pdf to text. I ran this 10 times.
> PDFTextStripper.writeCharacters is called no where in the class
> ---------------------------------------------------------------
>
> Key: PDFBOX-533
> URL: https://issues.apache.org/jira/browse/PDFBOX-533
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.8.0-incubator
> Reporter: Navendu Garg
> Attachments: TestPDFTextStripperPerf.java
>
>
> It seems writeCharacters method is not called anywhere in the PDFTextStripper
> class. This makes it impossible for handling character TextPosition as well
> as Line Separator because processLineSeparator method is no longer there and
> writeLineSeparator is called when actual writing happens.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.