[
https://issues.apache.org/jira/browse/PDFBOX-335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783719#action_12783719
]
fpientka commented on PDFBOX-335:
---------------------------------
With Lucene 3.0 and
In PDFTextStripper(PDFStreamEngine).processOperator(PDFOperator, List) line:
520
method protected void processOperator( PDFOperator operator, List arguments
) i get
if (!unsupportedOperators.contains(operation))
log.info("unsupported/disabled operation: " + operation);
unsupportedOperators.add(operation);
INFO: unsupported/disabled operation: rg org.apache.pdfbox.util.PDFStreamEngine
processOperator
LucenePDFDocument.addContent(Document, InputStream, String) line: 413
Ao i've to use commons-logging, otherwise i get a Class not found Excpetion
> Version incompatibility with Lucene?
> ------------------------------------
>
> Key: PDFBOX-335
> URL: https://issues.apache.org/jira/browse/PDFBOX-335
> Project: PDFBox
> Issue Type: Bug
> Components: Lucene
> Priority: Minor
> Fix For: 0.8.0-incubator
>
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1887932
> Originally submitted by nobody on 2008-02-06 06:30.
> It may seem that the code that does text extraction for Lucene is not
> compatible with the later versions of Lucene.
> I had to revert all the way back to to Lucene 2.0.0 from 2006 in order to
> make PDF Box be able to parse PDF documents for Lucene.
> Manual recompilation of the PDFBox source is not really an option due to a
> large number of dependencies which we don't have the resources to keep track
> of.
> A recompilation and release compatible with Lucene 2.3.0 would be much
> appreciated!
> [email protected]
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.