[ https://issues.apache.org/jira/browse/PDFBOX-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938560#comment-13938560 ]
John Hewson commented on PDFBOX-1987: ------------------------------------- {quote} An are which I kept out is how to handle malformed tokens such as strings which have an unbalanced number of parenthesis. {quote} Do you have any sample PDF files with this problem? > Provide a PDF Lexer as a base for PDF parsing > --------------------------------------------- > > Key: PDFBOX-1987 > URL: https://issues.apache.org/jira/browse/PDFBOX-1987 > Project: PDFBox > Issue Type: Improvement > Components: Parsing > Reporter: Maruan Sahyoun > Priority: Minor > Fix For: 2.0.0 > > Attachments: src.zip > > > In order to enhance the parsing process and as a foundation for a combination > of the different parsers a PDF lexer should be provided. -- This message was sent by Atlassian JIRA (v6.2#6252)