I created the issue 1210 for that feature. On Tue, Jan 17, 2012 at 3:04 PM, Edson Alves Pereira <[email protected]>wrote:
> right! > > > On Tue, Jan 17, 2012 at 1:22 PM, Martinez, Mel - 1004 - MITLL < > [email protected]> wrote: > >> Three things:**** >> >> ** ** >> >> **1) **Great Idea!**** >> >> **2) **‘Probably should be proposed through a Jira ‘new feature’ >> issue. You can attach the files to the issue. >> https://issues.apache.org/jira/browse/PDFBOX **** >> >> **3) **I would recommend it be proposed and implemented as a >> subclass of PDFTextStripper (i.e., something like >> “ObservablePDFTextStripper” instead of wired into that class directly. >> Almost all of PDFTextStripper can be overridden in a subclass so you should >> be able to fully instrument this in a subclass. There is a tiny but real >> performance hit for broadcasting events (even if you have no listeners) and >> it would be my preference that we do not introduce that overhead into the >> main PDFTextStripper class. Our group uses PDFTextStripper to process a >> large amount of documents so performance is important to us.**** >> >> ** ** >> >> Cheers,**** >> >> ** ** >> >> Mel**** >> >> ** ** >> >> *From:* Edson Alves Pereira [mailto:[email protected]] >> *Sent:* Tuesday, January 17, 2012 8:48 AM >> *To:* [email protected] >> *Cc:* Raul Abreu Leite >> *Subject:* Adding support to listener while parsing pdf to text - >> PDFTextStripper.java**** >> >> ** ** >> >> To help users get more interaction over the parsing processes.**** >> > >
