On Wed, 25 Jun 2014, Andrew Skiba wrote:
Let me check I understand you right. WordExtractor will continue to create
<img src="embedded:filename.jpg"/>

Yes, as will (should..) the other parsers which find embedded resources

and call the ImageParser once for every file name.

No. It'll call your code, as you'll have registered your code as the EmbeddedDocumentExtractor to call for embedded resources like images.

(If there isn't one, then a ParsingEmbeddedDocumentExtractor is used, which calls the default parser, which is how it ends up in ImageParser if you're recursing)

Nick

Reply via email to