On Wed, 29 May 2013, Christian Reuschling wrote:
Nevertheless, in this case an Exception (like in all other parsers) or a tika body with length zero, which is indicated at least by handler.endDocument() would be the appropriate way, isn't it? - From the ContentHandlers point of view, there is nothing in between.

I'm not sure if we do have a properly documented policy on what a parser should do if it receives a file it can't handle. For ones that are invalid (eg corrupt), I believe an exception is the expected result. The case when the file seems valid, but can't be handled by the parser, not sure

Does anyone know if we have a policy on this, and/or where we should document it?

Nick

Reply via email to