I think the Parser interface Javadoc would make sense as a place to document, 
but I don't know if there is an existing policy.

We'll certainly need to consider things like DelegatingParsers which may be 
using other parsers to do portions of the work.

Not the principle comment you were looking for, but my 2 cents.

Ray

On Jun 7, 2013, at 7:30 AM, Christian Reuschling <reuschl...@dfki.uni-kl.de> 
wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> it would be very interesting if somebody has a principle comment on this 
> thread...
> 
> 
> On 29.05.2013 14:42, Nick Burch wrote:
>> On Wed, 29 May 2013, Christian Reuschling wrote:
>>> Nevertheless, in this case an Exception (like in all other parsers) or a 
>>> tika body with
>>> length zero, which is indicated at least by handler.endDocument() would be 
>>> the appropriate
>>> way, isn't it? - From the ContentHandlers point of view, there is nothing 
>>> in between.
>> 
>> I'm not sure if we do have a properly documented policy on what a parser 
>> should do if it
>> receives a file it can't handle. For ones that are invalid (eg corrupt), I 
>> believe an exception
>> is the expected result. The case when the file seems valid, but can't be 
>> handled by the parser,
>> not sure
>> 
>> Does anyone know if we have a policy on this, and/or where we should 
>> document it?
>> 
>> Nick
> 
> - -- 
> ______________________________________________________________________________
> Christian Reuschling, Dipl.-Ing.(BA)
> Software Engineer
> 
> Knowledge Management Department
> German Research Center for Artificial Intelligence DFKI GmbH
> Trippstadter Straße 122, D-67663 Kaiserslautern, Germany
> 
> Phone: +49.631.20575-1250
> mailto:reuschl...@dfki.de  http://www.dfki.uni-kl.de/~reuschling/
> 
> - ------------Legal Company Information Required by German 
> Law------------------
> Geschäftsführung: Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
>                  Dr. Walter Olthoff
> Vorsitzender des Aufsichtsrats: Prof. Dr. h.c. Hans A. Aukes
> Amtsgericht Kaiserslautern, HRB 2313=
> ______________________________________________________________________________
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v2.0.19 (GNU/Linux)
> Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/
> 
> iEYEARECAAYFAlGxxFkACgkQ6EqMXq+WZg91CgCffJoxohycTUP0F2ha9djqAQbp
> tRAAoIbAkUjqZujYM/BHINMmbhNswir9
> =a1xL
> -----END PGP SIGNATURE-----

Reply via email to