On 10/12/07, Keith R. Bennett <[EMAIL PROTECTED]> wrote:

> ...Regarding the properties that the parser can output, the parser will only
> see the InputStream, not the File or URL from which it came.  So we can have
> it output size and some other information, but not the name....

Unless the filename is included in the input metadata.

Besides possibly giving hints to the parsers, this input metadata
could also contain any useful information that the user wants to
include in the generated metadata.

The simplest thing to do might be to copy the input metadata to the
output, unless its value is overwritten by the Tika parsing.

-Bertrand

Reply via email to