+1 for me too.  That could be really useful.

BTW, I recently inspected bytes from a Word Document, and it contained text
that I had already deleted, even though I didn't use Fast Save to save the
changes.  If you try this:

Create a new document containing:

____ is a dork.

Save it, then change "dork" to "genius".  Save the file again.  Inspect the
file using strings or other utility.  You still see "dork"! :)

Regarding the properties that the parser can output, the parser will only
see the InputStream, not the File or URL from which it came.  So we can have
it output size and some other information, but not the name.

- Keith



Bertrand Delacretaz wrote:
> 
> On 10/12/07, Jukka Zitting <[EMAIL PROTECTED]> wrote:
> 
>> ...We could perhaps do something like the Unix strings(1) command does,..
> 
> +1, good idea. I'm a big fan of strings(1).
> 
> -Bertrand
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Default-MIME-Type--tf4609978.html#a13177792
Sent from the Apache Tika - Development mailing list archive at Nabble.com.

Reply via email to