Jukka Zitting wrote:

> I'd rather go with:
> 
>     void parse(InputStream stream, ContentHandler handler, Metadata metadata)
>         throws IOException, SAXException, TikaException;
> 
> I.e. the parser invokes a series of callback methods on the given
> handler instance. This way the parse result never needs to be
> contained in a single object.

Does this mean Tika users need to implement "parser" (ContentHandler)
that can handle events fired by Tika Parser. One for each format? Or do
we plan to normalize events somehow?

Or is Tika going to provide those handlers for simple tasks like
extracting title + content.


-- 
 Sami Siren

Reply via email to