Am I understanding the DIH correctly in that it doesn't work with Blobs and or binary things? I'm basing this off of JdbcDataSource.getARow() which seems to be the place that populates the Map that is then passed to the Transformer.

One of the things that I think might be interesting is, as I'm integrating Tika, the notion of a Transformer that takes a blob and feeds it to Tika for parsing. In this way, people that store documents in databases (or download PDFs, etc.) can use the DIH to bring in more kinds of content.

Thoughts?

-Grant

Reply via email to