Re: Resume Solr indexing CSV after exception

2010-06-11 Thread Brad Greenlee
Why not just use the onError attribute on entity? The default is to abort, but you can also specify skip to skip the current document, or continue to continue as if the error never happened. See http://wiki.apache.org/solr/DataImportHandler#Schema_for_the_data_config Brad On Fri, Jun 11, 2010 at

Re: TikaEntityProcessor not working?

2010-06-04 Thread Brad Greenlee
You are my hero. I replaced the Tika 0.8 snapshots that were included with Solr with 0.6 and it works now. Thank you! Brad On Jun 3, 2010, at 6:22 AM, David George wrote: Which version of Tika do you have? There was a problem introduced somewhere between Tika 0.6 and Tika 0.7 whereby the

Re: TikaEntityProcessor not working?

2010-05-31 Thread Brad Greenlee
It is a file. Only the filename is stored in the database. Brad On May 31, 2010, at 2:59 AM, Noble Paul നോബിള്‍ नो ब्ळ् noble.p...@corp.aol.com wrote: BinFileDataSource will only work with file, Try FieldStreamDataSource On Mon, May 31, 2010 at 3:30 AM, Brad Greenlee b...@footle.org

TikaEntityProcessor not working?

2010-05-30 Thread Brad Greenlee
Hi. I'm trying to get Solr to index a database in which one column is a filename of a PDF document I'd like to index. My configuration looks like this: dataConfig dataSource name=ds-db driver=com.mysql.jdbc.Driver url=jdbc:mysql://localhost/document_db user=user password=password