Re: Solr Cell and Deduplication - Get ID of doc

2010-03-02 Thread Bill Engle
Thanks for the responses. This is exactly what I had to resort to. I will definitely put in a feature request to get the generated ID back from the extract request. I am doing this with PHP cURL for extraction and pecl php solr for querying. I am then saving the unique id and dupe hash in a MyS

Re: Solr Cell and Deduplication - Get ID of doc

2010-02-26 Thread Bill Engle
unreliable. My only option is to somehow return the id in the XML response. Any guidance is greatly appreciated. -Bill On Wed, Feb 24, 2010 at 12:06 PM, Bill Engle wrote: > Hi - > > New Solr user here. I am using Solr Cell to index files (PDF, doc, docx, > txt, htm, etc.) and the

Re: Solr Cell RTF Woes

2010-02-26 Thread Bill Engle
RTF is using Swing and AWT to analyze the rtf, these in turn will > attempt to use Graphics libraries, unless you use headless. > > > > -Original Message- > From: Bill Engle [mailto:billengle...@gmail.com] > Sent: 25 February 2010 19:09 > To: solr-user@lucene.apache.org

Solr Cell RTF Woes

2010-02-25 Thread Bill Engle
Any RTF file I tried to index in Solr 1.4 throws these errors out. I have no issues with doc, pdf. Any thoughts? Thanks. Apache Tomcat/6.0.18 - Error report