Solr fields for Microsoft files, image files, PDF, text files

2017-09-24 Thread Phillip Wu
Hi, I'm starting out with Solr on a Windows box. I want to index the following documents: doc;docx xls;xlsx ppt vsd pdf txt gif;jpeg;tiff I undersand that solr uses Apache Tika to read these file types and return an xml stream back to Solr. For Tika image processing, I've loaded Tesseract.

Re: Solr SQL: standalone mode

2017-09-24 Thread Joel Bernstein
At Alfresco we are working on a version of Solr's SQL that works in non-Solr Cloud mode. The plan is to contribute this back to 7x branch. There will also be improvements to the SQL coverage committed back from Alfresco. Joel Bernstein http://joelsolr.blogspot.com/ On Sun, Sep 24, 2017 at 6:04

RE: Solr update failing on remote server but works locally??

2017-09-24 Thread Phil Scadden
Finally got it. Version difference between local and remote version of solr meant different defaults on some fields. Field type Long get DocValues even if you leave it unchecked and also changed to multivalue=false. This results in "cannot change DocValues type from SORTED_SET to NUMERIC for

Solr SQL: standalone mode

2017-09-24 Thread Pavel Micka
Hi, I read in the documentation that executing Solr SQL is possible only in SolrCloud mode. The thing is that we have unfortunatelly some installations, which simply can't have multiple nodes (too small instances). Is it somehow possible to workaround this restriction or is there at least any

Re: overwrite the parameter query in DIH

2017-09-24 Thread solr2020
Thanks Erick. We are almost done with the full indexing part using DIH. So if we change the approach using Solrj now, we need to re-write the entire thing. So is there any way to overwrite the DIH data config file. -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html