Re: Restarting SolrCloud that is taking realtime updates

2016-11-25 Thread Jichi Guo
Thanks so much for the very quick and detailed explanation, Erick! According to the following page, it seems numRecordsToKeep cannot be too high that must fit in a singe POST. It seems your 1> or 3> approaches would be the best in pratical when the number of updated documents is high.

Re: Restarting SolrCloud that is taking realtime updates

2016-11-25 Thread Erick Erickson
First, get out of thinking about the replication API, things like DISABLEPOLL and the like when in SolrCloud mode. The "old style" replication is used under the control of the synching strategy. Unless you've configured master/slave sections of your solrconfig.xml files and somehow dealt with the

Restarting SolrCloud that is taking realtime updates

2016-11-25 Thread Jichi Guo
Hi, I am seeking for the best practice to restart a sharded SolrCloud that taking search traffic as well as realtime updates without downtime. When I deploy new customized Solr plugins,for example, it will require restarting the whole SolrCloud cluster. I am testing Solr 6.2.1 with 4

Re: Search opening hours

2016-11-25 Thread O. Klein
Thank you for your reply David. Yes, I ended up using a DateRangeField. Down side is that it needs frequent updates. Luckily not an issue for my use case. BTW how could I abuse DateRangeField for non-date data? david.w.smi...@gmail.com wrote > I just saw this conversation now. I didn't read

Re: Wildcard searches with space in TextField/StrField

2016-11-25 Thread Ahmet Arslan
Hi, You could try this: drop wildcard stuff altogether: 1) Employ edgengramfilter at index time. 2) Use plain searches at query time. Ahmet On Friday, November 25, 2016 4:59 PM, Sandeep Khanzode wrote: Hi All, Can someone please assist with this query?

Re: Query parser behavior with AND and negative clause

2016-11-25 Thread Sandeep Khanzode
WORKS: +{!field f=dateRange1 op=Intersects v='[2016-11-22T12:01:00Z TO  2016-11-22T13:59:00Z]'} +(*:* -{!field f=dateRange2 op=Contains  v='[2016-11-22T12:01:00Z TO 2016-11-22T13:59:00Z]'}) +ConstantScore(IntersectsPrefixTreeFilter(fieldName=dateRange1,queryShape=[2016-11-22T12:01 TO

Re: Data Import Request Handler isolated into its own project - any suggestions?

2016-11-25 Thread Marek Ščevlík
I forgot to mention I am creating a jar file beside of a running solr 6.3 instance to which I am hoping to attach with java via the SolrDispatchFilter to get at the cores and so then I could work with data in code. 2016-11-25 19:31 GMT+01:00 Marek Ščevlík : > Hi

Re: Data Import Request Handler isolated into its own project - any suggestions?

2016-11-25 Thread Marek Ščevlík
Hi Daniel. Thanks for a reply. I wonder is it now still possibly with release of Solr 6.3 to get hold of a running instance of the jetty server that is part of the solution? I found some code for previous versions where it was captured with this code and one could then obtain cores for a running

Re: AW: AW: Resync after restart

2016-11-25 Thread Pushkar Raste
Did you index any documents while node was being restarted? There was a issue introduced due to IndexFingerprint comparison. Check SOLR-9310. I am not sure if fix made it to Solr6.2 On Nov 25, 2016 3:51 AM, "Arkadi Colson" wrote: > I am using SolrCloud on version 6.2.1. I

Re: Solr 6 Performance Suggestions

2016-11-25 Thread Max Bridgewater
Thanks folks. It looks like the sweet spot where I get comparable results is at 30 concurrent threads. It progressively degrades from there as I increases the number of concurrent threads in the test script. This made me think that something is configured in Tomcat ((Solr4) that is not

Re: Import from S3

2016-11-25 Thread Tom Evans
On Fri, Nov 25, 2016 at 7:23 AM, Aniket Khare wrote: > You can use Solr DIH for indexing csv data into solr. > https://wiki.apache.org/solr/DataImportHandler > Seems overkill when you can simply post CSV data to the UpdateHandler, using either the post tool:

Using Solr CDCR with HdfsDirectoryFactory

2016-11-25 Thread ZHOU Ran (SAFRAN IDENTITY AND SECURITY)
Hello Hi All, I have followed the guide "Cross Data Center Replication (CDCR)" and get my source collection replicated to the target. And then I tried to use HDFS as storage for both Solr clusters, but failed with the following error message: ERROR: Failed to create collection 'collection11'

Re: Wildcard searches with space in TextField/StrField

2016-11-25 Thread Sandeep Khanzode
Hi All, Can someone please assist with this query? My data consists of: 1.] John Doe 2.] John V. Doe 3.] Johnson Doe 4.] Johnson V. Doe 5.] John Smith 6.] Johnson V. Smith 7.] Matt Doe 8.] Matt V. Doe 9.] Matt Doe 10.] Matthew V. Doe 11.] Matthew Smith 12.] Matthew V. Smith Querying ... (a)

Re: Zookeeper version

2016-11-25 Thread Novin Novin
Thanks guys. On Thu, 24 Nov 2016 at 17:03 Erick Erickson wrote: > Well, 3.4.6 gets the most testing, so if you want to upgrade it's at > your own risk. > > See: https://issues.apache.org/jira/browse/SOLR-8724, there are > problems with 3.4.8 in the Solr context for

Re: AW: AW: Resync after restart

2016-11-25 Thread Arkadi Colson
I am using SolrCloud on version 6.2.1. I will upgrade to 6.3.0 next week. This is the current config for numVersionBuckets: ${solr.ulog.dir:} name="numVersionBuckets">${solr.ulog.numVersionBuckets:65536} Are you saying that I should not use the config below on SolrCloud?