Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Ravi Solr
Erick I fixed the "missing content stream" issue as well. by making sure Iam not adding empty list. However, My very first issue of getting zero docs once in a while is still haunting me, even after using cursorMarkers, disabling auto commit and soft commit. I ran code two times and you can see

SetBoost for dismax query

2015-09-26 Thread Gajendra Dadheech
Hi, As i can see in solr code, DisJunctionMaxQuery extends Query, so by default is inherits the functionaltiy. So Query class has one method called SetBoost, this is behaving very odd when applied on a DisMax Query. Can someone guide me regarding how calling SetBoost on a dismax query would

Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Shawn Heisey
On 9/25/2015 10:10 PM, Ravi Solr wrote: > thank you for taking time to help me out. Yes I was not using cursorMark, I > will try that next. This is what I was doing, its a bit shabby coding but > what can I say my brain was fried :-) FYI this is a side process just to > correct a messed up string.

Re: start solr 5.3.1 under windows and admmin GUI show 5.2.1 is running

2015-09-26 Thread Shawn Heisey
On 9/26/2015 8:31 AM, Szűcs Roland wrote: > I downloaded the latest version of solr to my computer. When I started > solr as a standalone proccess on the default port two strange things > happened: > 1. I got an error message : Failed to parse command line arguments due > to: Unrecognized option:

Re: SetBoost for dismax query

2015-09-26 Thread Doug Turnbull
Hi Gajendra What do you mean by "behaving odd"? What odd behavior do you see? My colleague John Berryman and I have seen many ways dismax has "behaved odd" and we've tried to write about them (it quite often surprises me too)

Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Shawn Heisey
On 9/26/2015 10:41 AM, Shawn Heisey wrote: > 30 This needs to include openSearcher=false, as Erick mentioned. I'm sorry I screwed that up: 30 false Thanks, Shawn

Re: faceting is unusable slow since upgrade to 5.3.0

2015-09-26 Thread Mikhail Khludnev
Uwe, Would you mind to provide a few details about your case? I wonder about number of bigterms and other stats as well at 'author' field (ant other most expensive facets). It looks like log rows: Sep 13, 2011 2:51:53 PM org.apache.solr.request.UnInvertedField uninvert INFO: UnInverted

Re: SetBoost for dismax query

2015-09-26 Thread Gajendra Dadheech
Hey Doug, Thanks for reply. Details regarding my use case: I need to query on two fields in my indexes, and will consider score only for the one which has highest score. So i create two boolean queries [these queries are made up of other differet tokens, so chose boolean query] on those two

Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Erick Erickson
Well, let's forget the cursormark stuff for a bit. There's no reason you should be getting all 1.4 million rows. Presumably you've been running this program occasionally and blanking strings like "sun.org.mozilla.javascript.internal.NativeString:" in the uuid field. Then you turn around and run

Sleep function query a big security hole?

2015-09-26 Thread Doug Turnbull
I noticed a while back that "sleep" is a function query. Which I believe means I can make the current query thread sleep for as long as I like. I'm guessing an attacker could use this to starve Solr of threads, running a denial of service attack by running multiple queries with sleeps in them.

Re: faceting is unusable slow since upgrade to 5.3.0

2015-09-26 Thread Mikhail Khludnev
Uwe, As a workaround, can you add facet.threads=Ncores to count fields in parallel? Also, setting fcs method for single value fields runs per segment faceting in parallel. Of course, fields which has small number of terms are beneficial from enum method. Excuse me if it's obvious.

Re: Sleep function query a big security hole?

2015-09-26 Thread Doug Turnbull
Relevant code http://grepcode.com/file/repo1.maven.org/maven2/org.apache.solr/solr-core/5.2.0/org/apache/solr/search/ValueSourceParser.java#126 On Saturday, September 26, 2015, Doug Turnbull < dturnb...@opensourceconnections.com> wrote: > I noticed a while back that "sleep" is a function query.

Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Ravi Solr
Erick & Shawn I incrporated your suggestions. 0. Shut off all other indexing processes. 1. As Shawn mentioned set batch size to 1. 2. Loved Erick's suggestion about not using filter at all and sort by uniqueId and put last known uinqueId as next queries start while still using cursor marks

New Project setup too clunky

2015-09-26 Thread Mark Fenbers
Greetings, Being a Solr newbie, I've run the examples in the "Solr Quick Start" document and got a feel for Solr's capabilities. Now I want to move on and work with my own data and my own Solr server without using the example setup (i.e., "solr -e" options). This is where the documentation

start solr 5.3.1 under windows and admmin GUI show 5.2.1 is running

2015-09-26 Thread Szűcs Roland
Hi guys, I downloaded the latest version of solr to my computer. When I started solr as a standalone proccess on the default port two strange things happened: 1. I got an error message : Failed to parse command line arguments due to: Unrecognized option: -maxWaitSecs. I did not use any argumet

Re: SetBoost for dismax query

2015-09-26 Thread Doug Turnbull
Ah are you having an issue with Lucene code? You might do better at java-u...@lucene.apache.org That being said, do you have code you can share with the list that demonstrates your problem? -Doug On Sat, Sep 26, 2015 at 11:42 AM, Gajendra Dadheech wrote: > Hey Doug, > >

Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Erick Erickson
Oh, one more thing. _assuming_ you can't change the indexing process that gets the docs from the system of record, why not just add an update processor that does this at index time? See: https://cwiki.apache.org/confluence/display/solr/Update+Request+Processors, in particular the

Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Ravi Solr
Thank you Erick & Shawn for taking significant time off your weekends to debug and explain in great detail. I will try to address the main points from your emails to provide more situation context for better understanding of my situation 1. Erick, As part of our upgrade from 4.7.2 to 5.3.0 I

Re: query parsing

2015-09-26 Thread Erick Erickson
No need to re-install Solr, just create a new core, this time it'd probably be easiest to use the bin/solr create_core command. In the Solr directory just type bin/solr create_core -help to see the options. We're pretty much trying to migrate to using bin/solr for all the maintenance we can, but

Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Ravi Solr
Erick...There is only one type of String "sun.org.mozilla.javascript.internal.NativeString:" and no other variations of that in my index, so no question of missing it. Point taken regarding the CURSORMARK stuff, yes you are correct, my head so numb at this point after working 3 days on this, I

Re: bulk reindexing 5.3.0 issue

2015-09-26 Thread Erick Erickson
bq: 3. Erick, I wasnt getting all 1.4 mill in one shot. I was initially using 100 docs batch, which, I later increased to 500 docs per batch. Also it would not be a infinite loop if I commit for each batch, right !!?? That's not the point at all. Look at the basic logic here: You run for a while

Re: query parsing

2015-09-26 Thread Mark Fenbers
OK, a lot of dialog while I was gone for two days! I read the whole thread, but I'm a newbie to Solr, so some of the dialog was Greek to me. I understand the words, of course, but applying it so I know exactly what to do without screwing something else up is the problem. After all, that is