Re: Inconsistent number of hits returned by two solr instances (from the same index!)

2013-11-07 Thread Roman Chyla
Thanks Michael, haven't tried that yet.

Anybody has suggestions on what might be the problem there? SOLR cache?
DiskI/O? Something else..?

--roman


On Wed, Nov 6, 2013 at 9:41 PM, Michael Della Bitta 
michael.della.bi...@appinions.com wrote:

 Wow, that's pretty weird. Have you tried turning logging down to debug and
 seeing if anything interesting shakes out?

 Michael Della Bitta

 Applications Developer

 o: +1 646 532 3062  | c: +1 917 477 7906

 appinions inc.

 “The Science of Influence Marketing”

 18 East 41st Street

 New York, NY 10017

 t: @appinions https://twitter.com/Appinions | g+:
 plus.google.com/appinions
 https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
 
 w: appinions.com http://www.appinions.com/


 On Wed, Nov 6, 2013 at 6:40 PM, Roman Chyla roman.ch...@gmail.com wrote:

  No, and I should add that this query was not against shards, just a one
  single index (and we dont use timeouts).
 
  --roman
 
 
  On Wed, Nov 6, 2013 at 5:28 PM, Michael Della Bitta 
  michael.della.bi...@appinions.com wrote:
 
   Does the header in the response indicate you're getting partialResults?
  
  
 
 http://help.websolr.com/kb/common-problems/why-am-i-getting-partial-results
  
   Michael Della Bitta
  
   Applications Developer
  
   o: +1 646 532 3062  | c: +1 917 477 7906
  
   appinions inc.
  
   “The Science of Influence Marketing”
  
   18 East 41st Street
  
   New York, NY 10017
  
   t: @appinions https://twitter.com/Appinions | g+:
   plus.google.com/appinions
  
 
 https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
   
   w: appinions.com http://www.appinions.com/
  
  
   On Wed, Nov 6, 2013 at 4:23 PM, Roman Chyla roman.ch...@gmail.com
  wrote:
  
Hello,
   
We have two solr searchers/instances (read-only). They read the same
   index,
but they did not return the same #hits for a particular query
   
Log is below, but to summarize: first server always returns 576 hits,
  the
second server returns: 440, 440, 576, 576...
   
These are just few seconds apart. Load balancer directed requests to
  both
servers. Both servers report the same numHits for other queries. I
   checked
that nothing re-opened index, there was no errorthis is SOLR 4.0
  (we
should update, I know), running CentOS, the index lives on a RAID5
   mounted
volume, both instances just read it (the index wasn't updated while
  these
searches happened).
   
Anybody has a pointer, I can't really understand it. Can it be a bug?
   
Thanks,
   
  roman
   
   
   
If you look at the log below, you will see 9002 instance always
 returns
   576
hits, but 9003 instance is returning 440, 440, 65, 576
   
   
-bash-4.1$ grep -a -e 'Jones,+Christine+year:1990-2100'
./perpetuum/live-9002/solr-logging-0.log | grep -m 5 '2013-11-06 13:'
2013-11-06 13:10:47 INFO org.apache.solr.core.SolrCore execute
[collection1] webapp=/solr path=/select
   
   
  
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=400q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
hits=576 status=0 QTime=155
2013-11-06 13:17:05 INFO org.apache.solr.core.SolrCore execute
[collection1] webapp=/solr path=/select
   
   
  
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=trueq=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
hits=576 status=0 QTime=89
2013-11-06 13:17:06 INFO org.apache.solr.core.SolrCore execute
[collection1] webapp=/solr path=/select
   
   
  
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=200q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
hits=576 status=0 QTime=87
2013-11-06 13:21:50 INFO org.apache.solr.core.SolrCore execute
[collection1] webapp=/solr path=/select
   
   
  
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=200q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
hits=576 status=0 QTime=86
2013-11-06 13:21:51 

Re: Inconsistent number of hits returned by two solr instances (from the same index!)

2013-11-06 Thread Michael Della Bitta
Does the header in the response indicate you're getting partialResults?

http://help.websolr.com/kb/common-problems/why-am-i-getting-partial-results

Michael Della Bitta

Applications Developer

o: +1 646 532 3062  | c: +1 917 477 7906

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions https://twitter.com/Appinions | g+:
plus.google.com/appinionshttps://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
w: appinions.com http://www.appinions.com/


On Wed, Nov 6, 2013 at 4:23 PM, Roman Chyla roman.ch...@gmail.com wrote:

 Hello,

 We have two solr searchers/instances (read-only). They read the same index,
 but they did not return the same #hits for a particular query

 Log is below, but to summarize: first server always returns 576 hits, the
 second server returns: 440, 440, 576, 576...

 These are just few seconds apart. Load balancer directed requests to both
 servers. Both servers report the same numHits for other queries. I checked
 that nothing re-opened index, there was no errorthis is SOLR 4.0 (we
 should update, I know), running CentOS, the index lives on a RAID5 mounted
 volume, both instances just read it (the index wasn't updated while these
 searches happened).

 Anybody has a pointer, I can't really understand it. Can it be a bug?

 Thanks,

   roman



 If you look at the log below, you will see 9002 instance always returns 576
 hits, but 9003 instance is returning 440, 440, 65, 576


 -bash-4.1$ grep -a -e 'Jones,+Christine+year:1990-2100'
 ./perpetuum/live-9002/solr-logging-0.log | grep -m 5 '2013-11-06 13:'
 2013-11-06 13:10:47 INFO org.apache.solr.core.SolrCore execute
 [collection1] webapp=/solr path=/select

 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=400q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
 hits=576 status=0 QTime=155
 2013-11-06 13:17:05 INFO org.apache.solr.core.SolrCore execute
 [collection1] webapp=/solr path=/select

 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=trueq=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
 hits=576 status=0 QTime=89
 2013-11-06 13:17:06 INFO org.apache.solr.core.SolrCore execute
 [collection1] webapp=/solr path=/select

 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=200q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
 hits=576 status=0 QTime=87
 2013-11-06 13:21:50 INFO org.apache.solr.core.SolrCore execute
 [collection1] webapp=/solr path=/select

 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=200q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
 hits=576 status=0 QTime=86
 2013-11-06 13:21:51 INFO org.apache.solr.core.SolrCore execute
 [collection1] webapp=/solr path=/select

 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=400q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
 hits=576 status=0 QTime=87


 -bash-4.1$ grep -a -e 'Jones,+Christine+year:1990-2100'
 ./perpetuum/live-9003/solr-logging-0.log | grep -m 5 '2013-11-06 13:'
 2013-11-06 13:10:46 INFO org.apache.solr.core.SolrCore execute
 [collection1] webapp=/solr path=/select

 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=trueq=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
 hits=440 status=0 QTime=144
 2013-11-06 13:10:46 INFO org.apache.solr.core.SolrCore execute
 [collection1] webapp=/solr path=/select

 

Re: Inconsistent number of hits returned by two solr instances (from the same index!)

2013-11-06 Thread Roman Chyla
No, and I should add that this query was not against shards, just a one
single index (and we dont use timeouts).

--roman


On Wed, Nov 6, 2013 at 5:28 PM, Michael Della Bitta 
michael.della.bi...@appinions.com wrote:

 Does the header in the response indicate you're getting partialResults?

 http://help.websolr.com/kb/common-problems/why-am-i-getting-partial-results

 Michael Della Bitta

 Applications Developer

 o: +1 646 532 3062  | c: +1 917 477 7906

 appinions inc.

 “The Science of Influence Marketing”

 18 East 41st Street

 New York, NY 10017

 t: @appinions https://twitter.com/Appinions | g+:
 plus.google.com/appinions
 https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
 
 w: appinions.com http://www.appinions.com/


 On Wed, Nov 6, 2013 at 4:23 PM, Roman Chyla roman.ch...@gmail.com wrote:

  Hello,
 
  We have two solr searchers/instances (read-only). They read the same
 index,
  but they did not return the same #hits for a particular query
 
  Log is below, but to summarize: first server always returns 576 hits, the
  second server returns: 440, 440, 576, 576...
 
  These are just few seconds apart. Load balancer directed requests to both
  servers. Both servers report the same numHits for other queries. I
 checked
  that nothing re-opened index, there was no errorthis is SOLR 4.0 (we
  should update, I know), running CentOS, the index lives on a RAID5
 mounted
  volume, both instances just read it (the index wasn't updated while these
  searches happened).
 
  Anybody has a pointer, I can't really understand it. Can it be a bug?
 
  Thanks,
 
roman
 
 
 
  If you look at the log below, you will see 9002 instance always returns
 576
  hits, but 9003 instance is returning 440, 440, 65, 576
 
 
  -bash-4.1$ grep -a -e 'Jones,+Christine+year:1990-2100'
  ./perpetuum/live-9002/solr-logging-0.log | grep -m 5 '2013-11-06 13:'
  2013-11-06 13:10:47 INFO org.apache.solr.core.SolrCore execute
  [collection1] webapp=/solr path=/select
 
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=400q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
  hits=576 status=0 QTime=155
  2013-11-06 13:17:05 INFO org.apache.solr.core.SolrCore execute
  [collection1] webapp=/solr path=/select
 
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=trueq=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
  hits=576 status=0 QTime=89
  2013-11-06 13:17:06 INFO org.apache.solr.core.SolrCore execute
  [collection1] webapp=/solr path=/select
 
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=200q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
  hits=576 status=0 QTime=87
  2013-11-06 13:21:50 INFO org.apache.solr.core.SolrCore execute
  [collection1] webapp=/solr path=/select
 
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=200q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
  hits=576 status=0 QTime=86
  2013-11-06 13:21:51 INFO org.apache.solr.core.SolrCore execute
  [collection1] webapp=/solr path=/select
 
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=400q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
  hits=576 status=0 QTime=87
 
 
  -bash-4.1$ grep -a -e 'Jones,+Christine+year:1990-2100'
  ./perpetuum/live-9003/solr-logging-0.log | grep -m 5 '2013-11-06 13:'
  2013-11-06 13:10:46 INFO org.apache.solr.core.SolrCore execute
  [collection1] webapp=/solr path=/select
 
 
 

Re: Inconsistent number of hits returned by two solr instances (from the same index!)

2013-11-06 Thread Michael Della Bitta
Wow, that's pretty weird. Have you tried turning logging down to debug and
seeing if anything interesting shakes out?

Michael Della Bitta

Applications Developer

o: +1 646 532 3062  | c: +1 917 477 7906

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions https://twitter.com/Appinions | g+:
plus.google.com/appinionshttps://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
w: appinions.com http://www.appinions.com/


On Wed, Nov 6, 2013 at 6:40 PM, Roman Chyla roman.ch...@gmail.com wrote:

 No, and I should add that this query was not against shards, just a one
 single index (and we dont use timeouts).

 --roman


 On Wed, Nov 6, 2013 at 5:28 PM, Michael Della Bitta 
 michael.della.bi...@appinions.com wrote:

  Does the header in the response indicate you're getting partialResults?
 
 
 http://help.websolr.com/kb/common-problems/why-am-i-getting-partial-results
 
  Michael Della Bitta
 
  Applications Developer
 
  o: +1 646 532 3062  | c: +1 917 477 7906
 
  appinions inc.
 
  “The Science of Influence Marketing”
 
  18 East 41st Street
 
  New York, NY 10017
 
  t: @appinions https://twitter.com/Appinions | g+:
  plus.google.com/appinions
 
 https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
  
  w: appinions.com http://www.appinions.com/
 
 
  On Wed, Nov 6, 2013 at 4:23 PM, Roman Chyla roman.ch...@gmail.com
 wrote:
 
   Hello,
  
   We have two solr searchers/instances (read-only). They read the same
  index,
   but they did not return the same #hits for a particular query
  
   Log is below, but to summarize: first server always returns 576 hits,
 the
   second server returns: 440, 440, 576, 576...
  
   These are just few seconds apart. Load balancer directed requests to
 both
   servers. Both servers report the same numHits for other queries. I
  checked
   that nothing re-opened index, there was no errorthis is SOLR 4.0
 (we
   should update, I know), running CentOS, the index lives on a RAID5
  mounted
   volume, both instances just read it (the index wasn't updated while
 these
   searches happened).
  
   Anybody has a pointer, I can't really understand it. Can it be a bug?
  
   Thanks,
  
 roman
  
  
  
   If you look at the log below, you will see 9002 instance always returns
  576
   hits, but 9003 instance is returning 440, 440, 65, 576
  
  
   -bash-4.1$ grep -a -e 'Jones,+Christine+year:1990-2100'
   ./perpetuum/live-9002/solr-logging-0.log | grep -m 5 '2013-11-06 13:'
   2013-11-06 13:10:47 INFO org.apache.solr.core.SolrCore execute
   [collection1] webapp=/solr path=/select
  
  
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=400q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
   hits=576 status=0 QTime=155
   2013-11-06 13:17:05 INFO org.apache.solr.core.SolrCore execute
   [collection1] webapp=/solr path=/select
  
  
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=trueq=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
   hits=576 status=0 QTime=89
   2013-11-06 13:17:06 INFO org.apache.solr.core.SolrCore execute
   [collection1] webapp=/solr path=/select
  
  
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=200q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
   hits=576 status=0 QTime=87
   2013-11-06 13:21:50 INFO org.apache.solr.core.SolrCore execute
   [collection1] webapp=/solr path=/select
  
  
 
 params={hl.requireFieldMatch=truefl=bibcode,pubdate,keyword,author,property,abstract,bibstem,citation_count,pub,[citations],volume,database,aff,grants,year,id,title,identifier,issue,page,doisort=citation_count+descindent=truestart=200q=author:Jones,+Christine+year:1990-2100hl.usePhraseHighlighter=truehl.maxAnalyzedChars=15wt=jsonfq=database:astronomyrows=200}
   hits=576 status=0 QTime=86
   2013-11-06 13:21:51 INFO org.apache.solr.core.SolrCore execute
   [collection1] webapp=/solr path=/select