Re: Simple search low speed

2015-04-29 Thread Norgorn
In case s1 will face the same problem, the thing was that SOLR caches were
turned off, and I underestimated the meaning of caches in desire to save as
much RAM as possible.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135p4202975.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Simple search low speed

2015-04-26 Thread Tomasz Borek
Perhaps belated and IANA Solr expert, but take a look at mergeFactor and
ramBufferSizeMB. I've had a situation where after Solr upgrade old values
were kept. Those are used for buffering during indexing AFAIR. If buffer
size is exceeded, merge happens. If merge factor is exceeded, coalesced
segments are (AFAIR) written to IO, thus you might see high CPU.

pozdrawiam,
LAFK

2015-04-24 14:39 GMT+02:00 Joel Bernstein joels...@gmail.com:

 Try breaking down the query to see which part of it is slow. If it turns
 out to be the range query you may want to look into using an frange
 postfilter.

 Joel Bernstein
 http://joelsolr.blogspot.com/

 On Fri, Apr 24, 2015 at 6:50 AM, Norgorn lsunnyd...@mail.ru wrote:

  Thanks for your reply.
 
  Yes, 100% CPU is used by SOLR (100% - I mean 1 core, not all cores), I'm
  totally sure.
 
  I have more than 80 GB RAM on test machine and about 50 is cached as disk
  cache, SOLR uses about 8, Xmx=40G.
 
  I use GC1, but it can't be the problem, cause memory usage is much lower
  than GC start limit (45% of heap).
 
  I think, the problem can be in fully optimized index, and search over one
  big segment is much slower than parallel search over lot of segments, but
  it
  sounds weird, so I'm not sure.
  Setups with big indexes which I know are all with optimized indexes.
 
  Index scheme:
  field name=text type=text_general indexed=true stored=false
  termVectors=true termPositions=true termOffsets=true /
 field name=title type=text_general indexed=true stored=false
  termVectors=true termPositions=true termOffsets=true /
 
  field name=cat type=tint indexed=true stored=true
  multiValued=false required=true omitNorms=true
  omitTermFreqAndPositions=true/
 
  field name=date type=date indexed=true stored=false
  omitNorms=true omitTermFreqAndPositions=true/
 
  field name=field1 type=int indexed=true stored=true
  required=false omitNorms=true omitTermFreqAndPositions=true/
  field name=field2 type=int indexed=true stored=true
  required=false omitNorms=true omitTermFreqAndPositions=true/
 
 
 
  --
  View this message in context:
 
 http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135p4202157.html
  Sent from the Solr - User mailing list archive at Nabble.com.
 



Simple search low speed

2015-04-24 Thread Norgorn
We have simple search over 50 GB index.
And it's slow.
I can't even wonder why, whole index is in RAM (and a lot of free space is
available) and CPU is a bottleneck (100% load).

The query is simple (except tvrh):

q=(text:(word1+word2)++title:(word1+word2))tv=trueisShard=trueqt=/tvrhfq=cat:(10+11+12)fq=field1:(150)fq=field2:(0)fq=date:[2015-01-01T00:00:00Z+TO+2015-04-24T23:59:59Z]

text, title - text_general fields
cat, field1, field2 - tint fields
date - a date field (I know, it's deprecated, will be changed soon).
All fields are indexed, some of them are stored.

And search time is 15 seconds (for warmed searcher, it's not the first
query).

debug=true shows timings process={time=15382.0,query={time=15282.0}

What can I check?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Simple search low speed

2015-04-24 Thread Norgorn
Thanks for your reply.

Yes, 100% CPU is used by SOLR (100% - I mean 1 core, not all cores), I'm
totally sure.

I have more than 80 GB RAM on test machine and about 50 is cached as disk
cache, SOLR uses about 8, Xmx=40G.

I use GC1, but it can't be the problem, cause memory usage is much lower
than GC start limit (45% of heap).

I think, the problem can be in fully optimized index, and search over one
big segment is much slower than parallel search over lot of segments, but it
sounds weird, so I'm not sure.
Setups with big indexes which I know are all with optimized indexes.

Index scheme:
field name=text type=text_general indexed=true stored=false
termVectors=true termPositions=true termOffsets=true /
   field name=title type=text_general indexed=true stored=false
termVectors=true termPositions=true termOffsets=true /

field name=cat type=tint indexed=true stored=true
multiValued=false required=true omitNorms=true
omitTermFreqAndPositions=true/

field name=date type=date indexed=true stored=false
omitNorms=true omitTermFreqAndPositions=true/

field name=field1 type=int indexed=true stored=true
required=false omitNorms=true omitTermFreqAndPositions=true/
field name=field2 type=int indexed=true stored=true
required=false omitNorms=true omitTermFreqAndPositions=true/



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135p4202157.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Simple search low speed

2015-04-24 Thread Norgorn
The number of documents in collection is about 100m.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135p4202152.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Simple search low speed

2015-04-24 Thread Tomasz Borek
Java side:
- launch jvisualvm
- see how heap and CPU are occupied

What are your JVM settings (heap) and how much RAM do you have?

The CPU100% is used only by Solr? That is, are you 100% certain it's Solr
that drives CPU to it's limit?

pozdrawiam,
LAFK

2015-04-24 12:14 GMT+02:00 Norgorn lsunnyd...@mail.ru:

 The number of documents in collection is about 100m.



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135p4202152.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: Simple search low speed

2015-04-24 Thread Joel Bernstein
Try breaking down the query to see which part of it is slow. If it turns
out to be the range query you may want to look into using an frange
postfilter.

Joel Bernstein
http://joelsolr.blogspot.com/

On Fri, Apr 24, 2015 at 6:50 AM, Norgorn lsunnyd...@mail.ru wrote:

 Thanks for your reply.

 Yes, 100% CPU is used by SOLR (100% - I mean 1 core, not all cores), I'm
 totally sure.

 I have more than 80 GB RAM on test machine and about 50 is cached as disk
 cache, SOLR uses about 8, Xmx=40G.

 I use GC1, but it can't be the problem, cause memory usage is much lower
 than GC start limit (45% of heap).

 I think, the problem can be in fully optimized index, and search over one
 big segment is much slower than parallel search over lot of segments, but
 it
 sounds weird, so I'm not sure.
 Setups with big indexes which I know are all with optimized indexes.

 Index scheme:
 field name=text type=text_general indexed=true stored=false
 termVectors=true termPositions=true termOffsets=true /
field name=title type=text_general indexed=true stored=false
 termVectors=true termPositions=true termOffsets=true /

 field name=cat type=tint indexed=true stored=true
 multiValued=false required=true omitNorms=true
 omitTermFreqAndPositions=true/

 field name=date type=date indexed=true stored=false
 omitNorms=true omitTermFreqAndPositions=true/

 field name=field1 type=int indexed=true stored=true
 required=false omitNorms=true omitTermFreqAndPositions=true/
 field name=field2 type=int indexed=true stored=true
 required=false omitNorms=true omitTermFreqAndPositions=true/



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135p4202157.html
 Sent from the Solr - User mailing list archive at Nabble.com.