Re: Grouping performance problem

2014-05-28 Thread arres
Hello there, 
I am faceing the same problem. 
Did anyone found a solution yet?
Thank you,
arres



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Grouping-performance-problem-tp3995245p4138419.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Grouping performance problem

2013-10-30 Thread shamik
Bumping up this thread as I'm facing similar issue . Any solution ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Grouping-performance-problem-tp3995245p4098566.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Grouping performance problem

2013-04-15 Thread davidduffett
Agnieszka,

Did you find a good solution to your performance problem with grouping?  I
have an index with 45m records and am using grouping and the performance is
atrocious.

Any advice would be very welcome!

Thanks in advance,
David



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Grouping-performance-problem-tp3995245p4056113.html
Sent from the Solr - User mailing list archive at Nabble.com.


Grouping performance problem

2012-07-16 Thread Agnieszka Kukałowicz
Hi,

Is the any way to make grouping searches more efficient?

My queries look like:
/select?q=querygroup=truegroup.field=idgroup.facet=truegroup.ngroups=truefacet.field=category1facet.missing=falsefacet.mincount=1

For index with 3 mln documents query for all docs with group=true takes
almost 4000ms. Because queryResultCache is not used next queries take a
long time also.

When I remove group=true and leave only faceting the query for all docs
takes much more less time: for first time ~ 700ms and next runs only 200ms
because of queryResultCache being used.

So with group=true the query is about 20 time slower than without it.
Is it possible or is there any way to improve performance with grouping?

My application needs grouping feature and all of the queries use it but the
performance of them is to low for production use.

I use Solr 4.x from trunk

Agnieszka Kukalowicz


Re: Grouping performance problem

2012-07-16 Thread Pavel Goncharik
Hi Agnieszka ,

if you don't need number of groups, you can try leaving out
group.ngroups=true param.
In this case Solr apparently skips calculating all groups and delivers
results much faster.
At least for our application the difference in performance
with/without group.ngroups=true is significant (have to say, we use
Solr 3.6).

WBR,
Pavel

On Mon, Jul 16, 2012 at 1:00 PM, Agnieszka Kukałowicz
agnieszka.kukalow...@usable.pl wrote:
 Hi,

 Is the any way to make grouping searches more efficient?

 My queries look like:
 /select?q=querygroup=truegroup.field=idgroup.facet=truegroup.ngroups=truefacet.field=category1facet.missing=falsefacet.mincount=1

 For index with 3 mln documents query for all docs with group=true takes
 almost 4000ms. Because queryResultCache is not used next queries take a
 long time also.

 When I remove group=true and leave only faceting the query for all docs
 takes much more less time: for first time ~ 700ms and next runs only 200ms
 because of queryResultCache being used.

 So with group=true the query is about 20 time slower than without it.
 Is it possible or is there any way to improve performance with grouping?

 My application needs grouping feature and all of the queries use it but the
 performance of them is to low for production use.

 I use Solr 4.x from trunk

 Agnieszka Kukalowicz


Re: Grouping performance problem

2012-07-16 Thread Agnieszka Kukałowicz
Hi Pavel,

I tried with group.ngroups=false but didn't notice a big improvement.
The times were still about 4000 ms. It doesn't solve my problem.
Maybe this is because of my index type. I have millions of documents but
only about 20 000 groups.

 Cheers
 Agnieszka

2012/7/16 Pavel Goncharik pavel.goncha...@gmail.com

 Hi Agnieszka ,

 if you don't need number of groups, you can try leaving out
 group.ngroups=true param.
 In this case Solr apparently skips calculating all groups and delivers
 results much faster.
 At least for our application the difference in performance
 with/without group.ngroups=true is significant (have to say, we use
 Solr 3.6).

 WBR,
 Pavel

 On Mon, Jul 16, 2012 at 1:00 PM, Agnieszka Kukałowicz
 agnieszka.kukalow...@usable.pl wrote:
  Hi,
 
  Is the any way to make grouping searches more efficient?
 
  My queries look like:
 
 /select?q=querygroup=truegroup.field=idgroup.facet=truegroup.ngroups=truefacet.field=category1facet.missing=falsefacet.mincount=1
 
  For index with 3 mln documents query for all docs with group=true takes
  almost 4000ms. Because queryResultCache is not used next queries take a
  long time also.
 
  When I remove group=true and leave only faceting the query for all docs
  takes much more less time: for first time ~ 700ms and next runs only
 200ms
  because of queryResultCache being used.
 
  So with group=true the query is about 20 time slower than without it.
  Is it possible or is there any way to improve performance with grouping?
 
  My application needs grouping feature and all of the queries use it but
 the
  performance of them is to low for production use.
 
  I use Solr 4.x from trunk
 
  Agnieszka Kukalowicz



Re: Grouping performance problem

2012-07-16 Thread alxsss



Re: Grouping performance problem

2012-07-16 Thread Agnieszka Kukałowicz
I have server with 24GB RAM. I have 4 shards on it, each of them with 4GB
RAM for java:
JAVA_OPTIONS=-server -Xms4096M -Xmx4096M
The size is about 15GB for one shard (i use ssd disk for index data).

Agnieszka


2012/7/16 alx...@aim.com

 What are the RAM of your server and size of the data folder?



 -Original Message-
 From: Agnieszka Kukałowicz agnieszka.kukalow...@usable.pl
 To: solr-user solr-user@lucene.apache.org
 Sent: Mon, Jul 16, 2012 6:16 am
 Subject: Re: Grouping performance problem


 Hi Pavel,

 I tried with group.ngroups=false but didn't notice a big improvement.
 The times were still about 4000 ms. It doesn't solve my problem.
 Maybe this is because of my index type. I have millions of documents but
 only about 20 000 groups.

  Cheers
  Agnieszka

 2012/7/16 Pavel Goncharik pavel.goncha...@gmail.com

  Hi Agnieszka ,
 
  if you don't need number of groups, you can try leaving out
  group.ngroups=true param.
  In this case Solr apparently skips calculating all groups and delivers
  results much faster.
  At least for our application the difference in performance
  with/without group.ngroups=true is significant (have to say, we use
  Solr 3.6).
 
  WBR,
  Pavel
 
  On Mon, Jul 16, 2012 at 1:00 PM, Agnieszka Kukałowicz
  agnieszka.kukalow...@usable.pl wrote:
   Hi,
  
   Is the any way to make grouping searches more efficient?
  
   My queries look like:
  
 
 /select?q=querygroup=truegroup.field=idgroup.facet=truegroup.ngroups=truefacet.field=category1facet.missing=falsefacet.mincount=1
  
   For index with 3 mln documents query for all docs with group=true takes
   almost 4000ms. Because queryResultCache is not used next queries take a
   long time also.
  
   When I remove group=true and leave only faceting the query for all docs
   takes much more less time: for first time ~ 700ms and next runs only
  200ms
   because of queryResultCache being used.
  
   So with group=true the query is about 20 time slower than without it.
   Is it possible or is there any way to improve performance with
 grouping?
  
   My application needs grouping feature and all of the queries use it but
  the
   performance of them is to low for production use.
  
   I use Solr 4.x from trunk
  
   Agnieszka Kukalowicz
 





Re: Grouping performance problem

2012-07-16 Thread alxsss
This is strange. We have data folder size 24Gb,  RAM for java 2GB. We query 
with grouping, ngroups and  highlighting, do not query all fields and query 
time mostly is less than 1 sec it rarely goes up to 2 sec. We use solr 3.6 and 
tuned off all kind of caching.
Maybe your problem is with caching and displaying all fields?

Hope this may help.

Alex.



-Original Message-
From: Agnieszka Kukałowicz agnieszka.kukalow...@usable.pl
To: solr-user solr-user@lucene.apache.org
Sent: Mon, Jul 16, 2012 10:04 am
Subject: Re: Grouping performance problem


I have server with 24GB RAM. I have 4 shards on it, each of them with 4GB
RAM for java:
JAVA_OPTIONS=-server -Xms4096M -Xmx4096M
The size is about 15GB for one shard (i use ssd disk for index data).

Agnieszka


2012/7/16 alx...@aim.com

 What are the RAM of your server and size of the data folder?



 -Original Message-
 From: Agnieszka Kukałowicz agnieszka.kukalow...@usable.pl
 To: solr-user solr-user@lucene.apache.org
 Sent: Mon, Jul 16, 2012 6:16 am
 Subject: Re: Grouping performance problem


 Hi Pavel,

 I tried with group.ngroups=false but didn't notice a big improvement.
 The times were still about 4000 ms. It doesn't solve my problem.
 Maybe this is because of my index type. I have millions of documents but
 only about 20 000 groups.

  Cheers
  Agnieszka

 2012/7/16 Pavel Goncharik pavel.goncha...@gmail.com

  Hi Agnieszka ,
 
  if you don't need number of groups, you can try leaving out
  group.ngroups=true param.
  In this case Solr apparently skips calculating all groups and delivers
  results much faster.
  At least for our application the difference in performance
  with/without group.ngroups=true is significant (have to say, we use
  Solr 3.6).
 
  WBR,
  Pavel
 
  On Mon, Jul 16, 2012 at 1:00 PM, Agnieszka Kukałowicz
  agnieszka.kukalow...@usable.pl wrote:
   Hi,
  
   Is the any way to make grouping searches more efficient?
  
   My queries look like:
  
 
 /select?q=querygroup=truegroup.field=idgroup.facet=truegroup.ngroups=truefacet.field=category1facet.missing=falsefacet.mincount=1
  
   For index with 3 mln documents query for all docs with group=true takes
   almost 4000ms. Because queryResultCache is not used next queries take a
   long time also.
  
   When I remove group=true and leave only faceting the query for all docs
   takes much more less time: for first time ~ 700ms and next runs only
  200ms
   because of queryResultCache being used.
  
   So with group=true the query is about 20 time slower than without it.
   Is it possible or is there any way to improve performance with
 grouping?
  
   My application needs grouping feature and all of the queries use it but
  the
   performance of them is to low for production use.
  
   I use Solr 4.x from trunk
  
   Agnieszka Kukalowicz