Hi,
I submitted a new JIRA for this:
https://issues.apache.org/jira/browse/SOLR-5092

A (very initial) patch is already attached. Reviews are very welcome.


On Sun, Jul 28, 2013 at 4:50 PM, Erick Erickson <erickerick...@gmail.com>wrote:

> You'd probably start in CloudSolrServer in SolrJ code,
> as far as I know that's where the request is sent out.
>
> I'd think that would be better than changing Solr itself
> since if you found that this was useful you wouldn't
> be patching your Solr release, just keeping your client
> up to date.
>
> Best
> Erick
>
> On Sat, Jul 27, 2013 at 7:28 PM, Isaac Hebsh <isaac.he...@gmail.com>
> wrote:
> > Shawn, thank you for the tips.
> > I know the significant cons of virtualization, but I don't want to move
> > this thread into a virtualization pros/cons in the Solr(Cloud) case.
> >
> > I've just asked what is the minimal code change should be made, in order
> to
> > examine whether this is a possible solution or not.. :)
> >
> >
> > On Sun, Jul 28, 2013 at 1:06 AM, Shawn Heisey <s...@elyograg.org> wrote:
> >
> >> On 7/27/2013 3:33 PM, Isaac Hebsh wrote:
> >> > I have about 40 shards. repFactor=2.
> >> > The cause of slower shards is very interesting, and this is the main
> >> > approach we took.
> >> > Note that in every query, it is another shard which is the slowest. In
> >> 20%
> >> > of the queries, the slowest shard takes about 4 times more than the
> >> average
> >> > shard qtime.
> >> > While continuing investigation, remember it might be the
> virtualization /
> >> > storage-access / network / gc /..., so I thought that reducing the
> effect
> >> > of the slow shards might be a good (temporary or permanent) solution.
> >>
> >> Virtualization is not the best approach for Solr.  Assuming you're
> >> dealing with your own hardware and not something based in the cloud like
> >> Amazon, you can get better results by running on bare metal and having
> >> multiple shards per host.
> >>
> >> Garbage collection is a very likely source of this problem.
> >>
> >> http://wiki.apache.org/solr/SolrPerformanceProblems#GC_pause_problems
> >>
> >> > I thought it should be an almost trivial code change (for proving the
> >> > concept). Isn't it?
> >>
> >> I have no idea what you're saying/asking here.  Can you clarify?
> >>
> >> It seems to me that sending requests to all replicas would just increase
> >> the overall load on the cluster, with no real benefit.
> >>
> >> Thanks,
> >> Shawn
> >>
> >>
>

Reply via email to