Re: any difference between using collection vs. shard in URL?

2014-11-06 Thread Ramkumar R. Aiyengar
Do keep one thing in mind though. If you are already doing the work of
figuring out the right shard leader (through solrJ or otherwise), using
that location with just the collection name might be suboptimal if there
are multiple shard leaders present in the same instance -- the collection
name just goes to *some* shard leader and not necessarily to the one where
your document is destined. If it chooses the wrong one, it will lead to a
HTTP request to itself.
On 5 Nov 2014 15:33, "Shalin Shekhar Mangar"  wrote:

> There's no difference between the two. Even if you send updates to a shard
> url, it will still be forwarded to the right shard leader according to the
> hash of the id (assuming you're using the default compositeId router). Of
> course, if you happen to hit the right shard leader then it is just an
> internal forward and not an extra network hop.
>
> The advantage with using the collection name is that you can hit any
> SolrCloud node (even the ones not hosting this collection) and it will
> still work. So for a non Java client, a load balancer can be setup in front
> of the entire cluster and things will just work.
>
> On Wed, Nov 5, 2014 at 8:50 PM, Ian Rose  wrote:
>
> > If I add some documents to a SolrCloud shard in a collection "alpha", I
> can
> > post them to "/solr/alpha/update".  However I notice that you can also
> post
> > them using the shard name, e.g. "/solr/alpha_shard4_replica1/update" - in
> > fact this is what Solr seems to do internally (like if you send documents
> > to the wrong node so Solr needs to forward them over to the leader of the
> > correct shard).
> >
> > Assuming you *do* always post your documents to the correct shard, is
> there
> > any difference between these two, performance or otherwise?
> >
> > Thanks!
> > - Ian
> >
>
>
>
> --
> Regards,
> Shalin Shekhar Mangar.
>


Re: any difference between using collection vs. shard in URL?

2014-11-05 Thread Ian Rose
Awesome, thanks.  That's what I was hoping.

Cheers,
Ian


On Wed, Nov 5, 2014 at 10:33 AM, Shalin Shekhar Mangar <
shalinman...@gmail.com> wrote:

> There's no difference between the two. Even if you send updates to a shard
> url, it will still be forwarded to the right shard leader according to the
> hash of the id (assuming you're using the default compositeId router). Of
> course, if you happen to hit the right shard leader then it is just an
> internal forward and not an extra network hop.
>
> The advantage with using the collection name is that you can hit any
> SolrCloud node (even the ones not hosting this collection) and it will
> still work. So for a non Java client, a load balancer can be setup in front
> of the entire cluster and things will just work.
>
> On Wed, Nov 5, 2014 at 8:50 PM, Ian Rose  wrote:
>
> > If I add some documents to a SolrCloud shard in a collection "alpha", I
> can
> > post them to "/solr/alpha/update".  However I notice that you can also
> post
> > them using the shard name, e.g. "/solr/alpha_shard4_replica1/update" - in
> > fact this is what Solr seems to do internally (like if you send documents
> > to the wrong node so Solr needs to forward them over to the leader of the
> > correct shard).
> >
> > Assuming you *do* always post your documents to the correct shard, is
> there
> > any difference between these two, performance or otherwise?
> >
> > Thanks!
> > - Ian
> >
>
>
>
> --
> Regards,
> Shalin Shekhar Mangar.
>


Re: any difference between using collection vs. shard in URL?

2014-11-05 Thread Shalin Shekhar Mangar
There's no difference between the two. Even if you send updates to a shard
url, it will still be forwarded to the right shard leader according to the
hash of the id (assuming you're using the default compositeId router). Of
course, if you happen to hit the right shard leader then it is just an
internal forward and not an extra network hop.

The advantage with using the collection name is that you can hit any
SolrCloud node (even the ones not hosting this collection) and it will
still work. So for a non Java client, a load balancer can be setup in front
of the entire cluster and things will just work.

On Wed, Nov 5, 2014 at 8:50 PM, Ian Rose  wrote:

> If I add some documents to a SolrCloud shard in a collection "alpha", I can
> post them to "/solr/alpha/update".  However I notice that you can also post
> them using the shard name, e.g. "/solr/alpha_shard4_replica1/update" - in
> fact this is what Solr seems to do internally (like if you send documents
> to the wrong node so Solr needs to forward them over to the leader of the
> correct shard).
>
> Assuming you *do* always post your documents to the correct shard, is there
> any difference between these two, performance or otherwise?
>
> Thanks!
> - Ian
>



-- 
Regards,
Shalin Shekhar Mangar.


any difference between using collection vs. shard in URL?

2014-11-05 Thread Ian Rose
If I add some documents to a SolrCloud shard in a collection "alpha", I can
post them to "/solr/alpha/update".  However I notice that you can also post
them using the shard name, e.g. "/solr/alpha_shard4_replica1/update" - in
fact this is what Solr seems to do internally (like if you send documents
to the wrong node so Solr needs to forward them over to the leader of the
correct shard).

Assuming you *do* always post your documents to the correct shard, is there
any difference between these two, performance or otherwise?

Thanks!
- Ian