Susheel, thank you for asking. I am using joins of two cores (employee,
department, servicetickets), which isn't support by SolrCloud - last time I
check. Not sure if this (advanced distributed request option) was present
in 4.10. Do you think, I am missing something here?

Shahzad

On Tue, Feb 9, 2016 at 6:39 PM, Susheel Kumar <susheel2...@gmail.com> wrote:

> Shahzad - I am curious what features of distributed search stops you to run
> SolrCloud. Using DS, you would be able to search across cores or
> collections.
>
> https://cwiki.apache.org/confluence/display/solr/Advanced+Distributed+Request+Options
>
> Thanks,
> Susheel
>
> On Tue, Feb 9, 2016 at 12:10 AM, Shahzad Masud <
> shahzad.ma...@northbaysolutions.net> wrote:
>
> > Thank you Shawn for your response. I would be running some performance
> > tests lately on this structure (one JVM with multiple cores), and would
> > share feedback on this thread.
> >
> > >There IS a way to specify the solr home for a specific context, but keep
> > >in mind that I definitely DO NOT recommend doing this.  There is
> > >resource and administrative overhead to running multiple copies of Solr
> > >in one JVM.  Simply run one context and let it handle multiple shards,
> > >whether you choose SolrCloud or not.
> > Due to distributed search feature, I might not be able to run SolrCloud.
> I
> > would appreciate, if you please share that way of setting solr home for a
> > specific context in Jetty-Solr. Its good to seek more information for
> > comparison purposes. Do you think having multiple JVMs would increase or
> > decrease performance. My document base is around 20 million rows (in 24
> > shards), with document size ranging from 100KB - 400 MB.
> >
> > SM
> >
> > On Mon, Feb 8, 2016 at 8:09 PM, Shawn Heisey <apa...@elyograg.org>
> wrote:
> >
> > > On 2/8/2016 1:14 AM, Shahzad Masud wrote:
> > > > Thank you Shawn for your reply. Here is my structure of cores and
> > shards
> > > >
> > > > Shard 1 = localhost:8983/solr_2014 [3 Core  - Employee, Service
> > Tickets,
> > > > Departments]
> > > > Shard 2 = localhost:8983/solr_2015 [3 Core  - Employee, Service
> > Tickets,
> > > > Departments]
> > > > Shard 3 = localhost:8983/solr_2016 [3 Core  - Employee, Service
> > Tickets,
> > > > Departments]
> > > >
> > > > While searching, I use distributed search feature to search data from
> > all
> > > > three shards in respective cores e.g. If I want to search from
> Employee
> > > > data for all three years, I search from Employee core of three
> > contexts.
> > > > This is legacy design, do you think this is okay, or this require
> > > immediate
> > > > restructure / design? I am going to try this,
> > > >
> > > > Context = localhost:8982/solr (9 cores - Employee-2014,
> Employee-2015,
> > > > Employee-2016, ServiceTickets-2014, ServiceTickets-2015,
> > > > ServiceTickets-2016, Department-2014, Department-2015,
> Department-2016]
> > > > distributed search would be from all three cores of same data
> category
> > > > (i.e. For Employee search, it would be from Employee-2014,
> > Employee-2015,
> > > > Employee-2016).
> > >
> > > With SolrCloud, you can have multiple collections for each of these
> > > types and alias them together.  Or you can simply have one collection
> > > for employee, one for servicetickets, and one for department, with
> > > SolrCloud automatically handling splitting those documents into the
> > > number of shardsthat you specify when you create the collection.  You
> > > can also do manual sharding and split each collection on a time basis
> > > like you have been doing, but then you lose some of the automation that
> > > SolrCloud provides, so I do not recommend handling it that way.
> > >
> > > > Regarding one Solr context per jetty; I cannot run two solr contexts
> > > > pointing to different data in Jetty, as while starting jetty I have
> to
> > > > provide -Dsolr.solr.home variable - which ends up pointing to one
> data
> > > > folder (2014 data) only.
> > >
> > > You do not need multiple contexts to have multiple indexes.
> > >
> > > My dev Solr server has exactly one Solr JVM, with exactly one context
> --
> > > /solr.  That instance of Solr has 45 indexes (cores) on it.  These 45
> > > cores are various shards for three larger indexes.  I am not running
> > > SolrCloud, but I certainly could.
> > >
> > > You can see 25 of the 45 cores in my Solr instance in this screenshot
> of
> > > the admin UI for this server:
> > >
> > > https://www.dropbox.com/s/v87mxvkdejvd92h/solr-with-45-cores.png?dl=0
> > >
> > > There IS a way to specify the solr home for a specific context, but
> keep
> > > in mind that I definitely DO NOT recommend doing this.  There is
> > > resource and administrative overhead to running multiple copies of Solr
> > > in one JVM.  Simply run one context and let it handle multiple shards,
> > > whether you choose SolrCloud or not.
> > >
> > > Thanks,
> > > Shawn
> > >
> > >
> >
>

Reply via email to