To explain what a join does:

It goes over to the joined index, and executes a query. This results in
a list of "ids" that will be used to do a search on the main index. The
more of these ids there are, the worse performance will be. Thus, if you
have 100k documents that match in the join core, you will be doing a
100k term search, which will invariably be painful, because the more
terms you include in the search, the slower it will be.

How many matching docs do you have on the other side of your query?

Upayavira

On Tue, Sep 8, 2015, at 02:09 PM, Russell Taylor wrote:
> Hi,
>  I hope somebody can help.
> 
> We have two indexes, one which holds the descriptive data and the other
> one which holds lists of docs which are
> of a certain type (called universes in our world). They need to be joined
> together to show a list of data from indexA
> where a filtered indexB (by universe:value) has matching longs (The join
> field).
> 
> At the moment the query is taking 55 seconds we need to get it under a
> second, any help most appreciated.
> 
> INDEXES:
> 
> Index a (primary index)
> 31 million docs with a converted alphanumeric to a long value with a
> possible 10 million unique values.
> 
> Index B (the joined index)
> 250 million documents with a converted alphanumeric to a long value with
> a possible 10 million unique values.
> IndexB is filtered by universe which could be between 1 and 500,000 docs.
> 
> QUERY:
> http://127.0.0.1:8080/solr/indexA/select?q={!join+from=longValue+to=longValue+fromIndex=IndexB}universe:<http://127.0.0.1:8080/solr/indexA/select?q=%7b!join+from=longValue+to=longValue+fromIndex=IndexB%7duniverse:>universeValue
> 
> Qtime is 55 seconds for either a universe of 5 docs or 500,000 docs.
> 
> 
> 
> Thanks
> 
> 
> Russ.
> 
> 
> *******************************************************
> This message (including any files transmitted with it) may contain
> confidential and/or proprietary information, is the property of
> Interactive Data Corporation and/or its subsidiaries, and is directed
> only to the addressee(s). If you are not the designated recipient or have
> reason to believe you received this message in error, please delete this
> message from your system and notify the sender immediately. An unintended
> recipient's disclosure, copying, distribution, or use of this message or
> any attachments is prohibited and may be unlawful. 
> *******************************************************

Reply via email to