Collating results from multiple indexes

Aaron McKee Mon, 25 Jan 2010 13:01:41 -0800

Is there any somewhat convenient way to collate/integrate fields fromseparate indices during result writing, if the indices use the sameunique keys? Basically, some sort of cross-index JOIN?

As a bit of background, I have a rather heavyweight dataset of every USbusiness (~25m records, an on-disk index footprint of ~30g, and 5-10hours to fully index on a decent box). Given the size and relativelystability of the dataset, I generally only update this monthly. However,I have separate advertising-related datasets that need to be updatedeither hourly or daily (e.g. today's coupon, click revenue remaining,etc.) . These advertiser feeds reference the same keyspace that I use inthe main index, but are otherwise significantly lighter weight.Importing and indexing them discretely only takes a couple minutes.Given that Solr/Lucene doesn't support field updating, without having todrop and re-add an entire document, it doesn't seem practical tointegrate this data into the main index (the system would be under aconstant state of churn, if we did document re-inserts, and theperformance impact would probably be debilitating). It may be nice ifthis data could participate in filtering (e.g. only show advertisers),but it doesn't need to participate in scoring/ranking.

I'm guessing that someone else has had a similar need, at some point? Ican have our front-end query the smaller indices separately, using thekeys returned by the primary index, but would prefer to avoid the extrasequential roundtrips. I'm hoping to also avoid a coding solution, ifonly to avoid the maintenance overhead as we drop in new builds of Solr,but that's also feasible.


Thank you for your insight,
Aaron

Collating results from multiple indexes

Reply via email to