Customizing levels in DateRangePrefixTree (or using number ranges directly?)
Hi, I'm using DateRangePrefixTree and NumberRangePrefixTreeStrategy to show time histograms for my search results. This works perfectly. But is there a way to configure the way the levels are defined in the index? In my case, documents only come with integer ranges (from year X to year Y), but span a wide interval (4000BC - present, since it's about archaeological data). So I'm always retrieving my facets at year granularity, and then usually end up with lots (1000s) of values, which I'm then downsampling again into ~25 buckets for UI display. The downsampling eats up a good fraction of the overall response time. I assume there are still ways to optimize it. But if there was a way to configure the indexing levels upfront, so that they better match the granularity of my data, I could probably avoid much of the resampling altogether (e.g. set up the index so that it indexes at century and decade granularity, naively speaking). Is there a way to do this? I'd be perfectly happy with a solution that only works for integer ranges rather than dates, but the plain NumberRangePrefixTree doesn't seem directly usable as it's an abstract base class. Or am I still thinking along the wrong lines? Cheers, Rainer - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org
Lucene and accumulo
Hi, Can we use Lucene API to store and search data in Accumulo?can we have a system, which is combination of the capabilities of both Accumulo and Lucene so that data can be searched by lucene API which is actually stored in Accumulo as key value.I have created lucene indexes of data stored in accumulo, in HDFS. Lucene queries are working fine over that but I want to use those indexes to be searched via accumulo means the lucene queries should run via accumulo. Madhvi Gupta - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org
Re: Lucene and accumulo
Technically, yes, in much the same way that Cassandra data can be indexed by Solr and Lucene (e.g., Stratio, Stargate, and DataStax DSE Search), but it is a major engineering challenge, not a minor task. There is an open Accumulo Jira issue for Lucene integration: https://issues.apache.org/jira/browse/ACCUMULO-3698 The SQRRL commercial product has (or at least had before the company shifted its corporate strategy) Lucene indexing of Accumulo data, but that's a proprietary product: http://sqrrl.com/product/search/ -- Jack Krupansky On Thu, Apr 9, 2015 at 6:33 AM, madhvi madhvi.gu...@orkash.com wrote: Hi, Can we use Lucene API to store and search data in Accumulo?can we have a system, which is combination of the capabilities of both Accumulo and Lucene so that data can be searched by lucene API which is actually stored in Accumulo as key value.I have created lucene indexes of data stored in accumulo, in HDFS. Lucene queries are working fine over that but I want to use those indexes to be searched via accumulo means the lucene queries should run via accumulo. Madhvi Gupta - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org