Customizing levels in DateRangePrefixTree (or using number ranges directly?)

2015-04-09 Thread Simon Rainer
Hi,

I'm using DateRangePrefixTree and NumberRangePrefixTreeStrategy to show time 
histograms for my search results. This works perfectly. But is there a way to 
configure the way the levels are defined in the index?

In my case, documents only come with integer ranges (from year X to year Y), 
but span a wide interval (4000BC - present, since it's about archaeological 
data). So I'm always retrieving my facets at year granularity, and then usually 
end up with lots (1000s) of values, which I'm then downsampling again into 
~25 buckets for UI display. The downsampling eats up a good fraction of the 
overall response time. I assume there are still ways to optimize it. But if 
there was a way to configure the indexing levels upfront, so that they better 
match the granularity of my data, I could probably avoid much of the resampling 
altogether (e.g. set up the index so that it indexes at century and decade 
granularity, naively speaking).

Is there a way to do this? I'd be perfectly happy with a solution that only 
works for integer ranges rather than dates, but the plain NumberRangePrefixTree 
doesn't seem directly usable as it's an abstract base class. Or am I still 
thinking along the wrong lines?

Cheers,
Rainer


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



Lucene and accumulo

2015-04-09 Thread madhvi

Hi,

Can we use Lucene API to store and search data in Accumulo?can we have a 
system, which is combination of the capabilities of both Accumulo and 
Lucene so that data can be searched by lucene API which is actually 
stored in Accumulo as key value.I have created lucene indexes of data 
stored in accumulo, in HDFS.
Lucene queries are working fine over that but I want to use those 
indexes to be searched via accumulo means the lucene queries should run 
via accumulo.


Madhvi Gupta

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



Re: Lucene and accumulo

2015-04-09 Thread Jack Krupansky
Technically, yes, in much the same way that Cassandra data can be indexed
by Solr and Lucene (e.g., Stratio, Stargate, and DataStax DSE Search), but
it is a major engineering challenge, not a minor task.

There is an open Accumulo Jira issue for Lucene integration:
https://issues.apache.org/jira/browse/ACCUMULO-3698

The SQRRL commercial product has (or at least had before the company
shifted its corporate strategy) Lucene indexing of Accumulo data, but
that's a proprietary product:
http://sqrrl.com/product/search/


-- Jack Krupansky

On Thu, Apr 9, 2015 at 6:33 AM, madhvi madhvi.gu...@orkash.com wrote:

 Hi,

 Can we use Lucene API to store and search data in Accumulo?can we have a
 system, which is combination of the capabilities of both Accumulo and
 Lucene so that data can be searched by lucene API which is actually stored
 in Accumulo as key value.I have created lucene indexes of data stored in
 accumulo, in HDFS.
 Lucene queries are working fine over that but I want to use those indexes
 to be searched via accumulo means the lucene queries should run via
 accumulo.

 Madhvi Gupta

 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org