High frequency terms in results document....

2015-02-15 Thread Shouvik Bardhan
Apologies if I have missed it in discussions prior but I looked all over. I looked at the Luke code and it does find high frequency terms on the entire index. I am trying to get the top N high frequency terms in the documents returned from a search result. I came across something called FilterIndex

JTRES 2015 CFP

2015-02-15 Thread Lukasz Ziarek
Dear Real-time Java Community, Remi and I are pleased to announce the release of the JTRES 2015 call for papers (below) and the JTRES 2015 website: http://jtres2015.univ-mlv.fr. JTRES will be held in Paris on October 7th and 8th. We hope to see you there and look forward to your submissi

Re: Top 10 words

2015-02-15 Thread Denis Bazhenov
Either you have to index those words in a facet or calculate top 10 words on-the-fly. Last approach could be effective enough if you have ability to read those documents quickly. The calculation of Top 10 words could be done pretty easily in terms of memory and CPU, because there is no need to d

Re: Top 10 words

2015-02-15 Thread Maisnam Ns
Hi Denis, Looks good and thanks for the links. And one more help , once finding the top ten say 'Lucene' -1000 , 'search' -789 , I need to a quick span query on 'Lucene' say e.g 'Companies use Lucene for searching' , some phrases containing 'Lucene'. I tried using this http://sujitpal.blogspot.i

Re: Top 10 words

2015-02-15 Thread Maisnam Ns
Hi Jigar, The link you shared http://search.carrot2.org is really nice a lot of it's features actually has my requirements. Thanks for the share On Mon, Feb 16, 2015 at 9:20 AM, Maisnam Ns wrote: > Hi Denis, > > Looks good and thanks for the links. And one more help