Re: Nightly build link is broken

2009-08-05 Thread Adriano Crestani
The server is up again :D Thank you guys! On Wed, Aug 5, 2009 at 11:55 AM, Michael McCandless < luc...@mikemccandless.com> wrote: > Hmm... not working for me either. Looks like it's currently down: > >http://monitoring.apache.org/status/ > > I expect infra is working on fixing it... > > Mik

RE: DuplicateFilter.java

2009-08-05 Thread Steven A Rowe
Hi Paul, This class is under contrib/queries/: http://svn.apache.org/viewvc/lucene/java/branches/lucene_2_4/contrib/queries/src/java/org/apache/lucene/search/ Steve > -Original Message- > From: Paul [mailto:paul.fuehr...@gmail.com] > Sent: Wednesday, August 05, 2009 6:32 PM > To: java-u

DuplicateFilter.java

2009-08-05 Thread Paul
Hi, I'm looking for a way to filter out duplicates of a search result and found this class in the javadoc [1]. However, it is not included in the 2.4.1 source [2]. There seem to be some other classes missing or inconsistencies as well. [1] http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/

RE: Analysis Question

2009-08-05 Thread Christopher Condit
Perhaps a better question: let's say I have a few thousand terms or phrases. I want to prefer documents with these phrases in my search results over documents that do not have these terms or phrases. What's the best way to accomplish this? Thanks, -Chris > -Original Message- > From: Chri

Re: Efficient optimization of large indexes?

2009-08-05 Thread Michael McCandless
On Wed, Aug 5, 2009 at 12:08 PM, Nigel wrote: > We periodically optimize large indexes (100 - 200gb) by calling > IndexWriter.optimize(). It takes a heck of a long time, and I'm wondering > if a more efficient solution might be the following: > > - Create a new empty index on a different filesyste

Re: Nightly build link is broken

2009-08-05 Thread Michael McCandless
Hmm... not working for me either. Looks like it's currently down: http://monitoring.apache.org/status/ I expect infra is working on fixing it... Mike On Wed, Aug 5, 2009 at 2:10 PM, Adriano Crestani wrote: > The new link also does not work. Is it working for you? > > On Wed, Aug 5, 2009 at

Re: Nightly build link is broken

2009-08-05 Thread Adriano Crestani
The new link also does not work. Is it working for you? On Wed, Aug 5, 2009 at 11:06 AM, Adriano Crestani wrote: > Now I see it was in the news session on the main page: 3 January 2007 - > Nightly Source builds > available

Re: Nightly build link is broken

2009-08-05 Thread Adriano Crestani
Now I see it was in the news session on the main page: 3 January 2007 - Nightly Source builds available. It makes sense that the new is out of date. However, is there any other place the link to nightly builds can

Re: org.apache.lucene.index.MergePolicy$MergeException

2009-08-05 Thread tsuraan
On 05/08/2009, Michael McCandless wrote: > Switching to addIndexes instead, or using SerialMergeScheduler, or > upgrading to 2.4.1, should all work. Thanks! We'll be switching to 2.9 once it's ready. From past experience, lucene upgrades are simple and painless, but I don't think I can do a 2.4

Re: THIS WEEK: PNW Hadoop, HBase / Apache Cloud Stack Users' Meeting, Wed Jul 29th, Seattle

2009-08-05 Thread Bradford Stephens
A big "thanks" to everyone who came out despite the heat! Hope to see you again the last week of August, probably at UW. On Wed, Jul 29, 2009 at 4:52 PM, Bradford Stephens wrote: > Don't forget this is tonight! Excited to see everyone there. > > On Tue, Jul 28, 2009 at 11:25 AM, Bradford > Stephen

Re: org.apache.lucene.index.MergePolicy$MergeException

2009-08-05 Thread Michael McCandless
Switching to addIndexes instead, or using SerialMergeScheduler, or upgrading to 2.4.1, should all work. Mike On Wed, Aug 5, 2009 at 12:26 PM, tsuraan wrote: > I'm getting the exception > "org.apache.lucene.index.MergePolicy$MergeException: segment "_0 > exists in external directory yet the MergeS

Re: What happens after merging?

2009-08-05 Thread Simon Willnauer
Hi there, when you use addIndexes() the IndexWriter merges the provided indexes it as well as the index the IndexWriter was opened on into one single index. It seem like you are using compound file system with the IndexWriter you add your other indexes too. This instance takes all documents and the

org.apache.lucene.index.MergePolicy$MergeException

2009-08-05 Thread tsuraan
I'm getting the exception "org.apache.lucene.index.MergePolicy$MergeException: segment "_0 exists in external directory yet the MergeScheduler executed the merge in a separate thread". According to this: http://mail-archives.apache.org/mod_mbox/lucene-java-user/200809.mbox/ that only happens wit

What happens after merging?

2009-08-05 Thread prashant ullegaddi
Hi, I've some indexes. As you all know, each has these files: _0.fdt _0.fdx _hqy.fnm _hqy.frq _hqy.nrm _hqy.prx _hqy.tii _hqy.tis segments_2 segments.gen Once I merge those indexes into single index by (IndexWriter's addIndexes()), the merged index has only 3 files: _0.cfs segments_2 se

Efficient optimization of large indexes?

2009-08-05 Thread Nigel
We periodically optimize large indexes (100 - 200gb) by calling IndexWriter.optimize(). It takes a heck of a long time, and I'm wondering if a more efficient solution might be the following: - Create a new empty index on a different filesystem - Set a merge policy for the new index so it puts eve

Re: A Presentation on Building a Hadoop + Lucene System Architecture

2009-08-05 Thread Bradford Stephens
Yes, we do index generation with Hadoop, and search with Katta, which is distributed Lucene :) What are you trying to do? On Tue, Aug 4, 2009 at 9:36 PM, m.harig wrote: > > Hello > >              Do you've any idea about the integration of Lucene with Hadoop > > > BrickMcLargeHuge wrote: >> >> Hey

Re: ParallelMultiSearcher and idf

2009-08-05 Thread Simon Willnauer
Hey Christian, you might wanna look at distributed solr (http://wiki.apache.org/solr/DistributedSearch) or if you haven't done so have a look at the Katta project (http://katta.sourceforge.net/documentation/how-katta-works) maybe this can help you out. About distributed IDF and Scoring have a look