You can always combine index data into a single Lucene index - but there is no tool yet to actually merge segment data; in other words, you will end up with multiple segments, but with a single search index pointing to these multiple segments.

If it's not too much trouble, could you please elaborate on this? What command(s) I should use and how I should configure for this (sorry I am a newbie :->).

Thank you so much!

Olive


From: Andrzej Bialecki <[EMAIL PROTECTED]>
Reply-To: [email protected]
To: [email protected]
Subject: Re: more questions on this - please advice
Date: Mon, 03 Apr 2006 23:38:59 +0200

Mailing-List: contact [EMAIL PROTECTED]; run by ezmlm
Precedence: bulk
List-Help: <mailto:[EMAIL PROTECTED]>
List-Unsubscribe: <mailto:[EMAIL PROTECTED]>
List-Post: <mailto:[email protected]>
List-Id: <nutch-user.lucene.apache.org>
Delivered-To: mailing list [email protected]
X-ASF-Spam-Status: No, hits=0.0 required=10.0tests=
X-Spam-Check-By: apache.org
User-Agent: Thunderbird 1.5 (Windows/20051201)
References: <[EMAIL PROTECTED]>
X-Virus-Checked: Checked by ClamAV on apache.org
Return-Path: [EMAIL PROTECTED] X-OriginalArrivalTime: 03 Apr 2006 21:39:40.0264 (UTC) FILETIME=[1CC63E80:01C65767]

Olive g wrote:
Thank you for your reply. I have a few more questions:

- Is there any workaround that I can use for now for what I want to do (multiple crawls
  and then combine the data for search).


You can always combine index data into a single Lucene index - but there is no tool yet to actually merge segment data; in other words, you will end up with multiple segments, but with a single search index pointing to these multiple segments.

- If I were to back down to 0.7, would the data from 0.8 crawls be compatible with 0.7
 (I use DFS)?

No, data created in 0.8 is not compatible with 0.7. With (significant) effort suitable converters could be made, but it would be way less expensive to just bite the bullet and implement missing functionality in 0.8.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



_________________________________________________________________
Don’t just search. Find. Check out the new MSN Search! http://search.msn.click-url.com/go/onm00200636ave/direct/01/



-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to