RE: DIH and rss feeds

2008-10-31 Thread Lance Norskog
@lucene.apache.org Subject: Re: DIH and rss feeds The "clean" parameter is there in the 1.3 release. The full-import is by definition "full" so we delete all existing documents at the start. If you don't want to clean the index, you can pass clean=false and DIH will just add t

Re: DIH and rss feeds

2008-10-31 Thread Shalin Shekhar Mangar
The "clean" parameter is there in the 1.3 release. The full-import is by definition "full" so we delete all existing documents at the start. If you don't want to clean the index, you can pass clean=false and DIH will just add them. On Fri, Oct 31, 2008 at 8:58 PM, Jon Baer <[EMAIL PROTECTED]> wrote

Re: DIH and rss feeds

2008-10-31 Thread Jon Baer
Is that right? I find the wording of "clean" a little confusing. I would have thought this is what I had needed earlier but the topic came up regarding the fact that you can not deleteByQuery for an entity you want to flush w/ delta-import. I just noticed that the original JIRA request sa

Re: DIH and rss feeds

2008-10-30 Thread Noble Paul നോബിള്‍ नोब्ळ्
run full-import with clean=false for full-import clean is set to true by default and for delta-import clean is false by default. On Fri, Oct 31, 2008 at 9:16 AM, Lance Norskog <[EMAIL PROTECTED]> wrote: > I have a DataImportHandler configured to index from an RSS feed. It is a > "latest stuff" fe

Re: DIH and rss feeds

2008-10-30 Thread Norberto Meijome
On Thu, 30 Oct 2008 20:46:16 -0700 "Lance Norskog" <[EMAIL PROTECTED]> wrote: > Now: a few hours later there are a different 100 "lastest" documents. How do > I add those to the index so I will have 200 documents? 'full-import' throws > away the first 100. 'delta-import' is not implemented. What

Re: DIH and rss feeds

2008-10-30 Thread Jon Baer
Id like to say that deal is part of https://issues.apache.org/jira/browse/SOLR-783 but looking @ it closely it might be different. I think the issue is that delta-import does not have anything to match it's last_index_time against when doing feeds. Im also interested in that type of merge

DIH and rss feeds

2008-10-30 Thread Lance Norskog
I have a DataImportHandler configured to index from an RSS feed. It is a "latest stuff" feed. It reads the feed and indexes the 100 documents harvested from the feed. So far, works great. Now: a few hours later there are a different 100 "lastest" documents. How do I add those to the index so I wi