Re: Removing old documents

2012-05-02 Thread Paul Libbrecht
With which client? paul Le 2 mai 2012 à 01:29, alx...@aim.com a écrit : all caching is disabled and I restarted jetty. The same results.

Re: Removing old documents

2012-05-02 Thread alxsss
Subject: Re: Removing old documents With which client? paul Le 2 mai 2012 à 01:29, alx...@aim.com a écrit : all caching is disabled and I restarted jetty. The same results.

Re: Removing old documents

2012-05-01 Thread mav.p...@holidaylettings.co.uk
Not sure if there is an automatic way but we do it via a delete query and where possible we update doc under same id to avoid deletes. On 01/05/2012 13:43, Bai Shen baishen.li...@gmail.com wrote: What is the best method to remove old documents? Things that no generate 404 errors, etc. Is

Re: Removing old documents

2012-05-01 Thread Bai Shen
I'm running Nutch, so it's updating the documents, but I'm wanting to remove ones that are no longer available. So in that case, there's no update possible. On Tue, May 1, 2012 at 8:47 AM, mav.p...@holidaylettings.co.uk mav.p...@holidaylettings.co.uk wrote: Not sure if there is an automatic

Re: Removing old documents

2012-05-01 Thread Markus Jelsma
Nutch 1.4 has a separate tool to remove 404 and redirects documents from your index based on your CrawlDB. Trunk's SolrIndexer can add and remove documents in one run based on segment data. On Tuesday 01 May 2012 16:31:47 Bai Shen wrote: I'm running Nutch, so it's updating the documents, but

Re: Removing old documents

2012-05-01 Thread mav.p...@holidaylettings.co.uk
Hi What I do is I put the date created for when the doc was inserted or updated and then I do a search/delete query based on that Mav On 01/05/2012 15:31, Bai Shen baishen.li...@gmail.com wrote: I'm running Nutch, so it's updating the documents, but I'm wanting to remove ones that are no

Re: Removing old documents

2012-05-01 Thread alxsss
. -Original Message- From: Markus Jelsma markus.jel...@openindex.io To: solr-user solr-user@lucene.apache.org Sent: Tue, May 1, 2012 7:41 am Subject: Re: Removing old documents Nutch 1.4 has a separate tool to remove 404 and redirects documents from your index based on your

Re: Removing old documents

2012-05-01 Thread Lance Norskog
Jelsma markus.jel...@openindex.io To: solr-user solr-user@lucene.apache.org Sent: Tue, May 1, 2012 7:41 am Subject: Re: Removing old documents Nutch 1.4 has a separate tool to remove 404 and redirects documents from your index based on your CrawlDB. Trunk's SolrIndexer can add and remove

Re: Removing old documents

2012-05-01 Thread Paul Libbrecht
, May 1, 2012 7:41 am Subject: Re: Removing old documents Nutch 1.4 has a separate tool to remove 404 and redirects documents from your index based on your CrawlDB. Trunk's SolrIndexer can add and remove documents in one run based on segment data. On Tuesday 01 May 2012 16:31:47 Bai Shen

Re: Removing old documents

2012-05-01 Thread alxsss
all caching is disabled and I restarted jetty. The same results. Thanks. Alex. -Original Message- From: Lance Norskog goks...@gmail.com To: solr-user solr-user@lucene.apache.org Sent: Tue, May 1, 2012 2:57 pm Subject: Re: Removing old documents Maybe this is the HTTP caching