Re: duplicated URL sent from Nutch to solr index

2012-12-03 Thread Xi Shen
Then the "URL" must be the same. On Mon, Dec 3, 2012 at 2:34 PM, Joe Zhang wrote: > Sorry I didn't make it perfectly clear. The "id" field is URL. > > On Sun, Dec 2, 2012 at 11:33 PM, Joe Zhang wrote: > > > Thanks! > > > > > > On Sun, Dec 2, 2012 at 11:20 PM, Xi Shen wrote: > > > >> If the va

Re: duplicated URL sent from Nutch to solr index

2012-12-02 Thread Joe Zhang
Sorry I didn't make it perfectly clear. The "id" field is URL. On Sun, Dec 2, 2012 at 11:33 PM, Joe Zhang wrote: > Thanks! > > > On Sun, Dec 2, 2012 at 11:20 PM, Xi Shen wrote: > >> If the value for "id" field is the same, the old entry will be update; if >> it is new, a new entry will be creat

Re: duplicated URL sent from Nutch to solr index

2012-12-02 Thread Joe Zhang
Thanks! On Sun, Dec 2, 2012 at 11:20 PM, Xi Shen wrote: > If the value for "id" field is the same, the old entry will be update; if > it is new, a new entry will be created & indexed. > > This is my experience. :) > > > On Mon, Dec 3, 2012 at 1:45 PM, Joe Zhang wrote: > > > Dear list, > > > > I

Re: duplicated URL sent from Nutch to solr index

2012-12-02 Thread Xi Shen
If the value for "id" field is the same, the old entry will be update; if it is new, a new entry will be created & indexed. This is my experience. :) On Mon, Dec 3, 2012 at 1:45 PM, Joe Zhang wrote: > Dear list, > > I just want to confirm an expected behavior of solr: > > Assuming we have " id