Re: document id in nutch/solr

2013-06-24 Thread alxsss
Another way of overriding nutch fields is to modify solrindex-mapping.xml file.

hth
Alex.

 

 

 

-Original Message-
From: Jack Krupansky 
To: solr-user 
Sent: Sun, Jun 23, 2013 12:04 pm
Subject: Re: document id in nutch/solr


Add the "passthrough" dynamic field to your Solr schema, and then see what 
fields get passed through to Solr from Nutch. Then, add the missing fields 
to your Solr schema and remove the passthrough.



Or, add Solr  directives to place fields in existing named 
fields.

Or... talk to the nutch people about how to do field name mapping on the 
nutch side of the fence.

Hold off on UUIDs until you figure all of the above out and everything is 
working without them.

-- Jack Krupansky

-Original Message- 
From: Joe Zhang
Sent: Sunday, June 23, 2013 2:35 PM
To: solr-user@lucene.apache.org
Subject: Re: document id in nutch/solr

Can somebody help with this one, please?


On Fri, Jun 21, 2013 at 10:36 PM, Joe Zhang  wrote:

> A quite standard configuration of nutch seems to autoamtically map "url"
> to "id". Two questions:
>
> - Where is such mapping defined? I can't find it anywhere in
> nutch-site.xml or schema.xml. The latter does define the "id" field as 
> well
> as its uniqueness, but not the mapping.
>
> - Given that nutch nutch has already defined such an id, can i ask solr to
> redefine id as UUID?
> 
>
> - This leads to a related question: do solr and nutch have to have
> IDENTICAL schema.xml?
> 


 


Re: document id in nutch/solr

2013-06-23 Thread Jack Krupansky
Add the "passthrough" dynamic field to your Solr schema, and then see what 
fields get passed through to Solr from Nutch. Then, add the missing fields 
to your Solr schema and remove the passthrough.


multiValued="true" />


Or, add Solr  directives to place fields in existing named 
fields.


Or... talk to the nutch people about how to do field name mapping on the 
nutch side of the fence.


Hold off on UUIDs until you figure all of the above out and everything is 
working without them.


-- Jack Krupansky

-Original Message- 
From: Joe Zhang

Sent: Sunday, June 23, 2013 2:35 PM
To: solr-user@lucene.apache.org
Subject: Re: document id in nutch/solr

Can somebody help with this one, please?


On Fri, Jun 21, 2013 at 10:36 PM, Joe Zhang  wrote:


A quite standard configuration of nutch seems to autoamtically map "url"
to "id". Two questions:

- Where is such mapping defined? I can't find it anywhere in
nutch-site.xml or schema.xml. The latter does define the "id" field as 
well

as its uniqueness, but not the mapping.

- Given that nutch nutch has already defined such an id, can i ask solr to
redefine id as UUID?


- This leads to a related question: do solr and nutch have to have
IDENTICAL schema.xml?





Re: document id in nutch/solr

2013-06-23 Thread Joe Zhang
Can somebody help with this one, please?


On Fri, Jun 21, 2013 at 10:36 PM, Joe Zhang  wrote:

> A quite standard configuration of nutch seems to autoamtically map "url"
> to "id". Two questions:
>
> - Where is such mapping defined? I can't find it anywhere in
> nutch-site.xml or schema.xml. The latter does define the "id" field as well
> as its uniqueness, but not the mapping.
>
> - Given that nutch nutch has already defined such an id, can i ask solr to
> redefine id as UUID?
> 
>
> - This leads to a related question: do solr and nutch have to have
> IDENTICAL schema.xml?
>


document id in nutch/solr

2013-06-21 Thread Joe Zhang
A quite standard configuration of nutch seems to autoamtically map "url" to
"id". Two questions:

- Where is such mapping defined? I can't find it anywhere in nutch-site.xml
or schema.xml. The latter does define the "id" field as well as its
uniqueness, but not the mapping.

- Given that nutch nutch has already defined such an id, can i ask solr to
redefine id as UUID?


- This leads to a related question: do solr and nutch have to have
IDENTICAL schema.xml?