Any idea?
This is a snippet of my schema.xml now:
<?xml version="1.0" encoding="UTF-8" ?>
<!--
Licensed to the Apache Software Foundation (ASF) under one or more
...
<!-- fields for index-basic plugin -->
<field name="host" type="url" stored="false" indexed="true"/>
<field name="site" type="string" stored="false" indexed="true"/>
<field name="url" type="url" stored="true" indexed="true"
required="true"/>
<field name="content" type="text" stored="true" indexed="true"
omitNorms="true"/>
<field name="cache" type="string" stored="true" indexed="false"/>
<field name="tstamp" type="long" stored="true" indexed="false"/>
<!-- fields for index-anchor plugin -->
<field name="anchor" type="string" stored="true" indexed="true"
multiValued="true"/>
...
<!-- uncomment the following to ignore any fields that don't already
match an existing
field name or dynamic field, rather than reporting them as an
error.
alternately, change the type="ignored" to some other type e.g.
"text" if you want
unknown fields indexed and/or stored by default -->
<!--dynamicField name="*" type="ignored" multiValued="true" /-->
</fields>
<!-- Field to use to determine and enforce document uniqueness.
Unless this field is marked with required="false", it will be a
required field
-->
<uniqueKey>id</uniqueKey>
<!-- field for the QueryParser to use when an explicit fieldname is absent
...
</schema>
Remi
On Sun, Jan 22, 2012 at 6:31 PM, remi tassing <[email protected]> wrote:
> Hi,
>
> I got wrong in beginning but putting omitNorms in the query url.
>
> Now following your advice, I merged the schema.xml from Nutch and Solr and
> made sure omitNorms was set to "true" for the content, just as you said.
>
> Unfortunately the problem remains :-(
>
>
> On Thursday, January 19, 2012, Jan Høydahl <[email protected]> wrote:
> > Hi,
> >
> > The schema you pasted in your mail is NOT Solr3.5's default example
> schema. Did you get it from the Nutch project?
> >
> > And the "omitNorms" parameter is supposed to go in the <field> tag in
> schema.xml, and the "content" field in the example schema does not have
> omitNorms="true". Try to change
> >
> > <field name="content" type="text" stored="false" indexed="true"/>
> > to
> > <field name="content" type="text" stored="false" indexed="true"
> omitNorms="true"/>
> >
> > and try again. Please note that you SHOULD customize your schema, there
> is really no "default" schema in Solr (or Nutch), it's only an example or
> starting point. For your search application to work well you will have to
> invest some time in designing a schema, working with your queries, perhaps
> exploring DisMax query parser etc etc.
> >
> > --
> > Jan Høydahl, search solution architect
> > Cominvent AS - www.cominvent.com
> > Solr Training - www.solrtraining.com
> >
> > On 19. jan. 2012, at 13:01, remi tassing wrote:
> >
> >> Hello Jan,
> >>
> >> My schema wasn't changed from the release 3.5.0. The content can be seen
> >> below:
> >>
> >> <schema name="nutch" version="1.1">
> >> <types>
> >> <fieldType name="string" class="solr.StrField"
> >> sortMissingLast="true" omitNorms="true"/>
> >> <fieldType name="long" class="solr.LongField"
> >> omitNorms="true"/>
> >> <fieldType name="float" class="solr.FloatField"
> >> omitNorms="true"/>
> >> <fieldType name="text" class="solr.TextField"
> >> positionIncrementGap="100">
> >> <analyzer>
> >> <tokenizer class="solr.WhitespaceTokenizerFactory"/>
> >> <filter class="solr.StopFilterFactory"
> >> ignoreCase="true" words="stopwords.txt"/>
> >> <filter class="solr.WordDelimiterFilterFactory"
> >> generateWordParts="1" generateNumberParts="1"
> >> catenateWords="1" catenateNumbers="1" catenateAll="0"
> >> splitOnCaseChange="1"/>
> >> <filter class="solr.LowerCaseFilterFactory"/>
> >> <filter class="solr.EnglishPorterFilterFactory"
> >> protected="protwords.txt"/>
> >> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
> >> </analyzer>
> >> </fieldType>
> >> <fieldType name="url" class="solr.TextField"
> >> positionIncrementGap="100">
> >> <analyzer>
> >> <tokenizer class="solr.StandardTokenizerFactory"/>
> >> <filter class="solr.LowerCaseFilterFactory"/>
> >> <filter class="solr.WordDelimiterFilterFactory"
> >> generateWordParts="1" generateNumberParts="1"/>
> >> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
> >> </analyzer>
> >> </fieldType>
> >> </types>
> >> <fields>
> >> <field name="id" type="string" stored="true" indexed="true"/>
> >>
> >> <!-- core fields -->
> >> <field name="segment" type="string" stored="true"
> indexed="false"/>
> >> <field name="digest" type="string" stored="true"
> indexed="false"/>
> >> <field name="boost" type="float" stored="true" indexed="false"/>
> >>
> >> <!-- fields for index-basic plugin -->
> >> <field name="host" type="url" stored="false" indexed="true"/>
> >> <field name="site" type="string" stored="false" indexed="true"/>
> >> <f
>