Taking a look into the documentation I see this inconsistent orderings in
my opinion :

*Example:*

Concatenate word parts and number parts, but not word and number parts that
occur in the same token.
<analyzer>
  <tokenizer class="solr.WhitespaceTokenizerFactory"/>
  <filter class="solr.WordDelimiterFilterFactory" catenateWords="1"
catenateNumbers="1"/>
</analyzer>

*In:* "hot-spot 100+42 XL40"

*Tokenizer to Filter:* "hot-spot"(1), "100+42"(2), "XL40"(3)

*Out:* "hot"(1), "spot"(2), "hotspot"(2) *(1?)*, "100"(3), "42"(4),
"10042"(4) *(2?)*, "XL"(5)*(3?)*, "40"(6)*(4?)*

*Example:*

Concatenate all. Word and/or number parts are joined together.
<analyzer>
  <tokenizer class="solr.WhitespaceTokenizerFactory"/>
  <filter class="solr.WordDelimiterFilterFactory" catenateAll="1"/>
</analyzer>

*In:* "XL-4000/ES"

*Tokenizer to Filter:* "XL-4000/ES"(1)

*Out:* "XL"(1), "4000"(2), "ES"(3), "XL4000ES"(3)*(1?)*


I have not clear why a token generated by a catenation should not occupy
the same position of the original one.


In your example , I am a little bit surprised of the first results as well :

"RRR-COLECCION: COLECCIÓN: Gracita Morales foobar

Here are the final positions and terms that 4.7.2 yields for this on
query analysis:

1 rrr-coleccion
1 rrr
2 coleccion
2 rrrcoleccion *(1) ?*
3 coleccion
4 gracita
5 morales
6 foobar


It is not so clear, if the tokens must simply inherit their position from
the "parent" token, or if they must arrange it based on the final list of
tokens .

2015-07-08 16:03 GMT+01:00 Shawn Heisey <apa...@elyograg.org>:

> On 7/8/2015 8:44 AM, Shawn Heisey wrote:
> > This is what 4.9.1 does with it:
> >
> > 1 rrr-coleccion
> > 2 rrr
> > 2 coleccion
> > 2 rrrcoleccion
> > 3 coleccion
> > 4 gracita
> > 5 morales
> > 6 foobar
>
> Followup:  This is what Solr 5.2.1 does for query analysis, which also
> seems wrong, and doesn't match the phrase query:
>
> 1 rrr-coleccion
> 2 coleccion
> 2 rrr
> 2 rrrcoleccion
> 3 coleccion
> 4 gracita
> 5 morales
> 6 bleh
>
> The index analysis on 5.2.1 is the same as the other two versions.
>
> Thanks,
> Shawn
>
>


-- 
--------------------------

Benedetti Alessandro
Visiting card : http://about.me/alessandro_benedetti

"Tyger, tyger burning bright
In the forests of the night,
What immortal hand or eye
Could frame thy fearful symmetry?"

William Blake - Songs of Experience -1794 England

Reply via email to