I am running some experiments on more like this and the results seem rather
odd - I am doing something wrong but just cannot figure out what.
Basically, the similarity results are decent - but not great.

*Issue 1  = Quality*
Toyota Camry : finds Altima (good) but then next one is Camry Hybrid
whereas it should have found Accord.
I have normalized the data into a simi field which has only the attributes
that I care about.
Without the simi field, I could not get mlt.qf boosts to work well enough
to return results

*Issue 2*
Some fields do not work at all. For instance, text+simi (in mlt.fl) works
whereas just simi does not.
So some weirdness that am just not understanding.

Would be grateful for your guidance !


Here is the setup:
*1. SOLR Version*
solr-spec 4.2.0.2013.03.06.22.32.13
solr-impl 4.2.0 1453694   rmuir - 2013-03-06 22:32:13
lucene-spec 4.2.0
lucene-impl 4.2.0 1453694 -  rmuir - 2013-03-06 22:25:29

*2. Machine Information*
Sun Microsystems Inc. Java HotSpot(TM) 64-Bit Server VM (1.6.0_23 19.0-b09)
Windows 7 Home 64 Bit with 4 GB RAM

*3. Sample Data *
I created this 'dummy' data of cars  - the idea being that these would be
sufficient and simple to generate similarity and understand how it would
work.
There are 181 rows in the data set (I have attached it for reference in CSV
format)

[image: Inline image 1]

*4. SCHEMA*
*Field Definitions*
   <field name="id" type="string" indexed="true" stored="true"
termVectors="true" multiValued="false"/>
   <field name="make" type="string" indexed="true" stored="true"
termVectors="true" multiValued="false"/>
   <field name="model" type="string" indexed="true" stored="true"
termVectors="true" multiValued="false"/>
   <field name="class" type="string" indexed="true" stored="true"
termVectors="true" multiValued="false"/>
   <field name="type" type="string" indexed="true" stored="true"
termVectors="true" multiValued="false"/>
   <field name="drive" type="string" indexed="true" stored="true"
termVectors="true" multiValued="false"/>
   <field name="comment" type="text_general" indexed="true" stored="true"
termVectors="true" multiValued="true"/>
   <field name="size" type="string" indexed="true" stored="true"
termVectors="true" multiValued="false"/>
*
*
*Copy Fields*
<copyField   source="make"     dest="make_en"   />  <!-- Search  -->
<copyField   source="model"     dest="model_en"   />  <!-- Search  -->
<copyField   source="class"     dest="class_en"   />  <!-- Search  -->
<copyField   source="type"     dest="type_en"   />  <!-- Search  -->
<copyField   source="drive"     dest="drive_en"   />  <!-- Search  -->
<copyField   source="comment"     dest="comment_en"   />  <!-- Search  -->
<copyField   source="size"     dest="size_en"   />  <!-- Search  -->
<copyField   source="id"     dest="text"   />  <!-- Glob  -->
<copyField   source="make"     dest="text"   />  <!-- Glob  -->
<copyField   source="model"     dest="text"   />  <!-- Glob  -->
<copyField   source="class"     dest="text"   />  <!-- Glob  -->
<copyField   source="type"     dest="text"   />  <!-- Glob  -->
<copyField   source="drive"     dest="text"   />  <!-- Glob  -->
<copyField   source="comment"     dest="text"   />  <!-- Glob  -->
<copyField   source="size"     dest="text"   />  <!-- Glob  -->
<copyField   source="size"     dest="text"   />  <!-- Glob  -->
*<copyField   source="class"     dest="simi_en"   />  <!-- similarity  -->*
*<copyField   source="type"     dest="simi_en"   />  <!-- similarity  -->*
*<copyField   source="drive"     dest="simi_en"   />  <!-- similarity  -->*
*<copyField   source="size"     dest="simi_en"   />  <!-- similarity  -->*

Note that the "simi" field ends up with values like  make, class, size and
drive:
- Luxury SUV 4WD Large
- Standard Sedan Front Familt


*5. MLT Setup*
a. mlt.FL  = *text* QF=*text*  Works but results are obviously not good
(make is not a good similarity indicator)
http://localhost:8983/solr/cars/select/?q=id:2&mlt=true&fl=text&mlt.fl=text&mlt.qf=text

b. mlt.FL  = *simi* QF=*simi*  Does not work at all (0 results)
http://localhost:8983/solr/cars/select/?q=id:2&mlt=true&fl=text&mlt.fl=simi&mlt.qf=simi

c.  mlt.FL  = *simi,text * QF=*simi^10 text^.1*   Works with decent results
in most cases
http://localhost:8983/solr/cars/select/?q=id:2&mlt=true&fl=text&mlt.fl=simi,text&mlt.qf=simi
^10%20text^.01
Works for getting similarity for Acura MDX (Luxury SUV 4WD Large)
But for Toyota Camry - it finds hybrid family cars (Prius) ahead of Honda.


*
*

<<image.png>>

id,make,model,class,type,drive,comment,size,size_i
1,Acura ,ILX 2.0L,Luxury,Sedan,Front,,Mini,2
2,Acura ,MDX,Luxury,SUV,4wd,,Large,5
3,Acura ,RDX,Luxury,SUV,4wd,,Small,3
4,Acura ,RLX,Luxury,Sedan,AWD,,Large,5
5,Acura ,TL,Luxury,Sedan,Front,,Family,4
6,Acura ,TSX,Luxury,Sedan,Front,,Small,3
7,Acura ,ZDX,Luxury,SUV,4wd,,Large,5
8,Audi ,A3 2.0T,Luxury,Sedan,AWD,,Mini,2
9,Audi ,A4,Luxury,Sedan,AWD,,Small,3
10,Audi ,A5 2.0T,Luxury,Sedan,AWD,,Family,4
11,Audi ,A6 3.0T,Luxury,Sedan,AWD,,Family,4
12,Audi ,A7,Luxury,Sedan,AWD,,Large,5
13,Audi ,A8,Luxury,Sedan,AWD,,Largest,7
14,Audi ,Allroad,Luxury,Wagon,AWD,,Large,5
15,Audi ,Q5 2.0T,Luxury,SUV,4wd,,Large,5
16,Audi ,Q7,Luxury,SUV,4wd,,Largest,7
17,Audi ,R8,Luxury,Sports,RWD,,Largest,7
18,Audi ,S4,Luxury,Sports,AWD,,Small,3
19,Audi ,TT,Luxury,Coupe,Front,,Mini,2
20,BMW ,135i,Luxury,Sedan,RWD,,Mini,2
21,BMW ,328i,Luxury,Sedan,RWD,,Small,3
22,BMW ,4 Series,Luxury,Sedan,RWD,,Family,4
23,BMW ,535i,Luxury,Sedan,RWD,,Large,5
24,BMW ,6 Series,Luxury,Sedan,RWD,,Very Large,6
25,BMW ,750Li,Luxury,Sedan,RWD,,Largest,7
26,BMW ,X1 xDrive28i (2.0T),Luxury,SUV,4wd,,Mini,2
27,BMW ,X3 xDrive28i (2.0T),Luxury,SUV,4wd,,Small,3
28,BMW ,X5 35i,Luxury,SUV,4wd,,Large,5
29,BMW ,X6,Luxury,SUV,4wd,,Very Large,6
30,BMW ,Z4 sDrive28i,Luxury,Sports,RWD,,Mini,2
31,Buick ,Enclave,High,SUV,4wd,,Large,5
32,Cadillac ,ATS (turbo),Luxury,Sedan,RWD,,Mini,2
33,Cadillac ,CTS (V6),Luxury,Sedan,RWD,,Family,4
34,Cadillac ,Escalade,Luxury,SUV,4wd,,Largest,7
35,Cadillac ,SRX,Luxury,SUV,4wd,,Large,5
36,Cadillac ,XTS,Luxury,Sedan,RWD,,Small,3
37,Chevrolet ,Camaro 2LT (V6),Standard,Sports,RWD,,Small,3
38,Chevrolet ,Colorado,Standard,Pickup,4wd,,Small,3
39,Chevrolet ,Corvette Z06,Standard,Sports,RWD,,Small,3
40,Chevrolet ,Cruze 1LT (1.4T),Standard,Sedan,Front,,Mini,2
41,Chevrolet ,Cruze Eco,Standard,Sedan,Front,,Mini,2
42,Chevrolet ,Cruze LS (1.8),Standard,Sedan,Front,,Mini,2
43,Chevrolet ,Equinox (4-cyl.),Standard,SUV,4wd,,Mini,2
44,Chevrolet ,Express,Standard,Commercial,RWD,,Largest,7
45,Chevrolet ,Impala,Standard,Sedan,Front,,Large,5
46,Chevrolet ,Malibu 1LT (2.5),Standard,Sedan,Front,,Small,3
47,Chevrolet ,Malibu Eco,Standard,Sedan,Front,Hybrid,Small,3
48,Chevrolet ,Silverado 1500 5.3 V8,Standard,Pickup,4wd,,Large,5
49,Chevrolet ,Sonic LTZ (1.4T),Standard,Sedan,Front,,Mini,2
50,Chevrolet ,Suburban,Standard,SUV,4wd,,Largest,7
51,Chevrolet ,Volt,Standard,Sedan,Front,Plugin Hybrid,Small,3
52,Chrysler ,200 (V6),Standard,Sedan,Front,,Small,3
53,Chrysler ,300 C,Standard,Sedan,RWD,,Large,5
54,Chrysler ,Town & Country,Standard,Minivan,Front,,Largest,7
55,Coda ,EV,Standard,Sedan,Front,EV Hybrid,Small,3
56,Ford ,C-MAX Hybrid SE,Standard,Sedan,Front,Hybrid,Mini,2
57,Ford ,Econoline,Standard,Commercial,Front,,Largest,7
58,Ford ,"Edge (FWD, 2.0 EcoBoost)",Standard,SUV,4wd,,Family,4
59,Ford ,Escape SE (1.6T),Standard,Sedan,4WD,,Small,3
60,Ford ,Explorer,Standard,SUV,4wd,,Large,5
61,Ford ,F-150 EcoBoost (V6),Standard,Pickup,4wd,,Largest,7
62,Ford ,Fiesta sedan SE,Standard,Sedan,Front,,Micro,1
63,Ford ,Flex SEL (V6),Standard,SUV,4wd,,Large,5
64,Ford ,Focus ST,Standard,Sedan,Front,,Mini,2
65,Ford ,Fusion SE (1.6T),Standard,Sedan,Front,,Small,3
66,Ford ,Fusion SE Hybrid,Standard,Sedan,Front,,Small,3
67,Ford ,Mustang GT Convertible,Standard,Convertible,RWD,,Small,3
68,Ford ,Mustang GT Coupe,Standard,Sports,RWD,,Small,3
69,Ford ,Taurus Limited (V6),Standard,Sedan,Front,,Large,5
70,Ford ,Transit Connect,Standard,Commercial,Front,,Largest,7
71,GMC ,Sierra 1500 (V8),Standard,Pickup,4wd,,Largest,7
72,GMC ,Yukon SLE,Standard,SUV,4wd,,Very Large,6
73,Honda ,Accord EX-L (V6),Standard,Sedan,Front,,Family,4
74,Honda ,CR-V EX,Standard,SUV,4wd,,Small,3
75,Honda ,Civic EX,Standard,Sedan,Front,,Mini,2
76,Honda ,Civic Hybrid,Standard,Sedan,Front,Hybrid,Mini,2
77,Honda ,Crosstour (V6),Standard,Wagon,AWD,,Large,5
78,Honda ,Fit Base,Standard,Sedan,Front,,Micro,1
79,Honda ,Insight EX,Standard,Sedan,Front,Hybrid,Small,3
80,Honda ,Odyssey,Standard,Minivan,Front,,Largest,7
81,Honda ,Pilot,Standard,SUV,4wd,,Large,5
82,Honda ,Ridgeline,Standard,Pickup,4wd,,Large,5
83,Infiniti ,EX,Luxury,Wagon,RWD,,Small,3
84,Infiniti ,FX (V6),Luxury,SUV,4wd,,Large,5
85,Infiniti ,G37 Journey,Luxury,Sedan,RWD,,Small,3
86,Infiniti ,JX 35,Luxury,SUV,4wd,,Large,5
87,Infiniti ,M35h,Luxury,Sedan,RWD,Hybrid,Large,5
88,Infiniti ,M37,Luxury,Sedan,RWD,,Large,5
89,Infiniti ,QX56,Luxury,SUV,4wd,,Largest,7
90,Jeep ,Cherokee,Standard,SUV,4wd,,Small,3
91,Jeep ,Compass,Standard,SUV,4wd,,Mini,2
92,Jeep ,Grand Cherokee Laredo (V6),High,SUV,4wd,,Family,4
93,Land ,Rover LR2,High,SUV,4wd,,Small,3
94,Land ,Rover LR4,High,SUV,4wd,,Family,4
95,Land ,Rover Range Rover,Luxury,SUV,4wd,,Large,5
96,Lexus ,CT 200h,Luxury,Sedan,Front,Hybrid,Mini,2
97,Lexus ,ES 300h,Luxury,Sedan,Front,Hybrid,Family,4
98,Lexus ,ES 350,Luxury,Sedan,Front,,Family,4
99,Lexus ,GS 350,Luxury,Sedan,RWD,,Family,4
100,Lexus ,GX 460,Luxury,SUV,4wd,,Largest,7
101,Lexus ,HS Hybrid,Luxury,Sedan,Front,Hybrid,Small,3
102,Lexus ,IS 250,Luxury,Sedan,Front,Hybrid,Small,3
103,Lexus ,LS 460L,Luxury,Sedan,RWD,Hybrid EV,Largest,7
104,Lexus ,LX,Luxury,SUV,4wd,Hybrid EV,Largest,7
105,Lexus ,RX 350,Luxury,SUV,4wd,Hybrid,Large,5
106,Lexus ,RX 450h,Luxury,SUV,AWD,Hybrid,Large,5
107,Lotus ,Evora,Luxury,Sports,RWD,Hybrid,Mini,2
108,Mazda,Mazda2 ,Standard,Sedan,Front,Hybrid,Micro,1
109,Mazda,Mazda 3 Mazdaspeed,Standard,Sedan,Front,Hybrid,Mini,2
110,Mazda ,CX-5 Sport (2.0L),Standard,SUV,4wd,,Small,3
111,Mazda ,CX-7,Standard,SUV,4wd,,Large,5
112,Mazda ,CX-9,Standard,SUV,4wd,,Very Large,6
113,Mazda ,MX-5 Miata,Standard,Convertible,RWD,,Micro,1
114,Mercedes-Benz ,C250,Luxury,Sedan,RWD,,Small,3
115,Mercedes-Benz ,CL,Luxury,Sedan,RWD,,Largest,7
116,Mercedes-Benz ,CLA,Luxury,Sedan,RWD,,Small,3
117,Mercedes-Benz ,CLS,Luxury,Sedan,RWD,,Family,4
118,Mercedes-Benz ,E350 BlueTec,Luxury,Sedan,RWD,,Large,5
119,Mercedes-Benz ,GLK350,Luxury,SUV,4wd,,Small,3
120,Mercedes-Benz ,ML350,Luxury,SUV,4wd,,Large,5
121,Mercedes-Benz ,R-Class,Luxury,Wagon,RWD,,Very Large,6
122,Mercedes-Benz ,S550,Luxury,SUV,4wd,,Largest,7
123,Mercedes-Benz ,SL,Luxury,SUV,4wd,,Mini,2
124,Mercedes-Benz ,SLK250,Luxury,SUV,4wd,,Mini,2
125,Mercedes-Benz ,Sprinter,Commercial,Commercial,,,Largest,7
126,Mini ,Cooper Countryman S,High,Sedan,,,Micro,1
127,Nissan ,Altima 2.5 S (4-cyl.),Standard,Sedan,Front,,Family,4
128,Nissan ,Armada,Standard,SUV,4wd,,Largest,7
129,Nissan ,Frontier (V6),Standard,Pickup,4wd,,Small,3
130,Nissan ,GT-R,Standard,Super Sport,RWD,,Mini,2
131,Nissan ,Juke,Standard,SUV,4wd,,Mini,2
132,Nissan ,Leaf,Standard,Sedan,Front,Hybrid EV,Small,3
133,Nissan ,Maxima,Standard,Sedan,Front,Hybrid EV,Large,5
134,Nissan ,Murano,Standard,SUV,4wd,Hybrid,Family,4
135,Nissan ,NV,Standard,Commercial,RWD,Hybrid,Largest,7
136,Nissan ,Pathfinder SL,Standard,SUV,4wd,Hybrid,Family,4
137,Nissan ,Quest,Standard,Minivan,Front,Hybrid,Largest,7
138,Nissan ,Sentra SV,Standard,Sedan,Front,Hybrid,Mini,2
139,Nissan ,Titan SV,Standard,SUV,4wd,,Largest,7
140,Nissan ,Versa Sedan SV (1.6),Standard,Sedan,Front,,Micro,1
141,Nissan ,370Z,Standard,Sports,RWD,,Mini,2
142,Porsche ,911,Luxury,Super Sport,RWD,,Mini,2
143,Porsche ,Boxster (Base),Luxury,Sports,RWD,,Mini,2
144,Porsche ,Cayenne (V6),Luxury,SUV,4wd,,Large,5
145,Porsche ,Cayman,Luxury,Sports,RWD,,Mini,2
146,Porsche ,Panamera S,Luxury,Sedan,RWD,,Largest,7
147,Smart ,ForTwo,Standard,Mini,Front,,Micro,1
148,Subaru ,BRZ,Standard,Sports,RWD,,Mini,2
149,Subaru ,Forester 2.5 X Premium,Standard,SUV,4wd,,Small,3
150,Subaru ,Impreza WRX,Standard,Sports,AWD,,Small,3
151,Subaru ,Impreza STi,Standard,Sports,AWD,,Small,3
152,Subaru ,Impreza Premium,Standard,Sedan,AWD,,Small,3
153,Subaru ,Legacy 2.5i Premium,Standard,Sedan,AWD,,Family,4
154,Subaru ,Outback 2.5i Limited,Standard,Wagon,AWD,,Family,4
155,Subaru ,Tribeca,Standard,SUV,4wd,,Large,5
156,Subaru ,XV Crosstrek,Standard,Wagon,AWD,,Mini,2
157,Tesla ,Model S,Standard,Sedan,Front,Hybrid EV,Small,3
158,Toyota ,4Runner SR5,Standard,SUV,4wd,,Family,4
159,Toyota ,Avalon Limited (V6),High,Sedan,,,Large,5
160,Toyota ,Camry Hybrid XLE,Standard,Sedan,Front,Hybrid,Family,4
161,Toyota ,Camry LE (4-cyl.),Standard,Sedan,Front,,Family,4
162,Toyota ,Corolla LE,Standard,Sedan,Front,,Mini,2
163,Toyota ,FJ Cruiser,Standard,SUV,4wd,,Small,3
164,Toyota ,Highlander Hybrid,Standard,SUV,AWD,Hybrid,Family,4
165,Toyota ,Highlander (V6),Standard,SUV,4wd,,Family,4
166,Toyota ,Land Cruiser,Luxury,SUV,4wd,,Largest,7
167,Toyota ,Prius,Standard,Sedan,Front,Hybrid,Small,3
168,Toyota ,Prius C Two,Standard,Sedan,Front,Hybrid,Mini,2
169,Toyota ,Prius V Three,Standard,Sedan,Front,Hybrid,Family,4
170,Toyota ,RAV4 XLE,Standard,SUV,4wd,,Small,3
171,Toyota ,Sequoia 5.7 (V8),Standard,Pickup,4wd,,Largest,7
172,Toyota ,Sienna XLE (FWD),Standard,Minivan,Front,,Largest,7
173,Toyota ,Tacoma (V6),Standard,Pickup,4wd,,Family,4
174,Toyota ,Tundra 5.7 (V8),Standard,Pickup,4wd,,Largest,7
175,Volkswagen ,CC (2.0T),High,Sedan,,,Large,5
176,Volkswagen ,Eos,Standard,Convertible,Front,,Mini,2
177,Volkswagen ,GTI,Standard,Sports,Front,,Small,3
178,Volkswagen ,Golf 2.5,Standard,Sedan,Front,,Small,3
179,Volkswagen ,Jetta GLI,Standard,Sedan,Front,,Small,3
180,Volkswagen ,Passat SE (5-cyl.),Standard,Sedan,Front,,Family,4
181,Volkswagen ,Touareg TDI,High,SUV,4wd,,Large,5

Reply via email to