Facet Results Strange - Help

2007-04-27 Thread realw5

Hello,
I'm running into some strange results for some facets of mine. Below you'll
see the XML returned from solr. I did a query using the standard request
handler. Notice the duplicated values returned (american standard, delta,
etc). There is actually quite a few of them. At first I though it may be
because of case sensitivity, but I since lower case everything going to
solr. 

Hopefully someone can chime in with some tips, thanks!

Dan

 
- 
- 
  0 
  4 
  
   
- 
   
- 
- 
  1560 
  197 
  181 
  83 
  56 
  45 
  40 
  18 
  18 
  15 
  14 
  13 
  12 
  10 
  8 
  8 
  7 
  7 
  6 
  5 
  4 
  3 
  3 
  2 
  2 
  2 
  1 
  1 
  1 
  1 
  1 
  1 
  1 
  1 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  0 
  
  
  
  
-- 
View this message in context: 
http://www.nabble.com/Facet-Results-Strange---Help-tf3658597.html#a1084
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Facet Results Strange - Help

2007-04-27 Thread Jennifer Seaman



Hopefully someone can chime in with some tips, thanks!


It's likely you have the facet category added more than once for one 
or more docs. Like this;


american standard
american standard

Are you adding the facet values on-the-fly? This happened to me and I 
solved it by removing the duplicate facet fields.


Regards,
Jennifer Seaman 



Re: Facet Results Strange - Help

2007-04-27 Thread Yonik Seeley

On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote:

Hello,
I'm running into some strange results for some facets of mine. Below you'll
see the XML returned from solr. I did a query using the standard request
handler. Notice the duplicated values returned (american standard, delta,
etc). There is actually quite a few of them. At first I though it may be
because of case sensitivity, but I since lower case everything going to
solr.

Hopefully someone can chime in with some tips, thanks!


What's the field definition for manufacturer_facet in your schema?  Is
it multi-valued or not?

Also, can you try the python response format (wt=python) as it outputs
only ASCII and escapes everything else... there is an off chance the
strings look the same but aren't.

-Yonik


Re: Facet Results Strange - Help

2007-04-27 Thread realw5

I have a dynamic field setup for facets. It looks like this:

 

I do this, because we add facets quite often, so having to modify the schema
every time would be unfeasible.

I'm currently reindexing from scratch, so I cannot try wt=python for little
bit longer. Once it's done indexing I'll give that a go and see if I notice
anything.

Dan


Yonik Seeley wrote:
> 
> On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote:
>> Hello,
>> I'm running into some strange results for some facets of mine. Below
>> you'll
>> see the XML returned from solr. I did a query using the standard request
>> handler. Notice the duplicated values returned (american standard, delta,
>> etc). There is actually quite a few of them. At first I though it may be
>> because of case sensitivity, but I since lower case everything going to
>> solr.
>>
>> Hopefully someone can chime in with some tips, thanks!
> 
> What's the field definition for manufacturer_facet in your schema?  Is
> it multi-valued or not?
> 
> Also, can you try the python response format (wt=python) as it outputs
> only ASCII and escapes everything else... there is an off chance the
> strings look the same but aren't.
> 
> -Yonik
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Facet-Results-Strange---Help-tf3658597.html#a10226359
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Facet Results Strange - Help

2007-04-27 Thread Yonik Seeley

On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote:

I have a dynamic field setup for facets. It looks like this:



I do this, because we add facets quite often, so having to modify the schema
every time would be unfeasible.

I'm currently reindexing from scratch, so I cannot try wt=python for little
bit longer. Once it's done indexing I'll give that a go and see if I notice
anything.


If it's really the same field value repeated, you've hit a bug.
If so, it would be helpful if you could open a JIRA bug, and anything
you can do to help us reproduce the problem would be appreciated.

-Yonik


Re: Facet Results Strange - Help

2007-04-27 Thread realw5

Ok, I just finished indexing about 20k in documents. I took a look at so far
the problem has not appearred again. What I'm thinking caused it was I was
not adding overwritePending & overwriteCommited in the add process. Therefor
over time as data was being cleaned up, it was just appending to the
existing data.

I did have once cause of repeated values, but after looking at the python
writer, I notice a space at the end. I can fix this issue by triming all my
values before sening them to solr :-) 

I'm going to continue indexing, and if the problem popups up once fully
indexed I'll post back again. Otherwise thanks for the quick replies!

Dan


Yonik Seeley wrote:
> 
> On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote:
>> I have a dynamic field setup for facets. It looks like this:
>>
>> > multiValued="true" />
>>
>> I do this, because we add facets quite often, so having to modify the
>> schema
>> every time would be unfeasible.
>>
>> I'm currently reindexing from scratch, so I cannot try wt=python for
>> little
>> bit longer. Once it's done indexing I'll give that a go and see if I
>> notice
>> anything.
> 
> If it's really the same field value repeated, you've hit a bug.
> If so, it would be helpful if you could open a JIRA bug, and anything
> you can do to help us reproduce the problem would be appreciated.
> 
> -Yonik
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Facet-Results-Strange---Help-tf3658597.html#a10226731
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Facet Results Strange - Help

2007-04-27 Thread Yonik Seeley

On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote:

Ok, I just finished indexing about 20k in documents. I took a look at so far
the problem has not appearred again. What I'm thinking caused it was I was
not adding overwritePending & overwriteCommited in the add process. Therefor
over time as data was being cleaned up, it was just appending to the
existing data.


That is the default anyway.  Even if duplicate documents were somehow
added, that should not cause duplicates in facet results.  It should
be impossible to get duplicate values from facet.field, regardless of
what the index looks like.


I did have once cause of repeated values, but after looking at the python
writer, I notice a space at the end. I can fix this issue by triming all my
values before sening them to solr :-)


Hopefully you should have also seen the space in the XML response...
if it's not there, that would be a bug.

-Yonik


Re: Facet Results Strange - Help

2007-04-27 Thread Chris Hostetter

: It's likely you have the facet category added more than once for one
: or more docs. Like this;
:
: american standard
: american standard
:
: Are you adding the facet values on-the-fly? This happened to me and I
: solved it by removing the duplicate facet fields.

that's really odd ... i can't think of any way that exactly duplicate
field values would be counted twice in the current facet.field code.

I just tested this using the exampledocs by adding "electronics" to the
cat field of some docs multiple times, and i couldn't reproduce this
behavior.

can you elaborate more on how to trigger it?


-Hoss



Re: Facet Results Strange - Help

2007-04-27 Thread Chris Hostetter
: writer, I notice a space at the end. I can fix this issue by triming all my
: values before sening them to solr :-)

The built in Field Faceting works on the indexed values, so Solr can solve
this for you if you use something like this for your facet field type...

   
 
  
  
  
  
  
  
 
   



-Hoss