[
https://issues.apache.org/jira/browse/SOLR-1103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12748044#action_12748044
]
Hoss Man commented on SOLR-1103:
--------------------------------
There were three possible fixes depending on what people thought the correct
behavior should be.
i don't have the code in front of me, but as i recall they were all trivial...
1) add a comment
2) change a for loop to start at 1 instead of 2
3) change getPowerOfTwoBucket to have something like...
{code}
return result < 2 ? 2 : result;
{code}
I think Ryan wrote this code originally: Ryan, do you have any recollection as
to what the orriginal intent was with the first bucket?
> LukeRequestHandler histogram excludes freq of 1
> -----------------------------------------------
>
> Key: SOLR-1103
> URL: https://issues.apache.org/jira/browse/SOLR-1103
> Project: Solr
> Issue Type: Bug
> Reporter: Hoss Man
> Priority: Minor
> Fix For: 1.4
>
>
> the TermHistogram class in the LukeRequestHandler seems to properly count the
> occurances of terms with a freq of "1", but then when converting to a
> NamedLIst begins iterating at bucket "2" so the counts for freq of "1" don't
> appear in the result.
> this may have been a conscious choice to eliminate superfluously high values
> for terms with a freq of one ... or it may have been a mistake assuming freq
> values of 1 would fall in the "2" bucket.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.