[ 
https://issues.apache.org/jira/browse/SOLR-877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12651598#action_12651598
 ] 

[EMAIL PROTECTED] edited comment on SOLR-877 at 11/28/08 8:45 AM:
-------------------------------------------------------------

Looks useful, esp for distributed idf when we get around to it.
A quick review:
- "terms.fl", fl normally stands for field list.  Would this make more sense as 
"terms.f"?
- strings are returned instead of integers for the term count (it's even more 
obvious in JSON output)
- how does one ask for "all terms after foo?"  the docs suggest that upper or 
rows *must* be set... is the only way to set a really high rows value? If so, 
allowing terms.rows=-1 for "unlimited" might be nicer.

Actually, this is very much like faceting.... perhaps we should use the same 
parameters:
terms.field
terms.offset
terms.limit
terms.mincount (future)
terms.sort (future)

If this is to be useful for distributed search, it needs to be able to handle 
the direct specifications of terms in multiple fields.  We necessarily need to 
implement this now, but we should think about having an output format that 
doesn't need to be deprecated when it is added.  At a minimum it seems like 
there should be an extra level.... something like
{code}
"terms" = {
  "myfield" = { "foo"=10, "bar"=5, ...}
}
{code}
Or if we want to get even more like faceting with the output:
{code}
"terms" = {
  "fields" = {
    "myfield" = {"foo"=10,...}
  }
}
{code}

- "terms.upr.incl" and "terms.lwr.incl" hurt the eyes a little since we already 
have "lower" and "upper" in the names of other terms - seems error prone (but 
this is a purely aesthetic thing).
- This is supposedly useful for auto suggest - how do I go about asking for all 
terms starting with "abc"?  Shouldn't there be a "terms.prefix" (just like 
faceting)?


      was (Author: [EMAIL PROTECTED]):
    Looks useful, esp for distributed idf when we get around to it.
A quick review:
- "terms.fl", fl normally stands for field list.  Would this make more sense as 
"terms.f"?
- strings are returned instead of integers for the term count (it's even more 
obvious in JSON output)
- how does one ask for "all terms after foo?"  the docs suggest that upper or 
rows *must* be set... is the only way to set a really high rows value? If so, 
allowing terms.rows=-1 for "unlimited" might be nicer.

Actually, this is very much like faceting.... perhaps we should use the same 
parameters:
terms.field
terms.offset
terms.limit
terms.mincount (future)
terms.sort (future)

If this is to be useful for distributed search, it needs to be able to handle 
the direct specifications of terms in multiple fields.  We necessarily need to 
implement this now, but we should think about having an output format that 
doesn't need to be deprecated when it is added.  At a minimum it seems like 
there should be an extra level.... something like
{code}
"terms" = {
  "myfield" = { "foo"=10, "bar"=5, ...}
}
{code}
Or if we want to get even more like faceting with the output:
{code}
"terms" = {
  "fields" = {
    "myfield" = {"foo"=10,...}
  }
}
{code}


  
> Access to Lucene's TermEnum capabilities
> ----------------------------------------
>
>                 Key: SOLR-877
>                 URL: https://issues.apache.org/jira/browse/SOLR-877
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 1.4
>
>         Attachments: SOLR-877.patch, SOLR-877.patch
>
>
> I wrote a simple SearchComponent on the plane the other day that gives access 
> to Lucene's TermEnum capabilities.  I think this will be useful for doing 
> auto-suggest and other term based operations.  My first draft is not 
> distributed, but it probably should be made to do so eventually. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to