[ 
https://issues.apache.org/jira/browse/SOLR-9193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joel Bernstein updated SOLR-9193:
---------------------------------
    Description: 
The nodeScore Streaming Expression is another GraphExpression. It will decorate 
a gatherNodes expression and us a tf-idf scoring algorithm to score the nodes.

The gatherNodes expression only gathers nodes and aggregations. This is similar 
in nature to tf in search ranking, where the number of times a node appears in 
the traversal represents the tf. But this skews recommendations towards nodes 
that appear frequently in the index.

Using the idf for each node we can score each node as a function of tf and idf. 
This will provide a boost to nodes that appear less frequently in the index. 

The nodeScore expression will gather the idf's from the shards for each node 
emitted by the underlying gatherNodes expression. It will then perform then 
assign a score to each node. The score will be added to each node in the 
"nscore" field. The underlying gatherNodes expression will perform the 
aggregation providing the tf.

proposed syntax:
{code}
top(n="5",
      sort="nscore desc",
      nodeScore(gatherNodes(...))) 
{code}








  was:
The nodeRank Streaming Expression is another GraphExpression. It will decorate 
a gatherNodes expression and us a tf-idf ranking algorithm to rank the nodes to 
support recommendations.

The gatherNodes expression only gathers nodes and aggregations. This is similar 
in nature to tf in search ranking, where the number of times a node appears in 
the traversal represents the tf. But this skews recommendations towards nodes 
that appear frequently in the index.

Using the idf for each node we can score each node as a function of tf and idf. 
This will provide a boost to nodes that appear less frequently in the index. 

The nodeRank expression will gather the idf's from the shards for each node 
emitted by the underlying gatherNodes expression. It will then perform the 
ranking. The underlying gatherNodes expression will perform the aggregation 
providing the tf.


> Add the nodeRank Streaming Expression
> -------------------------------------
>
>                 Key: SOLR-9193
>                 URL: https://issues.apache.org/jira/browse/SOLR-9193
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Joel Bernstein
>            Assignee: Joel Bernstein
>
> The nodeScore Streaming Expression is another GraphExpression. It will 
> decorate a gatherNodes expression and us a tf-idf scoring algorithm to score 
> the nodes.
> The gatherNodes expression only gathers nodes and aggregations. This is 
> similar in nature to tf in search ranking, where the number of times a node 
> appears in the traversal represents the tf. But this skews recommendations 
> towards nodes that appear frequently in the index.
> Using the idf for each node we can score each node as a function of tf and 
> idf. This will provide a boost to nodes that appear less frequently in the 
> index. 
> The nodeScore expression will gather the idf's from the shards for each node 
> emitted by the underlying gatherNodes expression. It will then perform then 
> assign a score to each node. The score will be added to each node in the 
> "nscore" field. The underlying gatherNodes expression will perform the 
> aggregation providing the tf.
> proposed syntax:
> {code}
> top(n="5",
>       sort="nscore desc",
>       nodeScore(gatherNodes(...))) 
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to