[ https://issues.apache.org/jira/browse/SOLR-9193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joel Bernstein updated SOLR-9193: --------------------------------- Summary: Add the scoreNodes Streaming Expression (was: Add the nodeRank Streaming Expression) > Add the scoreNodes Streaming Expression > --------------------------------------- > > Key: SOLR-9193 > URL: https://issues.apache.org/jira/browse/SOLR-9193 > Project: Solr > Issue Type: New Feature > Reporter: Joel Bernstein > Assignee: Joel Bernstein > > The scoreNodes Streaming Expression is another *GraphExpression*. It will > decorate a gatherNodes expression and us a tf-idf scoring algorithm to score > the nodes. > The gatherNodes expression only gathers nodes and aggregations. This is > similar in nature to tf in search ranking, where the number of times a node > appears in the traversal represents the tf. But this skews recommendations > towards nodes that appear frequently in the index. > Using the idf for each node we can score each node as a function of tf and > idf. This will provide a boost to nodes that appear less frequently in the > index. > The scoreNodes expression will gather the idf's from the shards for each node > emitted by the underlying gatherNodes expression. It will then assign the > score to each node. The score will be added to each node in the "nscore" > field. The underlying gatherNodes expression will perform the aggregation > providing the tf. > proposed syntax: > {code} > top(n="5", > sort="nscore desc", > scoreNodes(gatherNodes(...))) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org