[ https://issues.apache.org/jira/browse/NUTCH-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel updated NUTCH-2249: ----------------------------------- Fix Version/s: (was: 1.14) 1.15 > WordNet Integration for Cosine Similarity > ----------------------------------------- > > Key: NUTCH-2249 > URL: https://issues.apache.org/jira/browse/NUTCH-2249 > Project: Nutch > Issue Type: New Feature > Components: plugin, scoring > Reporter: Bhavya Sanghavi > Assignee: Sujen Shah > Priority: Minor > Labels: memex > Fix For: 1.15 > > > Integrated WordNet database to enhance the cosine similarity plugin. > This helps in reducing the size of the vectors for calculating the cosine > similarity by mapping the synonymous words to the same entry in the vector. > Consequently, it would increase the accuracy of the scores given to the > webpages to be crawled. -- This message was sent by Atlassian JIRA (v6.4.14#64029)