[ 
http://issues.apache.org/jira/browse/NUTCH-294?page=comments#action_12415094 ] 

Dawid Weiss commented on NUTCH-294:
-----------------------------------

Well, you certainly have something wrong in your configuration then. I just 
tried
with the head revision. My nutch-site looks like this:

[...]
<property>
  <name>plugin.includes</name>
  
<value>clustering-carrot2|protocol-http|urlfilter-regex|parse-(text|html|js)|index-basic|query-(basic|site|url)|summary-basic|scoring-opic</value>
  <description>Regular expression naming plugin directory names to
  [...]
  </description>
</property>
[...]

Start Tomcat and issue any query that returns results. Look in the log files 
for:

2006-06-07 09:29:35 org.apache.nutch.plugin.PluginRepository displayStatus
INFO:   Online Search Results Clustering using Carrot2's Lingo component 
(clustering-carrot2)

2006-06-07 09:29:35 org.apache.nutch.clustering.OnlineClustererFactory 
getOnlineClusterer
INFO: Using the first clustering extension found: Carrot2-Lingo

Ok, the results page should show a "clustering" option next to "Search" button 
(it does
on my installation). Select it and rerun the query. On the right side you'll 
have clusters
(titles and three sample documents from each cluster are shown).

As for your idea, I still don't think Lingo is what you need... Of course you 
can try feeding it with unrelated keywords and then see what comes out, but I 
don't think it's the right approach.


> Topic-maps of related searchwords
> ---------------------------------
>
>          Key: NUTCH-294
>          URL: http://issues.apache.org/jira/browse/NUTCH-294
>      Project: Nutch
>         Type: New Feature

>   Components: searcher
>     Reporter: Stefan Neufeind

>
> Would it be possible to offer a user  "topic-maps"? It's when you search for 
> something and get topic-related words that might also be of interest for you. 
> I wonder if that's somehow possible with the ngram-index for "did you mean" 
> (see separate feature-enhancement-bug for this), but we'd need to have a 
> relation between words (in what context do they occur).
> For the webfrontend usually trees are used  - which for some users offer 
> quite impressive eye-candy :-) E.g. see this advertisement by Novell where 
> I've just seen a similar "topic-map" as well:
> http://www.novell.com/de-de/company/advertising/defineyouropen.html

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira



_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to