[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12536102 ] Hudson commented on NUTCH-488: -- Integrated in Nutch-Nightly #241 (See [http://lucene.zones.apache.org:8080/hudson/job/Nu

Re: JIRA, Resolving and Closing Issues

2007-10-18 Thread Sami Siren
I think for example lucene and hadoop follow different process where issues are resolved after successful commits and (bulk)closed on release. Manual addition of revision number is unnecessary when you add issue number into commit message, in that case JIRA will show this information (and some mor

Re: JIRA, Resolving and Closing Issues

2007-10-18 Thread Chris Mattmann
Dennis, My practice has been to do the following: 1. Resolve the issue, and describe (at a high level), the changes made to the code, e.g., *Introduced new classes A, B, C *Refactored method Y out of class D and into new class E *made internal method F of class G use member variable as a

JIRA, Resolving and Closing Issues

2007-10-18 Thread Dennis Kubes
Quick question about Jira. When we commit, are we supposed to first resolve and then close the issue. What is the process on this. Dennis Kubes

[jira] Closed: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes closed NUTCH-488. -- Closed > Avoid parsing uneccessary links and get a more relevant outlink list > ---

[jira] Resolved: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-488. Resolution: Fixed Fix Version/s: 1.0.0 Assignee: Emmanuel Joke This has been committ

Re: Scoring API issues (LONG)

2007-10-18 Thread Andrzej Bialecki
Sami Siren wrote: Andrzej Bialecki wrote: Hi all, I've been working recently on a custom scoring plugin, and I found out some issues with the scoring API that severely limit the way we can calculate static page scores. I'd like to restart the discussion about this API, and propose some changes.

Re: Scoring API issues (LONG)

2007-10-18 Thread Sami Siren
Andrzej Bialecki wrote: > Hi all, > > I've been working recently on a custom scoring plugin, and I found out > some issues with the scoring API that severely limit the way we can > calculate static page scores. I'd like to restart the discussion about > this API, and propose some changes. Any comm

[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter

2007-10-18 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535960 ] Dennis Kubes commented on NUTCH-565: Yeah, I didn't really know where to put this as it is a tool but it is also a

[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter

2007-10-18 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535943 ] Andrzej Bialecki commented on NUTCH-565: - +1 overall. One question: shouldn't we put this under org.apache.n

Re: writing a new parse-exe plugin [NullPointerException]

2007-10-18 Thread eyal edri
hello again, I've added a printStackTrace to where the fetcher throws the exception: java.lang.NullPointerException at org.apache.hadoop.io.Text.encode(Text.java:375) at org.apache.hadoop.io.Text.encode(Text.java:356) at org.apache.hadoop.io.Text.writeString(Text.java:396)

[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535875 ] Andrzej Bialecki commented on NUTCH-488: - +1. The nutch-default description has a typo, instead of "coma" it

Re: writing a new parse-exe plugin

2007-10-18 Thread eyal edri
Found how to associate multiple contentTypes to a certain plugin: just add the content type to the conf/parse-plugins.xml file: (the plugin can take more than one type) On 10/18/07, eyal edri <[EMAIL PROTECTED]> wrote: > > Excellent !! :) > > that did the trick! > > Any cha

Re: writing a new parse-exe plugin

2007-10-18 Thread eyal edri
Excellent !! :) that did the trick! Any chance to create a new page on the plugin central for writing a nutch-0.9 plugin, stating the checklist (written below)? (i would have uploaded, but dont have the rights to open a new page) The checklist: (relevant for a parse plugin, implementing the Pars

[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535855 ] Doğacan Güney commented on NUTCH-488: - I haven't tested it but the code looks good and clean. +1 from me. > Avoi

[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup.

2007-10-18 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535853 ] Dawid Weiss commented on NUTCH-567: --- Don't mention it. Happy birthday and I hope it'll work for you. If you take a