[jira] [Commented] (NUTCH-809) Parse-metatags plugin

2011-07-07 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13061177#comment-13061177 ] Markus Jelsma commented on NUTCH-809: - Why don't we include this plugin?

[jira] [Updated] (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a ?

2011-07-07 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-797: Fix Version/s: 2.0 1.4 Back on radar: has this ever been committed at all?

[jira] [Updated] (NUTCH-925) plugins stored in weakhashmap lead memory leak

2011-07-07 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-925: Fix Version/s: 1.4 This has been fixed for 2.0 in NUTCH-844 but not in 1.x. plugins stored in

[jira] [Commented] (NUTCH-783) IndexerChecker Utilty

2011-07-07 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13061222#comment-13061222 ] Markus Jelsma commented on NUTCH-783: - Hey, this code is not compatible with Nutch API

[jira] [Issue Comment Edited] (NUTCH-783) IndexerChecker Utilty

2011-07-07 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13061222#comment-13061222 ] Markus Jelsma edited comment on NUTCH-783 at 7/7/11 11:43 AM: --

Upgrade libs to support Hadoop 0.20.203 and 0.21

2011-07-07 Thread Markus Jelsma
Hi, To support Hadoop 0.20 in Nutch we should to upgrade our Ivy configuration for Hadoop. Newer versions depend need Jackson and Avro. We can include Avro and Jackson as both as available under de ASL 2.0. Thoughts? Cheers, -- Markus Jelsma - CTO - Openindex

[jira] [Commented] (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a ?

2011-07-07 Thread Robert Hohman (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13061280#comment-13061280 ] Robert Hohman commented on NUTCH-797: - Hi markus - I am not sure if the committers

[jira] [Commented] (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a ?

2011-07-07 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13061283#comment-13061283 ] Markus Jelsma commented on NUTCH-797: - We'll look in to it. Thanks for reporting.

[jira] [Commented] (NUTCH-925) plugins stored in weakhashmap lead memory leak

2011-07-07 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13061291#comment-13061291 ] Julien Nioche commented on NUTCH-925: - This should have been part of the batch of

[jira] [Resolved] (NUTCH-925) plugins stored in weakhashmap lead memory leak

2011-07-07 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-925. - Resolution: Duplicate I've checked the PluginReposiry diff of NUTCH-844 and compared with 1.4.

[Nutch Wiki] Trivial Update of Archive and Legacy by LewisJohnMcgibbney

2011-07-07 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The Archive and Legacy page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/Archive%20and%20Legacy?action=diffrev1=12rev2=13 == Archive and Legacy == This

Rebuilding site

2011-07-07 Thread lewis john mcgibbney
Hi, As I am back home I propose to rebuild the site to link the current tutorial link to the new 1.3 tutorial on the wiki. I would also like to formally make my first committ by adding my name to the list of committers before I progress with other bits and pieces. Julien, I managed to pick out

Re: Rebuilding site

2011-07-07 Thread Julien Nioche
Hi Lewis, As I am back home I propose to rebuild the site to link the current tutorial link to the new 1.3 tutorial on the wiki. I would also like to formally make my first committ by adding my name to the list of committers before I progress with other bits and pieces. Good idea! See

Re: Rebuilding site

2011-07-07 Thread lewis john mcgibbney
Thanks Julien, I didn't even see this ticket. I'm on it. One further question, it would be interesting to unearth why people are subscribing to the nutch-user@ list. I am aware that this was the old list when Nutch was a subpart of Lucene. There is a heavily weighted tendency for people to cross

Build failed in Jenkins: Nutch-trunk #1539

2011-07-07 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-trunk/1539/ -- [...truncated 985 lines...] A src/plugin/subcollection/src/java/org/apache/nutch/collection A src/plugin/subcollection/src/java/org/apache/nutch/collection/Subcollection.java A