[jira] [Commented] (NUTCH-251) Administration GUI

2011-09-16 Thread hadi (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13105882#comment-13105882 ] hadi commented on NUTCH-251: sorry ,i don't understand,would you mind give me a little more det

[jira] [Closed] (NUTCH-1112) off-by-one error in protocol-httpclient; truncates up to HttpBase.BUFFER_SIZE content

2011-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche closed NUTCH-1112. Resolution: Duplicate https://issues.apache.org/jira/browse/NUTCH-1089 already fixed this. Thanks f

[jira] [Commented] (NUTCH-251) Administration GUI

2011-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13105971#comment-13105971 ] Lewis John McGibbney commented on NUTCH-251: Hadi, this issue has been closed a

[jira] [Commented] (NUTCH-1113) Merging segments causes URLs to vanish from crawldb/index?

2011-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13105972#comment-13105972 ] Lewis John McGibbney commented on NUTCH-1113: - We have a pretty meaty JUnit te

Build failed in Jenkins: Nutch-branch-1.4 #5

2011-09-16 Thread Apache Jenkins Server
See -- [...truncated 5625 lines...] compile: [echo] Compiling plugin: lib-regex-filter [javac] :117: warning: 'inclu

[jira] [Commented] (NUTCH-1005) Index headings plugin

2011-09-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13105994#comment-13105994 ] Markus Jelsma commented on NUTCH-1005: -- {quote} you are right. I'd read your comments

Build failed in Jenkins: Nutch-branch-1.4 #6

2011-09-16 Thread Apache Jenkins Server
See -- [...truncated 1056 lines...] A src/plugin/urlnormalizer-pass/src/java/org/apache A src/plugin/urlnormalizer-pass/src/java/org/apache/nutch A src/plugin/urlnormalizer-pass/src/

Jenkins build is back to normal : Nutch-branch-1.4 #7

2011-09-16 Thread Apache Jenkins Server
See

[jira] [Resolved] (NUTCH-1067) Configure minimum throughput for fetcher

2011-09-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1067. -- Resolution: Fixed Fixed again. > Configure minimum throughput for fetcher > --

Re: Jenkins build is back to normal : Nutch-branch-1.4 #7

2011-09-16 Thread lewis john mcgibbney
Branch 1.4 build set up and 'should' be running succesfully from now on. This will also auto update any JIRA issues which have been committed with some Jenkins commentary. At least we can an open eye on a nightly build from now on. Thanks On Fri, Sep 16, 2011 at 12:11 PM, Apache Jenkins Server <

Re: Jenkins build is back to normal : Nutch-branch-1.4 #7

2011-09-16 Thread Markus Jelsma
thanks! On Friday 16 September 2011 13:20:09 lewis john mcgibbney wrote: > Branch 1.4 build set up and 'should' be running succesfully from now on. > This will also auto update any JIRA issues which have been committed with > some Jenkins commentary. > > At least we can an open eye on a nightly b

[jira] [Updated] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-09-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1052: - Priority: Major (was: Minor) Patch Info: [Patch Available] Assignee: Julien Nioche (

[jira] [Updated] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1078: Affects Version/s: (was: 2.0) Fix Version/s: (was: 2.0) any comment

[jira] [Commented] (NUTCH-623) Change plugin source directory "languageidentifier" to "language-identifier"

2011-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13106056#comment-13106056 ] Lewis John McGibbney commented on NUTCH-623: any comments on testing or the sou

[jira] [Commented] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13106060#comment-13106060 ] Markus Jelsma commented on NUTCH-1078: -- Ah, the org.apache.nutch.tools.Benchmark fail

[jira] [Commented] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13106068#comment-13106068 ] Lewis John McGibbney commented on NUTCH-1078: - thank you Markus. I noticed tha

Re: Jenkins build is back to normal : Nutch-branch-1.4 #7

2011-09-16 Thread Mattmann, Chris A (388J)
Yep, here here. Great work Lewis! Cheers, Chris On Sep 16, 2011, at 4:24 AM, Markus Jelsma wrote: > thanks! > > On Friday 16 September 2011 13:20:09 lewis john mcgibbney wrote: >> Branch 1.4 build set up and 'should' be running succesfully from now on. >> This will also auto update any JIRA iss

Re: Jenkins build is back to normal : Nutch-branch-1.4 #7

2011-09-16 Thread Julien Nioche
Thanks Lewis, that's great! On 16 September 2011 12:20, lewis john mcgibbney wrote: > Branch 1.4 build set up and 'should' be running succesfully from now on. > This will also auto update any JIRA issues which have been committed with > some Jenkins commentary. > > At least we can an open eye on

[jira] [Updated] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1078: Attachment: NUTCH-1078-branch-1.4-20110916-v4.patch patch fixes the error logging

[jira] [Issue Comment Edited] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-16 Thread Lewis John McGibbney (JIRA)
Priority: Minor > Fix For: 1.4 > > Attachments: NUTCH-1078-branch-1.4-20110816.patch, > NUTCH-1078-branch-1.4-20110824-v2.patch, > NUTCH-1078-branch-1.4-20110911-v3.patch, > NUTCH-1078-branch-1.4-20110916-v4.patch > > > Whilst working on another issu

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-16 Thread Julien Nioche
Am happy to call for a vote on the future of Nutch 2.0 if you want. Shall we reduce the various options described before to a single one? Julien On 15 September 2011 19:55, Markus Jelsma wrote: > > > Hi Guys, > > > > I thought I'd chime in on this thread. My comments below: > > > I understand an

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-16 Thread lewis john mcgibbney
Hi Julien, I didn't want to skip ship with this one, but it seems that the binding community has already spoken their mind, and I for one shadow your suggestion. It's clear that trunk as it currently exists is not bleeding edge, there have been too many broken fronts to launch a concentrated code

[Nutch Wiki] Trivial Update of "Website_Update_HOWTO" by LewisJohnMcgibbney

2011-09-16 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "Website_Update_HOWTO" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/Website_Update_HOWTO?action=diff&rev1=10&rev2=11 1. Go to {{{forrest}}}. It i

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-16 Thread Markus Jelsma
Option B) Shelve trunk in a branch and promote 1.4 to trunk. We can always choose to hardwire HBASE (option D) later. Markus > Am happy to call for a vote on the future of Nutch 2.0 if you want. Shall > we reduce the various options described before to a single one? > > Julien > > On 15 Septem

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-16 Thread Mattmann, Chris A (388J)
Why don't we just collect VOTEs for each of the options a-e, and then figure out based on that if there is a majority. If there's no majority, we can widdle it down to say the top 2-3, and then VOTE on those, looking for majority again. Cheers, Chris On Sep 16, 2011, at 11:44 AM, Markus Jelsma

Build failed in Jenkins: Nutch-trunk #1606

2011-09-16 Thread Apache Jenkins Server
See -- Started by timer Building remotely on solaris1 hudson.util.IOException2: remote file operation failed: at hudson.remoting.Channel@6d3bcf40:solaris1