[jira] [Commented] (NUTCH-1283) Ridically update all Solr configuration in Nutchgora

2012-02-20 Thread Markus Jelsma (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211732#comment-13211732 ] Markus Jelsma commented on NUTCH-1283: -- 1.4 is the schema version of Solr 3.5. It is

[jira] [Resolved] (NUTCH-1280) language-identifier should have option to use detected value by Tika even when uncertain

2012-02-20 Thread Ferdy Galema (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema resolved NUTCH-1280. - Resolution: Fixed committed language-identifier should have option to use

[jira] [Commented] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory

2012-02-20 Thread Gabriele Kahlout (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211751#comment-13211751 ] Gabriele Kahlout commented on NUTCH-1001: - Hi Lewis, am quite disconnected from it

[jira] [Commented] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory

2012-02-20 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211779#comment-13211779 ] Lewis John McGibbney commented on NUTCH-1001: - Great :0)

[jira] [Updated] (NUTCH-1053) Parsing of RSS feeds fails

2012-02-20 Thread Michael Kazekin (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Kazekin updated NUTCH-1053: --- Attachment: nutch-1053.patch The problem is that the Feed's plugin.xml doesn't support

[jira] [Issue Comment Edited] (NUTCH-1053) Parsing of RSS feeds fails

2012-02-20 Thread Michael Kazekin (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211786#comment-13211786 ] Michael Kazekin edited comment on NUTCH-1053 at 2/20/12 11:42 AM:

[jira] [Updated] (NUTCH-1283) Radically update all Solr configuration in Nutchgora

2012-02-20 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1283: Summary: Radically update all Solr configuration in Nutchgora (was: Ridically

[jira] [Updated] (NUTCH-1285) Debian Packaging for Nutch

2012-02-20 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1285: Fix Version/s: 1.6 nutchgora Debian Packaging for Nutch

[jira] [Resolved] (NUTCH-1277) Fix [fallthrough] javac warnings

2012-02-20 Thread Lewis John McGibbney (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1277. - Resolution: Fixed Committed @ revision 1291278 in trunk Committed @ revision

[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents

2012-02-20 Thread Ferdy Galema (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211875#comment-13211875 ] Ferdy Galema commented on NUTCH-965: Hi Lewis, FYI: I'm currently looking into this

[jira] [Closed] (NUTCH-1277) Fix [fallthrough] javac warnings

2012-02-20 Thread Lewis John McGibbney (Closed) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1277. --- Fix [fallthrough] javac warnings

[jira] [Created] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)

2012-02-20 Thread Ferdy Galema (Created) (JIRA)
Refactoring/reimplementing crawling API (NutchApp) -- Key: NUTCH-1286 URL: https://issues.apache.org/jira/browse/NUTCH-1286 Project: Nutch Issue Type: Improvement Components:

Re: [DISCUSS] Nutchgora 2.0 release

2012-02-20 Thread Ferdy Galema
Hi, Aside from the licensing issue, the only thing I really see as a blocker or as something we need to deal with first is Nutch-1205 (upgrade Gora libs). What are we going to do with that one? About the Nutch API (webapp), my colleague and I have some ideas about how to improve it, in such as

Build failed in Jenkins: nutch-trunk-maven #158

2012-02-20 Thread Apache Jenkins Server
See https://builds.apache.org/job/nutch-trunk-maven/158/changes Changes: [lewismc] trivial commit to properly annotate fallthrough scenarios in break statements, hence supressing the warnings. -- [...truncated 1038 lines...] A

Re: Build failed in Jenkins: nutch-trunk-maven #158

2012-02-20 Thread Markus Jelsma
Lewis, Can you fix your latest commits on trunk? Thanks On Monday 20 February 2012 16:02:37 Apache Jenkins Server wrote: See https://builds.apache.org/job/nutch-trunk-maven/158/changes Changes: [lewismc] trivial commit to properly annotate fallthrough scenarios in break statements,

Re: Build failed in Jenkins: nutch-trunk-maven #158

2012-02-20 Thread Lewis John Mcgibbney
Done. Apologies guys.

[jira] [Reopened] (NUTCH-1277) Fix [fallthrough] javac warnings

2012-02-20 Thread Lewis John McGibbney (Reopened) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1277: - Frustratingly not fixed. Reopening Fix [fallthrough] javac

[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2-SNAPSHOT in ivy/ivy.xml

2012-02-20 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1205: Priority: Blocker (was: Minor) Upgrade gora modules to 0.2-SNAPSHOT in

[jira] [Created] (NUTCH-1287) Upgrade to hsqldb 2.2.8

2012-02-20 Thread Ferdy Galema (Created) (JIRA)
Upgrade to hsqldb 2.2.8 --- Key: NUTCH-1287 URL: https://issues.apache.org/jira/browse/NUTCH-1287 Project: Nutch Issue Type: Improvement Reporter: Ferdy Galema Priority: Trivial Fix

Re: [DISCUSS] Nutchgora 2.0 release

2012-02-20 Thread Lewis John Mcgibbney
Hi, Not ignoring Chris' comments, but addressing the points below first, please see comments. On Mon, Feb 20, 2012 at 2:57 PM, Ferdy Galema ferdy.gal...@kalooga.comwrote: Aside from the licensing issue, the only thing I really see as a blocker or as something we need to deal with first is

[jira] [Closed] (NUTCH-1287) Upgrade to hsqldb 2.2.8

2012-02-20 Thread Ferdy Galema (Closed) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema closed NUTCH-1287. --- Resolution: Fixed Committed. Tested with running ant builds/tests, it works fine.

Re: [DISCUSS] Nutchgora 2.0 release

2012-02-20 Thread Ferdy Galema
Thanks Lewis, that's a real useful link. Updated the jira. On Mon, Feb 20, 2012 at 5:01 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi, Not ignoring Chris' comments, but addressing the points below first, please see comments. On Mon, Feb 20, 2012 at 2:57 PM, Ferdy Galema

[jira] [Updated] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)

2012-02-20 Thread Ferdy Galema (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema updated NUTCH-1286: Useful wiki http://wiki.apache.org/nutch/NutchAdministrationUserInterface

Re: [DISCUSS] Nutchgora 2.0 release

2012-02-20 Thread Mattmann, Chris A (388J)
+1 guys. Just let me know when you are ready and I can RM it. Cheers, Chris On Feb 20, 2012, at 8:01 AM, Lewis John Mcgibbney wrote: Hi, Not ignoring Chris' comments, but addressing the points below first, please see comments. On Mon, Feb 20, 2012 at 2:57 PM, Ferdy Galema

[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2-SNAPSHOT in ivy/ivy.xml

2012-02-20 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1205: Attachment: NUTCH-1205-v5.patch NUTCH-1205-v5.patch This is

[jira] [Commented] (NUTCH-1287) Upgrade to hsqldb 2.2.8

2012-02-20 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13212351#comment-13212351 ] Hudson commented on NUTCH-1287: --- Integrated in Nutch-nutchgora #168 (See

[jira] [Commented] (NUTCH-1280) language-identifier should have option to use detected value by Tika even when uncertain

2012-02-20 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13212350#comment-13212350 ] Hudson commented on NUTCH-1280: --- Integrated in Nutch-nutchgora #168 (See

Build failed in Jenkins: nutch-trunk-maven #160

2012-02-20 Thread Apache Jenkins Server
See https://builds.apache.org/job/nutch-trunk-maven/160/ -- Started by timer Building remotely on ubuntu3 in workspace https://builds.apache.org/job/nutch-trunk-maven/ws/ hudson.util.IOException2: remote file operation failed: