Re: [DISCUSS] Nutch 1.7 ready for release?

2013-06-10 Thread Julien Nioche
+1 to release now but it would have been nice to do https://issues.apache.org/jira/browse/NUTCH-1527 as part of the same release. The main change introduced in this version is the pluggable indexer and having a first working version for ES would be a good illustration of how useful this feature is.

[jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed

2013-06-10 Thread Iwan Luijks (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13679386#comment-13679386 ] Iwan Luijks commented on NUTCH-585: --- Hi [~veggen], Did you succeed making a patch for fi

[jira] [Updated] (NUTCH-1522) Upgrade to Tika 1.3

2013-06-10 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1522: - Fix Version/s: 1.7 > Upgrade to Tika 1.3 > --- > > Key: NUTCH

Re: [DISCUSS] Nutch 1.7 ready for release?

2013-06-10 Thread Julien Nioche
Have added the upgrade to Tika 1.3 to v1.7 https://issues.apache.org/jira/browse/NUTCH-1522. It should be quite straightforward to include and would be a shame not to do it for this release. Thoughts? On 10 June 2013 08:48, Julien Nioche wrote: > +1 to release now but it would have been nice to

[jira] [Resolved] (NUTCH-1522) Upgrade to Tika 1.3

2013-06-10 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-1522. -- Resolution: Fixed Trunk : Committed revision 1491420. 2-x : Committed revision 1491421.

Re: [DISCUSS] Nutch 1.7 ready for release?

2013-06-10 Thread Julien Nioche
Have just committed NUTCH-1522 for both 2-x and trunk On 10 June 2013 12:07, Julien Nioche wrote: > Have added the upgrade to Tika 1.3 to v1.7 > https://issues.apache.org/jira/browse/NUTCH-1522. It should be quite > straightforward to include and would be a shame not to do it for this > release

[jira] [Commented] (NUTCH-1522) Upgrade to Tika 1.3

2013-06-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13679458#comment-13679458 ] Hudson commented on NUTCH-1522: --- Integrated in Nutch-nutchgora #640 (See [https://builds.ap

[jira] [Commented] (NUTCH-1522) Upgrade to Tika 1.3

2013-06-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13679457#comment-13679457 ] Hudson commented on NUTCH-1522: --- Integrated in Nutch-trunk #2234 (See [https://builds.apach

[jira] [Updated] (NUTCH-911) recrawls file protocol causes Errors/Exceptions when actually not modified or gone

2013-06-10 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-911: -- Component/s: protocol > recrawls file protocol causes Errors/Exceptions when actually not mo

[jira] [Updated] (NUTCH-911) recrawls file protocol causes Errors/Exceptions when actually not modified or gone

2013-06-10 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-911: -- Attachment: NUTCH-911-trunk.patch Good catch! Cf.[[1|http://mail-archives.apache.org/mod_mbox/n

[jira] [Updated] (NUTCH-911) recrawls file protocol causes Errors/Exceptions when actually not modified or gone

2013-06-10 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-911: -- Fix Version/s: 2.3 Patch Info: Patch Available > recrawls file protocol causes Errors

Fwd: Nutch Compilation Error with Eclipse

2013-06-10 Thread Tejas Patil
Hi @nutch-dev, I want to put out this [0] tutorial over Nutch wiki. 1. Do you see anything wrong in it or any improvements ? 2. Where do I upload the images ? Wiki will allow me to just specify the url. [0] : https://docs.google.com/document/d/1qvJwrZ9Sc0NAF9p3ie4uV7JsfCHxnrh9QF19HINw48c/edit?us