Re: [ANNOUNCE] Apache Nutch v1.7 Released

2013-06-27 Thread Julien Nioche
Thanks Lewis for taking care of the release. Great stuff! Julien On 27 June 2013 00:38, Lewis John Mcgibbney lewis.mcgibb...@gmail.comwrote: N.B. Previous message doesn't seem to have been mod'd through under my @ apache.org address so resending ;) It has however been distributed to

[jira] [Commented] (NUTCH-1592) XPath works on documents parsed with parse-html but not parse-tika

2013-06-27 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13694564#comment-13694564 ] Julien Nioche commented on NUTCH-1592: -- Hi Seb That's a very plausible explanation.

[jira] [Commented] (NUTCH-1580) index-metadata returns object instead of value for index.static

2013-06-27 Thread Antoinette (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13694655#comment-13694655 ] Antoinette commented on NUTCH-1580: --- My System Administrator recompiled the patch and

[jira] [Updated] (NUTCH-1314) Impose a limit on the length of outlink target urls

2013-06-27 Thread Canan Girgin (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Canan Girgin updated NUTCH-1314: Attachment: NUTCH-1314-v3.patch Impose a limit on the length of outlink target urls

[jira] [Commented] (NUTCH-1314) Impose a limit on the length of outlink target urls

2013-06-27 Thread Canan Girgin (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13694784#comment-13694784 ] Canan Girgin commented on NUTCH-1314: - I tried to test NUTCH-1314-v2.patch. But it

[jira] [Updated] (NUTCH-1591) Incorrect conversion of ByteBuffer to String

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1591: Fix Version/s: (was: 2.3) 2.2.1 Incorrect conversion

[jira] [Resolved] (NUTCH-1591) Incorrect conversion of ByteBuffer to String

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1591. - Resolution: Fixed Committed @revision 1497447 in 2.x head Thank you v much for

[jira] [Resolved] (NUTCH-1578) Upgrade to Hadoop 1.2.0

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1578. - Resolution: Fixed Upgrade to Hadoop 1.2.0 ---

[jira] [Updated] (NUTCH-1578) Upgrade to Hadoop 1.2.0

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1578: Fix Version/s: (was: 2.3) 2.2.1 Upgrade to Hadoop

[jira] [Updated] (NUTCH-1522) Upgrade to Tika 1.3

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1522: Fix Version/s: (was: 2.4) 2.2.1 Upgrade to Tika 1.3

[jira] [Reopened] (NUTCH-1522) Upgrade to Tika 1.3

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1522: - Upgrade to Tika 1.3 --- Key: NUTCH-1522

[jira] [Reopened] (NUTCH-1420) Get rid of the dreaded �

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1420: - Get rid of the dreaded � Key:

[jira] [Reopened] (NUTCH-1578) Upgrade to Hadoop 1.2.0

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1578: - Upgrade to Hadoop 1.2.0 --- Key:

[jira] [Resolved] (NUTCH-1522) Upgrade to Tika 1.3

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1522. - Resolution: Fixed Upgrade to Tika 1.3 ---

[jira] [Updated] (NUTCH-1420) Get rid of the dreaded �

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1420: Fix Version/s: (was: 2.2) 2.2.1 Get rid of the dreaded

[jira] [Reopened] (NUTCH-1585) Ensure duplicate tags do not exist in microformat-reltag tag set.

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1585: - Ensure duplicate tags do not exist in microformat-reltag tag set.

[jira] [Updated] (NUTCH-1475) Index-More Plugin -- A better fall back value for date field

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1475: Fix Version/s: 2.2.1 Index-More Plugin -- A better fall back value for date

[jira] [Updated] (NUTCH-1585) Ensure duplicate tags do not exist in microformat-reltag tag set.

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1585: Fix Version/s: (was: 2.3) 2.2.1 Ensure duplicate tags

[jira] [Updated] (NUTCH-1126) JUnit test for urlfilter-prefix

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1126: Fix Version/s: (was: 2.3) 2.2.1 JUnit test for

[jira] [Updated] (NUTCH-1571) SolrInputSplit doesn't implement Writable and crawl script doesn't pass crawlId to generate and updatedb tasks

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1571: Fix Version/s: (was: 2.3) 2.2.1 SolrInputSplit doesn't

[jira] [Resolved] (NUTCH-1585) Ensure duplicate tags do not exist in microformat-reltag tag set.

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1585. - Resolution: Fixed Ensure duplicate tags do not exist in microformat-reltag

[jira] [Closed] (NUTCH-1585) Ensure duplicate tags do not exist in microformat-reltag tag set.

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1585. --- Ensure duplicate tags do not exist in microformat-reltag tag set.

[jira] [Closed] (NUTCH-1571) SolrInputSplit doesn't implement Writable and crawl script doesn't pass crawlId to generate and updatedb tasks

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1571. --- SolrInputSplit doesn't implement Writable and crawl script doesn't pass crawlId

[VOTE] Apache Nutch 2.2.1 RC#1

2013-06-27 Thread Lewis John Mcgibbney
Hi, It would be greatly appreciated if you could take some time to VOTE on the release candidate for the Apache Nutch 2.2.1 artifacts. This candidate is (amongst other things) a bug fix for NUTCH-1591 - Incorrect conversion of ByteBuffer to String. The big fix solved 8 issues:

[jira] [Commented] (NUTCH-1591) Incorrect conversion of ByteBuffer to String

2013-06-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13694932#comment-13694932 ] Hudson commented on NUTCH-1591: --- Integrated in Nutch-nutchgora #664 (See

RE: [VOTE] Apache Nutch 2.2.1 RC#1

2013-06-27 Thread Markus Jelsma
Looks fine Lewis! +1 -Original message- From: Lewis John Mcgibbneylewis.mcgibb...@gmail.com Sent: Thursday 27th June 2013 20:00 To: dev@nutch.apache.org; u...@nutch.apache.org Subject: [VOTE] Apache Nutch 2.2.1 RC#1 Hi, It would be greatly appreciated if you could take some time to VOTE

RE: [ANNOUNCE] Apache Nutch v1.7 Released

2013-06-27 Thread Markus Jelsma
Thanks again Lewis for the properly managing the release! Looking forward already to 1.8! Cheers -Original message- From: Lewis John Mcgibbneylewis.mcgibb...@gmail.com Sent: Thursday 27th June 2013 1:39 To: u...@nutch.apache.org; dev@nutch.apache.org Subject: [ANNOUNCE] Apache Nutch

Re: [VOTE] Apache Nutch 2.2.1 RC#1

2013-06-27 Thread Tejas Patil
+1 from me too On Thu, Jun 27, 2013 at 12:00 PM, Markus Jelsma markus.jel...@openindex.iowrote: Looks fine Lewis! +1 -Original message- From: Lewis John Mcgibbneylewis.mcgibb...@gmail.com Sent: Thursday 27th June 2013 20:00 To: dev@nutch.apache.org; u...@nutch.apache.org Subject:

[jira] [Updated] (NUTCH-1580) index-static returns object instead of value for index.static

2013-06-27 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1580: --- Summary: index-static returns object instead of value for index.static (was: index-metadata

[jira] [Resolved] (NUTCH-1580) index-static returns object instead of value for index.static

2013-06-27 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-1580. Resolution: Fixed Fix Version/s: (was: 1.9) 1.8 Committed to

[jira] [Updated] (NUTCH-1464) index-static plugin doesn't allow the colon within the field value

2013-06-27 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1464: --- Attachment: NUTCH-1464v3.patch patch v3: - added test for value containing colon - patch v2

[jira] [Commented] (NUTCH-1580) index-static returns object instead of value for index.static

2013-06-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13695063#comment-13695063 ] Hudson commented on NUTCH-1580: --- Integrated in Nutch-trunk #2258 (See

[jira] [Commented] (NUTCH-1464) index-static plugin doesn't allow the colon within the field value

2013-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13695087#comment-13695087 ] Lewis John McGibbney commented on NUTCH-1464: - Is there a chance that the