Re: 1.5.1 release

2012-06-21 Thread Lewis John Mcgibbney
Hi Markus, On Thu, Jun 21, 2012 at 10:02 PM, Markus Jelsma wrote: > It's still not clear to me what 1.5.1 is going to look like. Will it be > current trunk incl. the script bugfix or just 1.5 plus the bugfix? I would > vote for the latter as it makes more sense for a bugfix release. I am easy

re: 1.5.1 release

2012-06-21 Thread Markus Jelsma
Hi, It's still not clear to me what 1.5.1 is going to look like. Will it be current trunk incl. the script bugfix or just 1.5 plus the bugfix? I would vote for the latter as it makes more sense for a bugfix release. There is another debate behind this, in my opinion, about freezing trunk prior

[jira] [Commented] (NUTCH-1407) BasicIndexingFilter to optionally add domain field

2012-06-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398841#comment-13398841 ] Lewis John McGibbney commented on NUTCH-1407: - Big +1. I've recently been look

Nutch 2.0 Press Announcement

2012-06-21 Thread Lewis John Mcgibbney
Good Evening Sally, First and foremost I hope you are keeping well and that the beginning of the summer has been kind to you... all the good weather still to come not to worry :0) The reason I contact you is that we (the Apache Nutch community) are nearly ready to release Nutch 2.0 which represen

[jira] [Commented] (NUTCH-1407) BasicIndexingFilter to optionally add domain field

2012-06-21 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398836#comment-13398836 ] Markus Jelsma commented on NUTCH-1407: -- We usually filter subscribers by host or a sm

Re: [jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified

2012-06-21 Thread Lewis John Mcgibbney
OK if we agree on this then I will certainly do this on Saturday. I'll also roll 2.0 #RC2 Are we agreed? Lewis On Thu, Jun 21, 2012 at 1:49 PM, Markus Jelsma wrote: > Hi, > > I would tend to think that creating a new tag or branch from 1.5 and manually > patching 1.5.1 with the bugfix. This m

[jira] [Commented] (NUTCH-1407) BasicIndexingFilter to optionally add domain field

2012-06-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398808#comment-13398808 ] Lewis John McGibbney commented on NUTCH-1407: - Markus this looks fine to me ho

[jira] [Commented] (NUTCH-1406) Metatags-index/-parse plugin: conversion to Solr date format and prevents parsing/indexing of empty tags

2012-06-21 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398420#comment-13398420 ] Julien Nioche commented on NUTCH-1406: -- bq. index-metatags plugin (sometimes also ref

[jira] [Created] (NUTCH-1407) BasicIndexingFilter to optionally add domain field

2012-06-21 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-1407: Summary: BasicIndexingFilter to optionally add domain field Key: NUTCH-1407 URL: https://issues.apache.org/jira/browse/NUTCH-1407 Project: Nutch Issue Type:

[jira] [Updated] (NUTCH-1407) BasicIndexingFilter to optionally add domain field

2012-06-21 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1407: - Attachment: NUTCH-1407-1.6-1.patch Patch for 1.6. Any comments? > BasicIndexingF

RE: [jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified

2012-06-21 Thread Markus Jelsma
Hi, I would tend to think that creating a new tag or branch from 1.5 and manually patching 1.5.1 with the bugfix. This means no merging is required and 1.5.1 will only contain the neccesary fixes and no changes/features or new bugs. Thanks -Original message- > From:Lewis John Mcgib

Re: [jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified

2012-06-21 Thread Lewis John Mcgibbney
Mmmm. I suppose I could merge the changes from trunk to branch-1.5, then tag it and produce the RC from that? wdyt? On Thu, Jun 21, 2012 at 12:59 PM, Markus Jelsma wrote: > Hi guys, > > You're about to release 1.5.1 from trunk? That doesn't make sense as it > already contains quite some change

[jira] [Updated] (NUTCH-1406) Metatags-index/-parse plugin: conversion to Solr date format and prevents parsing/indexing of empty tags

2012-06-21 Thread Kristof (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kristof updated NUTCH-1406: Description: This improvement to the index-metatags plugin (sometimes also refered to parse-metatags plugi

[jira] [Commented] (NUTCH-1406) Metatags-index/-parse plugin: conversion to Solr date format and prevents parsing/indexing of empty tags

2012-06-21 Thread Kristof (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398367#comment-13398367 ] Kristof commented on NUTCH-1406: - I found a way to, but it involved replacing MetadataInd

[jira] [Updated] (NUTCH-1406) Metatags-index/-parse plugin: conversion to Solr date format and prevents parsing/indexing of empty tags

2012-06-21 Thread Kristof (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kristof updated NUTCH-1406: Description: This improvement to the index-metatags plugin (sometimes also refered to parse-metatags plugi

RE: [jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified

2012-06-21 Thread Markus Jelsma
Hi guys, You're about to release 1.5.1 from trunk? That doesn't make sense as it already contains quite some changes. It would, in my opinion, be best to bugfix 1.5 with only the issues that matter such as the broken nutch script and not use trunk for it but the 1.5 tag or branch in svn. It bot

[jira] [Updated] (NUTCH-1406) Metatags-index/-parse plugin: conversion to Solr date format and prevents parsing/indexing of empty tags

2012-06-21 Thread Kristof (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kristof updated NUTCH-1406: Attachment: index-metadata-plugin.patch > Metatags-index/-parse plugin: conversion to Solr date format

[jira] [Updated] (NUTCH-1406) Metatags-index/-parse plugin: conversion to Solr date format and prevents parsing/indexing of empty tags

2012-06-21 Thread Kristof (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kristof updated NUTCH-1406: Description: This improvement to the index-metatags plugin (sometimes also refered to parse-metatags plugi

[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified

2012-06-21 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398362#comment-13398362 ] Julien Nioche commented on NUTCH-1341: -- Let's release 1.5.1 first then add new -bugs-

[jira] [Commented] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule

2012-06-21 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398363#comment-13398363 ] Julien Nioche commented on NUTCH-1388: -- Let's release 1.5.1 first >

[jira] [Commented] (NUTCH-1342) Read time out protocol-http

2012-06-21 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398354#comment-13398354 ] Markus Jelsma commented on NUTCH-1342: -- Hi Ferdy, No, i have no clue as to why httpc

[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified

2012-06-21 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398349#comment-13398349 ] Markus Jelsma commented on NUTCH-1341: -- I'll commit this one shortly unless there are

[jira] [Commented] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule

2012-06-21 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398346#comment-13398346 ] Markus Jelsma commented on NUTCH-1388: -- I'll commit shortly unless there are objectio

[jira] [Closed] (NUTCH-1008) Switch to crawler-commons version of robots.txt parsing code

2012-06-21 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-1008. Resolution: Duplicate Closed in favor of NUTCH-1031. > Switch to crawler-commons v

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2012-06-21 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398340#comment-13398340 ] Julien Nioche commented on NUTCH-1031: -- crawler-commons is not super active and I hav

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2012-06-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398338#comment-13398338 ] Lewis John McGibbney commented on NUTCH-1031: - crawler-commons is available wi

[jira] [Commented] (NUTCH-1406) Metatags-index/-parse plugin: conversion to Solr date format and prevents parsing/indexing of empty tags

2012-06-21 Thread Kristof (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398296#comment-13398296 ] Kristof commented on NUTCH-1406: - Markus, I will provide the patch against trunk. But sin

[jira] [Commented] (NUTCH-1406) Metatags-index/-parse plugin: conversion to Solr date format and prevents parsing/indexing of empty tags

2012-06-21 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398247#comment-13398247 ] Julien Nioche commented on NUTCH-1406: -- See http://wiki.apache.org/nutch/HowToContrib