[jira] [Commented] (NUTCH-1174) Outlinks are not properly normalized

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149000#comment-13149000 ] Hudson commented on NUTCH-1174: --- Integrated in Nutch-trunk #1660 (See [https://builds.apach

[jira] [Commented] (NUTCH-1155) Host/domain limit in generator is generate.max.count+1

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148998#comment-13148998 ] Hudson commented on NUTCH-1155: --- Integrated in Nutch-trunk #1660 (See [https://builds.apach

[jira] [Commented] (NUTCH-1185) Decrease solr.commit.size

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148999#comment-13148999 ] Hudson commented on NUTCH-1185: --- Integrated in Nutch-trunk #1660 (See [https://builds.apach

[jira] [Commented] (NUTCH-1203) ParseSegment to list ms per record

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149001#comment-13149001 ] Hudson commented on NUTCH-1203: --- Integrated in Nutch-trunk #1660 (See [https://builds.apach

[jira] [Commented] (NUTCH-1180) UpdateDB to backup previous CrawlDB

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149002#comment-13149002 ] Hudson commented on NUTCH-1180: --- Integrated in Nutch-trunk #1660 (See [https://builds.apach

[jira] [Created] (NUTCH-1204) not all of pages parsed

2011-11-11 Thread behnam nikbakht (Created) (JIRA)
not all of pages parsed --- Key: NUTCH-1204 URL: https://issues.apache.org/jira/browse/NUTCH-1204 Project: Nutch Issue Type: Bug Components: parser Affects Versions: 1.3 Reporter: behnam nikbakht

[Nutch Wiki] Trivial Update of "RunNutchInEclipse" by LewisJohnMcgibbney

2011-11-11 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "RunNutchInEclipse" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/RunNutchInEclipse?action=diff&rev1=28&rev2=29 * The only Source folder will be trunk/

Re: Persistent problems with Ivy dependencies in Eclipse

2011-11-11 Thread Kirby Bohling
Lewis, https://issues.apache.org/jira/browse/NUTCH-1068 That is the issue I filed about the patch (it isn't directly related to this, but it is related to some potential fixes). http://www.mail-archive.com/dev%40nutch.apache.org/msg03419.html That's the e-mail thread where I originally mentione

[jira] [Updated] (NUTCH-1200) Resolving Ivy dependencies in several plugins

2011-11-11 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1200: Attachment: NUTCH-1200-v2-trunk.patch Patch locates missing dependencies. As sugges

Re: Persistent problems with Ivy dependencies in Eclipse

2011-11-11 Thread Lewis John Mcgibbney
Excellent Kirby, thanks for this. The obvious question I guess... where does this leave us with regards to the urlfilter-automation libraries? For the record as well, can you please provide the Jira you filed, it would be good to know where I can begin with this one. Thanks On Thu, Nov 10, 2011

[jira] [Commented] (NUTCH-1174) Outlinks are not properly normalized

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148548#comment-13148548 ] Hudson commented on NUTCH-1174: --- Integrated in nutch-trunk-maven #21 (See [https://builds.a

[jira] [Commented] (NUTCH-1203) ParseSegment to list ms per record

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148549#comment-13148549 ] Hudson commented on NUTCH-1203: --- Integrated in nutch-trunk-maven #21 (See [https://builds.a

[jira] [Updated] (NUTCH-1184) Fetcher to parse and follow Nth degree outlinks

2011-11-11 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1184: - Attachment: NUTCH-1184-1.5-5-ParseData.patch Patch for ParseData was missing. This now has a setO

[jira] [Resolved] (NUTCH-1174) Outlinks are not properly normalized

2011-11-11 Thread Markus Jelsma (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1174. -- Resolution: Fixed Committed for 1.5 in rev. 1200917. > Outlinks are not proper

[jira] [Resolved] (NUTCH-1203) ParseSegment to list ms per record

2011-11-11 Thread Markus Jelsma (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1203. -- Resolution: Fixed Committed for 1.5 in rev. 1200915. > ParseSegment to list ms

[jira] [Created] (NUTCH-1203) ParseSegment to list ms per record

2011-11-11 Thread Markus Jelsma (Created) (JIRA)
ParseSegment to list ms per record -- Key: NUTCH-1203 URL: https://issues.apache.org/jira/browse/NUTCH-1203 Project: Nutch Issue Type: Improvement Components: parser Reporter: Markus Jels

[jira] [Updated] (NUTCH-1203) ParseSegment to list ms per record

2011-11-11 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1203: - Attachment: NUTCH-1203-1.5-1.patch > ParseSegment to list ms per record > ---

[jira] [Commented] (NUTCH-1155) Host/domain limit in generator is generate.max.count+1

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148522#comment-13148522 ] Hudson commented on NUTCH-1155: --- Integrated in nutch-trunk-maven #20 (See [https://builds.a

[jira] [Resolved] (NUTCH-1155) Host/domain limit in generator is generate.max.count+1

2011-11-11 Thread Markus Jelsma (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1155. -- Resolution: Fixed Fix committed for 1.5 in rev. 1200912. > Host/domain limit i

[jira] [Reopened] (NUTCH-1155) Host/domain limit in generator is generate.max.count+1

2011-11-11 Thread Markus Jelsma (Reopened) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma reopened NUTCH-1155: -- For some reason, yet unknown, this minor change causes the tests to fail. I could have expected a

[jira] [Created] (NUTCH-1202) Fetcher timebomb kills long waiting fetch jobs

2011-11-11 Thread Markus Jelsma (Created) (JIRA)
Fetcher timebomb kills long waiting fetch jobs -- Key: NUTCH-1202 URL: https://issues.apache.org/jira/browse/NUTCH-1202 Project: Nutch Issue Type: Bug Components: fetcher Repo

[jira] [Created] (NUTCH-1201) Allow for different FetcherThread impls

2011-11-11 Thread Markus Jelsma (Created) (JIRA)
Allow for different FetcherThread impls --- Key: NUTCH-1201 URL: https://issues.apache.org/jira/browse/NUTCH-1201 Project: Nutch Issue Type: New Feature Components: fetcher Reporter:

[jira] [Resolved] (NUTCH-1180) UpdateDB to backup previous CrawlDB

2011-11-11 Thread Markus Jelsma (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1180. -- Resolution: Fixed Committed for 1.5 in rev. 1200830. > UpdateDB to backup prev

[jira] [Commented] (NUTCH-1180) UpdateDB to backup previous CrawlDB

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148444#comment-13148444 ] Hudson commented on NUTCH-1180: --- Integrated in nutch-trunk-maven #19 (See [https://builds.a

[jira] [Commented] (NUTCH-1185) Decrease solr.commit.size

2011-11-11 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148443#comment-13148443 ] Hudson commented on NUTCH-1185: --- Integrated in nutch-trunk-maven #19 (See [https://builds.a

[jira] [Resolved] (NUTCH-1185) Decrease solr.commit.size

2011-11-11 Thread Markus Jelsma (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1185. -- Resolution: Fixed Committed for nutchgora in rev. 1200834. and for trunk in rev. 1200833.

[jira] [Commented] (NUTCH-1200) Resolving Ivy dependencies in several plugins

2011-11-11 Thread Julien Nioche (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148393#comment-13148393 ] Julien Nioche commented on NUTCH-1200: -- I'm definitely against the idea of putting al