[jira] [Updated] (NUTCH-1733) parse-html to support HTML5 charset definitions

2014-03-13 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1733: --- Attachment: NUTCH-1733-trunk.patch patch for trunk including unit test > parse-html to suppo

[jira] [Commented] (NUTCH-1662) Indexer Plugin for Solr Cloud

2014-03-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933421#comment-13933421 ] Lewis John McGibbney commented on NUTCH-1662: - I tried this out today. It look

[jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-03-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933251#comment-13933251 ] Hudson commented on NUTCH-1478: --- SUCCESS: Integrated in Nutch-nutchgora #951 (See [https://

[jira] [Closed] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-03-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1478. --- > Parse-metatags and index-metadata plugin for Nutch 2.x series > --

[jira] [Resolved] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-03-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1478. - Resolution: Fixed v6 patch committed @revision 1577143 in 2.x HEAD Thank you to e

Re: How do I customize Nutch to cater to existing SOLR schema

2014-03-13 Thread Lajos
Done: https://issues.apache.org/jira/browse/NUTCH-1734 Will get to the patch shortly. Thanks, L On 13/03/2014 12:29, Lewis John Mcgibbney wrote: Hi Lajos, On Thu, Mar 13, 2014 at 11:22 AM, mailto:dev-digest-h...@nutch.apache.org>> wrote: I had raised this issue a few months ago, as I

[jira] [Created] (NUTCH-1734) Make SolrIndexWriter more intelligent

2014-03-13 Thread Lajos Moczar (JIRA)
Lajos Moczar created NUTCH-1734: --- Summary: Make SolrIndexWriter more intelligent Key: NUTCH-1734 URL: https://issues.apache.org/jira/browse/NUTCH-1734 Project: Nutch Issue Type: Improvement

[jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-03-13 Thread Vangelis Karvounis (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933136#comment-13933136 ] Vangelis Karvounis commented on NUTCH-1478: --- +1. Very nice work > Parse-metatag

Re: How do I customize Nutch to cater to existing SOLR schema

2014-03-13 Thread Lajos
Hi Lewis, Great and thanks for your reply. I'll do that then. If it helps, I'm an Apache committer from another project ages ago. I would love to be involved in this particular area of Nutch going forward, especially if this because a well-used feature. Let me know how we could go about this

Re: Bandwidth Limit

2014-03-13 Thread Lewis John Mcgibbney
Hi Talat, On Thu, Mar 13, 2014 at 11:22 AM, wrote: > I wonder can we do limit of bandwith usage ? We can control connection > size with fetch thread * reduce count. But How do we control download rate > ? > > > I really don't think we can/do control download rate to be honest. is there any part

Re: How do I customize Nutch to cater to existing SOLR schema

2014-03-13 Thread Lewis John Mcgibbney
Hi Lajos, On Thu, Mar 13, 2014 at 11:22 AM, wrote: > > I had raised this issue a few months ago, as I had the exact same problem. > It cannot be solved by configuration, because of the way that the > MappingReader puts fields in Maps. > > I solved this by implementing a custom plugin that furthe

[jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-03-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933120#comment-13933120 ] Lewis John McGibbney commented on NUTCH-1478: - I am a big fat +1 for v6 patch

Re: Who is moderating Nutch lists?

2014-03-13 Thread Julien Nioche
I don't think these lists are moderated. Don't think they should be either J On Thursday, 13 March 2014, Markus Jelsma wrote: > Well, thats not me, perhaps Chris? > > -Original message- > From: Lewis John Mcgibbney> > Sent: Wednesday 12th March 2014 15:56 > To: dev@nutch.apache.org > S

RE: [VOTE] Release Apache Nutch 1.8RC#2

2014-03-13 Thread Markus Jelsma
Looks fine to me, stuff works. +1 -Original message- > From:Lewis John Mcgibbney > Sent: Wednesday 12th March 2014 15:54 > To: u...@nutch.apache.org; dev@nutch.apache.org > Subject: [VOTE] Release Apache Nutch 1.8RC#2 > > Hi user@ & dev@, > > This thread is a VOTE for releasing Apac

RE: Who is moderating Nutch lists?

2014-03-13 Thread Markus Jelsma
Well, thats not me, perhaps Chris? -Original message- From: Lewis John Mcgibbney Sent: Wednesday 12th March 2014 15:56 To: dev@nutch.apache.org Subject: Who is moderating Nutch lists? Hi Folks, Is anyone moding these lists? I understand that it is a bit of a pain in the neck as both user