Re: Choosing an efficient family configuration for GORA HBase

2011-10-03 Thread Ferdy Galema
Ok thanks. I was just wondering whether there were any developments on this. I'm not sure yet what would be the fastest in the case of Nutch, all I know from our own experience is that it is best practice to group frequently-accessed columns together, but nevertheless store large columns in a

[jira] [Resolved] (NUTCH-1137) LinkDb / invertlinks: command line arguments ignored

2011-10-03 Thread Markus Jelsma (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1137. -- Resolution: Fixed Committed for 1.4 in rev. 1178376. Reused crawldb code instead. Thanks for

Re: Providing a list of FAQ's with every new subscribe request

2011-10-03 Thread lewis john mcgibbney
Hi Sami, At the moment I am not in a position to take on the role of mailing list moderator. But I've found out that the list moderators should be able to configure the nature of documentation on a per-list basis by emailing ${list}-help@ from their moderator address and following the

[jira] [Updated] (NUTCH-1144) Filtering optional in WebGraph

2011-10-03 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1144: - Fix Version/s: (was: 1.5) Filtering optional in WebGraph

[jira] [Resolved] (NUTCH-1144) Filtering optional in WebGraph

2011-10-03 Thread Markus Jelsma (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1144. -- Resolution: Won't Fix Decided to do filtering and normalizing in one issue.

[jira] [Updated] (NUTCH-1142) Normalization and filtering in WebGraph

2011-10-03 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1142: - Description: The WebGraph programs performs URL normalization. Since normalization of outlinks

[jira] [Commented] (NUTCH-1143) Omit anchor in webgraph's LinkDatum

2011-10-03 Thread Markus Jelsma (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119281#comment-13119281 ] Markus Jelsma commented on NUTCH-1143: -- It seems the anchor field was once used for

[jira] [Resolved] (NUTCH-1058) Upgrade Solr schema to version 1.4

2011-10-03 Thread Markus Jelsma (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1058. -- Resolution: Fixed Assignee: Markus Jelsma Committed for 1.4 in rev. 1178409 and for

[jira] [Updated] (NUTCH-717) Make Nutch Solr integration easier

2011-10-03 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-717: Fix Version/s: (was: 1.4) 1.5 Make Nutch Solr integration easier

[jira] [Commented] (NUTCH-1136) Ant pmd target is broken

2011-10-03 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119365#comment-13119365 ] Lewis John McGibbney commented on NUTCH-1136: - Would like to commit before RC

[jira] [Commented] (NUTCH-1109) Add Sonar targets to Ant build.xml

2011-10-03 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119367#comment-13119367 ] Lewis John McGibbney commented on NUTCH-1109: - Would like to commit before RC

Re: Providing a list of FAQ's with every new subscribe request

2011-10-03 Thread Sami Siren
On Mon, Oct 3, 2011 at 3:48 PM, lewis john mcgibbney lewis.mcgibb...@gmail.com wrote: Would it be possible to send out a list of our official FAQ's when a new user confirms their subscription to both user@ and dev@ lists. It seems this is possible. Can you craft a piece of text you would

Build failed in Jenkins: Nutch-trunk #1623

2011-10-03 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-trunk/1623/changes Changes: [markus] NUTCH-1058 Upgrade Solr schema to version 1.4 [markus] NUTCH-1137 LinkDB other options ignored with -dir -- [...truncated 937 lines...] A

[jira] [Commented] (NUTCH-1058) Upgrade Solr schema to version 1.4

2011-10-03 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119888#comment-13119888 ] Hudson commented on NUTCH-1058: --- Integrated in Nutch-trunk #1623 (See

[jira] [Commented] (NUTCH-1137) LinkDb / invertlinks: command line arguments ignored

2011-10-03 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119889#comment-13119889 ] Hudson commented on NUTCH-1137: --- Integrated in Nutch-trunk #1623 (See

[jira] [Commented] (NUTCH-1058) Upgrade Solr schema to version 1.4

2011-10-03 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119890#comment-13119890 ] Hudson commented on NUTCH-1058: --- Integrated in Nutch-nutchgora #25 (See

Build failed in Jenkins: Nutch-nutchgora #25

2011-10-03 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-nutchgora/25/changes Changes: [markus] NUTCH-1058 Upgrade Solr schema to version 1.4 -- [...truncated 2491 lines...] [ivy:resolve] :: loading settings :: file =