Build failed in Jenkins: Mahout-Examples-Cluster-Reuters-II #1117

2015-03-04 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Examples-Cluster-Reuters-II/1117/ -- [...truncated 1701 lines...] A mrlegacy/src/test/java/org/apache/mahout/classifier/df/data/Utils.java A

Build failed in Jenkins: Mahout-Quality #2980

2015-03-04 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/2980/ -- Started by timer Building remotely on ubuntu3 (Ubuntu ubuntu legacy-ubuntu) in workspace https://builds.apache.org/job/Mahout-Quality/ws/ git rev-parse --is-inside-work-tree # timeout=10 Fetching

Re: PMML

2015-03-04 Thread Dmitriy Lyubimov
I am willing to +1 any contribution at this point. my previous company used pmml to serialize simple stuff, but i don't have first hand experience. Its flexibility is ultimately pretty limited, isn't it? And xml is ultimately a media which is too ugly and too verbose at the same time to represent

Re: PMML

2015-03-04 Thread Andrew Musselman
Yes, the limitations are often an issue for people doing things that aren't in the PMML spec yet; there could be room for suggesting new features in the spec by building them though, I suppose. Also agree that XML is a lousy/bloated way of representing stuff like this, but in the end it's just a

Spark 1.1.1 and 1.2.1

2015-03-04 Thread Pat Ferrel
Spark 1.2.1 has a bug that blocks any JavaSerializer without a work around. It requires the SparkConf to get a path to a jar that exists on all nodes. So I’ve updated to 1.1.1 and if you want to run on 1.2+ just change the parent pom but the item similarity stuff won’t run without the work

Re: Spark 1.1.1 and 1.2.1

2015-03-04 Thread Dmitriy Lyubimov
is bug also present in 1.2.0? cdh 5.3 is 1.2.0 not 1.2.1 On Wed, Mar 4, 2015 at 1:58 PM, Pat Ferrel p...@occamsmachete.com wrote: Spark 1.2.1 has a bug that blocks any JavaSerializer without a work around. It requires the SparkConf to get a path to a jar that exists on all nodes. So I’ve

Question with contributing first steps

2015-03-04 Thread Олег Зотов
Hi I want to contribute to the Mahout and I have two questions: 1) What about Mahout and Google Summer of Code this year? 2) To take the first step, I fixed one not so difficult bug, and already more than 10 days ago sent pull request, but still did not see any response - I did something wrong?

Re: Question with contributing first steps

2015-03-04 Thread Dmitriy Lyubimov
(1) no mentors this year. (2) what was the PR #? On Wed, Mar 4, 2015 at 2:35 PM, Олег Зотов olegzoto...@gmail.com wrote: Hi I want to contribute to the Mahout and I have two questions: 1) What about Mahout and Google Summer of Code this year? 2) To take the first step, I fixed one not so

Re: Spark 1.1.1 and 1.2.1

2015-03-04 Thread Pat Ferrel
I think so. A related bug is fixed in 1.3. I’ll try both when I get a chance. On Mar 4, 2015, at 2:21 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: is bug also present in 1.2.0? cdh 5.3 is 1.2.0 not 1.2.1 On Wed, Mar 4, 2015 at 1:58 PM, Pat Ferrel p...@occamsmachete.com wrote: Spark 1.2.1 has

Re: Question with contributing first steps

2015-03-04 Thread Олег Зотов
Thank for your answer. So, I will look for some other issues more related to new scala modules, because I still want to take a part of development. 2015-03-05 1:55 GMT+03:00 Dmitriy Lyubimov dlie...@gmail.com: Ok (2) part of the problem is that this is a fix to taste recommender. There are two

Re: PMML

2015-03-04 Thread Andrew Musselman
Sure, those would be options. On Wed, Mar 4, 2015 at 3:41 PM, Saikat Kanjilal sxk1...@hotmail.com wrote: Question, is there a way to introduce PMML with using a more lightweight format like yaml or json? Date: Wed, 4 Mar 2015 13:25:29 -0800 Subject: Re: PMML From:

[jira] [Commented] (MAHOUT-1636) Class dependencies for the spark module are put in a job.jar, which is very inefficient

2015-03-04 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347724#comment-14347724 ] ASF GitHub Bot commented on MAHOUT-1636: Github user pferrel closed the pull

[jira] [Commented] (MAHOUT-1636) Class dependencies for the spark module are put in a job.jar, which is very inefficient

2015-03-04 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347723#comment-14347723 ] ASF GitHub Bot commented on MAHOUT-1636: Github user pferrel commented on the

Refactor

2015-03-04 Thread Pat Ferrel
I wonder if it’s possible to take the long awaited refactoring of mrlegacy to split Mahout into two builds one Scala only (with a little math java) and even use SBT for Scala leaving a pared down mvn for mrlegacy build? This would greatly speed and simplify builds and is a very natural divide.

Re: Refactor

2015-03-04 Thread Dmitriy Lyubimov
afaik spark is not built by sbt by default any longer, but rather, by maven. At least self-build instructions are for maven only (and sbt build changed enough that i can't effectively use it any longer). but when it was, it was always sbt-for-all. On Wed, Mar 4, 2015 at 3:45 PM, Pat Ferrel

Re: PMML

2015-03-04 Thread Saikat Kanjilal
Next question: Is the audience for PMML programmers or could it be folks that can script? I'm wondering how this intersects with a simple spark like DSL , could Mahout implement an intersection between the two? If there's interest I can go into examples. Sent from my iPhone On Mar 4, 2015,

Re: Question with contributing first steps

2015-03-04 Thread Suneel Marthi
yes, and we look forward to ur contributions. Please stick around these mailing lists. On Wed, Mar 4, 2015 at 7:06 PM, Олег Зотов olegzoto...@gmail.com wrote: For a start, I can close my own pull request, as I see it is not useful. Should I do it? 2015-03-05 2:55 GMT+03:00 Andrew Musselman

RE: PMML

2015-03-04 Thread Saikat Kanjilal
Question, is there a way to introduce PMML with using a more lightweight format like yaml or json? Date: Wed, 4 Mar 2015 13:25:29 -0800 Subject: Re: PMML From: andrew.mussel...@gmail.com To: dev@mahout.apache.org Yes, the limitations are often an issue for people doing things that aren't

Re: Question with contributing first steps

2015-03-04 Thread Andrew Musselman
Excellent, thank you! We should have a bug bash some time soon to help weed out old/irrelevant tickets. On Wed, Mar 4, 2015 at 3:20 PM, Олег Зотов olegzoto...@gmail.com wrote: Thank for your answer. So, I will look for some other issues more related to new scala modules, because I still want

Re: Refactor

2015-03-04 Thread Andrew Musselman
Sounds reasonable to me; would you like to put this in the next release? On Wed, Mar 4, 2015 at 4:00 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: afaik spark is not built by sbt by default any longer, but rather, by maven. At least self-build instructions are for maven only (and sbt build

Re: Question with contributing first steps

2015-03-04 Thread Suneel Marthi
Thanks, We'll close the issue. On Wed, Mar 4, 2015 at 7:31 PM, Олег Зотов olegzoto...@gmail.com wrote: I closed my pull request, but i can't do anything with Jira issue MAHOUT-1642, as it was opened not by me. But if we talk about weeding out old/irrelevant tickets, I think it can be the

Re: Question with contributing first steps

2015-03-04 Thread Олег Зотов
I closed my pull request, but i can't do anything with Jira issue MAHOUT-1642, as it was opened not by me. But if we talk about weeding out old/irrelevant tickets, I think it can be the first candidate for it, as Dmitriy said, nobody these days cares so much maintaining this mrlegacy component.

[jira] [Commented] (MAHOUT-1642) Iterator class within SimilarItems class always misses the first element

2015-03-04 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347820#comment-14347820 ] ASF GitHub Bot commented on MAHOUT-1642: Github user OlegZotov closed the pull

Re: Question with contributing first steps

2015-03-04 Thread Dmitriy Lyubimov
Ok (2) part of the problem is that this is a fix to taste recommender. There are two problems with this: first, it is mrlegacy component, and nobody these days cares so much maintaining these. Second, the author of this code, who would be qualified to assess the validity of the fix, has not come

Build failed in Jenkins: Mahout-Quality #2981

2015-03-04 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/2981/changes Changes: [pat] simplified driver and made required changes to all, note: left job assembly untouched [pat] removed o.a.m.Pair, cleaned up comments and style issues, simplified driver API, merged most of 1.2.1 changes but left on

Re: PMML

2015-03-04 Thread Andrew Musselman
I think keeping it simple is best, try implementing one or two models in XML and then get fancy if it makes sense. On Wednesday, March 4, 2015, Saikat Kanjilal sxk1...@hotmail.com wrote: Next question: Is the audience for PMML programmers or could it be folks that can script? I'm wondering

Re: Question with contributing first steps

2015-03-04 Thread Олег Зотов
For a start, I can close my own pull request, as I see it is not useful. Should I do it? 2015-03-05 2:55 GMT+03:00 Andrew Musselman andrew.mussel...@gmail.com: Excellent, thank you! We should have a bug bash some time soon to help weed out old/irrelevant tickets. On Wed, Mar 4, 2015 at 3:20

Re: Question with contributing first steps

2015-03-04 Thread Andrew Musselman
Feel free to sign in an comment on that ticket so we can remember for later :) Thanks Oleg On Wed, Mar 4, 2015 at 4:31 PM, Олег Зотов olegzoto...@gmail.com wrote: I closed my pull request, but i can't do anything with Jira issue MAHOUT-1642, as it was opened not by me. But if we talk about