Re: Contributing an algorithm for samsara

2017-04-13 Thread Jim Jagielski
Apologies for letting this slide... way too much life got in the way :) > On Mar 3, 2017, at 3:36 PM, Dmitriy Lyubimov wrote: > > And by formula yes i mean R syntax. > > possible use case would be to take Spark DataFrame and formula (say, `age ~ > . -1`) and produce outputs of DrmLike[Int] (a d

Re: Contributing an algorithm for samsara

2017-03-03 Thread Dmitriy Lyubimov
And by formula yes i mean R syntax. possible use case would be to take Spark DataFrame and formula (say, `age ~ . -1`) and produce outputs of DrmLike[Int] (a distributed matrix type) that converts into predictors and target. In this particular case, this formula means that the predictor matrix (X

Re: Contributing an algorithm for samsara

2017-03-03 Thread Dmitriy Lyubimov
On Fri, Mar 3, 2017 at 4:09 AM, Jim Jagielski wrote: > >> >> > >> > >> > 3) On the feature extraction per R like formula can you elaborate more >> here, are you talking about feature extraction using R like dataframes and >> operators? >> > > Yes. I would start doing generic formula parser and the

Re: Contributing an algorithm for samsara

2017-03-03 Thread Dmitriy Lyubimov
I am getting a liittle bit lost who asked what here, inline. On Fri, Mar 3, 2017 at 4:09 AM, Jim Jagielski wrote: > > > Would it make sense to keep them as-is, and "pull them out", as > it were, should they prove to be wanted/needed by the other algo users? > I would hope it is of some help (es

Re: Contributing an algorithm for samsara

2017-03-03 Thread Jim Jagielski
hrough the papers. > > > > From: Dmitriy Lyubimov > Sent: Friday, February 17, 2017 1:45 PM > To: dev@mahout.apache.org > Subject: Re: Contributing an algorithm for samsara > > in particular, this is the samsara implementation of do

Re: Contributing an algorithm for samsara

2017-02-25 Thread Saikat Kanjilal
m: Dmitriy Lyubimov Sent: Friday, February 17, 2017 1:45 PM To: dev@mahout.apache.org Subject: Re: Contributing an algorithm for samsara in particular, this is the samsara implementation of double-weighed als : https://github.com/apache/mahout/pull/14/files#diff-0fbeb8b848ed0c5e3f782c72569cf626 M

Re: Contributing an algorithm for samsara

2017-02-17 Thread Dmitriy Lyubimov
extraction per (preferrably R-like) formula. > > > -d > > > On Fri, Feb 17, 2017 at 10:11 AM, Andrew Palumbo > wrote: > >> +1 to glms >> >> >> >> Sent from my Verizon Wireless 4G LTE smartphone >> >> >>

Re: Contributing an algorithm for samsara

2017-02-17 Thread Dmitriy Lyubimov
l message > From: Trevor Grant > Date: 02/17/2017 6:56 AM (GMT-08:00) > To: dev@mahout.apache.org > Subject: Re: Contributing an algorithm for samsara > > Jim is right, and I would take it one further and say, it would be best to > implement GLMs https://en.wikipedi

RE: Contributing an algorithm for samsara

2017-02-17 Thread Andrew Palumbo
+1 to glms Sent from my Verizon Wireless 4G LTE smartphone Original message From: Trevor Grant Date: 02/17/2017 6:56 AM (GMT-08:00) To: dev@mahout.apache.org Subject: Re: Contributing an algorithm for samsara Jim is right, and I would take it one further and say, it would

Re: Contributing an algorithm for samsara

2017-02-17 Thread Saikat Kanjilal
lgorithms, what do you think? Regards From: Jim Jagielski Sent: Friday, February 17, 2017 8:18 AM To: dev@mahout.apache.org Subject: Re: Contributing an algorithm for samsara Sounds good to me. +1 > On Feb 17, 2017, at 11:15 AM, Saikat Kanjilal wrote: > > Jim, > What do you say we s

Re: Contributing an algorithm for samsara

2017-02-17 Thread Jim Jagielski
Sounds good to me. +1 > On Feb 17, 2017, at 11:15 AM, Saikat Kanjilal wrote: > > Jim, > What do you say we start with ALS and then tackle glm? > > > Sent from my iPhone > >> On Feb 17, 2017, at 6:56 AM, Trevor Grant wrote: >> >> Jim is right, and I would take it one further and say, it woul

Re: Contributing an algorithm for samsara

2017-02-17 Thread Saikat Kanjilal
Jim, What do you say we start with ALS and then tackle glm? Sent from my iPhone > On Feb 17, 2017, at 6:56 AM, Trevor Grant wrote: > > Jim is right, and I would take it one further and say, it would be best to > implement GLMs https://en.wikipedia.org/wiki/Generalized_linear_model , > from the

Re: Contributing an algorithm for samsara

2017-02-17 Thread Trevor Grant
Jim is right, and I would take it one further and say, it would be best to implement GLMs https://en.wikipedia.org/wiki/Generalized_linear_model , from there a Logistic regression is a trivial extension. Buyer beware- GLMs will be a bit of work- doable, but that would be jumping in neck first for

Re: Contributing an algorithm for samsara

2017-02-17 Thread Jim Jagielski
My own thoughts are that logistic regression seems a more "generalized" and hence more useful algo to be factored in... At least in the use cases that I've been toying with. So I'd like to help out with that if wanted... > On Feb 9, 2017, at 3:59 PM, Saikat Kanjilal wrote: > > Trevor et al, >

Contributing an algorithm for samsara

2017-02-09 Thread Saikat Kanjilal
Trevor et al, I'd like to contribute an algorithm or two in samsara using spark as I would like to do a compare and contrast with mahout with R server for a data science pipeline, machine learning repo that I'm working on, in looking at the list of algorithms (https://mahout.apache.org/users/ba