any way you like. Fork will make things easier, but since we are talking
about a fairly isolated experiment at this point, it is not required.


On Mon, May 5, 2014 at 9:40 PM, Saikat Kanjilal <[email protected]> wrote:

> How do you want to proceed on this, if it's expedient I was going to clone
> mahout into my own fork and do this work as before.
>
> Sent from my iPad
>
> > On May 5, 2014, at 9:36 PM, "Dmitriy Lyubimov" <[email protected]>
> wrote:
> >
> > Ok lets go one by one. Can you try and put mutate into scala dsl skeleton
> > of in-core dataframe that does nothing?
> >> On May 5, 2014 8:22 PM, "Saikat Kanjilal (JIRA)" <[email protected]>
> wrote:
> >>
> >>
> >>    [
> >>
> https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13990236#comment-13990236
> ]
> >>
> >> Saikat Kanjilal commented on MAHOUT-1490:
> >> -----------------------------------------
> >>
> >> d.mutate( let("gain") equal { col("ArrDelay") - col("DepDelay") } )
> seems
> >> rather verbose
> >>
> >> how about something like the following:
> >>
> >> d.mutate((v= gain) equal { col("ArrDelay") - col("DepDelay") } )
> >>
> >> in the above expression v= would essentially create a defined identifier
> >> called v
> >>
> >> I will put some more examples in the blog around select with this
> thinking
> >>
> >> Also what about the other functions I am proposing like running a
> function
> >> around a dataframe (like map) or the slicing functionality around R,
> should
> >> we keep those as part of this proposal?
> >>
> >>> Data frame R-like bindings
> >>> --------------------------
> >>>
> >>>                Key: MAHOUT-1490
> >>>                URL: https://issues.apache.org/jira/browse/MAHOUT-1490
> >>>            Project: Mahout
> >>>         Issue Type: New Feature
> >>>           Reporter: Saikat Kanjilal
> >>>           Assignee: Dmitriy Lyubimov
> >>>            Fix For: 1.0
> >>>
> >>>  Original Estimate: 20h
> >>> Remaining Estimate: 20h
> >>>
> >>> Create Data frame R-like bindings for spark
> >>
> >>
> >>
> >> --
> >> This message was sent by Atlassian JIRA
> >> (v6.2#6252)
> >>
>

Reply via email to