Re: DataFrame RDDs

2013-11-19 Thread andy petrella
indeed the scala version could be blocking (I'm not sure what it needs 2.11, maybe Miles uses quasiquotes...) Andy On Tue, Nov 19, 2013 at 8:48 AM, Anwar Rizal anriza...@gmail.com wrote: I had that in mind too when Miles Sabin presented Shapeless at Scala.IO Paris last month. If anybody

Re: DataFrame RDDs

2013-11-18 Thread andy petrella
Maybe I'm wrong, but this use case could be a good fit for Shapelesshttps://github.com/milessabin/shapeless' records. Shapeless' records are like, so to say, lisp's record but typed! In that sense, they're more closer to Haskell's record notation, but imho less powerful, since the access will be

Re: DataFrame RDDs

2013-11-18 Thread Matei Zaharia
Interesting idea — in Scala you can also use the Dynamic type (http://hacking-scala.org/post/49051516694/introduction-to-type-dynamic) to allow dynamic properties. It has the same potential pitfalls as string names, but with nicer syntax. Matei On Nov 18, 2013, at 3:45 PM, andy petrella

DataFrame RDDs

2013-11-15 Thread Shay Seng
Hi, Is there some way to get R-style Data.Frame data structures into RDDs? I've been using RDD[Seq[]] but this is getting quite error-prone and the code gets pretty hard to read especially after a few joins, maps etc. Rather than access columns by index, I would prefer to access them by name.

Re: DataFrame RDDs

2013-11-15 Thread Christopher Nguyen
Shay, we've done this at Adatao, specifically a big data frame in RDD representation and subsetting/projections/data mining/machine learning algorithms on that in-memory table structure. We're planning to harmonize that with the MLBase work in the near future. Just a matter of prioritization on

Re: DataFrame RDDs

2013-11-15 Thread Shay Seng
Nice, any possibility of sharing this code in advance? On Fri, Nov 15, 2013 at 11:22 AM, Christopher Nguyen c...@adatao.com wrote: Shay, we've done this at Adatao, specifically a big data frame in RDD representation and subsetting/projections/data mining/machine learning algorithms on that