On 23/02/16 18:03, Jeff Steinmetz wrote:
Hello zeppelin dev group,

Regarding the R Interpreter Pull requests 208 and 702.  I am trying to figure 
out if the functionality between these are overlapping, or one supports 
something different than the other.  Is 702 a super set of 208 (702 is a fork 
of 208)?


702 is not a fork of 208, just something that was in a public repo since long and finally decided to make a PR to deal with the points expressed on [1]

[1] https://github.com/apache/incubator-zeppelin/pull/208#issuecomment-170337289

Can you pass the reference of a distributed (parallelized) dataframe built in %spark 
(scala) to the R interpreter?   Similar to z.put(“myDF", myDF)?


I expect passing a dataframe via the Zeppelin context will fail, but as the interperters run on the same Spark REPL, the dataframe are accessible in both R and Scala (see links to screenshots)

https://raw.githubusercontent.com/datalayer/datalayer-zeppelin/rscala/_Rimg/r-scala-dataframe-binding.png

https://raw.githubusercontent.com/datalayer/datalayer-zeppelin/rscala/_Rimg/scala-r-dataframe-binding.png


Similarly, since R doesn’t support serialization of functions (unless you use 
something from the SparkR library) is there an example of collecting the 
parallel DF to a local DF (which I realize it means the dataset needs to fit in 
local memory on the zeppelin server).


That's something I am thinking to since long, especially to allow visualizing small or subset datasets with R visualizations. I will try to show an example.


I can to dig into this a bit and help out where appropriate, however its 
unclear which PR to focus my efforts on.


I guess you can checkout both and play with them to better feel what they offer.

Best,
Jeff Steinmetz
Principal Architect
Akili Interactive Labs







On 2/23/16, 8:01 AM, "elbamos" <g...@git.apache.org> wrote:

Github user elbamos commented on the pull request:

    https://github.com/apache/incubator-zeppelin/pull/702#issuecomment-187764059

    @btiernay support for that has been in 208 all along...

    > On Feb 23, 2016, at 9:27 AM, Bob Tiernay <notificati...@github.com> wrote:
    >
    > @echarles This is great! Thanks for all your hard work. Very much 
appreciated!
    >
    > ╉
    > Reply to this email directly or view it on GitHub.
    >



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to