Re: Any info on R+Hadoop

2012-01-29 Thread Prashant Sharma
Praveenesh,
Well, It gives you more convenience :). If you have worked on R, then you
might notice with R you can write mapper as a lapply(using rmr). They have
already abstracted a lot of stuff for you so you have less control over
things. But still as far as convenience is concerned its damn cool. For
example you can process data inside R using Hadoop (Nodoubt It uses hadoop
streaming behind the scenes) and have the process data easily loaded back
into R command line from hdfs(using rhdfs). Generally R developers do not
like being engrossed with hassles that hadoop streaming can bring.

-P

P.S. I am not endorsing anyone. It's just my view.

On Sun, Jan 29, 2012 at 12:54 PM, praveenesh kumar praveen...@gmail.comwrote:

 Does anyone has done any work with R + Hadoop ?

 I know there are some flavors of R+Hadoop available such as rmr,rhdfs,
 RHIPE, R-hive

 But as far as I know submitting jobs using Hadoop Streaming is the best way
 right now available. Am I right ?


 Any info on R on Hadoop ?

 Thanks,
 Praveenesh



Re: Any info on R+Hadoop

2012-01-29 Thread praveenesh kumar
Yeah, but I am facing a weird situation, in which my Rhadoop job (using
rmr) is taking much more time than My Hadoop streaming job in R. So wanted
to see if others also faced same problem or did anyone did any performance
evalulation of Revolutions - rmr ?

Thanks,
Praveenesh

On Mon, Jan 30, 2012 at 11:01 AM, Prashant Sharma
prashan...@imaginea.comwrote:

 Praveenesh,
 Well, It gives you more convenience :). If you have worked on R, then you
 might notice with R you can write mapper as a lapply(using rmr). They have
 already abstracted a lot of stuff for you so you have less control over
 things. But still as far as convenience is concerned its damn cool. For
 example you can process data inside R using Hadoop (Nodoubt It uses hadoop
 streaming behind the scenes) and have the process data easily loaded back
 into R command line from hdfs(using rhdfs). Generally R developers do not
 like being engrossed with hassles that hadoop streaming can bring.

 -P

 P.S. I am not endorsing anyone. It's just my view.

 On Sun, Jan 29, 2012 at 12:54 PM, praveenesh kumar praveen...@gmail.com
 wrote:

  Does anyone has done any work with R + Hadoop ?
 
  I know there are some flavors of R+Hadoop available such as
 rmr,rhdfs,
  RHIPE, R-hive
 
  But as far as I know submitting jobs using Hadoop Streaming is the best
 way
  right now available. Am I right ?
 
 
  Any info on R on Hadoop ?
 
  Thanks,
  Praveenesh
 



Any info on R+Hadoop

2012-01-28 Thread praveenesh kumar
Does anyone has done any work with R + Hadoop ?

I know there are some flavors of R+Hadoop available such as rmr,rhdfs,
RHIPE, R-hive

But as far as I know submitting jobs using Hadoop Streaming is the best way
right now available. Am I right ?


Any info on R on Hadoop ?

Thanks,
Praveenesh