from:"andy petrella"

[incubating-0.9.0] Too Many Open Files on Workers

2014-02-21 Thread andy petrella

Hey guyz, I've got this issue (see bottom) with Spark, deployed in Standalone mode on a local docker environment. I know that I need to raise the ulimit (only 1024 now) but in the meantime I was just wondering how this could happen. My gut feeling is because I'm mounting a lot in memory and Spark

Re: [incubating-0.9.0] Too Many Open Files on Workers

2014-02-21 Thread andy petrella

, Sourav On Fri, Feb 21, 2014 at 3:02 PM, andy petrella andy.petre...@gmail.comwrote: Hey guyz, I've got this issue (see bottom) with Spark, deployed in Standalone mode on a local docker environment. I know that I need to raise the ulimit (only 1024 now) but in the meantime I was just

Re: DataFrame RDDs

2013-11-19 Thread andy petrella

19, 2013 12:46 AM, andy petrella andy.petre...@gmail.com wrote: Maybe I'm wrong, but this use case could be a good fit for Shapelesshttps://github.com/milessabin/shapeless' records. Shapeless' records are like, so to say, lisp's record but typed! In that sense, they're more closer

Re: DataFrame RDDs

2013-11-18 Thread andy petrella

Maybe I'm wrong, but this use case could be a good fit for Shapelesshttps://github.com/milessabin/shapeless' records. Shapeless' records are like, so to say, lisp's record but typed! In that sense, they're more closer to Haskell's record notation, but imho less powerful, since the access will be

Re: Spark and geospatial data

2013-11-07 Thread andy petrella

Hello Rob, As you may know I have a long experience in Geospatial data, and I'm now investigating Spark... So I'll be very interested further answers but also to participate to going forward on this great idea! For instance, I'd say that implementing classical geospatial algorithms like

Re: Spark Streaming Server

2013-09-18 Thread andy petrella

Hello there, I hope I get the whole thing correctly... but I think I did something quite similar that polls the yahoo api for data, which I push into a DStream: https://github.com/andypetrella/spark-bd/blob/master/src/main/scala/yahoo.scala#L41 Any comments or even concerns (if they are

[incubating-0.9.0] Too Many Open Files on Workers

Re: [incubating-0.9.0] Too Many Open Files on Workers

Re: DataFrame RDDs

Re: DataFrame RDDs

Re: Spark and geospatial data

Re: Spark Streaming Server

6 matches

Site Navigation

Mail list logo

Footer information