Re: cutting 1.6.2 rc and 2.0.0 rc this week?

2016-06-16 Thread andy petrella
+1 both too (for tomorrow lunchtime? ^^) On Thu, Jun 16, 2016 at 5:06 AM Raymond Honderdors < raymond.honderd...@sizmek.com> wrote: > +1 for both > > Get Outlook for Android > > > > On Wed, Jun 15, 2016 at 10:23 PM +0300, "Michael Armbrust" < > mich...@databricks.com>

[Build] repo1.maven.org: spark libs 1.5.0 for scala 2.10 poms are broken (404)

2015-10-02 Thread andy petrella
Yup folks, I've been reported by someone building the Spark-Notebook that repo1 is apparently broken for scala 2.10 and spark 1.5.0. Check this https://repo1.maven.org/maven2/org/apache/spark/spark-streaming_2.10/1.5.0/spark-streaming_2.10-1.5.0.pom The URL is correct since

Re: [Build] repo1.maven.org: spark libs 1.5.0 for scala 2.10 poms are broken (404)

2015-10-02 Thread andy petrella
gmail.com> wrote: >>> > I tried to access >>> > >>> https://repo1.maven.org/maven2/org/apache/spark/spark-streaming_2.10/1.5.0/spark-streaming_2.10-1.5.0.pom >>> > on Chrome and Firefox (on Mac) >>> > I got 404 >>> > &

Re: [Build] repo1.maven.org: spark libs 1.5.0 for scala 2.10 poms are broken (404)

2015-10-02 Thread andy petrella
at 11:05 AM, Ted Yu <yuzhih...@gmail.com> wrote: >> > I tried to access >> > >> https://repo1.maven.org/maven2/org/apache/spark/spark-streaming_2.10/1.5.0/spark-streaming_2.10-1.5.0.pom >> > on Chrome and Firefox (on Mac) >> > I got 404 >> >

Re: [Build] repo1.maven.org: spark libs 1.5.0 for scala 2.10 poms are broken (404)

2015-10-02 Thread andy petrella
r using 1.5.1 > > Cheers > > On Fri, Oct 2, 2015 at 11:19 AM, andy petrella <andy.petre...@gmail.com> > wrote: > >> it's an option but not a solution, indeed >> >> Le ven. 2 oct. 2015 20:08, Ted Yu <yuzhih...@gmail.com> a écrit : >> >>> A

Re: [ANNOUNCE] Announcing Spark 1.5.0

2015-09-09 Thread andy petrella
You can try it out really quickly by "building" a Spark Notebook from http://spark-notebook.io/. Just choose the master branch and 1.5.0, a correct hadoop version (default to 2.2.0 though) and there you go :-) On Wed, Sep 9, 2015 at 6:39 PM Ted Yu wrote: > Jerry: > I just

Re: Notes on writing complex spark applications

2014-11-23 Thread andy petrella
Cool! On Sun Nov 23 2014 at 5:58:03 PM Evan R. Sparks evan.spa...@gmail.com wrote: Hi all, Shivaram Venkataraman, Joseph Gonzalez, Tomer Kaftan, and I have been working on a short document about writing high performance Spark applications based on our experience developing MLlib, GraphX,

Re: Spark Streaming Metrics

2014-11-21 Thread andy petrella
Yo, I've discussed with some guyz from cloudera that are working (only oO) on spark-core and streaming. The streaming was telling me the same thing about the scheduling part. Do you have some nice screenshots and info about stages running, task time, akka health and things like these -- I said

Re: Implementing TinkerPop on top of GraphX

2014-11-06 Thread andy petrella
Great stuffs! I've got some thoughts about that, and I was wondering if it would be first interesting to have something like for spark-core (let's say): 0/ Core API offering basic (or advanced → HeLP) primitives 1/ catalyst optimizer for a text base system (SPARQL, Cypher, custom SQL3, whatnot) 2/

Re: best IDE for scala + spark development?

2014-10-27 Thread andy petrella
I second the S[B]T combo! I tried ATOM → lack of features and stability (atm) aℕdy ℙetrella about.me/noootsab [image: aℕdy ℙetrella on about.me] http://about.me/noootsab On Mon, Oct 27, 2014 at 2:15 PM, Dean Wampler deanwamp...@gmail.com wrote: For what it's worth, I use Sublime Text + the

Re: [brainsotrming] Generalization of DStream, a ContinuousRDD ?

2014-08-01 Thread andy petrella
multiple pass magic) 2. An even more fundamental question is how do you ensure ordering with delayed records. If you want to process in order of application time, and records are delayed how do you deal with them. Any ideas? ;) TD On Wed, Jul 16, 2014 at 2:37 AM, andy petrella andy.petre

Re: [brainsotrming] Generalization of DStream, a ContinuousRDD ?

2014-08-01 Thread andy petrella
Actually for click stream, the users space wouldn't be a continuum, unless the order of users is important or the fact that they are coming in a kind of order can be used by the algo. The purpose of the break or binning function is to package things in a cluster for which we know the properties,

Re: [brainsotrming] Generalization of DStream, a ContinuousRDD ?

2014-07-16 Thread andy petrella
windows, its much harder. Interesting ideas nonetheless. I am curious to see how far we can push using the RDD model underneath, without losing parallelism and performance. TD On Tue, Jul 15, 2014 at 10:11 AM, andy petrella andy.petre...@gmail.com wrote: Dear Sparkers, *[sorry

Re: [brainsotrming] Generalization of DStream, a ContinuousRDD ?

2014-07-16 Thread andy petrella
you ensure ordering with delayed records. If you want to process in order of application time, and records are delayed how do you deal with them. Any ideas? ;) TD On Wed, Jul 16, 2014 at 2:37 AM, andy petrella andy.petre...@gmail.com wrote: Heya TD, Thanks for the detailed answer

[brainsotrming] Generalization of DStream, a ContinuousRDD ?

2014-07-15 Thread andy petrella
Dear Sparkers, *[sorry for the lengthy email... = head to the gist https://gist.github.com/andypetrella/12228eb24eea6b3e1389 for a preview :-p**]* I would like to share some thinking I had due to a use case I faced. Basically, as the subject announced it, it's a generalization of the DStream

Re: encounter jvm problem when integreation spark with mesos

2014-06-17 Thread andy petrella
Yep but no real resolution nor advances on this topic, since finally we've chosen to stick with a compatible version of Mesos (0.14.1 ftm). But I'm still convince it has to do with native libs clash :-s aℕdy ℙetrella about.me/noootsab [image: aℕdy ℙetrella on about.me] http://about.me/noootsab

Re: Announcing the official Spark Job Server repo

2014-03-24 Thread andy petrella
duplicated work with what Mesos or Yarn are doing (that is resources management) right? Anyways we can discuss offline if needed. Definitively, let's stop polluting the list !!! C ya andy On Thu, Mar 20, 2014 at 1:35 AM, andy petrella andy.petre...@gmail.com wrote: Heya, That's cool you've

Re: Making RDDs Covariant

2014-03-22 Thread andy petrella
Dear, I'm pretty much following the Pascal's advices, since I've myseelf encoutered some problems with implicits (when playing the same kind of game with my Neo4J Scala API). Nevertheless, one remark regarding the serialization, the lost of data shouldn't arrive in the case whenimplicit

Re: Announcing the official Spark Job Server repo

2014-03-18 Thread andy petrella
tad! That's awesome. A quick question, does someone has insights regarding having such JobServers deployed using Marathon on Mesos? I'm thinking about an arch where Marathon would deploy and keep the Job Servers running along with part of the whole set of apps deployed on it regarding the

Re: [re-cont] map and flatMap

2014-03-15 Thread andy petrella
Dev pascal.voitot@gmail.com wrote: On Wed, Mar 12, 2014 at 3:06 PM, andy petrella andy.petre...@gmail.com wrote: Folks, I want just to pint something out... I didn't had time yet to sort it out and to think enough to give valuable strict explanation of -- event though, intuitively

[re-cont] map and flatMap

2014-03-12 Thread andy petrella
Folks, I want just to pint something out... I didn't had time yet to sort it out and to think enough to give valuable strict explanation of -- event though, intuitively I feel they are a lot === need spark people or time to move forward. But here is the thing regarding *flatMap*. Actually, it