Re: PySpark / scikit-learn integration sprint at Cloudera - Strata Conference Friday 14th Feb 2014

2013-12-04 Thread Olivier Grisel
2013/12/3 Horia ho...@alum.berkeley.edu: I am very interested in this and will most definitely participate! Please share the event sign-up list and location details when all the organizational hurdles have been resolved :-) Great! I just created a new entry for this sprint on the scikit-learn

Re: PySpark / scikit-learn integration sprint at Cloudera - Strata Conference Friday 14th Feb 2014

2013-12-04 Thread Olivier Grisel
That should be fixed but only if nobody clicks on the previous URL... Use the following instead: https://github.com/scikit-learn/scikit-learn/wiki/Upcoming-events That's a weird github bug... -- Olivier

Sorry about business lately and general unavailability

2013-12-04 Thread Chris Mattmann
Hey Guys, Just wanted to apologize for the general lack of my availability lately. I thought moving from Rancho Cucamonga, to Pasadena, CA (over 50+ miles) wouldn't affect my productivity, and with that and the holidays, and all the house work and moving stuff I've had to do, coupled with $dayjob

Re: PySpark / scikit-learn integration sprint at Cloudera - Strata Conference Friday 14th Feb 2014

2013-12-04 Thread Josh Rosen
Thanks for organizing this! I'll definitely be attending. - Josh On Wed, Dec 4, 2013 at 6:07 AM, Olivier Grisel olivier.gri...@ensta.orgwrote: 2013/12/4 Olivier Grisel olivier.gri...@ensta.org: That should be fixed but only if nobody clicks on the previous URL... Use the following

Re: Sorry about business lately and general unavailability

2013-12-04 Thread Reynold Xin
Thanks for the update Chris. We do need to graduate soon. People have been asking me does incubating means the project is very immature. :( One thing we need to do is to import the JIRA tickets from AMPLab's JIRA. That INFRA ticket hasn't moved much along. Can you help push that? On Wed, Dec

Re: Sorry about business lately and general unavailability

2013-12-04 Thread Matei Zaharia
No worries Chris! Apart from the JIRA thing, we also plan another release or two soon. Matei On Dec 4, 2013, at 3:36 PM, Reynold Xin r...@apache.org wrote: Thanks for the update Chris. We do need to graduate soon. People have been asking me does incubating means the project is very

Re: Sorry about business lately and general unavailability

2013-12-04 Thread Roman Shaposhnik
On Wed, Dec 4, 2013 at 3:36 PM, Reynold Xin r...@apache.org wrote: Thanks for the update Chris. We do need to graduate soon. I think important thing to realize is that there's no rush and the project should graduate whenever the community has demonstrated its capacity for functioning under the

Re: Spark streaming quantile?

2013-12-04 Thread Ryan Weald
Hi Sandy, You could take a look at using the Q-Tree data structure that is provided by Twitter's Algebirdhttps://github.com/twitter/algebird/blob/develop/algebird-core/src/main/scala/com/twitter/algebird/QTree.scala. Due to the associative properties of Algebird's SemiGroup it is ideally suited