Re: spark and plot data

2016-07-23 Thread Gourav Sengupta
a...@santacruzintegration.com> >>>> ha scritto: >>>> >>>>> Hi Pseudo >>>>> >>>>> Plotting, graphing, data visualization, report generation are common >>>>> needs in scientific and enterprise computing. >>&

Re: spark and plot data

2016-07-23 Thread Andrew Ehrlich
assume you mean plotting and graphing) into spark. >> >> >> In my personal work all the graphing is done in the driver on summary stats >> calculated using spark. So for me using standard python libs has not been a >> problem. >> >> Andy >> >>

Re: spark and plot data

2016-07-23 Thread andy petrella
o: >>>> >>>>> Hi Pseudo >>>>> >>>>> Plotting, graphing, data visualization, report generation are common >>>>> needs in scientific and enterprise computing. >>>>> >>>>> Can you tell me more about your u

Re: spark and plot data

2016-07-23 Thread Gourav Sengupta
;> Can you tell me more about your use case? What is it about the current >>>> process / workflow do you think could be improved by pushing plotting (I >>>> assume you mean plotting and graphing) into spark. >>>> >>>> >>>> In my personal w

Re: spark and plot data

2016-07-23 Thread Gourav Sengupta
Hi Taotao, that is the way its usually used to visualize data from SPARK. But I do see that people transfer the data to list to feed to Matplot (as in the SPARK course currently running in EDX). Please try using blaze and bokeh and you will be in a new world altogether. Regards, Gourav On

Re: spark and plot data

2016-07-22 Thread Taotao.Li
hi, pesudo, I've posted a blog before spark-dataframe-introduction , and for me, I use spark dataframe [ or RDD ] to do the logic calculation on all the datasets, and then transform the result into pandas dataframe, and make

Re: spark and plot data

2016-07-22 Thread Pedro Rodriguez
;>> >>> >>> In my personal work all the graphing is done in the driver on summary >>> stats calculated using spark. So for me using standard python libs has not >>> been a problem. >>> >>> Andy >>> >>> From: pseudo oduesp

Re: spark and plot data

2016-07-22 Thread Gourav Sengupta
So for me using standard python libs has not >> been a problem. >> >> Andy >> >> From: pseudo oduesp <pseudo20...@gmail.com> >> Date: Thursday, July 21, 2016 at 8:30 AM >> To: "user @spark" <user@spark.apache.org> >> Subject: sp

Re: running jupyter notebook server Re: spark and plot data

2016-07-22 Thread Inam Ur Rehman
s of being released. The new Juypter lab > alpha was just announced it looks really sweet. > > > > From: pseudo oduesp <pseudo20...@gmail.com> > Date: Friday, July 22, 2016 at 2:08 AM > To: Andrew Davidson <a...@santacruzintegration.com> > Subject: Re: s

running jupyter notebook server Re: spark and plot data

2016-07-22 Thread Andy Davidson
e new Juypter lab alpha was just announced it looks really sweet. From: pseudo oduesp <pseudo20...@gmail.com> Date: Friday, July 22, 2016 at 2:08 AM To: Andrew Davidson <a...@santacruzintegration.com> Subject: Re: spark and plot data > HI andy , > thanks for reply , >

Re: spark and plot data

2016-07-22 Thread Pedro Rodriguez
n libs has not been a problem. Andy From: pseudo oduesp <pseudo20...@gmail.com> Date: Thursday, July 21, 2016 at 8:30 AM To: "user @spark" <user@spark.apache.org> Subject: spark and plot data Hi ,  i know spark  it s engine  to compute large data set but for me i work

Re: spark and plot data

2016-07-22 Thread Marco Colombo
To: "user @spark" <user@spark.apache.org > <javascript:_e(%7B%7D,'cvml','user@spark.apache.org');>> > Subject: spark and plot data > > Hi , > i know spark it s engine to compute large data set but for me i work > with pyspark and it s very wonderful machine >

Re: spark and plot data

2016-07-21 Thread Andy Davidson
user @spark" <user@spark.apache.org> Subject: spark and plot data > Hi , > i know spark it s engine to compute large data set but for me i work with > pyspark and it s very wonderful machine > > my question we don't have tools for ploting data each time we have to swi

spark and plot data

2016-07-21 Thread pseudo oduesp
Hi , i know spark it s engine to compute large data set but for me i work with pyspark and it s very wonderful machine my question we don't have tools for ploting data each time we have to switch and go back to python for using plot. but when you have large result scatter plot or roc curve