[GSoC 2016][Notebooks] Link for GSoC submission

2016-08-20 Thread Paul Bustios Belizario
Hi community, This is the link for my work submission for GSoC. Suggestions are welcomed. [0] https://github.com/bustios/zeppelin-notebooks Regards, Paul

Re: Python SQL over Pandas Dataframe, was - Re: [GSoC 2016] Notebooks

2016-07-30 Thread Paul Bustios Belizario
Hi Alexander, I created the JIRA issue https://issues.apache.org/jira/browse/ZEPPELIN-1255 and submited the PR https://github.com/apache/zeppelin/pull/1249 to fix it. Also, I updated the notebook at [1] with this feature. [1]

Re: Python SQL over Pandas Dataframe, was - Re: [GSoC 2016] Notebooks

2016-07-29 Thread Alexander Bezzubov
Hi Paul, it definitely looks like a bug and the right fix to me! Could you please create a JIRA issue and submit a PR with the fix? I think it is a very valuable contribution, thank you! -- Alex On Fri, Jul 29, 2016 at 3:45 PM, Paul Bustios Belizario wrote: > Hi Alex, > >

Re: [GSoC 2016] Notebooks

2016-07-27 Thread Paul Bustios Belizario
Hi Alexander, Yes, I'm using the latest version of the code in master branch and I have installed pandas and pandasql. By the way, I made searches in the repository. Below, the 2 screens of the search results for: PythonPandasSQL*Interpreter*

Re: [GSoC 2016] Notebooks

2016-07-25 Thread Alexander Bezzubov
Hi Paul, this sounds very strange indeed. Please make sure you are using latest master and to get correct interpreters classnames - it should be enough to delete /conf/interpreter-settings.json and restart Zeppelin - it will be re-created. Regarding dependencies for to run %python.sql (it's

Re: [GSoC 2016] Notebooks

2016-07-25 Thread Paul Bustios Belizario
Hi Alexander, Yes. I knew that, but for some reason, that I'm still investigating, z.show() doesn't display the dataframe in my notebook. That's why I decided to not incorporate z.show() yet. As soon as I find the problem I will add it. Regarding to pandasql, there is an error creating the

Re: [GSoC 2016] Notebooks

2016-07-24 Thread Alexander Bezzubov
Thanks for sharing your progress Paul, the notebook looks great! By the way, did you know that in latest Apache Zeppelin instead of ``` print(titanic.head()) ``` one can use ``` z.show(titanic) ``` ? It would be a good opportunity to showcase this [1] and other features of the Python

Re: [GSoC 2016] Notebooks

2016-07-22 Thread Paul Bustios Belizario
Thanks Moon, Here is my third notebook using the Titanic dataset: https://www.zeppelinhub.com/viewer/notebooks/bm90ZTovL2J1c3Rpb3MvbG9jYWwvYmI0Y2EwNjVkMTI1NDY2Y2EzNTIzNThiZjViYzIxOWQvbm90ZS5qc29u Now, I'm working on the fourth notebook and updating my first notebook to use z.show() Regards,

Re: [GSoC 2016] Notebooks

2016-07-16 Thread moon soo Lee
Hi Paul, That would be very interesting! And like you mentioned, it's dataset that for starters. I think it's super reasonable to have a notebooks with those data. Thanks, moon On Sat, Jul 9, 2016 at 11:09 AM Paul Bustios Belizario wrote: > Hi community, > > I was

[GSoC 2016] Notebooks

2016-07-08 Thread Paul Bustios Belizario
Hi community, I was searching some databases and chose [1,2] for the next notebooks. These databases are not big, but are classic and educational for people who are starting the path of data science. Additionally, through the process of machine learning, these databases can provide many graphics.