Re: [DISCUSS] ZEPPELIN-2619. Save note in [Title].zpln instead of [NOTEID]/note.json

2018-08-30 Thread andreas . weise
another reason for keeping noteId is uniqueness in case of multi-user 
environments. In that case users have separate zeppelin workspaces, which is 
something we are using in production: see ZEPPELIN_NOTEBOOK_PUBLIC=false in the 
doc [1]. In that case users might be very confused when they can not create 
notebooks with a name that already exists, but they most likely don't see (yet).

So I like the proposal {note_name}_{note_id}.zpln. where note_name could 
contains folders, e.g. folder_1/mynote_abcd.zpln. Even though I like 
{note_name}.{note_id}.zpln (dot in between note_name and note_id) even better 
:-)

Regards
Andreas


[1] 
http://zeppelin.apache.org/docs/0.8.0/setup/security/notebook_authorization.html#separate-notebook-workspaces-public-vs-private

On 2018/08/18 08:42:44, Jeff Zhang  wrote: 
> BTW, I also prefer to use note name as identify of note if the issue I
> mentioned before is acceptable for most of users.
> 
> 
> 
> Jeff Zhang 于2018年8月18日周六 下午4:40写道:
> 
> >
> > I am afraid we can not remove noteId, as noteId is the unique identifier
> > of note and is immutable which is used in a lot places, such as paragraph
> > share and rest api.
> > If we use note name as note id then it may break user's app if note name
> > is changed
> >
> >
> > Jongyoul Lee 于2018年8月18日周六 下午2:33写道:
> >
> >> Hi, thanks for this kind of discussion.
> >>
> >> About noteId, How about changing note id to note name? AFAIK, Note id is
> >> just an identifier and we can set any value to it.
> >>
> >> There’re two potential problems. We should be more careful to handle note
> >> id as it could have very various type of characters. And Second, in case
> >> where someone changes a note name, those who are seeing and updating the
> >> same note wouldn’t access that note. We could handle it by using 
> >> websockets.
> >>
> >> WDYT?
> >>
> >> On Tue, 14 Aug 2018 at 6:14 PM Jeff Zhang  wrote:
> >>
> >>> >>> But I’m still not comfortable with note ids in the name of the
> >>> notebook itself.  Those names would look ugly if you shared your notebooks
> >>> on github for example.  You don’t see Jupyter notebooks with names like
> >>> that. If you have to keep the note ids with the notebooks could you not
> >>> simply put the note id at the top of the notebook as Ruslan suggested? 
> >>> Then
> >>> you’d only have to read the first line of each notebook.
> >>>
> >>> I know putting note_id in the note file name is not so elegant, but this
> >>> is what we have to compromise to keep compatibility as we use noteId to
> >>> uniquely identify note right now. And I don't think putting noteId in the
> >>> top first line of note would help much. We still have to read note files
> >>> which take much more time than just read the file names via file system.
> >>>
> >>> Regarding the readability of note file name, I think it won't affect
> >>> much. E.g. This is the note book file name like:  *My Project/My Spark
> >>> Tutorial Note_2A94M5J1Z.zpln*
> >>> What user see in notebook menu is still *My Project/My Spark Tutorial* 
> >>> *Note
> >>> *which is no difference from what we see now.
> >>>
> >>> And thanks again for the feedback and comments, I am so glad to see so
> >>> many discussion in community.
> >>>
> >>>
> >>>
> >>> Partridge, Lucas (GE Aviation) 于2018年8月14日周二
> >>> 下午4:29写道:
> >>>
>  I agree you’re inviting consistency issues if you maintained a separate
>  note id-to-note name mapping file.
> 
> 
> 
>  But I’m still not comfortable with note ids in the name of the notebook
>  itself.  Those names would look ugly if you shared your notebooks on 
>  github
>  for example.  You don’t see Jupyter notebooks with names like that.  If 
>  you
>  have to keep the note ids with the notebooks could you not simply put the
>  note id at the top of the notebook as Ruslan suggested? Then you’d only
>  have to read the first line of each notebook.
> 
> 
> 
>  Presumably if you copied the notebooks to another Zeppelin server they
>  would be restored with the same note ids there too? And hopefully there
>  would be no id clash with notebooks already on that server…
> 
> 
> 
>  *From:* Jeff Zhang 
>  *Sent:* 14 August 2018 03:49
>  *To:* users@zeppelin.apache.org
> 
> 
>  *Subject:* EXT: Re: [DISCUSS] ZEPPELIN-2619. Save note in [Title].zpln
>  instead of [NOTEID]/note.json
> 
> 
> 
> 
> 
>  Thanks for the discussion.
> 
>  >>> I'm afraid about non-latin symbols in folder and note name. And
>  what about hieroglyphs?
> 
>  AFAIK, linux allow all the characters to be file name except `\0` and
>  '/'.  I can create file name with Chinese character in linux, I guess you
>  can use Russian as well.
> 
> 
> 
>  >>> If I understand correctly, this is being done solely to speed up
>  loading list of notebooks? What if a list of notebook names, their ids,
>  folder 

livy2.sql or spark2.sql does not show result

2018-08-30 Thread Lian Jiang
I am using HDP3.0 (having zeppelin 0.8.0).

Below queries do not show any result, even though downloading the csv show
the data correctly (e.g. if there is no tables, show the header).

%livy2.sql
show tables

%spark2.sql
show tables

Infrequently I saw the table show a short time and then disappear. I guess
it is html rendering issue.

Any idea? Any setting is related to this? Sql interpreters worked fine in
HDP2.6 using zeppelin 0.7.3.

Appreciate any clue.


Re: livy2.sql or spark2.sql does not show result

2018-08-30 Thread Lian Jiang
Here is the javascript error if it helps:

Error: [ngRepeat:iidexp] '_item_' in '_item_ in _collection_' should be an
identifier or '(_key_, _value_)' expression, but got
'/gateway/ui/zeppelin/app'.
http://errors.angularjs.org/1.5.7/ngRepeat/iidexp?p0=%2Fgateway%2Fui%2Fzeppelin%2Fapp
b/<@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:36:376
Kghttps://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:40:30580
Z@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:37:4755
S@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:36:30480
S@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:36:30611
S@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:36:30611
N@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:36:29413
X/<@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:37:440
d@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:36:30833
m@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:37:909
mghttps://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:40:17582
xc/this.$gethttps://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:38:10966
a/n.prototype.safeDigest@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:76:1460
b@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:76:3748
a/n.prototype._onMessageHandler@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:76:3960
R/<@
https://mydomain.com/gateway/ui/zeppelin/scripts/vendor.49d751b0c72342f6.js:36:5633
" 

On Thu, Aug 30, 2018 at 3:02 PM Lian Jiang  wrote:

> I am using HDP3.0 (having zeppelin 0.8.0).
>
> Below queries do not show any result, even though downloading the csv show
> the data correctly (e.g. if there is no tables, show the header).
>
> %livy2.sql
> show tables
>
> %spark2.sql
> show tables
>
> Infrequently I saw the table show a short time and then disappear. I guess
> it is html rendering issue.
>
> Any idea? Any setting is related to this? Sql interpreters worked fine in
> HDP2.6 using zeppelin 0.7.3.
>
> Appreciate any clue.
>
>
>


Re: [DISCUSS] ZEPPELIN-2619. Save note in [Title].zpln instead of [NOTEID]/note.json

2018-08-30 Thread Jongyoul Lee
Hi,

I have a bit different thoughts about the conflicts of the name of a new
note created. In a multiuser environment, AFAIK, most teams and companies,
generally, use a prefix for the group policy internally. In my case,
user/{user_id}/{notebook_name_they_want}.zpln. In this case, naming
conflicts rarely happen. And it will be stored under a specific folder. If
someone needed two different same named notes in the same directory, I
might not be appropriate. WDYT?

JL

On Fri, Aug 31, 2018 at 4:44 AM, andreas.we...@gmail.com <
andreas.we...@gmail.com> wrote:

> another reason for keeping noteId is uniqueness in case of multi-user
> environments. In that case users have separate zeppelin workspaces, which
> is something we are using in production: see ZEPPELIN_NOTEBOOK_PUBLIC=false
> in the doc [1]. In that case users might be very confused when they can not
> create notebooks with a name that already exists, but they most likely
> don't see (yet).
>
> So I like the proposal {note_name}_{note_id}.zpln. where note_name could
> contains folders, e.g. folder_1/mynote_abcd.zpln. Even though I like
> {note_name}.{note_id}.zpln (dot in between note_name and note_id) even
> better :-)
>
> Regards
> Andreas
>
>
> [1] http://zeppelin.apache.org/docs/0.8.0/setup/security/
> notebook_authorization.html#separate-notebook-workspaces-public-vs-private
>
> On 2018/08/18 08:42:44, Jeff Zhang  wrote:
> > BTW, I also prefer to use note name as identify of note if the issue I
> > mentioned before is acceptable for most of users.
> >
> >
> >
> > Jeff Zhang 于2018年8月18日周六 下午4:40写道:
> >
> > >
> > > I am afraid we can not remove noteId, as noteId is the unique
> identifier
> > > of note and is immutable which is used in a lot places, such as
> paragraph
> > > share and rest api.
> > > If we use note name as note id then it may break user's app if note
> name
> > > is changed
> > >
> > >
> > > Jongyoul Lee 于2018年8月18日周六 下午2:33写道:
> > >
> > >> Hi, thanks for this kind of discussion.
> > >>
> > >> About noteId, How about changing note id to note name? AFAIK, Note id
> is
> > >> just an identifier and we can set any value to it.
> > >>
> > >> There’re two potential problems. We should be more careful to handle
> note
> > >> id as it could have very various type of characters. And Second, in
> case
> > >> where someone changes a note name, those who are seeing and updating
> the
> > >> same note wouldn’t access that note. We could handle it by using
> websockets.
> > >>
> > >> WDYT?
> > >>
> > >> On Tue, 14 Aug 2018 at 6:14 PM Jeff Zhang  wrote:
> > >>
> > >>> >>> But I’m still not comfortable with note ids in the name of the
> > >>> notebook itself.  Those names would look ugly if you shared your
> notebooks
> > >>> on github for example.  You don’t see Jupyter notebooks with names
> like
> > >>> that. If you have to keep the note ids with the notebooks could you
> not
> > >>> simply put the note id at the top of the notebook as Ruslan
> suggested? Then
> > >>> you’d only have to read the first line of each notebook.
> > >>>
> > >>> I know putting note_id in the note file name is not so elegant, but
> this
> > >>> is what we have to compromise to keep compatibility as we use noteId
> to
> > >>> uniquely identify note right now. And I don't think putting noteId
> in the
> > >>> top first line of note would help much. We still have to read note
> files
> > >>> which take much more time than just read the file names via file
> system.
> > >>>
> > >>> Regarding the readability of note file name, I think it won't affect
> > >>> much. E.g. This is the note book file name like:  *My Project/My
> Spark
> > >>> Tutorial Note_2A94M5J1Z.zpln*
> > >>> What user see in notebook menu is still *My Project/My Spark
> Tutorial* *Note
> > >>> *which is no difference from what we see now.
> > >>>
> > >>> And thanks again for the feedback and comments, I am so glad to see
> so
> > >>> many discussion in community.
> > >>>
> > >>>
> > >>>
> > >>> Partridge, Lucas (GE Aviation) 于2018年8月14日周二
> > >>> 下午4:29写道:
> > >>>
> >  I agree you’re inviting consistency issues if you maintained a
> separate
> >  note id-to-note name mapping file.
> > 
> > 
> > 
> >  But I’m still not comfortable with note ids in the name of the
> notebook
> >  itself.  Those names would look ugly if you shared your notebooks
> on github
> >  for example.  You don’t see Jupyter notebooks with names like
> that.  If you
> >  have to keep the note ids with the notebooks could you not simply
> put the
> >  note id at the top of the notebook as Ruslan suggested? Then you’d
> only
> >  have to read the first line of each notebook.
> > 
> > 
> > 
> >  Presumably if you copied the notebooks to another Zeppelin server
> they
> >  would be restored with the same note ids there too? And hopefully
> there
> >  would be no id clash with notebooks already on that server…
> > 
> > 
> > 
> >  *From:* Jeff Zhang 
> > >>>

Re: "Read replica" for Zeppelin

2018-08-30 Thread Anthony Corbacho
Hi,
You can also take a look at zepl.com.
We prodive provide a way to perioadically sync notebook from s3 and you can
share it as read only.

Regards,
Anthony

On Wed, 29 Aug 2018 at 10:22 PM Jongyoul Lee  wrote:

> I like this idea, totally.
>
> I have an exactly same issue and thought to provide read-only mode. BTW,
> currently, Zeppelin server doesn't look proper for now. I, personally, have
> an idea to make a simple webserver only as a viewer.
>
> JL
>
> On Wed, Aug 29, 2018 at 9:55 PM, peay  wrote:
>
>> Hello,
>>
>> I have a Zeppelin server with a S3 repo backend for notebooks.
>>
>> I would like to have a second Zeppelin instance to access the notebooks
>> in read only mode. Mostly for security reasons, but also in general some
>> weak form of HA.
>>
>> Any suggestion on how to achieve this? I've set it up naively, but the
>> missing piece is getting the read-only Zeppelin to reload notebooks from S3
>> periodically (or when getting web server requests, for instance). Any help
>> would be appreciated.
>>
>> Thanks!
>>
>>
>
>
> --
> 이종열, Jongyoul Lee, 李宗烈
> http://madeng.net
>