Thanks Eran. So 3, seems to be something external to Zeppelin, and
hopefully 1 only means running "zeppelin-daemon.sh start" on a slave
machine, when master become inaccessible. Is that correct?

My main concern still remains on the storage front. And I don't really have
high availability disks or even hdfs in my setup. I have been using elastic
search cluster for data high availability, but was hoping that zeppelin can
save notebooks to a Elastic Search (like kibana) or maybe a document store.

Any idea if anything is planned in that direction. Don't want to fallback
to 'rsync' like options.

Regards,
Ashish



On Tue, Apr 5, 2016 at 11:17 PM, Eran Witkon <eranwit...@gmail.com> wrote:

> For 1 you need to have both zeppelin web HA and zeppelin deamon HA
> For 2 I guess you can use HDFS if you implement the storage interface for
> HDFS. But i am not sure.
> For 3 I mean that if you connect to an external cluster for example a
> spark cluster you need to make sure your spark cluster is HA. Otherwise you
> will have zeppelin running but your notebook will fail as no spark cluster
> available.
> HTH
> Eran
>
>
> On Tue, 5 Apr 2016 at 20:20 ashish rawat <dceash...@gmail.com> wrote:
>
>> Thanks Eran for your reply.
>> For 1) I am assuming that it would similar to HA of any other web
>> application, i.e. running multiple instances and switching to the backup
>> server when master is down, is it not the case?
>> For 2) is it also possible to save it on hdfs?
>> Can you please explain 3, are you referring to interpreter config? If I
>> am using Spark interpreter and submitting jobs to it, and if zeppelin
>> master node goes down, then what could be the problem in slave node
>> pointing to the same cluster and submitting jobs?
>>
>> Regards,
>> Ashish
>>
>> On Tue, Apr 5, 2016 at 10:08 PM, Eran Witkon <eranwit...@gmail.com>
>> wrote:
>>
>>> I would say you need to account for these things
>>> 1) availability of the zeppelin deamon
>>> 2) availability of the notebookd files
>>> 3) availability of the interpreters used.
>>>
>>> For 1 i don't know of out-of-box solution
>>> For 2 any ha storage will do, s3 or any ha external mounted disk
>>> For 3 it is up to the interpreter and your big data ha solution
>>>
>>> On Tue, 5 Apr 2016 at 19:29 ashish rawat <dceash...@gmail.com> wrote:
>>>
>>>> Hi,
>>>>
>>>> Is there a suggested architecture to run Zeppelin in high availability
>>>> mode. The only option I could find was by saving notebooks to S3. Are there
>>>> any options if one is not using AWS?
>>>>
>>>> Regards,
>>>> Ashish
>>>>
>>>
>>

Reply via email to