pyspark and elasticsearch - accessing ES fileds

2016-04-11 Thread Oren Shani
Hi All, I connected pyspark under Zeppelin to my Elasticsearch DB and I am able to do this: %pyspark es_rdd = sc.newAPIHadoopRDD( inputFormatClass="org.elasticsearch.hadoop.mr.EsInputFormat", keyClass="org.apache.hadoop.io.NullWritable", valueClass="org.elasticsearch.hadoop.mr.Linked

RE: It works, COOL, but now what?

2016-04-07 Thread Oren Shani
things like Apache drill for that. Zeppelin is THE tool for REPL type analysis against different data sources. Select your interpreter, issue a query, analyze the results using the different visual tools and continue the next question. Eran On Thu, Apr 7, 2016 at 5:00 PM Oren Shani mailto:osh

RE: It works, COOL, but now what?

2016-04-07 Thread Oren Shani
against ES. From their you are on your own :-) Eran On Thu, Apr 7, 2016 at 4:23 PM Oren Shani mailto:osh...@iucc.ac.il>> wrote: Hi All, So I got Zeppelin up and running, and even connected to Elasticsearch, which is cool, but now what? How do I use Zeppelin to analyze my Elasticsearch da

It works, COOL, but now what?

2016-04-07 Thread Oren Shani
Hi All, So I got Zeppelin up and running, and even connected to Elasticsearch, which is cool, but now what? How do I use Zeppelin to analyze my Elasticsearch data? I know it's a big, vague question, but I'm kind of hoping that some of you can point me in the right direction ( Documentation, ex

RE: Zeppelin 0.5.6 Disconnected

2016-04-04 Thread Oren Shani
zeppelin log files that is under the "~/$ZEPPELIN_HOME/logs" ? Thanks. 2016-04-05 0:05 GMT+09:00 Oren Shani mailto:osh...@iucc.ac.il>>: Hi, Nginx should be configured OK now, but still disconnected. Could it be something else? For example, did I have to install Spark and/o

RE: Zeppelin 0.5.6 Disconnected

2016-04-04 Thread Oren Shani
@zeppelin.incubator.apache.org Subject: Re: Zeppelin 0.5.6 Disconnected hi. You could refer to https://github.com/apache/incubator-zeppelin/pull/814. I just tested on Ubuntu 14.04.3 and it's working well. Thanks. 2016-04-04 21:08 GMT+09:00 Oren Shani mailto:osh...@iucc.ac.il>>: Sung Shim, I

RE: Zeppelin 0.5.6 Disconnected

2016-04-04 Thread Oren Shani
roxy_http_version 1.1; proxy_set_header Upgrade $http_upgrade; proxy_set_header Connection "upgrade"; proxy_set_header X-Real-IP $remote_addr; proxy_pass http://backendWS<http://backendws/>; } } I am fixing the documentation. Thanks. 2016-04-04 17:24 GMT+09:00 Oren Shani

Zeppelin 0.5.6 Disconnected

2016-04-04 Thread Oren Shani
Hello, I installed Zeppelin 0.5.6 on Ubuntu server 14.04, behind Nginx. Zeppelin comes up but shows "Disconnected" in the top right corner. I found several references to a similar problem on the web and most of them suggest that the problem has to do with access to Zeppelin's winsock port (80