Newbie question about Spark and Elasticsearch

2014-12-08 Thread Mohamed Lrhazi
am trying to understand how spark and ES work... could someone please help
me answer this question..

val conf = new Configuration()
conf.set(es.resource, radio/artists)
conf.set(es.query, ?q=me*)
val esRDD = sc.newHadoopRDD(conf, classOf[EsInputFormat[Text,
MapWritable]],
  classOf[Text], classOf[MapWritable]))
val docCount = esRDD.count();


When and where is data being transferred from ES? is it all collected on
the Spark master node, then partitioned and sent to the worker nodes? or is
each worker node talking to ES to somehow get a partition of the data?

How does this effectively work?

Thanks a lot,
Mohamed.

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAEU_gmf9Nt0xn_0NbzDn_moRWUT96uWYf4cicJdZik3r0Zz8XA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: Newbie question about Spark and Elasticsearch

2014-12-08 Thread Mohamed Lrhazi
Great Thanks a lot Costin.

Are people supposed to deploy the Spark workers on the same ES cluster? I 
guess it would make sense for data to remain local and avoid network 
transfers altogether? 

Thanks  a lot,
Mohamed.


On Monday, December 8, 2014 10:19:12 AM UTC-5, Costin Leau wrote:

 Hi, 

 First off I recommend using the native integration (aka the Java/Scala 
 APIs) instead of MapReduce. The latter works but 
 the former is better performing and more flexible. 

 ES works in a similar fashion to the HDFS store - the data doesn't go 
 through the master rather, each task has its own 
 partition on works on its own set of data. Behind the scenes we map each 
 worker to an index shard (if there aren't 
 enough workers, then some will work across multiple shards). 


 On 12/8/14 4:59 PM, Mohamed Lrhazi wrote: 
  am trying to understand how spark and ES work... could someone please 
 help me answer this question.. 
  
  val conf = new Configuration() 
  conf.set(es.resource, radio/artists) 
  conf.set(es.query, ?q=me*) 
  val esRDD = sc.newHadoopRDD(conf, classOf[EsInputFormat[Text, 
 MapWritable]], 
 classOf[Text], classOf[MapWritable])) 
  val docCount = esRDD.count(); 
  
  
  When and where is data being transferred from ES? is it all collected on 
 the Spark master node, then partitioned and 
  sent to the worker nodes? or is each worker node talking to ES to 
 somehow get a partition of the data? 
  
  How does this effectively work? 
  
  Thanks a lot, 
  Mohamed. 
  
  -- 
  You received this message because you are subscribed to the Google 
 Groups elasticsearch group. 
  To unsubscribe from this group and stop receiving emails from it, send 
 an email to 
  elasticsearc...@googlegroups.com javascript: mailto:
 elasticsearch+unsubscr...@googlegroups.com javascript:. 
  To view this discussion on the web visit 
  
 https://groups.google.com/d/msgid/elasticsearch/CAEU_gmf9Nt0xn_0NbzDn_moRWUT96uWYf4cicJdZik3r0Zz8XA%40mail.gmail.com
  
  
 https://groups.google.com/d/msgid/elasticsearch/CAEU_gmf9Nt0xn_0NbzDn_moRWUT96uWYf4cicJdZik3r0Zz8XA%40mail.gmail.com?utm_medium=emailutm_source=footer.
  

  For more options, visit https://groups.google.com/d/optout. 

 -- 
 Costin 


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/26361977-b5e1-45fa-b305-e59310e2ce3f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: Failed to start shard

2014-10-27 Thread Mohamed Lrhazi
Sorry dont recall! but I find this in an email  sent on the same thread
 think:

 For me the change that seemed to have worked, I keep my fingers crossed,
is to add node.master=False to all nodes except one... black magic!

Other things we did involve: Adding more nodes and deleting indexes with
zero documents.

HTH,
Mohamed.


On Mon, Oct 27, 2014 at 8:12 AM, Anirudh Perugu anir...@unbxd.com wrote:

 Hi Mohamed,

 Did you find a solution to this problem?

 On Wednesday, July 23, 2014 10:06:16 AM UTC+5:30, Mohamed Lrhazi wrote:

 I went ahead and upgraded to elasticsearch-1.2.2-1.noarch, but the same
 errors continues after cluster start.

 I notice that the node inquestion is the master, and is supposed to be a
 non data node. it should not be starting any shards, right?

 Thanks a lot,
 Mohamed.


 On Tue, Jul 22, 2014 at 11:55 PM, Mohamed Lrhazi ml...@georgetown.edu
 wrote:



 Hello,

 Our master is constantly logging this error, about twice per second

 I saw similar error mentioned in the release notes for a previous verion, 
 this cluster is running 1.0.1. Is it the same issue? What work arounds do 
 we have?



 [2014-07-22 23:50:28,533][WARN ][cluster.action.shard ] [rap-es2] 
 [de_2007][0] received shard failed for [de_2007][0], 
 node[EtDIFKT7Qi-nAuvG47tGYQ], [P], s[INITIALIZING], indexUUID 
 [yLrHy5k7RqO6rLnl_b4kww], reason [Failed to start shard, message 
 [IndexShardGatewayRecoveryException[[de_2007][0] failed to fetch index 
 version after copying it over]; nested: 
 IndexShardGatewayRecoveryException[[de_2007][0] shard allocated for local 
 recovery (post api), should exist, but doesn't, current files: []]; nested: 
 IndexNotFoundException[no segments* file found in 
 store(least_used[rate_limited(mmapfs(/data/elasticsearch/data/Odine2/nodes/0/indices/de_2007/0/index),
  type=MERGE, rate=20.0)]): files: []]; ]]


  --
 You received this message because you are subscribed to a topic in the
 Google Groups elasticsearch group.
 To unsubscribe from this topic, visit https://groups.google.com/d/
 topic/elasticsearch/OVc_0Kqm2yk/unsubscribe.
 To unsubscribe from this group and all its topics, send an email to
 elasticsearc...@googlegroups.com.
 To view this discussion on the web visit https://groups.google.com/d/
 msgid/elasticsearch/fd9dfaef-7179-4abf-bb60-b8a93982d76a%
 40googlegroups.com
 https://groups.google.com/d/msgid/elasticsearch/fd9dfaef-7179-4abf-bb60-b8a93982d76a%40googlegroups.com?utm_medium=emailutm_source=footer
 .
 For more options, visit https://groups.google.com/d/optout.


  --
 You received this message because you are subscribed to a topic in the
 Google Groups elasticsearch group.
 To unsubscribe from this topic, visit
 https://groups.google.com/d/topic/elasticsearch/OVc_0Kqm2yk/unsubscribe.
 To unsubscribe from this group and all its topics, send an email to
 elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/52f5a6c8-847c-4294-a2eb-308eb9aa3fe7%40googlegroups.com
 https://groups.google.com/d/msgid/elasticsearch/52f5a6c8-847c-4294-a2eb-308eb9aa3fe7%40googlegroups.com?utm_medium=emailutm_source=footer
 .

 For more options, visit https://groups.google.com/d/optout.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAEU_gmdtPD8Cxg%2BJmqJhtZRgMv%3DQ8SNo7qds1GT%2BOdegkGKQ-w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: Unstable cluster - suspect illegal state: trying to move shard from primary mode to replica mode

2014-07-28 Thread Mohamed Lrhazi
Thanks Robin. For me the change that seemed to have worked, I keep my
fingers crossed, is to add node.master=False to all nodes except one...
black magic!

Thanks,
Mohamed.


On Sun, Jul 27, 2014 at 7:35 AM, Robin Clarke ro...@robinclarke.net wrote:

 I adjusted the required master nodes to N-1, where N was the total number
 of master nodes I have.


 On 23 July 2014 15:36, Mohamed Lrhazi ml...@georgetown.edu wrote:

 I think am running into this same issue, even after upgrading to 1.2.2.

 Did you stabilize your cluster?

 Thanks,
 Mohamed.


 On Saturday, May 24, 2014 5:05:55 AM UTC-4, Robin Clarke wrote:

 And found this error too in one of the nodes which left the cluster:

 java.lang.NullPointerException
 at org.elasticsearch.gateway.local.state.meta.
 LocalGatewayMetaState.clusterChanged(LocalGatewayMetaState.java:185)
 at org.elasticsearch.gateway.local.LocalGateway.
 clusterChanged(LocalGateway.java:207)
 at org.elasticsearch.cluster.service.InternalClusterService$
 UpdateTask.run(InternalClusterService.java:431)
 at org.elasticsearch.common.util.concurrent.
 PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(
 PrioritizedEsThreadPoolExecutor.java:134)
  at java.util.concurrent.ThreadPoolExecutor.runWorker(
 ThreadPoolExecutor.java:1145)
 at java.util.concurrent.ThreadPoolExecutor$Worker.run(
 ThreadPoolExecutor.java:615)
 at java.lang.Thread.run(Thread.java:744)

 -Robin-

  --
 You received this message because you are subscribed to a topic in the
 Google Groups elasticsearch group.
 To unsubscribe from this topic, visit
 https://groups.google.com/d/topic/elasticsearch/9XP5MwOkgk0/unsubscribe.
 To unsubscribe from this group and all its topics, send an email to
 elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/f7d09651-0f97-4529-b4e3-4cee752539e0%40googlegroups.com
 https://groups.google.com/d/msgid/elasticsearch/f7d09651-0f97-4529-b4e3-4cee752539e0%40googlegroups.com?utm_medium=emailutm_source=footer
 .

 For more options, visit https://groups.google.com/d/optout.




 --
 Best winds,
 -Robin-
 ~:)

 --
 You received this message because you are subscribed to a topic in the
 Google Groups elasticsearch group.
 To unsubscribe from this topic, visit
 https://groups.google.com/d/topic/elasticsearch/9XP5MwOkgk0/unsubscribe.
 To unsubscribe from this group and all its topics, send an email to
 elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/CACX78vZyV19goRnBv4RR%3Dm_z%3DAFoFaxU67T0ndzWJWr%3Dru6Jbg%40mail.gmail.com
 https://groups.google.com/d/msgid/elasticsearch/CACX78vZyV19goRnBv4RR%3Dm_z%3DAFoFaxU67T0ndzWJWr%3Dru6Jbg%40mail.gmail.com?utm_medium=emailutm_source=footer
 .

 For more options, visit https://groups.google.com/d/optout.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAEU_gmfX8hMRD-BpvMAzhGoATmBTULjKA3TE7yAzGFsvs8j2bw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: Unstable cluster - suspect illegal state: trying to move shard from primary mode to replica mode

2014-07-23 Thread Mohamed Lrhazi
I think am running into this same issue, even after upgrading to 1.2.2.

Did you stabilize your cluster?

Thanks,
Mohamed.

On Saturday, May 24, 2014 5:05:55 AM UTC-4, Robin Clarke wrote:

 And found this error too in one of the nodes which left the cluster:

 java.lang.NullPointerException
 at 
 org.elasticsearch.gateway.local.state.meta.LocalGatewayMetaState.clusterChanged(LocalGatewayMetaState.java:185)
 at 
 org.elasticsearch.gateway.local.LocalGateway.clusterChanged(LocalGateway.java:207)
 at 
 org.elasticsearch.cluster.service.InternalClusterService$UpdateTask.run(InternalClusterService.java:431)
 at 
 org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(PrioritizedEsThreadPoolExecutor.java:134)
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
 at java.lang.Thread.run(Thread.java:744)

 -Robin-


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/f7d09651-0f97-4529-b4e3-4cee752539e0%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Failed to start shard

2014-07-22 Thread Mohamed Lrhazi


Hello,

Our master is constantly logging this error, about twice per second

I saw similar error mentioned in the release notes for a previous verion, this 
cluster is running 1.0.1. Is it the same issue? What work arounds do we have?


[2014-07-22 23:50:28,533][WARN ][cluster.action.shard ] [rap-es2] [de_2007][0] 
received shard failed for [de_2007][0], node[EtDIFKT7Qi-nAuvG47tGYQ], [P], 
s[INITIALIZING], indexUUID [yLrHy5k7RqO6rLnl_b4kww], reason [Failed to start 
shard, message [IndexShardGatewayRecoveryException[[de_2007][0] failed to fetch 
index version after copying it over]; nested: 
IndexShardGatewayRecoveryException[[de_2007][0] shard allocated for local 
recovery (post api), should exist, but doesn't, current files: []]; nested: 
IndexNotFoundException[no segments* file found in 
store(least_used[rate_limited(mmapfs(/data/elasticsearch/data/Odine2/nodes/0/indices/de_2007/0/index),
 type=MERGE, rate=20.0)]): files: []]; ]]

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/fd9dfaef-7179-4abf-bb60-b8a93982d76a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: Failed to start shard

2014-07-22 Thread Mohamed Lrhazi
I went ahead and upgraded to elasticsearch-1.2.2-1.noarch, but the same
errors continues after cluster start.

I notice that the node inquestion is the master, and is supposed to be a
non data node. it should not be starting any shards, right?

Thanks a lot,
Mohamed.


On Tue, Jul 22, 2014 at 11:55 PM, Mohamed Lrhazi ml...@georgetown.edu
wrote:


 Hello,

 Our master is constantly logging this error, about twice per second

 I saw similar error mentioned in the release notes for a previous verion, 
 this cluster is running 1.0.1. Is it the same issue? What work arounds do we 
 have?


 [2014-07-22 23:50:28,533][WARN ][cluster.action.shard ] [rap-es2] 
 [de_2007][0] received shard failed for [de_2007][0], 
 node[EtDIFKT7Qi-nAuvG47tGYQ], [P], s[INITIALIZING], indexUUID 
 [yLrHy5k7RqO6rLnl_b4kww], reason [Failed to start shard, message 
 [IndexShardGatewayRecoveryException[[de_2007][0] failed to fetch index 
 version after copying it over]; nested: 
 IndexShardGatewayRecoveryException[[de_2007][0] shard allocated for local 
 recovery (post api), should exist, but doesn't, current files: []]; nested: 
 IndexNotFoundException[no segments* file found in 
 store(least_used[rate_limited(mmapfs(/data/elasticsearch/data/Odine2/nodes/0/indices/de_2007/0/index),
  type=MERGE, rate=20.0)]): files: []]; ]]

  --
 You received this message because you are subscribed to a topic in the
 Google Groups elasticsearch group.
 To unsubscribe from this topic, visit
 https://groups.google.com/d/topic/elasticsearch/OVc_0Kqm2yk/unsubscribe.
 To unsubscribe from this group and all its topics, send an email to
 elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/fd9dfaef-7179-4abf-bb60-b8a93982d76a%40googlegroups.com
 https://groups.google.com/d/msgid/elasticsearch/fd9dfaef-7179-4abf-bb60-b8a93982d76a%40googlegroups.com?utm_medium=emailutm_source=footer
 .
 For more options, visit https://groups.google.com/d/optout.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAEU_gmfNi6nowVSM1FMxWn6XKZkk_szcy1%3DCVZ%3Dz%2BZWoZC%3D8HA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: There were no results because no indices were found that match your selected time span

2014-01-29 Thread Mohamed Lrhazi
I have the same issue... I upgraded to 0.9.10, still seems to find, and 
display, no data.

I have plugin head installed and it finds the indexes fine, but marvel 
does not. There is no configuration step, right? where do I check for the 
plugin's logs?

Thanks,
Mohamed.

On Tuesday, January 28, 2014 2:01:21 PM UTC-5, Thomas Ardal wrote:

 When trying out Marvel on my Elasticsearch installation, I get the error 
 There were no results because no indices were found that match your 
 selected time span in the top of the page.

 If I understand the documentation, Marvel automatically collects 
 statistics from all indexes on the node. What am I doing wrong?


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/9c3ea1b1-f134-4ea2-9584-6319e6b94cb7%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


Re: There were no results because no indices were found that match your selected time span

2014-01-29 Thread Mohamed Lrhazi
I found out some of my nodes were not upgraded, and I also did not have the 
marvel plugin installed on all nodes... data is now appearing in the marvel 
dashboard.

Thanks,
Mohamed.

On Wednesday, January 29, 2014 12:35:00 PM UTC-5, Mohamed Lrhazi wrote:

 I have the same issue... I upgraded to 0.9.10, still seems to find, and 
 display, no data.

 I have plugin head installed and it finds the indexes fine, but marvel 
 does not. There is no configuration step, right? where do I check for the 
 plugin's logs?

 Thanks,
 Mohamed.

 On Tuesday, January 28, 2014 2:01:21 PM UTC-5, Thomas Ardal wrote:

 When trying out Marvel on my Elasticsearch installation, I get the error 
 There were no results because no indices were found that match your 
 selected time span in the top of the page.

 If I understand the documentation, Marvel automatically collects 
 statistics from all indexes on the node. What am I doing wrong?



-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/4cc562a1-c952-4881-8be9-1fcae821209c%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.