About the time out error I found error log below,I don't know what that mean
[2015-02-27 21:59:41,575][DEBUG][action.search.type ] [Selene] [487030] Failed to execute fetch phase org.elasticsearch.transport.RemoteTransportException: [Red Wolf][inet[/10.1.33.77:9300]][search/phase/fetch/id] Caused by: org.elasticsearch.search.SearchContextMissingException: No search context found for id [487030] at org.elasticsearch.search.SearchService.findContext(SearchService.java:460) at org.elasticsearch.search.SearchService.executeFetchPhase(SearchService.java:433) at org.elasticsearch.search.action.SearchServiceTransportAction$SearchFetchByIdTransportHandler.messageReceived(SearchServiceTransportAction.java:728) at org.elasticsearch.search.action.SearchServiceTransportAction$SearchFetchByIdTransportHandler.messageReceived(SearchServiceTransportAction.java:717) at org.elasticsearch.transport.netty.MessageChannelHandler$RequestHandler.run(MessageChannelHandler.java:270) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) [2015-02-27 21:59:41,576][DEBUG][action.search.type ] [Selene] [475699] Failed to execute fetch phase org.elasticsearch.transport.RemoteTransportException: [S'byll][inet[/10.1.33.94:9300]][search/phase/fetch/id] Caused by: org.elasticsearch.search.SearchContextMissingException: No search context found for id [475699] at org.elasticsearch.search.SearchService.findContext(SearchService.java:460) at org.elasticsearch.search.SearchService.executeFetchPhase(SearchService.java:433) at org.elasticsearch.search.action.SearchServiceTransportAction$SearchFetchByIdTransportHandler.messageReceived(SearchServiceTransportAction.java:728) at org.elasticsearch.search.action.SearchServiceTransportAction$SearchFetchByIdTransportHandler.messageReceived(SearchServiceTransportAction.java:717) at org.elasticsearch.transport.netty.MessageChannelHandler$RequestHandler.run(MessageChannelHandler.java:270) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) [2015-02-27 21:59:41,575][DEBUG][action.search.type ] [Selene] [487026] Failed to execute fetch phase org.elasticsearch.transport.RemoteTransportException: [Red Wolf][inet[/10.1.33.77:9300]][search/phase/fetch/id] Caused by: org.elasticsearch.search.SearchContextMissingException: No search context found for id [487026] at org.elasticsearch.search.SearchService.findContext(SearchService.java:460) at org.elasticsearch.search.SearchService.executeFetchPhase(SearchService.java:433) at org.elasticsearch.search.action.SearchServiceTransportAction$SearchFetchByIdTransportHandler.messageReceived(SearchServiceTransportAction.java:728) at org.elasticsearch.search.action.SearchServiceTransportAction$SearchFetchByIdTransportHandler.messageReceived(SearchServiceTransportAction.java:717) at org.elasticsearch.transport.netty.MessageChannelHandler$RequestHandler.run(MessageChannelHandler.java:270) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) [2015-02-27 21:59:41,575][DEBUG][action.search.type ] [Selene] [460892] Failed to execute fetch 在 2015年3月2日星期一 UTC+8下午4:01:48,xiaoliang tian写道: > > The Version is 0.90.13 > > There is Out of memory error occurred when flushing shard > > How could I resolve this error? > > Any suggestion to prevent it. > > Below is the cluster situation: > 5 data node,1 master and 1 search node > > Dozens of index and each index has more than 100GB data > > Another problem is When someone try to query data there is connect > time-out problem,what could cause time out? I think concurrency is ort of > considered,Maybe due to the huge data? > > Plz help > > Below is OOM error > [2015-03-01 07:23:24,023][WARN ][index.translog ] [Outlaw] > [16494][4] failed to flush shard on translog threshold > org.elasticsearch.index.engine.FlushFailedEngineException: [16494][4] > Flush failed > at > org.elasticsearch.index.engine.robin.RobinEngine.flush(RobinEngine.java:907) > at > org.elasticsearch.index.shard.service.InternalIndexShard.flush(InternalIndexShard.java:563) > at > org.elasticsearch.index.translog.TranslogService$TranslogBasedFlush$1.run(TranslogService.java:194) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.IllegalStateException: this writer hit an > OutOfMemoryError; cannot commit > at > org.apache.lucene.index.IndexWriter.startCommit(IndexWriter.java:4354) > at > org.apache.lucene.index.IndexWriter.prepareCommitInternal(IndexWriter.java:2891) > at > org.apache.lucene.index.IndexWriter.commitInternal(IndexWriter.java:2984) > at > org.apache.lucene.index.IndexWriter.commit(IndexWriter.java:2954) > at > org.elasticsearch.index.engine.robin.RobinEngine.flush(RobinEngine.java:893) > ... 5 more > [2015-03-01 07:23:27,078][WARN ][cluster.action.shard ] [Outlaw] > [16494][4] sending failed shard for [16494][4], > node[z-YGubBGRe2afo5G8MBPkQ], [P], s[STARTED], indexUUID > [9MjfwirySmWIbqT8clWDwQ], reason [engine failure, message > [OutOfMemoryError[Java heap space]]] > [2015-03-01 07:23:24,030][DEBUG][action.bulk ] [Outlaw] > [16494][4] failed to execute bulk item (index) index > {[16494][cs-us-east-1-logging-swc-rel][c22f6222-c146-4608-9dcc-c8846191c21a], > source[{"version":"0.2","role":"es-data","from":"cs-us-east-1-logging-swc-rel","host":"ip-10-1-33-94-us-east-1-compute-internal","type":"log","time":1425092107803,"level":"system","text":" > > 27 disks \n 2 partitions \n 47584725 total reads\n > 80364 merged reads\n 3159251537 read sectors\n 586297450 milli > reading\n 634621170 writes\n 17059531 merged writes\n 49307983928 > written sectors\n 2768108439 milli writing\n 0 inprogress IO\n > 426354 milli spent > IO\n","state":"info","service":"snapshot","process":"VMstat","uid":"c22f6222-c146-4608-9dcc-c8846191c21a"}]} > org.elasticsearch.index.engine.IndexFailedEngineException: [16494][4] > Index failed for > [cs-us-east-1-logging-swc-rel#c22f6222-c146-4608-9dcc-c8846191c21a] > at > org.elasticsearch.index.engine.robin.RobinEngine.index(RobinEngine.java:501) > at > org.elasticsearch.index.shard.service.InternalIndexShard.index(InternalIndexShard.java:386) > at > org.elasticsearch.action.bulk.TransportShardBulkAction.shardIndexOperation(TransportShardBulkAction.java:398) > at > org.elasticsearch.action.bulk.TransportShardBulkAction.shardOperationOnPrimary(TransportShardBulkAction.java:156) > at > org.elasticsearch.action.support.replication.TransportShardReplicationOperationAction$AsyncShardOperationAction.performOnPrimary(TransportShardReplicationOperationAction.java:556) > at > org.elasticsearch.action.support.replication.TransportShardReplicationOperationAction$AsyncShardOperationAction$1.run(TransportShardReplicationOperationAction.java:426) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.OutOfMemoryError: Java heap space > [2015-03-01 07:23:30,699][DEBUG][action.bulk ] [Outlaw] > [16494][4], node[z-YGubBGRe2afo5G8MBPkQ], [P], s[STARTED]: Failed to > execute [org.elasticsearch.action.bulk.BulkShardRequest@7fca9100] > java.lang.NullPointerException > at > org.elasticsearch.action.bulk.TransportShardBulkAction.applyVersion(TransportShardBulkAction.java:640) > at > org.elasticsearch.action.bulk.TransportShardBulkAction.shardOperationOnPrimary(TransportShardBulkAction.java:178) > at > org.elasticsearch.action.support.replication.TransportShardReplicationOperationAction$AsyncShardOperationAction.performOnPrimary(TransportShardReplicationOperationAction.java:556) > at > org.elasticsearch.action.support.replication.TransportShardReplicationOperationAction$AsyncShardOperationAction$1.run(TransportShardReplicationOperationAction.java:426) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > [2015-03-01 07:23:30,719][WARN ][indices.cluster ] [Outlaw] > [16494][4] master > [[Tyrak][gJq80HQDTSKtZO56q81pIg][inet[/10.1.33.53:9300]]{data=false, > rack=rack_tone, max_local_storage_nodes=1, master=true}] marked shard as > started, but shard has not been created, mark shard as failed > [2015-03-01 07:23:30,728][WARN ][cluster.action.shard ] [Outlaw] > [16494][4] sending failed shard for [16494][4], > node[z-YGubBGRe2afo5G8MBPkQ], [P], s[STARTED], indexUUID > [9MjfwirySmWIbqT8clWDwQ], reason [master > [Tyrak][gJq80HQDTSKtZO56q81pIg][inet[/10.1.33.53:9300]]{data=false, > rack=rack_tone, max_local_storage_nodes=1, master=true} marked shard as > started, but shard has not been created, mark shard as failed] > > -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d867c194-24a6-49ef-a08d-f579e9b5e457%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.