Hey, Guys,
I am loading a hive table of around 10million records into ES regularly. 
Each document is small with 5-6 attributes. My Es cluster has 7 nodes, each 
has 4 core and 128G. ES was allocated with 60% of the memory, and I am 
bulking insert (use python client) every 200 entries.  My cluster is in 
Green status, running version  1.2.1. The index "number_of_shards" : 7, 
"number_of_replicas" : 1
But I keep getting read time out exception:

Traceback (most recent call last):
  File "reduce_dotcom_browse.test.py", line 95, in <module>
    helpers.bulk(es, actions)
  File "/usr/lib/python2.6/site-packages/elasticsearch/helpers.py", line 
148, in bulk
    for ok, item in streaming_bulk(client, actions, **kwargs):
  File "/usr/lib/python2.6/site-packages/elasticsearch/helpers.py", line 
107, in streaming_bulk
    resp = client.bulk(bulk_actions, **kwargs)
  File "/usr/lib/python2.6/site-packages/elasticsearch/client/utils.py", 
line 70, in _wrapped
    return func(*args, params=params, **kwargs)
  File "/usr/lib/python2.6/site-packages/elasticsearch/client/__init__.py", 
line 568, in bulk
    params=params, body=self._bulk_body(body))
  File "/usr/lib/python2.6/site-packages/elasticsearch/transport.py", line 
274, in perform_request
    status, headers, data = connection.perform_request(method, url, params, 
body, ignore=ignore)
  File 
"/usr/lib/python2.6/site-packages/elasticsearch/connection/http_urllib3.py", 
line 51, in perform_request
    raise ConnectionError('N/A', str(e), e)
elasticsearch.exceptions.ConnectionError: 
ConnectionError(HTTPConnectionPool(host=u'10.93.80.216', port=9200): Read 
timed out. (read timeout=10)) caused by: 
ReadTimeoutError(HTTPConnectionPool(host=u'10.93.80.216', port=9200): Read 
timed out. (read timeout=10))

How can I trouble shoot this? In my opinion, bulk insert 200 entries should 
be fairly easy..
Thanks for any pointers.
Chen


-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/450ae411-586a-431b-b3a9-3767230eaf92%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to