Hi Guys,
I have ES installed on 16 nodes in my Cloudera Hadoop cluster. All looks 
good from an ES point of view.

I now want to test a very simple data load from hdfs to ES but am 
struggling. I am using PIG and have elasticsearch-hadoop installed.

I want to load a single file, it is a pipe-delimited text file and is on 
the hdfs:

$ hdfs dfs -ls /logfiles/20140820
Found 1 items
-rw-r--r--   3 bob supergroup 2015426946 2014-08-20 06:45 /logfiles/20140820

Can someone help me with a really simple test using PIG? When I try the 
following I get:

grunt> DEFINE EsStorage org.elasticsearch.hadoop.pig.EsStorage();
grunt> data = load '/logfiles/20140820' using PigStorage('\n')
grunt> B = foreach data generate $0 as id;
grunt> STORE B INTO 'esTEST' using EsStorage('es.http.timeout = 5m');

Failed!

Failed Jobs:
JobId    Alias    Feature    Message    Outputs
job_201408191741_0008    B,data    MAP_ONLY    Message: Job failed!    
esTEST,

Input(s):
Failed to read data from "/logfiles/20140820"

Output(s):
Failed to produce result in "esTEST"


Can someone help a noob out with some simple PIG just to check I have it 
working?
Thanks
Paul


-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/11125bff-de59-4cb2-a42c-3f1f37a39f70%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to