Re: listdir() python function is not wokring on hadoop

2013-12-05 Thread Haider
I am trying to read from HDFS not from Local file system, so would it be possible through listdir? or is there any way to read hdfs files one by one and passing to one funtion. On Fri, Dec 6, 2013 at 4:20 AM, Yigitbasi, Nezih wrote: > I can call listdir to read from local filesystem in a pytho

embedded Pig using PigServer

2013-12-05 Thread ey-chih chow
Hi, I have some issues in using embedded Pig using PigServer. I used AvroStorage() to store data into the local disk and Eclipse for debugging. When I debug the program by setting some breakpoints and using the local mode. I found that, initially, there was a directory, _temporary, created in t

Example analyze Tomcat logs using PIG

2013-12-05 Thread Watrous, Daniel
Hi, I've finally accomplished my task of analyzing Apache Tomcat files (catalina.out) in PIG. Here are the results: http://software.danielwatrous.com/analyze-tomcat-logs-using-pig-hadoop/ My biggest struggles were a mismatch of Hadoop versions (local vs. PIG default) and getting regular express

RE: listdir() python function is not wokring on hadoop

2013-12-05 Thread Yigitbasi, Nezih
I can call listdir to read from local filesystem in a python UDF. Did you implement your function as a proper UDF? From: Haider [haider.n...@gmail.com] Sent: Monday, December 02, 2013 5:22 AM To: user@pig.apache.org Subject: listdir() python function is not

Where and when are StoreFuncInterface functions called

2013-12-05 Thread Patrick Thompson
It's not clear from the docs where the various StoreFuncInterface functions get called. There are some hints in the API docs, but I am left wondering, does pig guarantee that, for example, putNext and cleanUpOnSuccess will be called in the same execution con