I need open and save image using hadoop and python, i'm tried two way to do this:
1. Using WholeFileInputFormat.class for infile in sys.stdin: data = str(infile) data = StringIO.StringIO(data).getvalue() image = Image.frombuffer("RGB", (128,128), str(data), "raw","RGB",0,1) image = image.convert('L') print '%s\t' % (data) ERROR File "/usr/lib/python2.5/site-packages/PIL/Image.py", line 576, in fromstring raise ValueError("not enough image data") ValueError: not enough image data 2. for infile in sys.stdin: pathIn = os.getenv('map_input_file') image = Image.open(pathIn) ERROR IOError: [Errno 2] No such file or directory: 'hdfs://localhost:8022/user/training/input/img10.jpg' This dir exist....! train...@training-vm:~$ hadoop fs -lsr /user/training/input/ -rw-r--r-- 1 training supergroup 1556016 2010-01-13 08:22 /user/training/input/img10.jpg Can you help me, what i do wrong??, I'm using hadoop 0.20.1, python and PIL