Re: PySpark: saveAsTextFile and unicode

2013-11-29 Thread Andrei
toString(), but I should have used unicode() instead. I'll submit a patch to fix this (I think it should be as simple as replacing str() with unicode()). On Thu, Nov 28, 2013 at 12:14 AM, Andrei faithlessfri...@gmail.comwrote: Hi, I have a very simple script that just reads file from HDFS

Re: PySpark script works itself, but fails when called from other script

2013-11-18 Thread Andrei
. Hope this helps. Thanks, Andrei On Sat, Nov 16, 2013 at 2:12 PM, Andrei faithlessfri...@gmail.com wrote: Hi, thanks for your replies. I'm out of office now, so I will check it out on Monday morning, but guess about serialization/deserialization looks plausible. Thanks, Andrei On Sat

Re: PySpark script works itself, but fails when called from other script

2013-11-16 Thread Andrei
Hi, thanks for your replies. I'm out of office now, so I will check it out on Monday morning, but guess about serialization/deserialization looks plausible. Thanks, Andrei On Sat, Nov 16, 2013 at 11:11 AM, Jey Kottalam j...@cs.berkeley.edu wrote: Hi Andrei, Could you please post the stderr

PySpark script works itself, but fails when called from other script

2013-11-15 Thread Andrei
') --- So, what's the difference between calling PySpark-enabled script directly and as Python module? What are good rules for writing multi-module Python programs with Spark? Thanks, Andrei