Re: PySpark problem with textblob from NLTK used in map

2014-10-27 Thread jan.zikes
So the problem was that Spark has internaly set home to /home. Hack to make this work with Python is to add before call of textblob line: os.environ['HOME'] = '/home/hadoop'  __ Maybe I'll add one more question. I think that the

Re: PySpark problem with textblob from NLTK used in map

2014-10-24 Thread jan.zikes
Maybe I'll add one more question. I think that the problem is with user, so I would like to ask under which user are run Spark jobs on slaves? __ Hi, I am trying to implement function for text preprocessing in PySpark. I have amazon