cen yuhai created SPARK-21796: --------------------------------- Summary: pyspark count failed in python3.5.2 Key: SPARK-21796 URL: https://issues.apache.org/jira/browse/SPARK-21796 Project: Spark Issue Type: Bug Components: PySpark Affects Versions: 2.1.1 Environment: spark 2.1.1 Python 3.5.2 anaconda3 4.2.0 Reporter: cen yuhai
steps: {code} pyspark user_data = sc.textFile("/data/external_table/ods/table/dt=2017-08-17/hour=01/*.txt") user_data.count() {code} {code} Caused by: org.apache.spark.api.python.PythonException: Traceback (most recent call last): File "/home/master/platform/spark/python/pyspark/worker.py", line 98, in main command = pickleSer._read_with_length(infile) File "/home/master/platform/spark/python/pyspark/serializers.py", line 164, in _read_with_length return self.loads(obj) File "/home/master/platform/spark/python/pyspark/serializers.py", line 419, in loads return pickle.loads(obj, encoding=encoding) EOFError: Ran out of input at org.apache.spark.api.python.PythonRunner$$anon$1.read(PythonRDD.scala:193) {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org