Tamas Jambor created SPARK-7192: ----------------------------------- Summary: Pyspark casts hive bigint to int Key: SPARK-7192 URL: https://issues.apache.org/jira/browse/SPARK-7192 Project: Spark Issue Type: Bug Affects Versions: 1.3.0 Reporter: Tamas Jambor
It seems that pyspark reads bigint from hive and stores it as an int: >> hive_ctx = HiveContext(sc) >> data = hive_ctx.sql("select col1, col2 from dataset1") >> data DataFrame[col1: int, col2: bigint] >> c_t = [type(v) for v in data.collect()[0]] >> c_t [int, int] -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org