[jira] [Commented] (SPARK-12981) Dataframe distinct() followed by a filter(udf) in pyspark throws a casting error

2016-03-19 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198280#comment-15198280 ] Xiu (Joe) Guo commented on SPARK-12981: --- Yes [~fabboe], my PR will fix your scenario too. >

[jira] [Updated] (SPARK-13366) Support Cartesian join for Datasets

2016-02-17 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiu (Joe) Guo updated SPARK-13366: -- Description: Saw a comment from [~marmbrus] regarding Cartesian join for Datasets: "You will

[jira] [Created] (SPARK-13366) Support Cartesian join for Datasets

2016-02-17 Thread Xiu (Joe) Guo (JIRA)
Xiu (Joe) Guo created SPARK-13366: - Summary: Support Cartesian join for Datasets Key: SPARK-13366 URL: https://issues.apache.org/jira/browse/SPARK-13366 Project: Spark Issue Type:

[jira] [Commented] (SPARK-13283) Spark doesn't escape column names when creating table on JDBC

2016-02-16 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15149387#comment-15149387 ] Xiu (Joe) Guo commented on SPARK-13283: --- Yes, it is a different problem from

[jira] [Commented] (SPARK-13301) PySpark Dataframe return wrong results with custom UDF

2016-02-12 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15145173#comment-15145173 ] Xiu (Joe) Guo commented on SPARK-13301: --- Hi Simone: How long is the string length for each row in

[jira] [Commented] (SPARK-13297) [SQL] Backticks cannot be escaped in column names

2016-02-12 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15145566#comment-15145566 ] Xiu (Joe) Guo commented on SPARK-13297: --- Looks like in the current [master

[jira] [Commented] (SPARK-9414) HiveContext:saveAsTable creates wrong partition for existing hive table(append mode)

2016-02-03 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131054#comment-15131054 ] Xiu (Joe) Guo commented on SPARK-9414: -- With the current master

[jira] [Commented] (SPARK-12262) describe extended doesn't return table on detail info tabled stored as PARQUET format

2016-01-14 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15099007#comment-15099007 ] Xiu (Joe) Guo commented on SPARK-12262: --- You might want to check out this JIRA:

[jira] [Commented] (SPARK-12521) DataFrame Partitions in java does not work

2015-12-25 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071703#comment-15071703 ] Xiu (Joe) Guo commented on SPARK-12521: --- Thanks [~hvanhovell] to clarifying this up. Maybe it is a

[jira] [Commented] (SPARK-12521) DataFrame Partitions in java does not work

2015-12-24 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071347#comment-15071347 ] Xiu (Joe) Guo commented on SPARK-12521: --- In 1.5.2 {code}sqlContext.load(){code} is deprecated, but

[jira] [Commented] (SPARK-12262) describe extended doesn't return table on detail info tabled stored as PARQUET format

2015-12-23 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15070503#comment-15070503 ] Xiu (Joe) Guo commented on SPARK-12262: --- The property

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread Xiu(Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15030615#comment-15030615 ] Xiu(Joe) Guo commented on SPARK-12030: -- I tried your scenario with some TPCDS table last night,

[jira] [Commented] (SPARK-9701) allow not automatically using HiveContext with spark-shell when hive support built in

2015-11-28 Thread Xiu(Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15030645#comment-15030645 ] Xiu(Joe) Guo commented on SPARK-9701: - [~yhuai][~lian cheng] Would you mind reviewing my PR for

[jira] [Comment Edited] (SPARK-9701) allow not automatically using HiveContext with spark-shell when hive support built in

2015-11-28 Thread Xiu(Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15030645#comment-15030645 ] Xiu(Joe) Guo edited comment on SPARK-9701 at 11/28/15 7:44 PM: --- [~yhuai],

[jira] [Commented] (SPARK-9701) allow not automatically using HiveContext with spark-shell when hive support built in

2015-11-28 Thread Xiu(Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15030641#comment-15030641 ] Xiu(Joe) Guo commented on SPARK-9701: - I think this is the same issue as SPARK-11562. > allow not

[jira] [Commented] (SPARK-6644) After adding new columns to a partitioned table and inserting data to an old partition, data of newly added columns are all NULL

2015-11-26 Thread Xiu(Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15029226#comment-15029226 ] Xiu(Joe) Guo commented on SPARK-6644: - With the current master branch code line (1.6.0-snapshot), this

[jira] [Commented] (SPARK-11631) DAGScheduler prints "Stopping DAGScheduler" at INFO to the logs with no corresponding "Starting"

2015-11-10 Thread Xiu(Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14999101#comment-14999101 ] Xiu(Joe) Guo commented on SPARK-11631: -- I am looking at it, will submit a PR shortly. >

[jira] [Commented] (SPARK-11628) spark-sql do not support for column datatype of CHAR

2015-11-10 Thread Xiu(Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14999742#comment-14999742 ] Xiu(Joe) Guo commented on SPARK-11628: -- Hi Shunyu: I think you are right about the parser part, but