[jira] [Reopened] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dayou Zhou reopened SPARK-21101: > Error running Hive temporary UDTF on latest Spark 2.2 >

[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050031#comment-16050031 ] Dayou Zhou commented on SPARK-21101: Hi [~maropu], >> I'll close this because this seems to be a

[jira] [Assigned] (SPARK-18016) Code Generation: Constant Pool Past Limit for Wide/Nested Dataset

2017-06-14 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-18016: --- Assignee: Aleksander Eskilson > Code Generation: Constant Pool Past Limit for Wide/Nested

[jira] [Commented] (SPARK-20851) Drop spark table failed if a column name is a numeric string

2017-06-14 Thread Chen Gong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050028#comment-16050028 ] Chen Gong commented on SPARK-20851: --- [~benyuel] [~maropu] Thanks for your comments. Root cause for this

[jira] [Resolved] (SPARK-18016) Code Generation: Constant Pool Past Limit for Wide/Nested Dataset

2017-06-14 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18016. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18075

[jira] [Commented] (SPARK-20980) Rename the option `wholeFile` to `multiLine` for JSON and CSV

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050023#comment-16050023 ] Apache Spark commented on SPARK-20980: -- User 'felixcheung' has created a pull request for this

[jira] [Commented] (SPARK-21082) Consider Executor's memory usage when scheduling task

2017-06-14 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050016#comment-16050016 ] Saisai Shao commented on SPARK-21082: - That's fine if the storage memory is not enough to cache all

[jira] [Resolved] (SPARK-20980) Rename the option `wholeFile` to `multiLine` for JSON and CSV

2017-06-14 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-20980. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 18202

[jira] [Commented] (SPARK-21074) Parquet files are read fully even though only count() is requested

2017-06-14 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050005#comment-16050005 ] Takeshi Yamamuro commented on SPARK-21074: -- Since this is an expected behaviour and I think this

[jira] [Updated] (SPARK-21074) Parquet files are read fully even though only count() is requested

2017-06-14 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-21074: - Issue Type: Improvement (was: Bug) > Parquet files are read fully even though only

[jira] [Resolved] (SPARK-21092) Wire SQLConf in logical plan and expressions

2017-06-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-21092. - Resolution: Fixed Fix Version/s: 2.3.0 > Wire SQLConf in logical plan and expressions >

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2017-06-14 Thread remoteServer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049979#comment-16049979 ] remoteServer commented on SPARK-15905: -- I faced the same issue. Increasing driver memory helped. You

[jira] [Closed] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro closed SPARK-21101. Resolution: Not A Problem > Error running Hive temporary UDTF on latest Spark 2.2 >

[jira] [Comment Edited] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049977#comment-16049977 ] Takeshi Yamamuro edited comment on SPARK-21101 at 6/15/17 4:16 AM: ---

[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049977#comment-16049977 ] Takeshi Yamamuro commented on SPARK-21101: -- Since JIRA is not a place for questions, you better

[jira] [Updated] (SPARK-21082) Consider Executor's memory usage when scheduling task

2017-06-14 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DjvuLee updated SPARK-21082: Affects Version/s: (was: 2.2.1) 2.3.0 > Consider Executor's memory usage when

[jira] [Commented] (SPARK-21082) Consider Executor's memory usage when scheduling task

2017-06-14 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049965#comment-16049965 ] DjvuLee commented on SPARK-21082: - Data locality, input size for task, scheduling order affect a lot,

[jira] [Comment Edited] (SPARK-19900) [Standalone] Master registers application again when driver relaunched

2017-06-14 Thread Li Yichao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049942#comment-16049942 ] Li Yichao edited comment on SPARK-19900 at 6/15/17 3:29 AM: My user name (and

[jira] [Comment Edited] (SPARK-19900) [Standalone] Master registers application again when driver relaunched

2017-06-14 Thread Li Yichao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049942#comment-16049942 ] Li Yichao edited comment on SPARK-19900 at 6/15/17 3:26 AM: My user name (and

[jira] [Updated] (SPARK-20869) Master should clear failed apps when worker down

2017-06-14 Thread lyc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lyc updated SPARK-20869: Description: In `Master.removeWorker`, master clears executor and driver state, but does not clear app state. App

[jira] [Updated] (SPARK-20869) Master should clear failed apps when worker down

2017-06-14 Thread lyc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lyc updated SPARK-20869: Description: In `Master.removeWorker`, master clears executor and driver state, but does not clear app state. App

[jira] [Commented] (SPARK-21093) Multiple gapply execution occasionally failed in SparkR

2017-06-14 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049944#comment-16049944 ] Hyukjin Kwon commented on SPARK-21093: -- I am taking a look here gdb with bt: {code} GNU gdb (GDB)

[jira] [Commented] (SPARK-19900) [Standalone] Master registers application again when driver relaunched

2017-06-14 Thread lyc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049942#comment-16049942 ] lyc commented on SPARK-19900: - Hi, what's the meaning of JIRA id? I only know that my user name (and JIRA

[jira] [Commented] (SPARK-21082) Consider Executor's memory usage when scheduling task

2017-06-14 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049940#comment-16049940 ] Saisai Shao commented on SPARK-21082: - Fast node actually equals to idle node, since fast node

[jira] [Comment Edited] (SPARK-21082) Consider Executor's memory usage when scheduling task

2017-06-14 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049933#comment-16049933 ] DjvuLee edited comment on SPARK-21082 at 6/15/17 2:47 AM: -- Not a really fast

[jira] [Commented] (SPARK-21082) Consider Executor's memory usage when scheduling task

2017-06-14 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049933#comment-16049933 ] DjvuLee commented on SPARK-21082: - Not a really fast node and slow node problem. Even all the nodes have

[jira] [Resolved] (SPARK-20912) map function with columns as strings

2017-06-14 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20912. -- Resolution: Won't Fix I am resolving this per the discussion in the PR. I guess we are fine

[jira] [Commented] (SPARK-21080) Workaround for HDFS delegation token expiry broken with some Hadoop versions

2017-06-14 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049920#comment-16049920 ] Saisai Shao commented on SPARK-21080: - Are you getting this issue in HDFS HA mode? If yes, then

[jira] [Assigned] (SPARK-21103) QueryPlanConstraints should be part of LogicalPlan

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21103: Assignee: Apache Spark (was: Reynold Xin) > QueryPlanConstraints should be part of

[jira] [Assigned] (SPARK-21103) QueryPlanConstraints should be part of LogicalPlan

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21103: Assignee: Reynold Xin (was: Apache Spark) > QueryPlanConstraints should be part of

[jira] [Commented] (SPARK-21103) QueryPlanConstraints should be part of LogicalPlan

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049919#comment-16049919 ] Apache Spark commented on SPARK-21103: -- User 'rxin' has created a pull request for this issue:

[jira] [Created] (SPARK-21103) QueryPlanConstraints should be part of LogicalPlan

2017-06-14 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-21103: --- Summary: QueryPlanConstraints should be part of LogicalPlan Key: SPARK-21103 URL: https://issues.apache.org/jira/browse/SPARK-21103 Project: Spark Issue Type:

[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049915#comment-16049915 ] Dayou Zhou commented on SPARK-21101: Hi [~maropu], yes I saw this one, but in my case, I'm using JDBC

[jira] [Commented] (SPARK-21082) Consider Executor's memory usage when scheduling task

2017-06-14 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049906#comment-16049906 ] Saisai Shao commented on SPARK-21082: - Is it due to fast node and slow node problem? Ideally if all

[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049892#comment-16049892 ] Takeshi Yamamuro commented on SPARK-21101: -- See

[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049891#comment-16049891 ] Dayou Zhou commented on SPARK-21101: Hi [~maropu] >>You just don't pass your uber-jar into spark?

[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049883#comment-16049883 ] Takeshi Yamamuro commented on SPARK-21101: -- You just don't pass your uber-jar into spark? Or,

[jira] [Created] (SPARK-21102) Refresh command is too aggressive in parsing

2017-06-14 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-21102: --- Summary: Refresh command is too aggressive in parsing Key: SPARK-21102 URL: https://issues.apache.org/jira/browse/SPARK-21102 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-21102) Refresh command is too aggressive in parsing

2017-06-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-21102: Labels: starter (was: ) > Refresh command is too aggressive in parsing >

[jira] [Assigned] (SPARK-21079) ANALYZE TABLE fails to calculate totalSize for a partitioned table

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21079: Assignee: (was: Apache Spark) > ANALYZE TABLE fails to calculate totalSize for a

[jira] [Assigned] (SPARK-21079) ANALYZE TABLE fails to calculate totalSize for a partitioned table

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21079: Assignee: Apache Spark > ANALYZE TABLE fails to calculate totalSize for a partitioned

[jira] [Issue Comment Deleted] (SPARK-21079) ANALYZE TABLE fails to calculate totalSize for a partitioned table

2017-06-14 Thread Maria (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maria updated SPARK-21079: -- Comment: was deleted (was: [~ZenWzh], here is a PR: https://github.com/apache/spark/pull/18309) > ANALYZE

[jira] [Commented] (SPARK-21079) ANALYZE TABLE fails to calculate totalSize for a partitioned table

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049877#comment-16049877 ] Apache Spark commented on SPARK-21079: -- User 'mbasmanova' has created a pull request for this issue:

[jira] [Commented] (SPARK-21079) ANALYZE TABLE fails to calculate totalSize for a partitioned table

2017-06-14 Thread Maria (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049879#comment-16049879 ] Maria commented on SPARK-21079: --- [~ZenWzh], here is a PR: https://github.com/apache/spark/pull/18309 >

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049862#comment-16049862 ] Dongjoon Hyun commented on SPARK-20954: --- FYI, Apache Spark STS uses 1 as a port number. >

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049861#comment-16049861 ] Dongjoon Hyun commented on SPARK-20954: --- This is the same result over beeline on branch-2.2. I

[jira] [Resolved] (SPARK-21096) Pickle error when passing a member variable to Spark executors

2017-06-14 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21096. -- Resolution: Not A Problem I am resolving this. Please reopen this if I misunderstood. >

[jira] [Commented] (SPARK-21096) Pickle error when passing a member variable to Spark executors

2017-06-14 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049852#comment-16049852 ] Hyukjin Kwon commented on SPARK-21096: -- This is because you are passing {{self}} which has

[jira] [Updated] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dayou Zhou updated SPARK-21101: --- Description: I'm using temporary UDTFs on Spark 2.2, e.g. {noformat} CREATE TEMPORARY FUNCTION

[jira] [Updated] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dayou Zhou updated SPARK-21101: --- Description: I'm using temporary UDTFs on Spark 2.2, e.g. CREATE TEMPORARY FUNCTION myudtf AS

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Garros Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049844#comment-16049844 ] Garros Chan commented on SPARK-20954: - Hi [~dongjoon] Thanks for your confirmation! Do you know when

[jira] [Commented] (SPARK-19900) [Standalone] Master registers application again when driver relaunched

2017-06-14 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049842#comment-16049842 ] Wenchen Fan commented on SPARK-19900: - liyichao can you provide your JIRA id? thanks! > [Standalone]

[jira] [Resolved] (SPARK-19900) [Standalone] Master registers application again when driver relaunched

2017-06-14 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19900. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18084

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049837#comment-16049837 ] Dongjoon Hyun commented on SPARK-20954: --- Yep. 2.2.0-RC5. It also includes SPARK-12868, too. >

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Garros Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049833#comment-16049833 ] Garros Chan commented on SPARK-20954: - Hi [~dongjoon] I see. Do you mean spark-2.2.0-rc5 :) ? Also,

[jira] [Created] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2

2017-06-14 Thread Dayou Zhou (JIRA)
Dayou Zhou created SPARK-21101: -- Summary: Error running Hive temporary UDTF on latest Spark 2.2 Key: SPARK-21101 URL: https://issues.apache.org/jira/browse/SPARK-21101 Project: Spark Issue

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049829#comment-16049829 ] Dongjoon Hyun commented on SPARK-20954: --- RC5 is coming very soon with this fix. :) > DESCRIBE

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Garros Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049827#comment-16049827 ] Garros Chan commented on SPARK-20954: - Hi [~dongjoon] I see. Would you be able to tell me where I

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049826#comment-16049826 ] Dongjoon Hyun commented on SPARK-20954: --- FYI, the following is the result on `branch-2.2`. I'm not

[jira] [Commented] (SPARK-18294) Implement commit protocol to support `mapred` package's committer

2017-06-14 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049816#comment-16049816 ] Dayou Zhou commented on SPARK-18294: Hi [~jiangxb1987] does this answer your question? Any help

[jira] [Commented] (SPARK-12868) ADD JAR via sparkSQL JDBC will fail when using a HDFS URL

2017-06-14 Thread Dayou Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049814#comment-16049814 ] Dayou Zhou commented on SPARK-12868: Hi [~tleftwich] and all, I was using Spark 2.0 and when I tried

[jira] [Assigned] (SPARK-21099) INFO Log Message Using Incorrect Executor Idle Timeout Value

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21099: Assignee: Apache Spark > INFO Log Message Using Incorrect Executor Idle Timeout Value >

[jira] [Commented] (SPARK-21099) INFO Log Message Using Incorrect Executor Idle Timeout Value

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049790#comment-16049790 ] Apache Spark commented on SPARK-21099: -- User 'ihazem' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21099) INFO Log Message Using Incorrect Executor Idle Timeout Value

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21099: Assignee: (was: Apache Spark) > INFO Log Message Using Incorrect Executor Idle

[jira] [Commented] (SPARK-20954) DESCRIBE showing 1 extra row of "| # col_name | data_type | comment |"

2017-06-14 Thread Garros Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049785#comment-16049785 ] Garros Chan commented on SPARK-20954: - Hi [~dongjoon] I downloaded

[jira] [Updated] (SPARK-21084) Improvements to dynamic allocation for notebook use cases

2017-06-14 Thread Frederick Reiss (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frederick Reiss updated SPARK-21084: Description: One important application of Spark is to support many notebook users with a

[jira] [Assigned] (SPARK-21100) describe should give quartiles similar to Pandas

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21100: Assignee: (was: Apache Spark) > describe should give quartiles similar to Pandas >

[jira] [Commented] (SPARK-21100) describe should give quartiles similar to Pandas

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049701#comment-16049701 ] Apache Spark commented on SPARK-21100: -- User 'aray' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21100) describe should give quartiles similar to Pandas

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21100: Assignee: Apache Spark > describe should give quartiles similar to Pandas >

[jira] [Created] (SPARK-21100) describe should give quartiles similar to Pandas

2017-06-14 Thread Andrew Ray (JIRA)
Andrew Ray created SPARK-21100: -- Summary: describe should give quartiles similar to Pandas Key: SPARK-21100 URL: https://issues.apache.org/jira/browse/SPARK-21100 Project: Spark Issue Type:

[jira] [Updated] (SPARK-21084) Improvements to dynamic allocation for notebook use cases

2017-06-14 Thread Frederick Reiss (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frederick Reiss updated SPARK-21084: Description: One important application of Spark is to support many notebook users with a

[jira] [Resolved] (SPARK-21091) Move constraint code into QueryPlanConstraints

2017-06-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-21091. - Resolution: Fixed Fix Version/s: 2.3.0 > Move constraint code into QueryPlanConstraints >

[jira] [Commented] (SPARK-21084) Improvements to dynamic allocation for notebook use cases

2017-06-14 Thread Frederick Reiss (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049669#comment-16049669 ] Frederick Reiss commented on SPARK-21084: - [~sowen] thanks for having a look at this JIRA and

[jira] [Updated] (SPARK-21099) INFO Log Message Using Incorrect Executor Idle Timeout Value

2017-06-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-21099: -- Priority: Trivial (was: Major) > INFO Log Message Using Incorrect Executor Idle Timeout Value >

[jira] [Created] (SPARK-21099) INFO Log Message Using Incorrect Executor Idle Timeout Value

2017-06-14 Thread Hazem Mahmoud (JIRA)
Hazem Mahmoud created SPARK-21099: - Summary: INFO Log Message Using Incorrect Executor Idle Timeout Value Key: SPARK-21099 URL: https://issues.apache.org/jira/browse/SPARK-21099 Project: Spark

[jira] [Assigned] (SPARK-21029) All StreamingQuery should be stopped when the SparkSession is stopped

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21029: Assignee: Apache Spark > All StreamingQuery should be stopped when the SparkSession is

[jira] [Commented] (SPARK-21029) All StreamingQuery should be stopped when the SparkSession is stopped

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049626#comment-16049626 ] Apache Spark commented on SPARK-21029: -- User 'aray' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21029) All StreamingQuery should be stopped when the SparkSession is stopped

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21029: Assignee: (was: Apache Spark) > All StreamingQuery should be stopped when the

[jira] [Assigned] (SPARK-20988) Convert logistic regression to new aggregator framework

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20988: Assignee: (was: Apache Spark) > Convert logistic regression to new aggregator

[jira] [Commented] (SPARK-20988) Convert logistic regression to new aggregator framework

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049519#comment-16049519 ] Apache Spark commented on SPARK-20988: -- User 'sethah' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20988) Convert logistic regression to new aggregator framework

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20988: Assignee: Apache Spark > Convert logistic regression to new aggregator framework >

[jira] [Assigned] (SPARK-21098) Add line separator option to csv read/write

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21098: Assignee: (was: Apache Spark) > Add line separator option to csv read/write >

[jira] [Commented] (SPARK-21098) Add line separator option to csv read/write

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049509#comment-16049509 ] Apache Spark commented on SPARK-21098: -- User 'danielvdende' has created a pull request for this

[jira] [Assigned] (SPARK-21098) Add line separator option to csv read/write

2017-06-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21098: Assignee: Apache Spark > Add line separator option to csv read/write >

[jira] [Closed] (SPARK-16669) Partition pruning for metastore relation size estimates for better join selection.

2017-06-14 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang closed SPARK-16669. Resolution: Duplicate > Partition pruning for metastore relation size estimates for better join >

[jira] [Updated] (SPARK-21098) Add line separator option to csv read/write

2017-06-14 Thread Daniel van der Ende (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel van der Ende updated SPARK-21098: Summary: Add line separator option to csv read/write (was: Add line separator

[jira] [Created] (SPARK-21098) Add line separator option to csv

2017-06-14 Thread Daniel van der Ende (JIRA)
Daniel van der Ende created SPARK-21098: --- Summary: Add line separator option to csv Key: SPARK-21098 URL: https://issues.apache.org/jira/browse/SPARK-21098 Project: Spark Issue Type:

[jira] [Updated] (SPARK-21067) Thrift Server - CTAS fail with Unable to move source

2017-06-14 Thread Dominic Ricard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dominic Ricard updated SPARK-21067: --- Description: After upgrading our Thrift cluster to 2.1.1, we ran into an issue where CTAS

[jira] [Comment Edited] (SPARK-21097) Dynamic allocation will preserve cached data

2017-06-14 Thread Brad (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049485#comment-16049485 ] Brad edited comment on SPARK-21097 at 6/14/17 6:29 PM: --- Hey [~srowen], thanks for

[jira] [Commented] (SPARK-21097) Dynamic allocation will preserve cached data

2017-06-14 Thread Brad (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049485#comment-16049485 ] Brad commented on SPARK-21097: -- Hey Sean, thanks for your input. I would definitely like to do some

[jira] [Commented] (SPARK-21088) CrossValidator, TrainValidationSplit should preserve all models after fitting: Python

2017-06-14 Thread Ajay Saini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049447#comment-16049447 ] Ajay Saini commented on SPARK-21088: I'll work on this one. > CrossValidator, TrainValidationSplit

[jira] [Commented] (SPARK-21097) Dynamic allocation will preserve cached data

2017-06-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049442#comment-16049442 ] Sean Owen commented on SPARK-21097: --- This seems to add a fair bit of complexity when Spark is already

[jira] [Commented] (SPARK-21097) Dynamic allocation will preserve cached data

2017-06-14 Thread Brad (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049443#comment-16049443 ] Brad commented on SPARK-21097: -- I am working on this now and will be posting a more detailed design document

[jira] [Updated] (SPARK-21096) Pickle error when passing a member variable to Spark executors

2017-06-14 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Irina Truong updated SPARK-21096: - Description: There is a pickle error when submitting a spark job that references a member

[jira] [Updated] (SPARK-21096) Pickle error when passing a member variable to Spark executors

2017-06-14 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Irina Truong updated SPARK-21096: - Description: There is a pickle error when submitting a spark job that references a member

[jira] [Updated] (SPARK-21096) Pickle error when passing a member variable to Spark executors

2017-06-14 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Irina Truong updated SPARK-21096: - Description: There is a pickle error when submitting a spark job that references a member

[jira] [Updated] (SPARK-21096) Pickle error when passing a member variable to Spark executors

2017-06-14 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Irina Truong updated SPARK-21096: - Description: There is a pickle error when submitting a spark job that references a member

[jira] [Updated] (SPARK-21096) Pickle error when passing a member variable to Spark executors

2017-06-14 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Irina Truong updated SPARK-21096: - Description: There is a pickle error when submitting a spark job that references a member

[jira] [Created] (SPARK-21097) Dynamic allocation will preserve cached data

2017-06-14 Thread Brad (JIRA)
Brad created SPARK-21097: Summary: Dynamic allocation will preserve cached data Key: SPARK-21097 URL: https://issues.apache.org/jira/browse/SPARK-21097 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-21096) Pickle error when passing a member variable to Spark executors

2017-06-14 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Irina Truong updated SPARK-21096: - Description: There is a pickle error when submitting a spark job that references a member

  1   2   >