[jira] [Commented] (SPARK-16595) Spark History server Rest Api gives Application not found error for yarn-cluster mode

2016-07-17 Thread Yesha Vora (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381760#comment-15381760 ] Yesha Vora commented on SPARK-16595: [~sowen], SPARK-15923 is referring to yarn-client mode. I

[jira] [Resolved] (SPARK-16588) Deprecate monotonicallyIncreasingId in Scala

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16588. - Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > Deprecate

[jira] [Updated] (SPARK-16588) Deprecate monotonicallyIncreasingId in Scala

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16588: Summary: Deprecate monotonicallyIncreasingId in Scala (was: Deprecate ) > Deprecate

[jira] [Assigned] (SPARK-16588) Deprecate monotonicallyIncreasingId in Scala

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reassigned SPARK-16588: --- Assignee: Reynold Xin > Deprecate monotonicallyIncreasingId in Scala >

[jira] [Updated] (SPARK-16588) Deprecate

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16588: Summary: Deprecate (was: Missed API fix for a function name mismatched between FunctionRegistry

[jira] [Reopened] (SPARK-16595) Spark History server Rest Api gives Application not found error for yarn-cluster mode

2016-07-17 Thread Yesha Vora (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yesha Vora reopened SPARK-16595: > Spark History server Rest Api gives Application not found error for > yarn-cluster mode >

[jira] [Updated] (SPARK-16595) Spark History server Rest Api gives Application not found error

2016-07-17 Thread Yesha Vora (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yesha Vora updated SPARK-16595: --- Description: Scenario: * Start SparkPi application in Spark1 using yarn-cluster mode

[jira] [Updated] (SPARK-16595) Spark History server Rest Api gives Application not found error for yarn-cluster mode

2016-07-17 Thread Yesha Vora (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yesha Vora updated SPARK-16595: --- Summary: Spark History server Rest Api gives Application not found error for yarn-cluster mode

[jira] [Commented] (SPARK-16321) Pyspark 2.0 performance drop vs pyspark 1.6

2016-07-17 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381755#comment-15381755 ] Maciej Bryński commented on SPARK-16321: Yep. I'll try. > Pyspark 2.0 performance drop vs

[jira] [Commented] (SPARK-16321) Pyspark 2.0 performance drop vs pyspark 1.6

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381749#comment-15381749 ] Reynold Xin commented on SPARK-16321: - Thanks - that's a great find. Can you take a look at what

[jira] [Commented] (SPARK-16321) Pyspark 2.0 performance drop vs pyspark 1.6

2016-07-17 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381747#comment-15381747 ] Maciej Bryński commented on SPARK-16321: I did some more investigation. I started to test

[jira] [Commented] (SPARK-16597) DataFrame DateType is written as an int(Days since epoch) by csv writer

2016-07-17 Thread Dean Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381719#comment-15381719 ] Dean Chen commented on SPARK-16597: --- Yes, closing as a dupe. > DataFrame DateType is written as an

[jira] [Closed] (SPARK-16597) DataFrame DateType is written as an int(Days since epoch) by csv writer

2016-07-17 Thread Dean Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dean Chen closed SPARK-16597. - Resolution: Duplicate > DataFrame DateType is written as an int(Days since epoch) by csv writer >

[jira] [Commented] (SPARK-16301) Analyzer rule for resolving using joins should respect case sensitivity setting

2016-07-17 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381705#comment-15381705 ] Shivaram Venkataraman commented on SPARK-16301: --- [~yhuai] [~davies] The PR looks to have

[jira] [Commented] (SPARK-16597) DataFrame DateType is written as an int(Days since epoch) by csv writer

2016-07-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381696#comment-15381696 ] Hyukjin Kwon commented on SPARK-16597: -- I guess this is a duplicated of SPARK-16216. > DataFrame

[jira] [Assigned] (SPARK-16598) Added a test case for verifying the table identifier parsing

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16598: Assignee: (was: Apache Spark) > Added a test case for verifying the table identifier

[jira] [Assigned] (SPARK-16598) Added a test case for verifying the table identifier parsing

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16598: Assignee: Apache Spark > Added a test case for verifying the table identifier parsing >

[jira] [Commented] (SPARK-16598) Added a test case for verifying the table identifier parsing

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381667#comment-15381667 ] Apache Spark commented on SPARK-16598: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Updated] (SPARK-16598) Added a test case for verifying the table identifier parsing

2016-07-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16598: Summary: Added a test case for verifying the table identifier parsing (was: Added a test case for

[jira] [Created] (SPARK-16598) Added a test case for verifying the table identifier parsing.

2016-07-17 Thread Xiao Li (JIRA)
Xiao Li created SPARK-16598: --- Summary: Added a test case for verifying the table identifier parsing. Key: SPARK-16598 URL: https://issues.apache.org/jira/browse/SPARK-16598 Project: Spark Issue

[jira] [Created] (SPARK-16597) DataFrame DateType is written as an int(Days since epoch) by csv writer

2016-07-17 Thread Dean Chen (JIRA)
Dean Chen created SPARK-16597: - Summary: DataFrame DateType is written as an int(Days since epoch) by csv writer Key: SPARK-16597 URL: https://issues.apache.org/jira/browse/SPARK-16597 Project: Spark

[jira] [Comment Edited] (SPARK-15810) Aggregator doesn't play nice with Option

2016-07-17 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381637#comment-15381637 ] koert kuipers edited comment on SPARK-15810 at 7/18/16 2:28 AM: i believe

[jira] [Commented] (SPARK-15810) Aggregator doesn't play nice with Option

2016-07-17 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381637#comment-15381637 ] koert kuipers commented on SPARK-15810: --- i believe the issue null shows up in both scala and java

[jira] [Commented] (SPARK-15810) Aggregator doesn't play nice with Option

2016-07-17 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381633#comment-15381633 ] koert kuipers commented on SPARK-15810: --- ok thats an improvement, because i got the same odd schema

[jira] [Updated] (SPARK-16027) Fix SparkR session unit test

2016-07-17 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-16027: -- Assignee: Felix Cheung (was: Apache Spark) > Fix SparkR session unit test >

[jira] [Resolved] (SPARK-16027) Fix SparkR session unit test

2016-07-17 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-16027. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request

[jira] [Updated] (SPARK-16593) Provide a pre-fetch mechanism to accelerate shuffle stage.

2016-07-17 Thread Biao Ma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Biao Ma updated SPARK-16593: Summary: Provide a pre-fetch mechanism to accelerate shuffle stage. (was: a) > Provide a pre-fetch

[jira] [Updated] (SPARK-16593) a

2016-07-17 Thread Biao Ma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Biao Ma updated SPARK-16593: Summary: a (was: Provide a pre-fetch mechanism to accelerate shuffle stage.) > a > - > >

[jira] [Updated] (SPARK-12420) Have a built-in CSV data source implementation

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12420: Assignee: (was: Hyukjin Kwon) > Have a built-in CSV data source implementation >

[jira] [Comment Edited] (SPARK-16533) Spark application not handling preemption messages

2016-07-17 Thread Emaad Manzoor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381616#comment-15381616 ] Emaad Manzoor edited comment on SPARK-16533 at 7/18/16 1:39 AM: I had the

[jira] [Commented] (SPARK-16533) Spark application not handling preemption messages

2016-07-17 Thread Emaad Manzoor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381616#comment-15381616 ] Emaad Manzoor commented on SPARK-16533: --- I had the same issue running on EC2 with single-core

[jira] [Assigned] (SPARK-10683) Source code missing for SparkR test JAR

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10683: Assignee: (was: Apache Spark) > Source code missing for SparkR test JAR >

[jira] [Assigned] (SPARK-16510) Move SparkR test JAR into Spark, include its source code

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16510: Assignee: Apache Spark (was: Shivaram Venkataraman) > Move SparkR test JAR into Spark,

[jira] [Commented] (SPARK-16510) Move SparkR test JAR into Spark, include its source code

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381585#comment-15381585 ] Apache Spark commented on SPARK-16510: -- User 'shivaram' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-3728) RandomForest: Learn models too large to store in memory

2016-07-17 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381583#comment-15381583 ] Xusen Yin edited comment on SPARK-3728 at 7/17/16 11:46 PM: Not now. Because I

[jira] [Assigned] (SPARK-10683) Source code missing for SparkR test JAR

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10683: Assignee: Apache Spark > Source code missing for SparkR test JAR >

[jira] [Commented] (SPARK-10683) Source code missing for SparkR test JAR

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381584#comment-15381584 ] Apache Spark commented on SPARK-10683: -- User 'shivaram' has created a pull request for this issue:

[jira] [Assigned] (SPARK-16510) Move SparkR test JAR into Spark, include its source code

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16510: Assignee: Shivaram Venkataraman (was: Apache Spark) > Move SparkR test JAR into Spark,

[jira] [Commented] (SPARK-3728) RandomForest: Learn models too large to store in memory

2016-07-17 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381583#comment-15381583 ] Xusen Yin commented on SPARK-3728: -- Not now. Because I thought the BFS style could reach the best

[jira] [Assigned] (SPARK-16510) Move SparkR test JAR into Spark, include its source code

2016-07-17 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman reassigned SPARK-16510: - Assignee: Shivaram Venkataraman > Move SparkR test JAR into Spark,

[jira] [Commented] (SPARK-16510) Move SparkR test JAR into Spark, include its source code

2016-07-17 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381580#comment-15381580 ] Shivaram Venkataraman commented on SPARK-16510: --- I actually found a better way to do this

[jira] [Commented] (SPARK-16589) Chained cartesian produces incorrect number of records

2016-07-17 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381557#comment-15381557 ] Dongjoon Hyun commented on SPARK-16589: --- Oh, Indeed, there is a bug of PySpark. Could you make a PR

[jira] [Assigned] (SPARK-16596) Refactor DataSourceScanExec to do partition discovery at execution instead of planning time

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16596: Assignee: Apache Spark > Refactor DataSourceScanExec to do partition discovery at

[jira] [Assigned] (SPARK-16596) Refactor DataSourceScanExec to do partition discovery at execution instead of planning time

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16596: Assignee: (was: Apache Spark) > Refactor DataSourceScanExec to do partition discovery

[jira] [Commented] (SPARK-16596) Refactor DataSourceScanExec to do partition discovery at execution instead of planning time

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381553#comment-15381553 ] Apache Spark commented on SPARK-16596: -- User 'ericl' has created a pull request for this issue:

[jira] [Created] (SPARK-16596) Refactor DataSourceScanExec to do partition discovery at execution instead of planning time

2016-07-17 Thread Eric Liang (JIRA)
Eric Liang created SPARK-16596: -- Summary: Refactor DataSourceScanExec to do partition discovery at execution instead of planning time Key: SPARK-16596 URL: https://issues.apache.org/jira/browse/SPARK-16596

[jira] [Commented] (SPARK-16576) Move plan SQL generation code from SQLBuilder into logical operators

2016-07-17 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381525#comment-15381525 ] Dongjoon Hyun commented on SPARK-16576: --- Oh, I see. I could access from the Spark shell, so I was

[jira] [Commented] (SPARK-16576) Move plan SQL generation code from SQLBuilder into logical operators

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381518#comment-15381518 ] Reynold Xin commented on SPARK-16576: - Everything in catalyst module is private. > Move plan SQL

[jira] [Commented] (SPARK-16576) Move plan SQL generation code from SQLBuilder into logical operators

2016-07-17 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381516#comment-15381516 ] Dongjoon Hyun commented on SPARK-16576: --- After moving the SQL generation codes into Logical

[jira] [Commented] (SPARK-14816) Update MLlib, GraphX, SparkR websites for 2.0

2016-07-17 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381506#comment-15381506 ] Shivaram Venkataraman commented on SPARK-14816: --- We didn't create a website for SparkR as

[jira] [Resolved] (SPARK-16595) Spark History server Rest Api gives Application not found error

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16595. --- Resolution: Duplicate Um, you already reported this. Are you not following this existing thread? >

[jira] [Created] (SPARK-16595) Spark History server Rest Api gives Application not found error

2016-07-17 Thread Yesha Vora (JIRA)
Yesha Vora created SPARK-16595: -- Summary: Spark History server Rest Api gives Application not found error Key: SPARK-16595 URL: https://issues.apache.org/jira/browse/SPARK-16595 Project: Spark

[jira] [Commented] (SPARK-14816) Update MLlib, GraphX, SparkR websites for 2.0

2016-07-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381438#comment-15381438 ] Joseph K. Bradley commented on SPARK-14816: --- I'd say the updates for

[jira] [Assigned] (SPARK-16594) Physical Plan Differences when Table Scan Having Duplicate Columns

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16594: Assignee: (was: Apache Spark) > Physical Plan Differences when Table Scan Having

[jira] [Commented] (SPARK-16594) Physical Plan Differences when Table Scan Having Duplicate Columns

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381418#comment-15381418 ] Apache Spark commented on SPARK-16594: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-16594) Physical Plan Differences when Table Scan Having Duplicate Columns

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16594: Assignee: Apache Spark > Physical Plan Differences when Table Scan Having Duplicate

[jira] [Updated] (SPARK-16594) Physical Plan Differences when Table Scan Having Duplicate Columns

2016-07-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16594: Description: Currently, we keep two implementations for planning scans over data sources. There is one

[jira] [Created] (SPARK-16594) Physical Plan Differences when Table Scan Having Duplicate Columns

2016-07-17 Thread Xiao Li (JIRA)
Xiao Li created SPARK-16594: --- Summary: Physical Plan Differences when Table Scan Having Duplicate Columns Key: SPARK-16594 URL: https://issues.apache.org/jira/browse/SPARK-16594 Project: Spark

[jira] [Assigned] (SPARK-16593) Provide a pre-fetch mechanism to accelerate shuffle stage.

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16593: Assignee: Apache Spark > Provide a pre-fetch mechanism to accelerate shuffle stage. >

[jira] [Commented] (SPARK-16593) Provide a pre-fetch mechanism to accelerate shuffle stage.

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381391#comment-15381391 ] Apache Spark commented on SPARK-16593: -- User 'f7753' has created a pull request for this issue:

[jira] [Assigned] (SPARK-16593) Provide a pre-fetch mechanism to accelerate shuffle stage.

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16593: Assignee: (was: Apache Spark) > Provide a pre-fetch mechanism to accelerate shuffle

[jira] [Created] (SPARK-16593) Provide a pre-fetch mechanism to accelerate shuffle stage.

2016-07-17 Thread Biao Ma (JIRA)
Biao Ma created SPARK-16593: --- Summary: Provide a pre-fetch mechanism to accelerate shuffle stage. Key: SPARK-16593 URL: https://issues.apache.org/jira/browse/SPARK-16593 Project: Spark Issue

[jira] [Commented] (SPARK-16283) Implement percentile_approx SQL function

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381335#comment-15381335 ] Apache Spark commented on SPARK-16283: -- User 'lw-lin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-16283) Implement percentile_approx SQL function

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16283: Assignee: (was: Apache Spark) > Implement percentile_approx SQL function >

[jira] [Assigned] (SPARK-16283) Implement percentile_approx SQL function

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16283: Assignee: Apache Spark > Implement percentile_approx SQL function >

[jira] [Commented] (SPARK-13634) Assigning spark context to variable results in serialization error

2016-07-17 Thread Rahul Palamuttam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381236#comment-15381236 ] Rahul Palamuttam commented on SPARK-13634: -- Understood and thank you for explaining. I agree

[jira] [Issue Comment Deleted] (SPARK-16592) Improving ml.Logistic Regression on speed and scalability

2016-07-17 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-16592: --- Comment: was deleted (was: sparse data support) > Improving ml.Logistic Regression on speed and

[jira] [Commented] (SPARK-16592) Improving ml.Logistic Regression on speed and scalability

2016-07-17 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381225#comment-15381225 ] yuhao yang commented on SPARK-16592: sparse data support > Improving ml.Logistic Regression on speed

[jira] [Commented] (SPARK-16588) Missed API fix for a function name mismatched between FunctionRegistry and functions.scala

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381224#comment-15381224 ] Apache Spark commented on SPARK-16588: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-16588) Missed API fix for a function name mismatched between FunctionRegistry and functions.scala

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16588: Assignee: (was: Apache Spark) > Missed API fix for a function name mismatched between

[jira] [Assigned] (SPARK-16588) Missed API fix for a function name mismatched between FunctionRegistry and functions.scala

2016-07-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16588: Assignee: Apache Spark > Missed API fix for a function name mismatched between

[jira] [Comment Edited] (SPARK-16592) Improving ml.Logistic Regression on speed and scalability

2016-07-17 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381222#comment-15381222 ] yuhao yang edited comment on SPARK-16592 at 7/17/16 8:19 AM: - Placeholder for

[jira] [Created] (SPARK-16592) Improving ml.Logistic Regression on speed and scalability

2016-07-17 Thread yuhao yang (JIRA)
yuhao yang created SPARK-16592: -- Summary: Improving ml.Logistic Regression on speed and scalability Key: SPARK-16592 URL: https://issues.apache.org/jira/browse/SPARK-16592 Project: Spark Issue

[jira] [Commented] (SPARK-16592) Improving ml.Logistic Regression on speed and scalability

2016-07-17 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381222#comment-15381222 ] yuhao yang commented on SPARK-16592: Placeholder for list of primary ongoing efforts: > Improving

[jira] [Reopened] (SPARK-16466) names() function allows creation of column name containing "-". filter() function subsequently fails

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-16466: --- Minor: I prefer using "Fixed" when there's a resolution to point to. Here we're not sure what if any

[jira] [Resolved] (SPARK-16466) names() function allows creation of column name containing "-". filter() function subsequently fails

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16466. --- Resolution: Not A Problem > names() function allows creation of column name containing "-".

[jira] [Updated] (SPARK-12420) Have a built-in CSV data source implementation

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12420: -- Assignee: Hossein Falaki > Have a built-in CSV data source implementation >

[jira] [Updated] (SPARK-12420) Have a built-in CSV data source implementation

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12420: -- Assignee: Hyukjin Kwon (was: Hossein Falaki) > Have a built-in CSV data source implementation >

[jira] [Updated] (SPARK-12436) If all values of a JSON field is null, JSON's inferSchema should return NullType instead of StringType

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12436: -- Target Version/s: (was: 2.0.0) > If all values of a JSON field is null, JSON's inferSchema should

[jira] [Updated] (SPARK-12373) Type coercion rule of dividing two decimal values may choose an intermediate precision that does not have enough number of digits at the left of decimal point

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12373: -- Target Version/s: (was: 2.0.0) > Type coercion rule of dividing two decimal values may choose an

[jira] [Updated] (SPARK-12437) Reserved words (like table) throws error when writing a data frame to JDBC

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12437: -- Target Version/s: (was: 2.0.0) > Reserved words (like table) throws error when writing a data frame

[jira] [Updated] (SPARK-16585) Update inner fields of complex types in dataframes

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16585: -- Labels: (was: build features) Priority: Minor (was: Blocker) Fix Version/s:

[jira] [Updated] (SPARK-16591) HadoopFsRelation will list , cache all parquet file paths

2016-07-17 Thread cen yuhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cen yuhai updated SPARK-16591: -- Description: HadoopFsRelation has a fileStatusCache which list all paths and then cache all

[jira] [Resolved] (SPARK-16585) Update inner fields of complex types in dataframes

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16585. --- Resolution: Invalid > Update inner fields of complex types in dataframes >

[jira] [Resolved] (SPARK-16570) Not able to access table's data after ALTER TABLE RENAME in Spark 1.6.2

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16570. --- Resolution: Duplicate Fix Version/s: (was: 1.6.2) Target Version/s: (was:

[jira] [Updated] (SPARK-16591) HadoopFsRelation will list , cache all parquet file paths

2016-07-17 Thread cen yuhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cen yuhai updated SPARK-16591: -- Shepherd: Cheng Lian (was: lianwenbo) > HadoopFsRelation will list , cache all parquet file paths >

[jira] [Updated] (SPARK-16591) HadoopFsRelation will list , cache all parquet file paths

2016-07-17 Thread cen yuhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cen yuhai updated SPARK-16591: -- Shepherd: lianwenbo > HadoopFsRelation will list , cache all parquet file paths >

[jira] [Created] (SPARK-16591) HadoopFsRelation will list , cache all parquet file paths

2016-07-17 Thread cen yuhai (JIRA)
cen yuhai created SPARK-16591: - Summary: HadoopFsRelation will list , cache all parquet file paths Key: SPARK-16591 URL: https://issues.apache.org/jira/browse/SPARK-16591 Project: Spark Issue

[jira] [Comment Edited] (SPARK-13634) Assigning spark context to variable results in serialization error

2016-07-17 Thread Rahul Palamuttam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381168#comment-15381168 ] Rahul Palamuttam edited comment on SPARK-13634 at 7/17/16 7:41 AM: ---

[jira] [Resolved] (SPARK-15393) Writing empty Dataframes doesn't save any _metadata files

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15393. --- Resolution: Duplicate Target Version/s: (was: 2.0.0) > Writing empty Dataframes doesn't

[jira] [Commented] (SPARK-13634) Assigning spark context to variable results in serialization error

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381169#comment-15381169 ] Sean Owen commented on SPARK-13634: --- Go ahead, though in general I think it's pretty implicit that you

[jira] [Commented] (SPARK-13634) Assigning spark context to variable results in serialization error

2016-07-17 Thread Rahul Palamuttam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381168#comment-15381168 ] Rahul Palamuttam commented on SPARK-13634: -- Kai Chen, thank you. I apologize for not responding

[jira] [Comment Edited] (SPARK-13634) Assigning spark context to variable results in serialization error

2016-07-17 Thread Rahul Palamuttam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177093#comment-15177093 ] Rahul Palamuttam edited comment on SPARK-13634 at 7/17/16 7:24 AM: ---

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-07-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381167#comment-15381167 ] Sean Owen commented on SPARK-12261: --- We need logs showing the actual error. If this is local mode, the

[jira] [Commented] (SPARK-16452) basic INFORMATION_SCHEMA support

2016-07-17 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381164#comment-15381164 ] Dongjoon Hyun commented on SPARK-16452: --- Could you review the PR again? > basic INFORMATION_SCHEMA

[jira] [Commented] (SPARK-16452) basic INFORMATION_SCHEMA support

2016-07-17 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381161#comment-15381161 ] Dongjoon Hyun commented on SPARK-16452: --- Yep. It's closed. > basic INFORMATION_SCHEMA support >

[jira] [Closed] (SPARK-16492) Fill VIEW_DEFINITION column in INFORMATION_SCHEMA.views

2016-07-17 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-16492. - Resolution: Invalid > Fill VIEW_DEFINITION column in INFORMATION_SCHEMA.views >

[jira] [Commented] (SPARK-16588) Missed API fix for a function name mismatched between FunctionRegistry and functions.scala

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381152#comment-15381152 ] Reynold Xin commented on SPARK-16588: - Yes we should deprecate the monotonicallyIncreasingId >

[jira] [Commented] (SPARK-16452) basic INFORMATION_SCHEMA support

2016-07-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381149#comment-15381149 ] Reynold Xin commented on SPARK-16452: - Go ahead. Thanks. > basic INFORMATION_SCHEMA support >

[jira] [Commented] (SPARK-16452) basic INFORMATION_SCHEMA support

2016-07-17 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15381148#comment-15381148 ] Dongjoon Hyun commented on SPARK-16452: --- Oh, may I close SPARK-16492 as INVALID or LATER? > basic

  1   2   >