[jira] [Assigned] (SPARK-20264) asm should be non-test dependency in sql/core

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20264: Assignee: Reynold Xin (was: Apache Spark) > asm should be non-test dependency in

[jira] [Assigned] (SPARK-20264) asm should be non-test dependency in sql/core

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20264: Assignee: Apache Spark (was: Reynold Xin) > asm should be non-test dependency in

[jira] [Commented] (SPARK-20264) asm should be non-test dependency in sql/core

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961700#comment-15961700 ] Apache Spark commented on SPARK-20264: -- User 'rxin' has created a pull request for this issue:

[jira] [Created] (SPARK-20264) asm should be non-test dependency in sql/core

2017-04-07 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-20264: --- Summary: asm should be non-test dependency in sql/core Key: SPARK-20264 URL: https://issues.apache.org/jira/browse/SPARK-20264 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18055) Dataset.flatMap can't work with types from customized jar

2017-04-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961687#comment-15961687 ] Wenchen Fan commented on SPARK-18055: - I think this is a different issue, can you open a new ticket

[jira] [Commented] (SPARK-19352) Sorting issues on relatively big datasets

2017-04-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961686#comment-15961686 ] Wenchen Fan commented on SPARK-19352: - I don't think Spark will provide API support for this

[jira] [Closed] (SPARK-20262) AssertNotNull should throw NullPointerException

2017-04-07 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li closed SPARK-20262. --- Resolution: Fixed Fix Version/s: 2.2.0 2.1.2 > AssertNotNull should throw

[jira] [Commented] (SPARK-20259) Support push down join optimizations in DataFrameReader when loading from JDBC

2017-04-07 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961677#comment-15961677 ] Hyukjin Kwon commented on SPARK-20259: -- Could you describe the current status and why it should be

[jira] [Commented] (SPARK-19935) SparkSQL unsupports to create a hive table which is mapped for HBase table

2017-04-07 Thread Xiaochen Ouyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961670#comment-15961670 ] Xiaochen Ouyang commented on SPARK-19935: - Not yet! I tried to work on this issue, but now only

[jira] [Comment Edited] (SPARK-19935) SparkSQL unsupports to create a hive table which is mapped for HBase table

2017-04-07 Thread Xiaochen Ouyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961670#comment-15961670 ] Xiaochen Ouyang edited comment on SPARK-19935 at 4/8/17 4:04 AM: -

[jira] [Resolved] (SPARK-20246) Should check determinism when pushing predicates down through aggregation

2017-04-07 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20246. - Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.2.0 2.1.2

[jira] [Updated] (SPARK-20263) create empty dataframes in sparkR

2017-04-07 Thread Ott Toomet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ott Toomet updated SPARK-20263: --- Priority: Minor (was: Trivial) Description: SparkR 2.1 does not support creating empty

[jira] [Created] (SPARK-20263) create empty dataframes in sparkR

2017-04-07 Thread Ott Toomet (JIRA)
Ott Toomet created SPARK-20263: -- Summary: create empty dataframes in sparkR Key: SPARK-20263 URL: https://issues.apache.org/jira/browse/SPARK-20263 Project: Spark Issue Type: Wish

[jira] [Assigned] (SPARK-20262) AssertNotNull should throw NullPointerException

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20262: Assignee: Reynold Xin (was: Apache Spark) > AssertNotNull should throw

[jira] [Assigned] (SPARK-20262) AssertNotNull should throw NullPointerException

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20262: Assignee: Apache Spark (was: Reynold Xin) > AssertNotNull should throw

[jira] [Created] (SPARK-20262) AssertNotNull should throw NullPointerException

2017-04-07 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-20262: --- Summary: AssertNotNull should throw NullPointerException Key: SPARK-20262 URL: https://issues.apache.org/jira/browse/SPARK-20262 Project: Spark Issue Type:

[jira] [Commented] (SPARK-20262) AssertNotNull should throw NullPointerException

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961584#comment-15961584 ] Apache Spark commented on SPARK-20262: -- User 'rxin' has created a pull request for this issue:

[jira] [Created] (SPARK-20261) EventLoggingListener may not truly flush the logger when a compression codec is used

2017-04-07 Thread Brian Cho (JIRA)
Brian Cho created SPARK-20261: - Summary: EventLoggingListener may not truly flush the logger when a compression codec is used Key: SPARK-20261 URL: https://issues.apache.org/jira/browse/SPARK-20261

[jira] [Assigned] (SPARK-20260) MLUtils parseLibSVMRecord has incorrect string interpolation for error message

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20260: Assignee: (was: Apache Spark) > MLUtils parseLibSVMRecord has incorrect string

[jira] [Assigned] (SPARK-20260) MLUtils parseLibSVMRecord has incorrect string interpolation for error message

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20260: Assignee: Apache Spark > MLUtils parseLibSVMRecord has incorrect string interpolation for

[jira] [Commented] (SPARK-20260) MLUtils parseLibSVMRecord has incorrect string interpolation for error message

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961491#comment-15961491 ] Apache Spark commented on SPARK-20260: -- User 'vijaykramesh' has created a pull request for this

[jira] [Created] (SPARK-20260) MLUtils parseLibSVMRecord has incorrect string interpolation for error message

2017-04-07 Thread Vijay Krishna Ramesh (JIRA)
Vijay Krishna Ramesh created SPARK-20260: Summary: MLUtils parseLibSVMRecord has incorrect string interpolation for error message Key: SPARK-20260 URL: https://issues.apache.org/jira/browse/SPARK-20260

[jira] [Resolved] (SPARK-20255) FileIndex hierarchy inconsistency

2017-04-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-20255. - Resolution: Fixed Assignee: Adrian Ionescu Fix Version/s: 2.2.0 > FileIndex

[jira] [Updated] (SPARK-20258) SparkR logistic regression example did not converge in programming guide

2017-04-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-20258: - Fix Version/s: 2.2.0 > SparkR logistic regression example did not converge in programming guide

[jira] [Resolved] (SPARK-20258) SparkR logistic regression example did not converge in programming guide

2017-04-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-20258. -- Resolution: Fixed Assignee: Wayne Zhang > SparkR logistic regression example did not

[jira] [Commented] (SPARK-18934) Writing to dynamic partitions does not preserve sort order if spill occurs

2017-04-07 Thread Charles Pritchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961390#comment-15961390 ] Charles Pritchard commented on SPARK-18934: --- Possibly fixed in:

[jira] [Commented] (SPARK-19352) Sorting issues on relatively big datasets

2017-04-07 Thread Charles Pritchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961389#comment-15961389 ] Charles Pritchard commented on SPARK-19352: --- [~cloud_fan] Is there something on the roadmap to

[jira] [Created] (SPARK-20259) Support push down join optimizations in DataFrameReader when loading from JDBC

2017-04-07 Thread John Muller (JIRA)
John Muller created SPARK-20259: --- Summary: Support push down join optimizations in DataFrameReader when loading from JDBC Key: SPARK-20259 URL: https://issues.apache.org/jira/browse/SPARK-20259

[jira] [Assigned] (SPARK-20258) SparkR logistic regression example did not converge in programming guide

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20258: Assignee: Apache Spark > SparkR logistic regression example did not converge in

[jira] [Commented] (SPARK-20258) SparkR logistic regression example did not converge in programming guide

2017-04-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961274#comment-15961274 ] Felix Cheung commented on SPARK-20258: -- Thanks! > SparkR logistic regression example did not

[jira] [Commented] (SPARK-20258) SparkR logistic regression example did not converge in programming guide

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961273#comment-15961273 ] Apache Spark commented on SPARK-20258: -- User 'actuaryzhang' has created a pull request for this

[jira] [Assigned] (SPARK-20258) SparkR logistic regression example did not converge in programming guide

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20258: Assignee: (was: Apache Spark) > SparkR logistic regression example did not converge

[jira] [Created] (SPARK-20258) SparkR logistic regression example did not converge in programming guide

2017-04-07 Thread Wayne Zhang (JIRA)
Wayne Zhang created SPARK-20258: --- Summary: SparkR logistic regression example did not converge in programming guide Key: SPARK-20258 URL: https://issues.apache.org/jira/browse/SPARK-20258 Project:

[jira] [Created] (SPARK-20257) Fix test for directory created to work when running as R CMD check

2017-04-07 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-20257: Summary: Fix test for directory created to work when running as R CMD check Key: SPARK-20257 URL: https://issues.apache.org/jira/browse/SPARK-20257 Project: Spark

[jira] [Commented] (SPARK-20257) Fix test for directory created to work when running as R CMD check

2017-04-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961221#comment-15961221 ] Felix Cheung commented on SPARK-20257: -- Please see the PR https://github.com/apache/spark/pull/17516

[jira] [Updated] (SPARK-20197) CRAN check fail with package installation

2017-04-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-20197: - Target Version/s: 2.1.1, 2.2.0 Fix Version/s: 2.2.0 2.1.1 > CRAN

[jira] [Commented] (SPARK-20256) Fail to start SparkContext/SparkSession with Hive support enabled when user does not have read/write privilege to Hive metastore warehouse dir

2017-04-07 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961193#comment-15961193 ] Xin Wu commented on SPARK-20256: I am working on a fix and creating simulated test cases for this issue.

[jira] [Updated] (SPARK-20256) Fail to start SparkContext/SparkSession with Hive support enabled when user does not have read/write privilege to Hive metastore warehouse dir

2017-04-07 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Wu updated SPARK-20256: --- Description: In a cluster setup with production Hive running, when the user wants to run spark-shell using

[jira] [Resolved] (SPARK-20026) Document R GLM Tweedie family support in programming guide and code example

2017-04-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-20026. -- Resolution: Fixed Assignee: Wayne Zhang Fix Version/s: 2.2.0 > Document R GLM

[jira] [Created] (SPARK-20256) Fail to start SparkContext/SparkSession with Hive support enabled when user does not have read/write privilege to Hive metastore warehouse dir

2017-04-07 Thread Xin Wu (JIRA)
Xin Wu created SPARK-20256: -- Summary: Fail to start SparkContext/SparkSession with Hive support enabled when user does not have read/write privilege to Hive metastore warehouse dir Key: SPARK-20256 URL:

[jira] [Commented] (SPARK-18055) Dataset.flatMap can't work with types from customized jar

2017-04-07 Thread Paul Zaczkieiwcz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961180#comment-15961180 ] Paul Zaczkieiwcz commented on SPARK-18055: -- [~marmbrus]: I ran into this issue when using a

[jira] [Assigned] (SPARK-20255) FileIndex hierarchy inconsistency

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20255: Assignee: Apache Spark > FileIndex hierarchy inconsistency >

[jira] [Commented] (SPARK-20255) FileIndex hierarchy inconsistency

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961173#comment-15961173 ] Apache Spark commented on SPARK-20255: -- User 'adrian-ionescu' has created a pull request for this

[jira] [Assigned] (SPARK-20255) FileIndex hierarchy inconsistency

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20255: Assignee: (was: Apache Spark) > FileIndex hierarchy inconsistency >

[jira] [Commented] (SPARK-20253) Remove unnecessary nullchecks of a return value from Spark runtime routines in generated Java code

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961171#comment-15961171 ] Apache Spark commented on SPARK-20253: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20253) Remove unnecessary nullchecks of a return value from Spark runtime routines in generated Java code

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20253: Assignee: (was: Apache Spark) > Remove unnecessary nullchecks of a return value from

[jira] [Assigned] (SPARK-20253) Remove unnecessary nullchecks of a return value from Spark runtime routines in generated Java code

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20253: Assignee: Apache Spark > Remove unnecessary nullchecks of a return value from Spark

[jira] [Commented] (SPARK-20254) SPARK-19716 generates unnecessary data conversion for Dataset with primitive array

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961167#comment-15961167 ] Apache Spark commented on SPARK-20254: -- User 'kiszk' has created a pull request for this issue:

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-04-07 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961170#comment-15961170 ] Andrew Ash commented on SPARK-20144: This is a regression from 1.6 to the 2.x line. [~marmbrus]

[jira] [Created] (SPARK-20255) FileIndex hierarchy inconsistency

2017-04-07 Thread Adrian Ionescu (JIRA)
Adrian Ionescu created SPARK-20255: -- Summary: FileIndex hierarchy inconsistency Key: SPARK-20255 URL: https://issues.apache.org/jira/browse/SPARK-20255 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-20254) SPARK-19716 generates unnecessary data conversion for Dataset with primitive array

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20254: Assignee: (was: Apache Spark) > SPARK-19716 generates unnecessary data conversion for

[jira] [Assigned] (SPARK-20254) SPARK-19716 generates unnecessary data conversion for Dataset with primitive array

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20254: Assignee: Apache Spark > SPARK-19716 generates unnecessary data conversion for Dataset

[jira] [Commented] (SPARK-20254) SPARK-19716 generates unnecessary data conversion for Dataset with primitive array

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961126#comment-15961126 ] Apache Spark commented on SPARK-20254: -- User 'kiszk' has created a pull request for this issue:

[jira] [Updated] (SPARK-20254) SPARK-19716 generates unnecessary data conversion for Dataset with primitive array

2017-04-07 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki updated SPARK-20254: - Summary: SPARK-19716 generates unnecessary data conversion for Dataset with primitive

[jira] [Issue Comment Deleted] (SPARK-20243) DebugFilesystem.assertNoOpenStreams thread race

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20243: -- Comment: was deleted (was: This needs detail to be a JIRA.) > DebugFilesystem.assertNoOpenStreams

[jira] [Commented] (SPARK-19991) FileSegmentManagedBuffer performance improvement.

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961110#comment-15961110 ] Apache Spark commented on SPARK-19991: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-20219) Schedule tasks based on size of input from ScheduledRDD

2017-04-07 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15961104#comment-15961104 ] Imran Rashid commented on SPARK-20219: -- I'm not saying I don't think this is a good proposal. I'm

[jira] [Updated] (SPARK-20254) SPARK-19716 generates inefficient Java code from a primitive array of Dataset

2017-04-07 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki updated SPARK-20254: - Description: Since {{unresolvedmapobjects}} is newly introduced by SPARK-19716, the

[jira] [Updated] (SPARK-20254) SPARK-19716 generates inefficient Java code from a primitive array of Dataset

2017-04-07 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki updated SPARK-20254: - Summary: SPARK-19716 generates inefficient Java code from a primitive array of Dataset

[jira] [Created] (SPARK-20254) SPARK-19716 generate inefficient Java code from a primitive array of Dataset

2017-04-07 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-20254: Summary: SPARK-19716 generate inefficient Java code from a primitive array of Dataset Key: SPARK-20254 URL: https://issues.apache.org/jira/browse/SPARK-20254

[jira] [Assigned] (SPARK-19518) IGNORE NULLS in first_value / last_value should be supported in SQL statements

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19518: Assignee: Apache Spark > IGNORE NULLS in first_value / last_value should be supported in

[jira] [Assigned] (SPARK-19518) IGNORE NULLS in first_value / last_value should be supported in SQL statements

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19518: Assignee: (was: Apache Spark) > IGNORE NULLS in first_value / last_value should be

[jira] [Commented] (SPARK-19518) IGNORE NULLS in first_value / last_value should be supported in SQL statements

2017-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960985#comment-15960985 ] Apache Spark commented on SPARK-19518: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Closed] (SPARK-20181) Avoid noisy Jetty WARN log when failing to bind a port

2017-04-07 Thread Derek Dagit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Derek Dagit closed SPARK-20181. --- Resolution: Invalid This is no longer an issue in master because the log level is already set such

[jira] [Comment Edited] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge edited comment on SPARK-20227 at 4/7/17 3:28 PM: -- Well, you

[jira] [Commented] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960965#comment-15960965 ] Sean Owen commented on SPARK-20227: --- See https://issues.apache.org/jira/browse/SPARK-20226 for

[jira] [Comment Edited] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge edited comment on SPARK-20227 at 4/7/17 3:26 PM: -- Well, you

[jira] [Comment Edited] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge edited comment on SPARK-20227 at 4/7/17 3:26 PM: -- Well, you

[jira] [Comment Edited] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge edited comment on SPARK-20227 at 4/7/17 3:24 PM: -- Well, you

[jira] [Comment Edited] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge edited comment on SPARK-20227 at 4/7/17 3:24 PM: -- Well, you

[jira] [Comment Edited] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge edited comment on SPARK-20227 at 4/7/17 3:24 PM: -- Well, you

[jira] [Comment Edited] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge edited comment on SPARK-20227 at 4/7/17 3:23 PM: -- Well, you

[jira] [Comment Edited] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge edited comment on SPARK-20227 at 4/7/17 3:22 PM: -- Well, you

[jira] [Commented] (SPARK-20227) Job hangs when joining a lot of aggregated columns

2017-04-07 Thread Quentin Auge (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960953#comment-15960953 ] Quentin Auge commented on SPARK-20227: -- Well, you were right to ask. After further investigation, it

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2017-04-07 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960925#comment-15960925 ] Steve Loughran commented on SPARK-2984: --- For s3a commits, HADOOP-13786 is going to be the fix. This

[jira] [Comment Edited] (SPARK-16784) Configurable log4j settings

2017-04-07 Thread Torsten Scholak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960922#comment-15960922 ] Torsten Scholak edited comment on SPARK-16784 at 4/7/17 3:01 PM: - I

[jira] [Commented] (SPARK-16784) Configurable log4j settings

2017-04-07 Thread Torsten Scholak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960922#comment-15960922 ] Torsten Scholak commented on SPARK-16784: - I having this exact problem. I need to be able to

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2017-04-07 Thread Hemang Nagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960906#comment-15960906 ] Hemang Nagar commented on SPARK-2984: - Is there any work going on this issue, or anything related to

[jira] [Commented] (SPARK-20226) Call to sqlContext.cacheTable takes an incredibly long time in some cases

2017-04-07 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960868#comment-15960868 ] Barry Becker commented on SPARK-20226: -- Only 11 columns. I did not want to wait for 10 or 20 minutes

[jira] [Created] (SPARK-20253) Remove unnecessary nullchecks of a return value from Spark runtime routines in generated Java code

2017-04-07 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-20253: Summary: Remove unnecessary nullchecks of a return value from Spark runtime routines in generated Java code Key: SPARK-20253 URL:

[jira] [Commented] (SPARK-20226) Call to sqlContext.cacheTable takes an incredibly long time in some cases

2017-04-07 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960835#comment-15960835 ] Liang-Chi Hsieh commented on SPARK-20226: - How many columns are added in above runs? I didn't see

[jira] [Commented] (SPARK-20226) Call to sqlContext.cacheTable takes an incredibly long time in some cases

2017-04-07 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960806#comment-15960806 ] Barry Becker commented on SPARK-20226: -- OK, I set the flag using

[jira] [Commented] (SPARK-20252) java.lang.ClassNotFoundException: $line22.$read$$iwC$$iwC$movie_row

2017-04-07 Thread Peter Mead (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960762#comment-15960762 ] Peter Mead commented on SPARK-20252: I'm Not sure how this explains how it work the first (and every)

[jira] [Resolved] (SPARK-20252) java.lang.ClassNotFoundException: $line22.$read$$iwC$$iwC$movie_row

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20252. --- Resolution: Duplicate This is basically a limitation of how the shell and classloaders work. Simpler

[jira] [Commented] (SPARK-20251) Spark streaming skips batches in a case of failure

2017-04-07 Thread Roman Studenikin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960703#comment-15960703 ] Roman Studenikin commented on SPARK-20251: -- we've spent quite a lot of time investigating this

[jira] [Commented] (SPARK-19935) SparkSQL unsupports to create a hive table which is mapped for HBase table

2017-04-07 Thread sydt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960682#comment-15960682 ] sydt commented on SPARK-19935: -- Have you resolve this problem about create table in sparksql for hbase table

[jira] [Created] (SPARK-20252) java.lang.ClassNotFoundException: $line22.$read$$iwC$$iwC$movie_row

2017-04-07 Thread Peter Mead (JIRA)
Peter Mead created SPARK-20252: -- Summary: java.lang.ClassNotFoundException: $line22.$read$$iwC$$iwC$movie_row Key: SPARK-20252 URL: https://issues.apache.org/jira/browse/SPARK-20252 Project: Spark

[jira] [Resolved] (SPARK-19900) [Standalone] Master registers application again when driver relaunched

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19900. --- Resolution: Cannot Reproduce > [Standalone] Master registers application again when driver

[jira] [Commented] (SPARK-20251) Spark streaming skips batches in a case of failure

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960671#comment-15960671 ] Sean Owen commented on SPARK-20251: --- This depends on too many things, like how you've set up your app

[jira] [Resolved] (SPARK-20218) '/applications/[app-id]/stages' in REST API,add description.

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20218. --- Resolution: Fixed Fix Version/s: 2.1.2 2.2.0 Issue resolved by pull

[jira] [Assigned] (SPARK-20218) '/applications/[app-id]/stages' in REST API,add description.

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-20218: - Assignee: guoxiaolongzte Priority: Trivial (was: Minor) > '/applications/[app-id]/stages'

[jira] [Updated] (SPARK-20251) Spark streaming skips batches in a case of failure

2017-04-07 Thread Roman Studenikin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Roman Studenikin updated SPARK-20251: - Description: We are experiencing strange behaviour of spark streaming application.

[jira] [Created] (SPARK-20251) Spark streaming skips batches in a case of failure

2017-04-07 Thread Roman Studenikin (JIRA)
Roman Studenikin created SPARK-20251: Summary: Spark streaming skips batches in a case of failure Key: SPARK-20251 URL: https://issues.apache.org/jira/browse/SPARK-20251 Project: Spark

[jira] [Commented] (SPARK-20219) Schedule tasks based on size of input from ScheduledRDD

2017-04-07 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960608#comment-15960608 ] jin xing commented on SPARK-20219: -- [~kayousterhout] [~irashid] Thanks a lot for taking look at this :)

[jira] [Updated] (SPARK-19282) RandomForestRegressionModel summary should expose getMaxDepth

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19282: -- Fix Version/s: (was: 2.2.0) > RandomForestRegressionModel summary should expose getMaxDepth >

[jira] [Updated] (SPARK-19522) --executor-memory flag doesn't work in local-cluster mode

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19522: -- Target Version/s: 2.0.3, 2.1.2, 2.2.0 (was: 2.0.3, 2.1.1) > --executor-memory flag doesn't work in

[jira] [Updated] (SPARK-19035) rand() function in case when cause failed

2017-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19035: -- Target Version/s: 2.0.3, 2.1.2, 2.2.0 (was: 2.0.3, 2.1.1, 2.2.0) > rand() function in case when cause

[jira] [Updated] (SPARK-20219) Schedule tasks based on size of input from ScheduledRDD

2017-04-07 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jin xing updated SPARK-20219: - Attachment: screenshot-1.png > Schedule tasks based on size of input from ScheduledRDD >

[jira] [Updated] (SPARK-20250) Improper OOM error when a task been killed while spilling data

2017-04-07 Thread Feng Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feng Zhu updated SPARK-20250: - Description: When a task is calling spill() but it receives a killing request from driver (e.g.,

[jira] [Updated] (SPARK-20250) Improper OOM error when a task been killed while spilling data

2017-04-07 Thread Feng Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feng Zhu updated SPARK-20250: - Description: While a task is calling spill() when it receives a killing request from driver (e.g.,

  1   2   >