[jira] [Closed] (SPARK-22089) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-21 Thread guichaoxian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guichaoxian closed SPARK-22089. --- Resolution: Fixed > There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex >

[jira] [Assigned] (SPARK-21981) Python API for ClusteringEvaluator

2017-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-21981: --- Assignee: Marco Gaido > Python API for ClusteringEvaluator >

[jira] [Resolved] (SPARK-21981) Python API for ClusteringEvaluator

2017-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-21981. - Resolution: Fixed Fix Version/s: 2.3.0 > Python API for ClusteringEvaluator >

[jira] [Resolved] (SPARK-22094) processAllAvailable should not block forever when a query is stopped

2017-09-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-22094. -- Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 > processAllAvailable

[jira] [Assigned] (SPARK-21766) DataFrame toPandas() raises ValueError with nullable int columns

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21766: Assignee: (was: Apache Spark) > DataFrame toPandas() raises ValueError with nullable

[jira] [Assigned] (SPARK-21766) DataFrame toPandas() raises ValueError with nullable int columns

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21766: Assignee: Apache Spark > DataFrame toPandas() raises ValueError with nullable int columns

[jira] [Updated] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes

2017-09-21 Thread Vincent (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vincent updated SPARK-22096: Attachment: performance data for NB.png > use aggregateByKeyLocally to save one stage in calculating

[jira] [Updated] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes

2017-09-21 Thread Vincent (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vincent updated SPARK-22096: Description: NaiveBayes currently takes aggreateByKey followed by a collect to calculate frequency for

[jira] [Updated] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes

2017-09-21 Thread Vincent (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vincent updated SPARK-22096: Description: NaiveBayes currently takes aggreateByKey followed by a collect to calculate frequency for

[jira] [Commented] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175894#comment-16175894 ] Apache Spark commented on SPARK-22096: -- User 'VinceShieh' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22096: Assignee: (was: Apache Spark) > use aggregateByKeyLocally to save one stage in

[jira] [Assigned] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22096: Assignee: Apache Spark > use aggregateByKeyLocally to save one stage in calculating

[jira] [Assigned] (SPARK-22098) Add aggregateByKeyLocally in RDD

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22098: Assignee: (was: Apache Spark) > Add aggregateByKeyLocally in RDD >

[jira] [Assigned] (SPARK-22098) Add aggregateByKeyLocally in RDD

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22098: Assignee: Apache Spark > Add aggregateByKeyLocally in RDD >

[jira] [Commented] (SPARK-22098) Add aggregateByKeyLocally in RDD

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175892#comment-16175892 ] Apache Spark commented on SPARK-22098: -- User 'ConeyLiu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22097) Call serializationStream.close after we requested enough memory

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22097: Assignee: Apache Spark > Call serializationStream.close after we requested enough memory

[jira] [Commented] (SPARK-22097) Call serializationStream.close after we requested enough memory

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175887#comment-16175887 ] Apache Spark commented on SPARK-22097: -- User 'ConeyLiu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22097) Call serializationStream.close after we requested enough memory

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22097: Assignee: (was: Apache Spark) > Call serializationStream.close after we requested

[jira] [Created] (SPARK-22098) Add aggregateByKeyLocally in RDD

2017-09-21 Thread Vincent (JIRA)
Vincent created SPARK-22098: --- Summary: Add aggregateByKeyLocally in RDD Key: SPARK-22098 URL: https://issues.apache.org/jira/browse/SPARK-22098 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-22097) Call serializationStream.close after we requested enough memory

2017-09-21 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-22097: Summary: Call serializationStream.close after we requested enough memory Key: SPARK-22097 URL: https://issues.apache.org/jira/browse/SPARK-22097 Project: Spark

[jira] [Created] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes

2017-09-21 Thread Vincent (JIRA)
Vincent created SPARK-22096: --- Summary: use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes Key: SPARK-22096 URL: https://issues.apache.org/jira/browse/SPARK-22096

[jira] [Commented] (SPARK-13030) Change OneHotEncoder to Estimator

2017-09-21 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175833#comment-16175833 ] zhengruifeng commented on SPARK-13030: -- I approve that one-hot encoder should be a estimtor, and

[jira] [Commented] (SPARK-22093) UtilsSuite "resolveURIs with multiple paths" test always cancelled

2017-09-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175794#comment-16175794 ] Hyukjin Kwon commented on SPARK-22093: -- Would it make sense to just remove that {{assume}}? It looks

[jira] [Closed] (SPARK-22095) java.util.NoSuchElementException: key not found: _PYSPARK_DRIVER_CALLBACK_HOST

2017-09-21 Thread softwarevamp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] softwarevamp closed SPARK-22095. Resolution: Not A Problem > java.util.NoSuchElementException: key not found:

[jira] [Comment Edited] (SPARK-22095) java.util.NoSuchElementException: key not found: _PYSPARK_DRIVER_CALLBACK_HOST

2017-09-21 Thread softwarevamp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175773#comment-16175773 ] softwarevamp edited comment on SPARK-22095 at 9/22/17 1:29 AM: --- i am sorry

[jira] [Commented] (SPARK-19357) Parallel Model Evaluation for ML Tuning: Scala

2017-09-21 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175774#comment-16175774 ] Weichen Xu commented on SPARK-19357: [~josephkb] I thought about this, the desgin: `Estimator:: def

[jira] [Commented] (SPARK-22095) java.util.NoSuchElementException: key not found: _PYSPARK_DRIVER_CALLBACK_HOST

2017-09-21 Thread softwarevamp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175773#comment-16175773 ] softwarevamp commented on SPARK-22095: -- i am sorry maybe my fault: spark-submit --packages ...

[jira] [Commented] (SPARK-22095) java.util.NoSuchElementException: key not found: _PYSPARK_DRIVER_CALLBACK_HOST

2017-09-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175770#comment-16175770 ] Jeff Zhang commented on SPARK-22095: Could you tell how to reproduce this issue ? >

[jira] [Created] (SPARK-22095) java.util.NoSuchElementException: key not found: _PYSPARK_DRIVER_CALLBACK_HOST

2017-09-21 Thread softwarevamp (JIRA)
softwarevamp created SPARK-22095: Summary: java.util.NoSuchElementException: key not found: _PYSPARK_DRIVER_CALLBACK_HOST Key: SPARK-22095 URL: https://issues.apache.org/jira/browse/SPARK-22095

[jira] [Commented] (SPARK-22094) processAllAvailable should not block forever when a query is stopped

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175687#comment-16175687 ] Apache Spark commented on SPARK-22094: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22094) processAllAvailable should not block forever when a query is stopped

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22094: Assignee: Apache Spark (was: Shixiong Zhu) > processAllAvailable should not block

[jira] [Assigned] (SPARK-22094) processAllAvailable should not block forever when a query is stopped

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22094: Assignee: Shixiong Zhu (was: Apache Spark) > processAllAvailable should not block

[jira] [Created] (SPARK-22094) processAllAvailable should not block forever when a query is stopped

2017-09-21 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-22094: Summary: processAllAvailable should not block forever when a query is stopped Key: SPARK-22094 URL: https://issues.apache.org/jira/browse/SPARK-22094 Project: Spark

[jira] [Comment Edited] (SPARK-19357) Parallel Model Evaluation for ML Tuning: Scala

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175629#comment-16175629 ] Joseph K. Bradley edited comment on SPARK-19357 at 9/21/17 11:14 PM: -

[jira] [Commented] (SPARK-19357) Parallel Model Evaluation for ML Tuning: Scala

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175629#comment-16175629 ] Joseph K. Bradley commented on SPARK-19357: --- [~bryanc], [~nick.pentre...@gmail.com],

[jira] [Updated] (SPARK-21928) ClassNotFoundException for custom Kryo registrator class during serde in netty threads

2017-09-21 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-21928: --- Fix Version/s: 2.1.2 > ClassNotFoundException for custom Kryo registrator class during serde

[jira] [Resolved] (SPARK-22053) Implement stream-stream inner join in Append mode

2017-09-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-22053. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 19271

[jira] [Comment Edited] (SPARK-14236) UDAF does not use incomingSchema for update Method

2017-09-21 Thread Guilherme Braccialli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175541#comment-16175541 ] Guilherme Braccialli edited comment on SPARK-14236 at 9/21/17 10:29 PM:

[jira] [Updated] (SPARK-22060) CrossValidator/TrainValidationSplit parallelism param persist/load bug

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22060: -- Target Version/s: 2.3.0 > CrossValidator/TrainValidationSplit parallelism param

[jira] [Assigned] (SPARK-22060) CrossValidator/TrainValidationSplit parallelism param persist/load bug

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-22060: - Assignee: Weichen Xu > CrossValidator/TrainValidationSplit parallelism param

[jira] [Updated] (SPARK-22060) CrossValidator/TrainValidationSplit parallelism param persist/load bug

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22060: -- Shepherd: Joseph K. Bradley > CrossValidator/TrainValidationSplit parallelism param

[jira] [Comment Edited] (SPARK-14236) UDAF does not use incomingSchema for update Method

2017-09-21 Thread Guilherme Braccialli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175541#comment-16175541 ] Guilherme Braccialli edited comment on SPARK-14236 at 9/21/17 10:29 PM:

[jira] [Commented] (SPARK-14236) UDAF does not use incomingSchema for update Method

2017-09-21 Thread Guilherme Braccialli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175541#comment-16175541 ] Guilherme Braccialli commented on SPARK-14236: -- +1 to implement this. as a workaround I'm

[jira] [Commented] (SPARK-22077) RpcEndpointAddress fails to parse spark URL if it is an ipv6 address.

2017-09-21 Thread Eric Vandenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175479#comment-16175479 ] Eric Vandenberg commented on SPARK-22077: - Yes, it worked when I overloaded with "localhost" so

[jira] [Commented] (SPARK-21928) ClassNotFoundException for custom Kryo registrator class during serde in netty threads

2017-09-21 Thread John Brock (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175452#comment-16175452 ] John Brock commented on SPARK-21928: Excellent, thanks for looking into this. >

[jira] [Updated] (SPARK-22083) When dropping multiple blocks to disk, Spark should release all locks on a failure

2017-09-21 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-22083: - Description: {{MemoryStore.evictBlocksToFreeSpace}} first [acquires writer locks on all the

[jira] [Commented] (SPARK-21928) ClassNotFoundException for custom Kryo registrator class during serde in netty threads

2017-09-21 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175410#comment-16175410 ] Imran Rashid commented on SPARK-21928: -- thanks [~jbrock], thats great. I think this is fully

[jira] [Updated] (SPARK-21928) ClassNotFoundException for custom Kryo registrator class during serde in netty threads

2017-09-21 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-21928: - Summary: ClassNotFoundException for custom Kryo registrator class during serde in netty threads

[jira] [Updated] (SPARK-21928) ML LogisticRegression training occasionally produces java.lang.ClassNotFoundException when attempting to load custom Kryo registrator class

2017-09-21 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-21928: - Description: >From SPARK-13990 & SPARK-13926, Spark's SerializerManager has its own instance

[jira] [Commented] (SPARK-21928) ML LogisticRegression training occasionally produces java.lang.ClassNotFoundException when attempting to load custom Kryo registrator class

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175385#comment-16175385 ] Apache Spark commented on SPARK-21928: -- User 'squito' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22071) Improve release build scripts to check correct JAVA version is being used for build

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22071: Assignee: Apache Spark (was: holdenk) > Improve release build scripts to check correct

[jira] [Commented] (SPARK-22072) Allow the same shell params to be used for all of the different steps in release-build

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175352#comment-16175352 ] Apache Spark commented on SPARK-22072: -- User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22071) Improve release build scripts to check correct JAVA version is being used for build

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22071: Assignee: holdenk (was: Apache Spark) > Improve release build scripts to check correct

[jira] [Commented] (SPARK-22071) Improve release build scripts to check correct JAVA version is being used for build

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175354#comment-16175354 ] Apache Spark commented on SPARK-22071: -- User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22072) Allow the same shell params to be used for all of the different steps in release-build

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22072: Assignee: Apache Spark (was: holdenk) > Allow the same shell params to be used for all

[jira] [Assigned] (SPARK-22072) Allow the same shell params to be used for all of the different steps in release-build

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22072: Assignee: holdenk (was: Apache Spark) > Allow the same shell params to be used for all

[jira] [Commented] (SPARK-22083) When dropping multiple blocks to disk, Spark should release all locks on a failure

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175309#comment-16175309 ] Apache Spark commented on SPARK-22083: -- User 'squito' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22083) When dropping multiple blocks to disk, Spark should release all locks on a failure

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22083: Assignee: (was: Apache Spark) > When dropping multiple blocks to disk, Spark should

[jira] [Assigned] (SPARK-22083) When dropping multiple blocks to disk, Spark should release all locks on a failure

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22083: Assignee: Apache Spark > When dropping multiple blocks to disk, Spark should release all

[jira] [Resolved] (SPARK-22009) Using treeAggregate improve some algs

2017-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22009. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19232

[jira] [Commented] (SPARK-18136) Make PySpark pip install works on windows

2017-09-21 Thread Jakub Nowacki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175303#comment-16175303 ] Jakub Nowacki commented on SPARK-18136: --- I've tried using Windows command {{mklink}} to create

[jira] [Assigned] (SPARK-22009) Using treeAggregate improve some algs

2017-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22009: - Assignee: zhengruifeng > Using treeAggregate improve some algs >

[jira] [Assigned] (SPARK-22075) GBTs forgot to unpersist datasets cached by Checkpointer

2017-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22075: - Assignee: zhengruifeng Priority: Minor (was: Major) Issue Type: Improvement (was:

[jira] [Resolved] (SPARK-22075) GBTs forgot to unpersist datasets cached by Checkpointer

2017-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22075. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19288

[jira] [Resolved] (SPARK-22088) Incorrect scalastyle comment causes wrong styles in stringExpressions

2017-09-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22088. - Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 2.3.0 > Incorrect scalastyle

[jira] [Commented] (SPARK-18136) Make PySpark pip install works on windows

2017-09-21 Thread Jakub Nowacki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175276#comment-16175276 ] Jakub Nowacki commented on SPARK-18136: --- [PR|https://github.com/apache/spark/pull/19310] fixes how

[jira] [Commented] (SPARK-18136) Make PySpark pip install works on windows

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175258#comment-16175258 ] Apache Spark commented on SPARK-18136: -- User 'jsnowacki' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18136) Make PySpark pip install works on windows

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18136: Assignee: (was: Apache Spark) > Make PySpark pip install works on windows >

[jira] [Assigned] (SPARK-18136) Make PySpark pip install works on windows

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18136: Assignee: Apache Spark > Make PySpark pip install works on windows >

[jira] [Commented] (SPARK-19558) Provide a config option to attach QueryExecutionListener to SparkSession

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175173#comment-16175173 ] Apache Spark commented on SPARK-19558: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19558) Provide a config option to attach QueryExecutionListener to SparkSession

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19558: Assignee: (was: Apache Spark) > Provide a config option to attach

[jira] [Assigned] (SPARK-19558) Provide a config option to attach QueryExecutionListener to SparkSession

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19558: Assignee: Apache Spark > Provide a config option to attach QueryExecutionListener to

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-09-21 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175158#comment-16175158 ] Timothy Hunter commented on SPARK-21866: Putting this code under {{org.apache.spark.ml.image}}

[jira] [Resolved] (SPARK-21928) ML LogisticRegression training occasionally produces java.lang.ClassNotFoundException when attempting to load custom Kryo registrator class

2017-09-21 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-21928. Resolution: Fixed Assignee: Imran Rashid Fix Version/s: 2.3.0

[jira] [Resolved] (SPARK-22061) Add pipeline model of SVM

2017-09-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-22061. Resolution: Won't Fix > Add pipeline model of SVM > - > >

[jira] [Commented] (SPARK-22061) Add pipeline model of SVM

2017-09-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175124#comment-16175124 ] Nick Pentreath commented on SPARK-22061: Agreed, this already exists. I closed this issue. > Add

[jira] [Created] (SPARK-22093) UtilsSuite "resolveURIs with multiple paths" test always cancelled

2017-09-21 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-22093: -- Summary: UtilsSuite "resolveURIs with multiple paths" test always cancelled Key: SPARK-22093 URL: https://issues.apache.org/jira/browse/SPARK-22093 Project:

[jira] [Updated] (SPARK-22083) When dropping multiple blocks to disk, Spark should release all locks on a failure

2017-09-21 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-22083: - Description: {{MemoryStore.evictBlocksToFreeSpace}} first [acquires writer locks on all the

[jira] [Commented] (SPARK-22092) Reallocation in OffHeapColumnVector.reserveInternal corrupts array data

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175087#comment-16175087 ] Apache Spark commented on SPARK-22092: -- User 'ala' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22092) Reallocation in OffHeapColumnVector.reserveInternal corrupts array data

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22092: Assignee: Apache Spark > Reallocation in OffHeapColumnVector.reserveInternal corrupts

[jira] [Assigned] (SPARK-22092) Reallocation in OffHeapColumnVector.reserveInternal corrupts array data

2017-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22092: Assignee: (was: Apache Spark) > Reallocation in OffHeapColumnVector.reserveInternal

[jira] [Created] (SPARK-22092) Reallocation in OffHeapColumnVector.reserveInternal corrupts array data

2017-09-21 Thread Ala Luszczak (JIRA)
Ala Luszczak created SPARK-22092: Summary: Reallocation in OffHeapColumnVector.reserveInternal corrupts array data Key: SPARK-22092 URL: https://issues.apache.org/jira/browse/SPARK-22092 Project:

[jira] [Commented] (SPARK-21928) ML LogisticRegression training occasionally produces java.lang.ClassNotFoundException when attempting to load custom Kryo registrator class

2017-09-21 Thread John Brock (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175066#comment-16175066 ] John Brock commented on SPARK-21928: It does! I see this in the log right before an executor got

[jira] [Closed] (SPARK-22091) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-21 Thread guichaoxian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guichaoxian closed SPARK-22091. --- Resolution: Duplicate > There is no need for fileStatusCache to invalidateAll when

[jira] [Closed] (SPARK-22090) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-21 Thread guichaoxian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guichaoxian closed SPARK-22090. --- > There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex > refresh >

[jira] [Resolved] (SPARK-21653) Complement SQL expression document

2017-09-21 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh resolved SPARK-21653. - Resolution: Fixed > Complement SQL expression document >

[jira] [Resolved] (SPARK-22086) Add expression description for CASE WHEN

2017-09-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22086. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19304

[jira] [Assigned] (SPARK-22086) Add expression description for CASE WHEN

2017-09-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-22086: Assignee: Liang-Chi Hsieh > Add expression description for CASE WHEN >

[jira] [Resolved] (SPARK-17997) Aggregation function for counting distinct values for multiple intervals

2017-09-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-17997. - Resolution: Fixed > Aggregation function for counting distinct values for multiple intervals >

[jira] [Reopened] (SPARK-17997) Aggregation function for counting distinct values for multiple intervals

2017-09-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reopened SPARK-17997: - Assignee: Zhenhua Wang > Aggregation function for counting distinct values for multiple

[jira] [Resolved] (SPARK-22090) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22090. --- Resolution: Duplicate > There is no need for fileStatusCache to invalidateAll when

[jira] [Created] (SPARK-22091) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-21 Thread guichaoxian (JIRA)
guichaoxian created SPARK-22091: --- Summary: There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh Key: SPARK-22091 URL: https://issues.apache.org/jira/browse/SPARK-22091

[jira] [Created] (SPARK-22090) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-21 Thread guichaoxian (JIRA)
guichaoxian created SPARK-22090: --- Summary: There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh Key: SPARK-22090 URL: https://issues.apache.org/jira/browse/SPARK-22090

[jira] [Created] (SPARK-22089) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-21 Thread guichaoxian (JIRA)
guichaoxian created SPARK-22089: --- Summary: There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh Key: SPARK-22089 URL: https://issues.apache.org/jira/browse/SPARK-22089

[jira] [Commented] (SPARK-22034) CrossValidator's training and testing set with different set of labels, resulting in encoder transform error

2017-09-21 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16174729#comment-16174729 ] Weichen Xu commented on SPARK-22034: Do you mean a pipeline including stage VectorIndexer + stage

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2017-09-21 Thread Marco Veluscek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16174727#comment-16174727 ] Marco Veluscek commented on SPARK-16845: Hello, I have just encountered a similar issue when

[jira] [Commented] (SPARK-19141) VectorAssembler metadata causing memory issues

2017-09-21 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16174685#comment-16174685 ] Weichen Xu commented on SPARK-19141: Maybe we need design a sparse format of AttributeGroup for

[jira] [Comment Edited] (SPARK-18727) Support schema evolution as new files are inserted into table

2017-09-21 Thread Serge Smertin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16174660#comment-16174660 ] Serge Smertin edited comment on SPARK-18727 at 9/21/17 12:31 PM: - i have

[jira] [Commented] (SPARK-18727) Support schema evolution as new files are inserted into table

2017-09-21 Thread Serge Smertin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16174660#comment-16174660 ] Serge Smertin commented on SPARK-18727: --- in one of the use-cases for project in [#comment-15987668]

[jira] [Commented] (SPARK-21418) NoSuchElementException: None.get in DataSourceScanExec with sun.io.serialization.extendedDebugInfo=true

2017-09-21 Thread Artem Kupchinskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16174642#comment-16174642 ] Artem Kupchinskiy commented on SPARK-21418: --- There is still a place in FileSourceScanExec.scala

  1   2   >