[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471545#comment-16471545 ] shahid commented on SPARK-15784: Hi [~josephkb] , I can work on it. > Add Power Iteratio

[jira] [Commented] (SPARK-21187) Complete support for remaining Spark data types in Arrow Converters

2018-05-10 Thread Eric Wohlstadter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471519#comment-16471519 ] Eric Wohlstadter commented on SPARK-21187: -- [~bryanc] [~hyukjin.kwon] Hi Bryan,

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471479#comment-16471479 ] Yinan Li commented on SPARK-24248: -- I think it's both more robust and easier to implemen

[jira] [Updated] (SPARK-24173) Flaky Test: VersionsSuite

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24173: -- Description: *BRANCH-2.2* - https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(D

[jira] [Updated] (SPARK-24211) Flaky test: StreamingOuterJoinSuite

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24211: -- Description: *windowed left outer join* - https://amplab.cs.berkeley.edu/jenkins/view/Spark%20

[jira] [Resolved] (SPARK-24197) add array_sort function

2018-05-10 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24197. -- Resolution: Fixed Fix Version/s: 2.4.0 Fixed in https://github.com/apache/spark/pull/212

[jira] [Commented] (SPARK-11150) Dynamic partition pruning

2018-05-10 Thread Henry Robinson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471311#comment-16471311 ] Henry Robinson commented on SPARK-11150: The title of this JIRA is 'dynamic parti

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471295#comment-16471295 ] Matt Cheah commented on SPARK-24248: I see - I suppose if the watch connection drops,

[jira] [Updated] (SPARK-24233) union operation on read of dataframe does nor produce correct result

2018-05-10 Thread smohr003 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] smohr003 updated SPARK-24233: - Description: I know that I can use wild card * to read all subfolders. But, I am trying to use .par and

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471288#comment-16471288 ] Yinan Li commented on SPARK-24248: -- Just realized one thing: solely replying on the watc

[jira] [Commented] (SPARK-24204) Verify a write schema in Json/Orc/ParquetFileFormat

2018-05-10 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471283#comment-16471283 ] Takeshi Yamamuro commented on SPARK-24204: -- ok, I'll do it later. Thanks for the

[jira] [Comment Edited] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471280#comment-16471280 ] Matt Cheah edited comment on SPARK-24248 at 5/10/18 11:18 PM: -

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471280#comment-16471280 ] Matt Cheah commented on SPARK-24248: I thought about it a bit more, and believe that

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471259#comment-16471259 ] Yinan Li commented on SPARK-24248: -- Actually even if the fabric8 client does not support

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471244#comment-16471244 ] Yinan Li commented on SPARK-24248: -- It's potentially possible to get rid of the in-memor

[jira] [Commented] (SPARK-24198) add slice function

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471215#comment-16471215 ] Apache Spark commented on SPARK-24198: -- User 'mn-mikke' has created a pull request f

[jira] [Assigned] (SPARK-24198) add slice function

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24198: Assignee: Apache Spark > add slice function > -- > > Key:

[jira] [Assigned] (SPARK-24198) add slice function

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24198: Assignee: (was: Apache Spark) > add slice function > -- > >

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471206#comment-16471206 ] Matt Cheah commented on SPARK-24248: [~foxish] [~liyinan926] curious as to what you t

[jira] [Created] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-24248: -- Summary: [K8S] Use the Kubernetes cluster as the backing store for the state of pods Key: SPARK-24248 URL: https://issues.apache.org/jira/browse/SPARK-24248 Project: Spar

[jira] [Created] (SPARK-24247) [K8S] currentNodeToLocalTaskCount is unused in KubernetesClusterSchedulerBackend

2018-05-10 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-24247: -- Summary: [K8S] currentNodeToLocalTaskCount is unused in KubernetesClusterSchedulerBackend Key: SPARK-24247 URL: https://issues.apache.org/jira/browse/SPARK-24247 Project:

[jira] [Resolved] (SPARK-10878) Race condition when resolving Maven coordinates via Ivy

2018-05-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-10878. Resolution: Fixed Assignee: Kazuaki Ishizaki Fix Version/s: 2.4.0

[jira] [Issue Comment Deleted] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shahid updated SPARK-24217: --- Comment: was deleted (was: Thanks for the clarification Joseph K. Bradley Is it really required to append th

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471128#comment-16471128 ] shahid edited comment on SPARK-24217 at 5/10/18 9:28 PM: - Thanks

[jira] [Resolved] (SPARK-19181) SparkListenerSuite.local metrics fails when average executorDeserializeTime is too short.

2018-05-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-19181. Resolution: Fixed Fix Version/s: 2.3.1 2.4.0 Issue resolved by pu

[jira] [Assigned] (SPARK-19181) SparkListenerSuite.local metrics fails when average executorDeserializeTime is too short.

2018-05-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-19181: -- Assignee: Attila Zsolt Piros > SparkListenerSuite.local metrics fails when average exe

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471128#comment-16471128 ] shahid edited comment on SPARK-24217 at 5/10/18 9:26 PM: - Thanks

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471128#comment-16471128 ] shahid edited comment on SPARK-24217 at 5/10/18 9:23 PM: - Thanks

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471128#comment-16471128 ] shahid edited comment on SPARK-24217 at 5/10/18 9:19 PM: - Thanks

[jira] [Assigned] (SPARK-24246) Improve AnalysisException by setting the cause when it's available

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24246: Assignee: Shixiong Zhu (was: Apache Spark) > Improve AnalysisException by setting the cau

[jira] [Assigned] (SPARK-24246) Improve AnalysisException by setting the cause when it's available

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24246: Assignee: Apache Spark (was: Shixiong Zhu) > Improve AnalysisException by setting the cau

[jira] [Commented] (SPARK-24246) Improve AnalysisException by setting the cause when it's available

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471131#comment-16471131 ] Apache Spark commented on SPARK-24246: -- User 'zsxwing' has created a pull request fo

[jira] [Issue Comment Deleted] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shahid updated SPARK-24217: --- Comment: was deleted (was: Thanks for the clarification.  Is it really required to append the result with th

[jira] [Commented] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471128#comment-16471128 ] shahid commented on SPARK-24217: Thanks for the clarification Joseph K. Bradley Is it r

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471040#comment-16471040 ] shahid edited comment on SPARK-24217 at 5/10/18 9:15 PM: - Thanks

[jira] [Created] (SPARK-24246) Improve AnalysisException by setting the cause when it's available

2018-05-10 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-24246: Summary: Improve AnalysisException by setting the cause when it's available Key: SPARK-24246 URL: https://issues.apache.org/jira/browse/SPARK-24246 Project: Spark

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471040#comment-16471040 ] shahid edited comment on SPARK-24217 at 5/10/18 9:11 PM: - Thanks

[jira] [Updated] (SPARK-23681) Switch OrcFileFormat to newer hadoop.mapreduce output classes

2018-05-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-23681: --- Target Version/s: (was: 2.3.1) > Switch OrcFileFormat to newer hadoop.mapreduce output clas

[jira] [Commented] (SPARK-24245) Flaky test: KafkaContinuousSinkSuite

2018-05-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471077#comment-16471077 ] Marcelo Vanzin commented on SPARK-24245: Lowering since it doesn't seem that flak

[jira] [Updated] (SPARK-24245) Flaky test: KafkaContinuousSinkSuite

2018-05-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-24245: --- Target Version/s: (was: 2.3.1) > Flaky test: KafkaContinuousSinkSuite > ---

[jira] [Updated] (SPARK-24245) Flaky test: KafkaContinuousSinkSuite

2018-05-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-24245: --- Priority: Major (was: Critical) > Flaky test: KafkaContinuousSinkSuite > ---

[jira] [Created] (SPARK-24245) Flaky test: KafkaContinuousSinkSuite

2018-05-10 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-24245: -- Summary: Flaky test: KafkaContinuousSinkSuite Key: SPARK-24245 URL: https://issues.apache.org/jira/browse/SPARK-24245 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread spark_user (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16471040#comment-16471040 ] spark_user commented on SPARK-24217: Thanks for the clarification. I am closing the P

[jira] [Updated] (SPARK-24137) [K8s] Mount temporary directories in emptydir volumes

2018-05-10 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yinan Li updated SPARK-24137: - Fix Version/s: (was: 2.3.1) > [K8s] Mount temporary directories in emptydir volumes > ---

[jira] [Assigned] (SPARK-24230) With Parquet 1.10 upgrade has errors in the vectorized reader

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24230: Assignee: Apache Spark > With Parquet 1.10 upgrade has errors in the vectorized reader > -

[jira] [Commented] (SPARK-24244) Parse only required columns of CSV file

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470947#comment-16470947 ] Apache Spark commented on SPARK-24244: -- User 'MaxGekk' has created a pull request fo

[jira] [Commented] (SPARK-24230) With Parquet 1.10 upgrade has errors in the vectorized reader

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470948#comment-16470948 ] Apache Spark commented on SPARK-24230: -- User 'rdblue' has created a pull request for

[jira] [Assigned] (SPARK-24244) Parse only required columns of CSV file

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24244: Assignee: Apache Spark > Parse only required columns of CSV file > ---

[jira] [Assigned] (SPARK-24244) Parse only required columns of CSV file

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24244: Assignee: (was: Apache Spark) > Parse only required columns of CSV file >

[jira] [Assigned] (SPARK-24230) With Parquet 1.10 upgrade has errors in the vectorized reader

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24230: Assignee: (was: Apache Spark) > With Parquet 1.10 upgrade has errors in the vectorized

[jira] [Created] (SPARK-24244) Parse only required columns of CSV file

2018-05-10 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-24244: -- Summary: Parse only required columns of CSV file Key: SPARK-24244 URL: https://issues.apache.org/jira/browse/SPARK-24244 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-24228) Fix the lint error

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24228: -- Priority: Minor (was: Major) > Fix the lint error > -- > > Key

[jira] [Updated] (SPARK-24137) [K8s] Mount temporary directories in emptydir volumes

2018-05-10 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yinan Li updated SPARK-24137: - Fix Version/s: 2.3.1 > [K8s] Mount temporary directories in emptydir volumes > --

[jira] [Assigned] (SPARK-24137) [K8s] Mount temporary directories in emptydir volumes

2018-05-10 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anirudh Ramanathan reassigned SPARK-24137: -- Assignee: Matt Cheah > [K8s] Mount temporary directories in emptydir volumes >

[jira] [Resolved] (SPARK-24137) [K8s] Mount temporary directories in emptydir volumes

2018-05-10 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anirudh Ramanathan resolved SPARK-24137. Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 21238

[jira] [Updated] (SPARK-24225) Support closing AutoClosable objects in MemoryStore so Broadcast Variables can be released properly

2018-05-10 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-24225: Shepherd: DB Tsai Priority: Major (was: Minor) > Support closing AutoClosable objects in MemoryStore s

[jira] [Updated] (SPARK-24225) Support closing AutoClosable objects in MemoryStore so Broadcast Variables can be released properly

2018-05-10 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-24225: Issue Type: New Feature (was: Improvement) > Support closing AutoClosable objects in MemoryStore so Broadc

[jira] [Assigned] (SPARK-24225) Support closing AutoClosable objects in MemoryStore so Broadcast Variables can be released properly

2018-05-10 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-24225: --- Assignee: Doug Rohrer > Support closing AutoClosable objects in MemoryStore so Broadcast Variables

[jira] [Commented] (SPARK-23519) Create View Commands Fails with The view output (col1,col1) contains duplicate column name

2018-05-10 Thread Franck Tago (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470843#comment-16470843 ] Franck Tago commented on SPARK-23519: - I do not agree with the 'typical database' cla

[jira] [Updated] (SPARK-23519) Create View Commands Fails with The view output (col1,col1) contains duplicate column name

2018-05-10 Thread Franck Tago (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franck Tago updated SPARK-23519: Attachment: image-2018-05-10-10-48-57-259.png > Create View Commands Fails with The view output (c

[jira] [Commented] (SPARK-24243) Expose exceptions from InProcessAppHandle

2018-05-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470797#comment-16470797 ] Marcelo Vanzin commented on SPARK-24243: Sure. As long as the child process handl

[jira] [Commented] (SPARK-23206) Additional Memory Tuning Metrics

2018-05-10 Thread Edwina Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470789#comment-16470789 ] Edwina Lu commented on SPARK-23206: --- [~irashid], I do not have the rest of the changes

[jira] [Comment Edited] (SPARK-23206) Additional Memory Tuning Metrics

2018-05-10 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470777#comment-16470777 ] Felix Cheung edited comment on SPARK-23206 at 5/10/18 5:20 PM:

[jira] [Commented] (SPARK-23206) Additional Memory Tuning Metrics

2018-05-10 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470777#comment-16470777 ] Felix Cheung commented on SPARK-23206: -- yes, for use network and disk IO stats. We h

[jira] [Comment Edited] (SPARK-23458) Flaky test: OrcQuerySuite

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470735#comment-16470735 ] Dongjoon Hyun edited comment on SPARK-23458 at 5/10/18 5:10 PM: ---

[jira] [Commented] (SPARK-23458) Flaky test: OrcQuerySuite

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470735#comment-16470735 ] Dongjoon Hyun commented on SPARK-23458: --- Oh, I missed your ping here, [~smilegator]

[jira] [Commented] (SPARK-24213) Power Iteration Clustering in the SparkML throws exception, when the ID is IntType

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470705#comment-16470705 ] Joseph K. Bradley commented on SPARK-24213: --- On the topic of eating my words, p

[jira] [Commented] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470704#comment-16470704 ] Joseph K. Bradley commented on SPARK-24217: --- On the topic of eating my words, p

[jira] [Comment Edited] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470701#comment-16470701 ] Joseph K. Bradley edited comment on SPARK-15784 at 5/10/18 4:45 PM: ---

[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470701#comment-16470701 ] Joseph K. Bradley commented on SPARK-15784: --- So... we originally agreed to make

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16469562#comment-16469562 ] Joseph K. Bradley edited comment on SPARK-24217 at 5/10/18 4:37 PM: ---

[jira] [Updated] (SPARK-24068) CSV schema inferring doesn't work for compressed files

2018-05-10 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-24068: - Fix Version/s: 2.3.1 > CSV schema inferring doesn't work for compressed files > -

[jira] [Comment Edited] (SPARK-24204) Verify a write schema in Json/Orc/ParquetFileFormat

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470634#comment-16470634 ] Dongjoon Hyun edited comment on SPARK-24204 at 5/10/18 4:29 PM: ---

[jira] [Updated] (SPARK-24233) union operation on read of dataframe does nor produce correct result

2018-05-10 Thread smohr003 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] smohr003 updated SPARK-24233: - Description: I know that I can use wild card * to read all subfolders. But, I am trying to use .par and

[jira] [Comment Edited] (SPARK-24204) Verify a write schema in Json/Orc/ParquetFileFormat

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470634#comment-16470634 ] Dongjoon Hyun edited comment on SPARK-24204 at 5/10/18 4:28 PM: ---

[jira] [Updated] (SPARK-24204) Verify a write schema in Json/Orc/ParquetFileFormat

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24204: -- Description: *SUMMARY* - CSV: Raising analysis exception. - JSON: dropping columns with null ty

[jira] [Updated] (SPARK-24204) Verify a write schema in Json/Orc/ParquetFileFormat

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24204: -- Summary: Verify a write schema in Json/Orc/ParquetFileFormat (was: Verify a write schema in Or

[jira] [Updated] (SPARK-24204) Verify a write schema in Orc/ParquetFileFormat

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24204: -- Summary: Verify a write schema in Orc/ParquetFileFormat (was: Verify a write schema in OrcFile

[jira] [Commented] (SPARK-24204) Verify a write schema in OrcFileFormat

2018-05-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470634#comment-16470634 ] Dongjoon Hyun commented on SPARK-24204: --- Thank you for pinging me, [~maropu]. Could

[jira] [Assigned] (SPARK-24197) add array_sort function

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24197: Assignee: Apache Spark > add array_sort function > --- > >

[jira] [Commented] (SPARK-24197) add array_sort function

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470616#comment-16470616 ] Apache Spark commented on SPARK-24197: -- User 'mn-mikke' has created a pull request f

[jira] [Assigned] (SPARK-24197) add array_sort function

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24197: Assignee: (was: Apache Spark) > add array_sort function > --- > >

[jira] [Commented] (SPARK-24036) Stateful operators in continuous processing

2018-05-10 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470552#comment-16470552 ] Jose Torres commented on SPARK-24036: - My concern isn't that we'll have to write more

[jira] [Commented] (SPARK-24237) continuous shuffle dependency

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470546#comment-16470546 ] Apache Spark commented on SPARK-24237: -- User 'xuanyuanking' has created a pull reque

[jira] [Assigned] (SPARK-24237) continuous shuffle dependency

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24237: Assignee: Apache Spark > continuous shuffle dependency > - > >

[jira] [Assigned] (SPARK-24237) continuous shuffle dependency

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24237: Assignee: (was: Apache Spark) > continuous shuffle dependency > --

[jira] [Commented] (SPARK-24227) Not able to submit spark job to kubernetes on 2.3

2018-05-10 Thread Felipe Cavalcanti (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470493#comment-16470493 ] Felipe Cavalcanti commented on SPARK-24227: --- solved it, spark was using the bas

[jira] [Commented] (SPARK-24068) CSV schema inferring doesn't work for compressed files

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470361#comment-16470361 ] Apache Spark commented on SPARK-24068: -- User 'MaxGekk' has created a pull request fo

[jira] [Comment Edited] (SPARK-24232) Allow referring to kubernetes secrets as env variable

2018-05-10 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470333#comment-16470333 ] Stavros Kontopoulos edited comment on SPARK-24232 at 5/10/18 12:43 PM:

[jira] [Commented] (SPARK-24243) Expose exceptions from InProcessAppHandle

2018-05-10 Thread Sahil Takiar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470339#comment-16470339 ] Sahil Takiar commented on SPARK-24243: -- [~vanzin] would adding something like this b

[jira] [Comment Edited] (SPARK-11150) Dynamic partition pruning

2018-05-10 Thread tim geary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16469551#comment-16469551 ] tim geary edited comment on SPARK-11150 at 5/10/18 12:42 PM: -

[jira] [Created] (SPARK-24243) Expose exceptions from InProcessAppHandle

2018-05-10 Thread Sahil Takiar (JIRA)
Sahil Takiar created SPARK-24243: Summary: Expose exceptions from InProcessAppHandle Key: SPARK-24243 URL: https://issues.apache.org/jira/browse/SPARK-24243 Project: Spark Issue Type: Improve

[jira] [Commented] (SPARK-24232) Allow referring to kubernetes secrets as env variable

2018-05-10 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470333#comment-16470333 ] Stavros Kontopoulos commented on SPARK-24232: - Check also what we use on meso

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread spark_user (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16469858#comment-16469858 ] spark_user edited comment on SPARK-24217 at 5/10/18 12:22 PM: -

[jira] [Resolved] (SPARK-23907) Support regr_* functions

2018-05-10 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-23907. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21054 [https://g

[jira] [Assigned] (SPARK-23907) Support regr_* functions

2018-05-10 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-23907: - Assignee: Marco Gaido > Support regr_* functions > > >

[jira] [Commented] (SPARK-24242) RangeExec should have correct outputOrdering

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470201#comment-16470201 ] Apache Spark commented on SPARK-24242: -- User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-24242) RangeExec should have correct outputOrdering

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24242: Assignee: Apache Spark > RangeExec should have correct outputOrdering > --

[jira] [Assigned] (SPARK-24242) RangeExec should have correct outputOrdering

2018-05-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24242: Assignee: (was: Apache Spark) > RangeExec should have correct outputOrdering > ---

[jira] [Created] (SPARK-24242) RangeExec should have correct outputOrdering

2018-05-10 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-24242: --- Summary: RangeExec should have correct outputOrdering Key: SPARK-24242 URL: https://issues.apache.org/jira/browse/SPARK-24242 Project: Spark Issue Type

  1   2   >