[jira] [Commented] (SPARK-11474) Options to jdbc load are lower cased

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990335#comment-14990335 ] Apache Spark commented on SPARK-11474: -- User 'huaxingao' has created a pull request for this issue:

[jira] [Commented] (SPARK-3789) [GRAPHX] Python bindings for GraphX

2015-11-04 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990450#comment-14990450 ] Ankur Dave commented on SPARK-3789: --- I don't think there's any danger of graph analytics going away. On

[jira] [Commented] (SPARK-9372) For a join operator, rows with null equal join key expression can be filtered out early

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990531#comment-14990531 ] Apache Spark commented on SPARK-9372: - User 'vidma' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11510: Assignee: Apache Spark (was: Reynold Xin) > Remove some SQL aggregation tests >

[jira] [Resolved] (SPARK-10949) Upgrade Snappy Java to 1.1.2

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-10949. - Resolution: Fixed Assignee: Josh Rosen Fix Version/s: 1.6.0 > Upgrade Snappy

[jira] [Created] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-11510: --- Summary: Remove some SQL aggregation tests Key: SPARK-11510 URL: https://issues.apache.org/jira/browse/SPARK-11510 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-3789) [GRAPHX] Python bindings for GraphX

2015-11-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990365#comment-14990365 ] Sean Owen commented on SPARK-3789: -- I suppose I am arguing there isn't actually much demand for GraphX

[jira] [Commented] (SPARK-3789) [GRAPHX] Python bindings for GraphX

2015-11-04 Thread Michael Malak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990391#comment-14990391 ] Michael Malak commented on SPARK-3789: -- My publisher tells me the MEAP for Spark GraphX In Action has

[jira] [Resolved] (SPARK-11475) DataFrame API saveAsTable() does not work well for HDFS HA

2015-11-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11475. --- Resolution: Not A Problem > DataFrame API saveAsTable() does not work well for HDFS HA >

[jira] [Resolved] (SPARK-11505) Break aggregate functions into multiple files

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-11505. - Resolution: Fixed Fix Version/s: 1.6.0 > Break aggregate functions into multiple files >

[jira] [Created] (SPARK-11507) Error thrown when using BlockMatrix.add

2015-11-04 Thread Kareem Alhazred (JIRA)
Kareem Alhazred created SPARK-11507: --- Summary: Error thrown when using BlockMatrix.add Key: SPARK-11507 URL: https://issues.apache.org/jira/browse/SPARK-11507 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11193) Spark 1.5+ Kinesis Streaming - ClassCastException when starting KinesisReceiver

2015-11-04 Thread Phil Kallos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990574#comment-14990574 ] Phil Kallos commented on SPARK-11193: - Any chance a fix for this will make the 1.6 release milestone?

[jira] [Commented] (SPARK-3789) [GRAPHX] Python bindings for GraphX

2015-11-04 Thread Darren Govoni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990509#comment-14990509 ] Darren Govoni commented on SPARK-3789: -- I think reasons for this are dominated mostly by lack of

[jira] [Created] (SPARK-11508) Add Python API for repartition and sortWithinPartitions

2015-11-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-11508: --- Summary: Add Python API for repartition and sortWithinPartitions Key: SPARK-11508 URL: https://issues.apache.org/jira/browse/SPARK-11508 Project: Spark Issue

[jira] [Commented] (SPARK-10838) Repeat to join one DataFrame twice,there will be AnalysisException.

2015-11-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990544#comment-14990544 ] Xiao Li commented on SPARK-10838: - In 1.5.1, both failed with the same exception. Exception in thread

[jira] [Created] (SPARK-11509) ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script

2015-11-04 Thread Andrew Davidson (JIRA)
Andrew Davidson created SPARK-11509: --- Summary: ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script Key: SPARK-11509 URL:

[jira] [Updated] (SPARK-11459) Allow configuring checkpoint dir, filenames

2015-11-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11459: -- Priority: Minor (was: Major) What's the use case for this? you can already control the directory, but

[jira] [Commented] (SPARK-11459) Allow configuring checkpoint dir, filenames

2015-11-04 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990545#comment-14990545 ] Rekha Joshi commented on SPARK-11459: - imo [~rdub], Having UUID is usual practice to avoid collision,

[jira] [Created] (SPARK-11511) Creating an InputDStream but not using it throws NPE

2015-11-04 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-11511: Summary: Creating an InputDStream but not using it throws NPE Key: SPARK-11511 URL: https://issues.apache.org/jira/browse/SPARK-11511 Project: Spark Issue

[jira] [Comment Edited] (SPARK-5068) When the path not found in the hdfs,we can't get the result

2015-11-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990352#comment-14990352 ] Xiao Li edited comment on SPARK-5068 at 11/4/15 8:34 PM: - Now, the default value

[jira] [Commented] (SPARK-5068) When the path not found in the hdfs,we can't get the result

2015-11-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990352#comment-14990352 ] Xiao Li commented on SPARK-5068: Now, the default value of this feature is off. You can turn it on and do

[jira] [Resolved] (SPARK-11504) API audit for distributeBy and localSort

2015-11-04 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-11504. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9470

[jira] [Updated] (SPARK-11507) Error thrown when using BlockMatrix.add

2015-11-04 Thread Kareem Alhazred (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kareem Alhazred updated SPARK-11507: Description: In certain situations when adding two block matrices, I get an error

[jira] [Commented] (SPARK-3789) [GRAPHX] Python bindings for GraphX

2015-11-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990575#comment-14990575 ] Sean Owen commented on SPARK-3789: -- I'll be quiet now; I like my opinions but I dragged this off-topic. I

[jira] [Commented] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990576#comment-14990576 ] Apache Spark commented on SPARK-10788: -- User 'sethah' has created a pull request for this issue:

[jira] [Commented] (SPARK-10387) Code generation for decision tree

2015-11-04 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990492#comment-14990492 ] holdenk commented on SPARK-10387: - So for now I'm working on doing this with quasi quotes, but we should

[jira] [Assigned] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10788: Assignee: Apache Spark > Decision Tree duplicates bins for unordered categorical features

[jira] [Assigned] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10788: Assignee: (was: Apache Spark) > Decision Tree duplicates bins for unordered

[jira] [Resolved] (SPARK-11493) Remove Bitset in BytesToBytesMap

2015-11-04 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-11493. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9452

[jira] [Assigned] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11510: Assignee: Reynold Xin (was: Apache Spark) > Remove some SQL aggregation tests >

[jira] [Commented] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990615#comment-14990615 ] Apache Spark commented on SPARK-11510: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11511) Creating an InputDStream but not using it throws NPE

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11511: Assignee: (was: Apache Spark) > Creating an InputDStream but not using it throws NPE

[jira] [Commented] (SPARK-11511) Creating an InputDStream but not using it throws NPE

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990639#comment-14990639 ] Apache Spark commented on SPARK-11511: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11511) Creating an InputDStream but not using it throws NPE

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11511: Assignee: Apache Spark > Creating an InputDStream but not using it throws NPE >

[jira] [Commented] (SPARK-11509) ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script

2015-11-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990600#comment-14990600 ] Sean Owen commented on SPARK-11509: --- This ultimately means the initialization failed. In this situation

[jira] [Commented] (SPARK-7542) Support off-heap sort buffer in UnsafeExternalSorter

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990761#comment-14990761 ] Apache Spark commented on SPARK-7542: - User 'davies' has created a pull request for this issue:

[jira] [Commented] (SPARK-10387) Code generation for decision tree

2015-11-04 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990809#comment-14990809 ] holdenk commented on SPARK-10387: - Progress - although I'm a little uncertain of what the best API is for

[jira] [Commented] (SPARK-9722) Pass random seed to spark.ml DecisionTree*

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990866#comment-14990866 ] Joseph K. Bradley commented on SPARK-9722: -- [~yuu.ishik...@gmail.com] Thanks for the PR! Sorry I

[jira] [Created] (SPARK-11514) Pass random seed to spark.ml DecisionTree*

2015-11-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-11514: - Summary: Pass random seed to spark.ml DecisionTree* Key: SPARK-11514 URL: https://issues.apache.org/jira/browse/SPARK-11514 Project: Spark Issue

[jira] [Assigned] (SPARK-10371) Optimize sequential projections

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10371: Assignee: Apache Spark > Optimize sequential projections >

[jira] [Commented] (SPARK-10371) Optimize sequential projections

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990894#comment-14990894 ] Apache Spark commented on SPARK-10371: -- User 'nongli' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10371) Optimize sequential projections

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10371: Assignee: (was: Apache Spark) > Optimize sequential projections >

[jira] [Commented] (SPARK-10785) Scale QuantileDiscretizer using distributed binning

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990930#comment-14990930 ] Joseph K. Bradley commented on SPARK-10785: --- Yes, we should sample still. Extensions to

[jira] [Commented] (SPARK-10785) Scale QuantileDiscretizer using distributed binning

2015-11-04 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990948#comment-14990948 ] holdenk commented on SPARK-10785: - So looking at the tree work it looks like just did a grouByKey for

[jira] [Commented] (SPARK-11459) Allow configuring checkpoint dir, filenames

2015-11-04 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990676#comment-14990676 ] Ryan Williams commented on SPARK-11459: --- I'm mostly interested in saving RDDs to disk with

[jira] [Created] (SPARK-11512) Bucket Join

2015-11-04 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-11512: - Summary: Bucket Join Key: SPARK-11512 URL: https://issues.apache.org/jira/browse/SPARK-11512 Project: Spark Issue Type: Sub-task Components: SQL

[jira] [Commented] (SPARK-9465) Could not read parquet table after recreating it with the same table name

2015-11-04 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990890#comment-14990890 ] Xin Wu commented on SPARK-9465: --- I tried on both 1.5.1 and 1.6.0, I can not recreate the issue {code}

[jira] [Commented] (SPARK-10809) Single-document topicDistributions method for LocalLDAModel

2015-11-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990974#comment-14990974 ] yuhao yang commented on SPARK-10809: working on this. > Single-document topicDistributions method

[jira] [Resolved] (SPARK-10028) Add Python API for PrefixSpan

2015-11-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10028. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9469

[jira] [Assigned] (SPARK-7542) Support off-heap sort buffer in UnsafeExternalSorter

2015-11-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-7542: - Assignee: Davies Liu > Support off-heap sort buffer in UnsafeExternalSorter >

[jira] [Resolved] (SPARK-11307) Reduce memory consumption of OutputCommitCoordinator bookkeeping structures

2015-11-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11307. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9274

[jira] [Commented] (SPARK-9465) Could not read parquet table after recreating it with the same table name

2015-11-04 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990909#comment-14990909 ] Xin Wu commented on SPARK-9465: --- I can not recreate the issue on 1.5.1 or 1.6.0.. {code} scala>

[jira] [Commented] (SPARK-11509) ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script

2015-11-04 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990746#comment-14990746 ] Andrew Davidson commented on SPARK-11509: - I forgot to mentioned. on my cluster master I was able

[jira] [Resolved] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-11510. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9475

[jira] [Assigned] (SPARK-11513) Remove the internal implicit conversion from LogicalPlan to DataFrame

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11513: Assignee: Reynold Xin (was: Apache Spark) > Remove the internal implicit conversion from

[jira] [Resolved] (SPARK-11491) Use Scala 2.10.5

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-11491. - Resolution: Fixed Fix Version/s: 1.6.0 > Use Scala 2.10.5 > > >

[jira] [Created] (SPARK-11516) Spark application cannot be found from JSON API even though it exists

2015-11-04 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-11516: -- Summary: Spark application cannot be found from JSON API even though it exists Key: SPARK-11516 URL: https://issues.apache.org/jira/browse/SPARK-11516 Project: Spark

[jira] [Assigned] (SPARK-10809) Single-document topicDistributions method for LocalLDAModel

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10809: Assignee: (was: Apache Spark) > Single-document topicDistributions method for

[jira] [Commented] (SPARK-10809) Single-document topicDistributions method for LocalLDAModel

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990975#comment-14990975 ] Apache Spark commented on SPARK-10809: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10809) Single-document topicDistributions method for LocalLDAModel

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10809: Assignee: Apache Spark > Single-document topicDistributions method for LocalLDAModel >

[jira] [Commented] (SPARK-6521) executors in the same node read local shuffle file

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990791#comment-14990791 ] Apache Spark commented on SPARK-6521: - User 'maropu' has created a pull request for this issue:

[jira] [Resolved] (SPARK-6001) K-Means clusterer should return the assignments of input points to clusters

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-6001. -- Resolution: Fixed Assignee: Yu Ishikawa Fix Version/s: 1.5.0 Yep,

[jira] [Commented] (SPARK-11512) Bucket Join

2015-11-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990861#comment-14990861 ] Marcelo Vanzin commented on SPARK-11512: Isn't this the same as in SPARK-5292? > Bucket Join >

[jira] [Resolved] (SPARK-11398) unnecessary def dialectClassName in HiveContext, and misleading dialect conf at the start of spark-sql

2015-11-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11398. Resolution: Fixed Fix Version/s: 1.6.0 > unnecessary def dialectClassName in HiveContext,

[jira] [Issue Comment Deleted] (SPARK-9465) Could not read parquet table after recreating it with the same table name

2015-11-04 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Wu updated SPARK-9465: -- Comment: was deleted (was: I can not recreate the issue on 1.5.1 or 1.6.0.. {code} scala>

[jira] [Commented] (SPARK-11509) ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script

2015-11-04 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990742#comment-14990742 ] Andrew Davidson commented on SPARK-11509: - yes , it appears the show stopper issue I am facing is

[jira] [Commented] (SPARK-10648) Spark-SQL JDBC fails to set a default precision and scale when they are not defined in an oracle schema.

2015-11-04 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990785#comment-14990785 ] Yin Huai commented on SPARK-10648: -- https://github.com/apache/spark/pull/8780#issuecomment-145598968 and

[jira] [Commented] (SPARK-7425) spark.ml Predictor should support other numeric types for label

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990917#comment-14990917 ] Joseph K. Bradley commented on SPARK-7425: -- The VectorUDT usage for features should be a separate

[jira] [Commented] (SPARK-10745) Separate configs between shuffle and RPC

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990926#comment-14990926 ] Apache Spark commented on SPARK-10745: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10745) Separate configs between shuffle and RPC

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10745: Assignee: Apache Spark > Separate configs between shuffle and RPC >

[jira] [Assigned] (SPARK-10745) Separate configs between shuffle and RPC

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10745: Assignee: (was: Apache Spark) > Separate configs between shuffle and RPC >

[jira] [Created] (SPARK-11515) QuantileDiscretizer should take random seed

2015-11-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-11515: - Summary: QuantileDiscretizer should take random seed Key: SPARK-11515 URL: https://issues.apache.org/jira/browse/SPARK-11515 Project: Spark Issue

[jira] [Commented] (SPARK-11453) append data to partitioned table will messes up the result

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990951#comment-14990951 ] Apache Spark commented on SPARK-11453: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-11517) Calc partitions in parallel for multiple partitions table

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990962#comment-14990962 ] Apache Spark commented on SPARK-11517: -- User 'zhichao-li' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11517) Calc partitions in parallel for multiple partitions table

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11517: Assignee: (was: Apache Spark) > Calc partitions in parallel for multiple partitions

[jira] [Created] (SPARK-11517) Calc partitions in parallel for multiple partitions table

2015-11-04 Thread zhichao-li (JIRA)
zhichao-li created SPARK-11517: -- Summary: Calc partitions in parallel for multiple partitions table Key: SPARK-11517 URL: https://issues.apache.org/jira/browse/SPARK-11517 Project: Spark Issue

[jira] [Assigned] (SPARK-11517) Calc partitions in parallel for multiple partitions table

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11517: Assignee: Apache Spark > Calc partitions in parallel for multiple partitions table >

[jira] [Commented] (SPARK-11499) Spark History Server UI should respect protocol when doing redirection

2015-11-04 Thread Lukasz Jastrzebski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990674#comment-14990674 ] Lukasz Jastrzebski commented on SPARK-11499: There is also

[jira] [Commented] (SPARK-11103) Parquet filters push-down may cause exception when schema merging is turned on

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990793#comment-14990793 ] Reynold Xin commented on SPARK-11103: - I think this was included in 1.5.2 > Parquet filters

[jira] [Commented] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990795#comment-14990795 ] Reynold Xin commented on SPARK-11303: - This made it into 1.5.2. > sample (without replacement) +

[jira] [Updated] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-11303: Fix Version/s: 1.5.2 > sample (without replacement) + filter returns wrong results in DataFrame >

[jira] [Closed] (SPARK-7332) RpcCallContext.sender has a different name from the original sender's name

2015-11-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu closed SPARK-7332. --- Resolution: Won't Fix They are internal APIs and not exposed to the user. > RpcCallContext.sender

[jira] [Commented] (SPARK-9722) Pass random seed to spark.ml RandomForest findSplitsBins

2015-11-04 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990877#comment-14990877 ] Yu Ishikawa commented on SPARK-9722: [~josephkb] I'll add a seed Param to {{DecisionTreeClassifier}}

[jira] [Created] (SPARK-11513) Remove the internal implicit conversion from LogicalPlan to DataFrame

2015-11-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-11513: --- Summary: Remove the internal implicit conversion from LogicalPlan to DataFrame Key: SPARK-11513 URL: https://issues.apache.org/jira/browse/SPARK-11513 Project: Spark

[jira] [Commented] (SPARK-10309) Some tasks failed with Unable to acquire memory

2015-11-04 Thread Abhishek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990851#comment-14990851 ] Abhishek commented on SPARK-10309: -- Is there any work around for this issue. We migrated from 1.1 to 1.5

[jira] [Assigned] (SPARK-11513) Remove the internal implicit conversion from LogicalPlan to DataFrame

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11513: Assignee: Apache Spark (was: Reynold Xin) > Remove the internal implicit conversion from

[jira] [Commented] (SPARK-11513) Remove the internal implicit conversion from LogicalPlan to DataFrame

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990853#comment-14990853 ] Apache Spark commented on SPARK-11513: -- User 'rxin' has created a pull request for this issue:

[jira] [Commented] (SPARK-11512) Bucket Join

2015-11-04 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990867#comment-14990867 ] Cheng Hao commented on SPARK-11512: --- Oh, yes, but SPARK-5292 is only about to support the Hive bucket,

[jira] [Updated] (SPARK-9722) Pass random seed to spark.ml RandomForest findSplitsBins

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9722: - Summary: Pass random seed to spark.ml RandomForest findSplitsBins (was: Pass random seed

[jira] [Commented] (SPARK-11512) Bucket Join

2015-11-04 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990868#comment-14990868 ] Cheng Hao commented on SPARK-11512: --- We need to support the "bucket" for DataSource API. > Bucket Join

[jira] [Commented] (SPARK-9722) Pass random seed to spark.ml RandomForest findSplitsBins

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990899#comment-14990899 ] Joseph K. Bradley commented on SPARK-9722: -- Great, thank you! > Pass random seed to spark.ml

[jira] [Created] (SPARK-11522) input_file_name() returns "" for external tables

2015-11-04 Thread Simeon Simeonov (JIRA)
Simeon Simeonov created SPARK-11522: --- Summary: input_file_name() returns "" for external tables Key: SPARK-11522 URL: https://issues.apache.org/jira/browse/SPARK-11522 Project: Spark Issue

[jira] [Commented] (SPARK-2533) Show summary of locality level of completed tasks in the each stage page of web UI

2015-11-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14991197#comment-14991197 ] Jean-Baptiste Onofré commented on SPARK-2533: - New clean PR. > Show summary of locality level

[jira] [Created] (SPARK-11518) The script spark-submit.cmd can not handle spark directory with space.

2015-11-04 Thread Cele Liu (JIRA)
Cele Liu created SPARK-11518: Summary: The script spark-submit.cmd can not handle spark directory with space. Key: SPARK-11518 URL: https://issues.apache.org/jira/browse/SPARK-11518 Project: Spark

[jira] [Commented] (SPARK-9273) Add Convolutional Neural network to Spark MLlib

2015-11-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14991018#comment-14991018 ] yuhao yang commented on SPARK-9273: --- Hi [~avulanov]. I've refactored the CNN in

[jira] [Commented] (SPARK-11193) Spark 1.5+ Kinesis Streaming - ClassCastException when starting KinesisReceiver

2015-11-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14991160#comment-14991160 ] Jean-Baptiste Onofré commented on SPARK-11193: -- Hi Phil, I'm testing a fix on Kryo right

[jira] [Created] (SPARK-11523) spark_partition_id() considered invalid function

2015-11-04 Thread Simeon Simeonov (JIRA)
Simeon Simeonov created SPARK-11523: --- Summary: spark_partition_id() considered invalid function Key: SPARK-11523 URL: https://issues.apache.org/jira/browse/SPARK-11523 Project: Spark Issue

[jira] [Created] (SPARK-11519) Spark MemoryStore with hadoop SequenceFile cache the values is same record.

2015-11-04 Thread xukaiqiang (JIRA)
xukaiqiang created SPARK-11519: -- Summary: Spark MemoryStore with hadoop SequenceFile cache the values is same record. Key: SPARK-11519 URL: https://issues.apache.org/jira/browse/SPARK-11519 Project:

[jira] [Commented] (SPARK-10838) Repeat to join one DataFrame twice,there will be AnalysisException.

2015-11-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14991108#comment-14991108 ] Xiao Li commented on SPARK-10838: - The fix is ready. Writing unit test cases now. > Repeat to join one

[jira] [Commented] (SPARK-2533) Show summary of locality level of completed tasks in the each stage page of web UI

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14991187#comment-14991187 ] Apache Spark commented on SPARK-2533: - User 'jbonofre' has created a pull request for this issue:

  1   2   3   >