[jira] [Assigned] (SPARK-14019) Remove noop SortOrder in Sort

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14019: Assignee: (was: Apache Spark) > Remove noop SortOrder in Sort >

[jira] [Commented] (SPARK-14019) Remove noop SortOrder in Sort

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202599#comment-15202599 ] Apache Spark commented on SPARK-14019: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14019) Remove noop SortOrder in Sort

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14019: Assignee: Apache Spark > Remove noop SortOrder in Sort > - >

[jira] [Assigned] (SPARK-13897) GroupedData vs GroupedDataset naming is confusing

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13897: Assignee: Reynold Xin (was: Apache Spark) > GroupedData vs GroupedDataset naming is

[jira] [Commented] (SPARK-13897) GroupedData vs GroupedDataset naming is confusing

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202603#comment-15202603 ] Apache Spark commented on SPARK-13897: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13897) GroupedData vs GroupedDataset naming is confusing

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13897: Assignee: Apache Spark (was: Reynold Xin) > GroupedData vs GroupedDataset naming is

[jira] [Assigned] (SPARK-14018) BenchmarkWholeStageCodegen should accept 64-bit num records

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14018: Assignee: Reynold Xin (was: Apache Spark) > BenchmarkWholeStageCodegen should accept

[jira] [Commented] (SPARK-13908) Limit not pushed down

2016-03-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202604#comment-15202604 ] Liang-Chi Hsieh commented on SPARK-13908: - Rethink this issue, I think it should not related to

[jira] [Created] (SPARK-14019) Remove noop SortOrder in Sort

2016-03-19 Thread Xiao Li (JIRA)
Xiao Li created SPARK-14019: --- Summary: Remove noop SortOrder in Sort Key: SPARK-14019 URL: https://issues.apache.org/jira/browse/SPARK-14019 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-13968) Use MurmurHash3 for hashing String features

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202590#comment-15202590 ] Nick Pentreath commented on SPARK-13968: Ah I didn't pick up the old ticket, thanks. On Fri, 18

[jira] [Updated] (SPARK-13941) kafka.cluster.BrokerEndPoint cannot be cast to kafka.cluster.Broker

2016-03-19 Thread Hurshal Patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hurshal Patel updated SPARK-13941: -- Description: I am connecting to a Kafka cluster with the following (anonymized) code: {code}

[jira] [Assigned] (SPARK-13579) Stop building assemblies for Spark

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13579: Assignee: Apache Spark > Stop building assemblies for Spark >

[jira] [Commented] (SPARK-13821) TPC-DS Query 20 fails to compile

2016-03-19 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201828#comment-15201828 ] Dilip Biswal commented on SPARK-13821: -- [~roycecil] Thanks Roy !! > TPC-DS Query 20 fails to

[jira] [Commented] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201859#comment-15201859 ] Yin Huai commented on SPARK-14006: -- cc [~shivaram] > Builds of 1.6 branch fail R style check >

[jira] [Resolved] (SPARK-13403) HiveConf used for SparkSQL is not based on the Hadoop configuration

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13403. - Resolution: Fixed Assignee: Ryan Blue Fix Version/s: 2.0.0 > HiveConf used for

[jira] [Created] (SPARK-13958) Executor OOM due to unbounded growth of pointer array in Sorter

2016-03-19 Thread Sital Kedia (JIRA)
Sital Kedia created SPARK-13958: --- Summary: Executor OOM due to unbounded growth of pointer array in Sorter Key: SPARK-13958 URL: https://issues.apache.org/jira/browse/SPARK-13958 Project: Spark

[jira] [Commented] (SPARK-13862) TPCDS query 49 returns wrong results compared to TPC official result set

2016-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198585#comment-15198585 ] Xiao Li commented on SPARK-13862: - I will also take this. Thanks! > TPCDS query 49 returns wrong results

[jira] [Commented] (SPARK-13986) Make `DeveloperApi`-annotated things public

2016-03-19 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200525#comment-15200525 ] Timothy Hunter commented on SPARK-13986: [~dongjoon] how did you find the conflicting annotation?

[jira] [Issue Comment Deleted] (SPARK-13041) Add a driver history ui link and a mesos sandbox link on the dispatcher's ui page for each driver

2016-03-19 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stavros Kontopoulos updated SPARK-13041: Comment: was deleted (was: There is a requirement for: "history server links in

[jira] [Comment Edited] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2016-03-19 Thread Hiten Patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198556#comment-15198556 ] Hiten Patel edited comment on SPARK-5594 at 3/17/16 1:31 AM: - Yes, this was

[jira] [Assigned] (SPARK-14021) Support custom context derived from HiveContext for SparkSQLEnv

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14021: Assignee: (was: Apache Spark) > Support custom context derived from HiveContext for

[jira] [Resolved] (SPARK-13827) Can't add subquery to an operator with same-name outputs while generate SQL string

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-13827. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11658

[jira] [Commented] (SPARK-13952) spark.ml GBT algs need to use random seed

2016-03-19 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201703#comment-15201703 ] Seth Hendrickson commented on SPARK-13952: -- Yes, I can work on it. > spark.ml GBT algs need to

[jira] [Resolved] (SPARK-13954) spar-shell starts with exceptions

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13954. --- Resolution: Not A Problem Please read

[jira] [Commented] (SPARK-913) log the size of each shuffle block in block manager

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201280#comment-15201280 ] Apache Spark commented on SPARK-913: User 'devaraj-kavali' has created a pull request for this issue:

[jira] [Commented] (SPARK-13979) Killed executor is respawned without AWS keys in standalone spark cluster

2016-03-19 Thread Mitesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200183#comment-15200183 ] Mitesh commented on SPARK-13979: I'm seeing this too. Its really annoying because I set the s3 access and

[jira] [Updated] (SPARK-14021) Support custom context derived from HiveContext for SparkSQLEnv

2016-03-19 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrian Wang updated SPARK-14021: Description: This is to create a custom context for command bin/spark-sql and

[jira] [Resolved] (SPARK-13948) MiMa Check should catch if the visibility change to `private`

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13948. - Resolution: Fixed Fix Version/s: 2.0.0 > MiMa Check should catch if the visibility

[jira] [Assigned] (SPARK-14021) Support custom context derived from HiveContext for SparkSQLEnv

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14021: Assignee: Apache Spark > Support custom context derived from HiveContext for SparkSQLEnv

[jira] [Commented] (SPARK-14005) Make RDD more compatible with Scala's collection

2016-03-19 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202658#comment-15202658 ] zhengruifeng commented on SPARK-14005: -- I think easiness to implement should not be the reason to

[jira] [Commented] (SPARK-3249) Fix links in ScalaDoc that cause warning messages in `sbt/sbt unidoc`

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201270#comment-15201270 ] Sean Owen commented on SPARK-3249: -- It's definitely still an issue. I remember trying to fix this and

[jira] [Assigned] (SPARK-14000) case class with a tuple field can't work in Dataset

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14000: Assignee: (was: Apache Spark) > case class with a tuple field can't work in Dataset >

[jira] [Commented] (SPARK-13951) PySpark ml.pipeline support export/import - nested Piplines

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202319#comment-15202319 ] Apache Spark commented on SPARK-13951: -- User 'yinxusen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13970) Add Non-Negative Matrix Factorization to MLlib

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13970: Assignee: (was: Apache Spark) > Add Non-Negative Matrix Factorization to MLlib >

[jira] [Updated] (SPARK-13948) MiMa Check should catch if the visibility change to `private`

2016-03-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-13948: --- Assignee: (was: Josh Rosen) > MiMa Check should catch if the visibility change to

[jira] [Comment Edited] (SPARK-13986) Make `DeveloperApi`-annotated things public

2016-03-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200559#comment-15200559 ] Dongjoon Hyun edited comment on SPARK-13986 at 3/17/16 10:53 PM: - Oh,

[jira] [Created] (SPARK-14021) Support custom context derived from HiveContext for SparkSQLEnv

2016-03-19 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-14021: --- Summary: Support custom context derived from HiveContext for SparkSQLEnv Key: SPARK-14021 URL: https://issues.apache.org/jira/browse/SPARK-14021 Project: Spark

[jira] [Resolved] (SPARK-13838) Clear variable code to prevent it to be re-evaluated in BoundAttribute

2016-03-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13838. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11674

[jira] [Commented] (SPARK-13969) Extend input format that feature hashing can handle

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201257#comment-15201257 ] Nick Pentreath commented on SPARK-13969: What I have in mind is something like the following:

[jira] [Commented] (SPARK-12719) SQL generation support for generators (including UDTF)

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197975#comment-15197975 ] Apache Spark commented on SPARK-12719: -- User 'yhuai' has created a pull request for this issue:

[jira] [Commented] (SPARK-13863) TPCDS query 66 returns wrong results compared to TPC official result set

2016-03-19 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200149#comment-15200149 ] JESSE CHEN commented on SPARK-13863: Going to validate this also on my cluster. Nice find. > TPCDS

[jira] [Commented] (SPARK-13958) Executor OOM due to unbounded growth of pointer array in Sorter

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200360#comment-15200360 ] Apache Spark commented on SPARK-13958: -- User 'sitalkedia' has created a pull request for this issue:

[jira] [Commented] (SPARK-13987) Build fails due to scala version mismatch between

2016-03-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-13987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200539#comment-15200539 ] Jean-Baptiste Onofré commented on SPARK-13987: -- Nevermind, it works fine following the

[jira] [Created] (SPARK-13963) Add binary toggle Param to ml.HashingTF

2016-03-19 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-13963: -- Summary: Add binary toggle Param to ml.HashingTF Key: SPARK-13963 URL: https://issues.apache.org/jira/browse/SPARK-13963 Project: Spark Issue Type: New

[jira] [Assigned] (SPARK-13965) Driver should kill the other running task attempts if any one task attempt succeeds for the same task

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13965: Assignee: Apache Spark > Driver should kill the other running task attempts if any one

[jira] [Updated] (SPARK-13942) Remove Shark-related docs for 2.x

2016-03-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-13942: -- Description: `Shark` was merged into `Spark SQL` since [July

[jira] [Commented] (SPARK-13862) TPCDS query 49 returns wrong results compared to TPC official result set

2016-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198595#comment-15198595 ] Xiao Li commented on SPARK-13862: - Nope. The PR is still open. https://github.com/apache/spark/pull/10731

[jira] [Resolved] (SPARK-12719) SQL generation support for generators (including UDTF)

2016-03-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-12719. - Resolution: Fixed > SQL generation support for generators (including UDTF) >

[jira] [Commented] (SPARK-13984) Schema verification always fail when using remote Hive metastore

2016-03-19 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202191#comment-15202191 ] Rekha Joshi commented on SPARK-13984: - Hi [~Jianfeng Hu] Implicitly modifying schema is disabled by

[jira] [Assigned] (SPARK-13977) Bring back ShuffledHashJoin

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13977: Assignee: Davies Liu (was: Apache Spark) > Bring back ShuffledHashJoin >

[jira] [Commented] (SPARK-13934) SqlParser.parseTableIdentifier cannot recognize table name start with scientific notation

2016-03-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197285#comment-15197285 ] Herman van Hovell commented on SPARK-13934: --- We overhauled much of the parser infrastructure

[jira] [Assigned] (SPARK-13951) PySpark ml.pipeline support export/import - nested Piplines

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13951: Assignee: (was: Apache Spark) > PySpark ml.pipeline support export/import - nested

[jira] [Issue Comment Deleted] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-19 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated SPARK-14006: Comment: was deleted (was: pushing a pull request in few mins.thanks!) > Builds of 1.6 branch

[jira] [Commented] (SPARK-14004) AttributeReference and Alias should only use their first qualifier to build SQL representations

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201313#comment-15201313 ] Apache Spark commented on SPARK-14004: -- User 'liancheng' has created a pull request for this issue:

[jira] [Closed] (SPARK-13987) Build fails due to scala version mismatch between

2016-03-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-13987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jean-Baptiste Onofré closed SPARK-13987. Resolution: Not A Problem > Build fails due to scala version mismatch between >

[jira] [Assigned] (SPARK-12183) Remove spark.mllib tree, forest implementations and use spark.ml

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-12183: - Assignee: Joseph K. Bradley > Remove spark.mllib tree, forest implementations

[jira] [Updated] (SPARK-13935) Other clients' connection hang up when someone do huge load

2016-03-19 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Wang updated SPARK-13935: - Affects Version/s: 1.6.0 1.6.1 > Other clients' connection hang up when someone

[jira] [Commented] (SPARK-13972) hive tests should fail if SQL generation failed

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201820#comment-15201820 ] Apache Spark commented on SPARK-13972: -- User 'yhuai' has created a pull request for this issue:

[jira] [Updated] (SPARK-13986) Remove `DeveloperApi`-annotation for non-publics

2016-03-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-13986: -- Summary: Remove `DeveloperApi`-annotation for non-publics (was: Make `DeveloperApi`-annotated

[jira] [Created] (SPARK-13982) SparkR - KMeans predict: Output column name of features is an unclear, automatic genetared text

2016-03-19 Thread Narine Kokhlikyan (JIRA)
Narine Kokhlikyan created SPARK-13982: - Summary: SparkR - KMeans predict: Output column name of features is an unclear, automatic genetared text Key: SPARK-13982 URL:

[jira] [Comment Edited] (SPARK-13955) Spark in yarn mode fails

2016-03-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199927#comment-15199927 ] Marcelo Vanzin edited comment on SPARK-13955 at 3/17/16 4:59 PM: - How did

[jira] [Commented] (SPARK-13955) Spark in yarn mode fails

2016-03-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199927#comment-15199927 ] Marcelo Vanzin commented on SPARK-13955: How did you build Spark? Did you do "mvn package" or

[jira] [Created] (SPARK-13970) Add Non-Negative Matrix Factorization to MLlib

2016-03-19 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13970: Summary: Add Non-Negative Matrix Factorization to MLlib Key: SPARK-13970 URL: https://issues.apache.org/jira/browse/SPARK-13970 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12148) SparkR: rename DataFrame to SparkDataFrame

2016-03-19 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200940#comment-15200940 ] Sun Rui commented on SPARK-12148: - An issue reported in the Spark user list may be related to this naming

[jira] [Created] (SPARK-14001) support multi-children Union in SQLBuilder

2016-03-19 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-14001: --- Summary: support multi-children Union in SQLBuilder Key: SPARK-14001 URL: https://issues.apache.org/jira/browse/SPARK-14001 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-13458) Datasets cannot be sorted

2016-03-19 Thread Oliver Beattie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201112#comment-15201112 ] Oliver Beattie commented on SPARK-13458: Yes, it looks like these methods were added a week ago

[jira] [Resolved] (SPARK-13427) Support USING clause in JOIN

2016-03-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-13427. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11297

[jira] [Created] (SPARK-14004) AttributeReference and Alias should only use their first qualifier to build SQL representations

2016-03-19 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-14004: -- Summary: AttributeReference and Alias should only use their first qualifier to build SQL representations Key: SPARK-14004 URL: https://issues.apache.org/jira/browse/SPARK-14004

[jira] [Assigned] (SPARK-13996) Add more not null attributes for Filter codegen

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13996: Assignee: Apache Spark > Add more not null attributes for Filter codegen >

[jira] [Commented] (SPARK-13865) TPCDS query 87 returns wrong results compared to TPC official result set

2016-03-19 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199136#comment-15199136 ] Dilip Biswal commented on SPARK-13865: -- [~smilegator] Quick update on this .. This also seems

[jira] [Updated] (SPARK-13955) Spark in yarn mode fails

2016-03-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-13955: --- Description: I ran spark-shell in yarn client, but from the logs seems the spark assembly jar is

[jira] [Commented] (SPARK-13877) Consider removing Kafka modules from Spark / Spark Streaming

2016-03-19 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200188#comment-15200188 ] Mark Grover commented on SPARK-13877: - Yeah, that totally makes sense. I agree that it's a big change

[jira] [Resolved] (SPARK-13915) Allow bin/spark-submit to be called via symbolic link

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13915. --- Resolution: Not A Problem > Allow bin/spark-submit to be called via symbolic link >

[jira] [Assigned] (SPARK-13981) Improve Filter generated code to defer variable evaluation within operator

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13981: Assignee: Apache Spark > Improve Filter generated code to defer variable evaluation

[jira] [Assigned] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14006: Assignee: (was: Apache Spark) > Builds of 1.6 branch fail R style check >

[jira] [Updated] (SPARK-13989) Remove non-vectorized/unsafe-row parquet record reader

2016-03-19 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-13989: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-14008 > Remove

[jira] [Commented] (SPARK-13313) Strongly connected components doesn't find all strongly connected components

2016-03-19 Thread Petar Zecevic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200028#comment-15200028 ] Petar Zecevic commented on SPARK-13313: --- Ok, thanks for reporting. I'll look into this. >

[jira] [Assigned] (SPARK-13289) Word2Vec generate infinite distances when numIterations>5

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13289: Assignee: (was: Apache Spark) > Word2Vec generate infinite distances when

[jira] [Commented] (SPARK-13932) CUBE Query with filter (HAVING) and condition (IF) raises an AnalysisException

2016-03-19 Thread Tien-Dung LE (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199689#comment-15199689 ] Tien-Dung LE commented on SPARK-13932: -- The error is still there in the latest spark code version

[jira] [Updated] (SPARK-14003) Multi-session can not work when one session is moving files for "INSERT ... SELECT" clause

2016-03-19 Thread Weizhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhong updated SPARK-14003: - Summary: Multi-session can not work when one session is moving files for "INSERT ... SELECT" clause

[jira] [Commented] (SPARK-3308) Ability to read JSON Arrays as tables

2016-03-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198543#comment-15198543 ] Hyukjin Kwon commented on SPARK-3308: - I removed the PR link,

[jira] [Assigned] (SPARK-13858) TPCDS query 21 returns wrong results compared to TPC official result set

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13858: Assignee: (was: Apache Spark) > TPCDS query 21 returns wrong results compared to TPC

[jira] [Commented] (SPARK-13963) Add binary toggle Param to ml.HashingTF

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200047#comment-15200047 ] Nick Pentreath commented on SPARK-13963: Sure, assigned to you. > Add binary toggle Param to

[jira] [Updated] (SPARK-13969) Extend input format that feature hashing can handle

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-13969: --- Description: Currently {{HashingTF}} works like {{CountVectorizer}} (the equivalent in

[jira] [Created] (SPARK-13957) Support group by position in SQL

2016-03-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-13957: --- Summary: Support group by position in SQL Key: SPARK-13957 URL: https://issues.apache.org/jira/browse/SPARK-13957 Project: Spark Issue Type: Improvement

[jira] [Comment Edited] (SPARK-13969) Extend input format that feature hashing can handle

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202007#comment-15202007 ] Joseph K. Bradley edited comment on SPARK-13969 at 3/18/16 7:35 PM: I

[jira] [Commented] (SPARK-13831) TPC-DS Query 35 fails with the following compile error

2016-03-19 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197944#comment-15197944 ] kevin yu commented on SPARK-13831: -- The same query will fail at spark sql 2.0 . And the failure can

[jira] [Resolved] (SPARK-11011) UserDefinedType serialization should be strongly typed

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11011. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11379

[jira] [Updated] (SPARK-13978) [GSoC 2016] Build monitoring UI and infrastructure for Spark SQL and structured streaming

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13978: - Labels: GSOC2016 mentor (was: GSOC2016) > [GSoC 2016] Build monitoring UI and infrastructure for Spark

[jira] [Commented] (SPARK-13938) word2phrase feature created in ML

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197890#comment-15197890 ] Apache Spark commented on SPARK-13938: -- User 's4weng' has created a pull request for this issue:

[jira] [Commented] (SPARK-13997) Use Hadoop 2.0 default value for compression in data sources

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200984#comment-15200984 ] Apache Spark commented on SPARK-13997: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Updated] (SPARK-12789) Support order by position in SQL

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12789: Description: This is to support order by position in SQL, e.g. {noformat} select c1, c2, c3 from

[jira] [Created] (SPARK-13941) kafka.cluster.BrokerEndPoint cannot be cast to kafka.cluster.Broker

2016-03-19 Thread Hurshal Patel (JIRA)
Hurshal Patel created SPARK-13941: - Summary: kafka.cluster.BrokerEndPoint cannot be cast to kafka.cluster.Broker Key: SPARK-13941 URL: https://issues.apache.org/jira/browse/SPARK-13941 Project: Spark

[jira] [Created] (SPARK-14002) SQLBuilder should add subquery to Aggregate child when necessary

2016-03-19 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-14002: -- Summary: SQLBuilder should add subquery to Aggregate child when necessary Key: SPARK-14002 URL: https://issues.apache.org/jira/browse/SPARK-14002 Project: Spark

[jira] [Commented] (SPARK-13996) Add more not null attributes for Filter codegen

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200946#comment-15200946 ] Apache Spark commented on SPARK-13996: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-13923) Implement SessionCatalog to manage temp functions and tables

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198529#comment-15198529 ] Yin Huai commented on SPARK-13923: -- Please note that the temp function part of the pr is a placeholder.

[jira] [Commented] (SPARK-13934) SqlParser.parseTableIdentifier cannot recognize table name start with scientific notation

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197241#comment-15197241 ] Sean Owen commented on SPARK-13934: --- What does scientific notation have to do with it? it's a table

[jira] [Assigned] (SPARK-13973) `ipython notebook` is going away...

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13973: Assignee: Apache Spark > `ipython notebook` is going away... >

[jira] [Assigned] (SPARK-13950) Generate code for sort merge left/right outer join

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13950: Assignee: Apache Spark (was: Davies Liu) > Generate code for sort merge left/right outer

[jira] [Commented] (SPARK-13963) Add binary toggle Param to ml.HashingTF

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200048#comment-15200048 ] Nick Pentreath commented on SPARK-13963: Sure, assigned to you. > Add binary toggle Param to

  1   2   3   4   5   6   >