[jira] [Created] (SPARK-15634) SQL repl is bricked if a function is registered with a non-existent jar

2016-05-27 Thread Eric Liang (JIRA)
Eric Liang created SPARK-15634: -- Summary: SQL repl is bricked if a function is registered with a non-existent jar Key: SPARK-15634 URL: https://issues.apache.org/jira/browse/SPARK-15634 Project: Spark

[jira] [Updated] (SPARK-15623) 2.0 python coverage ml.feature

2016-05-27 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15623: - Summary: 2.0 python coverage ml.feature (was: 2.0 python converage ml.feature) > 2.0 python cov

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-05-27 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304869#comment-15304869 ] Benjamin Fradet commented on SPARK-15581: - [~josephkb] Just out of curiosity: I d

[jira] [Updated] (SPARK-15633) Make package name for Java tests consistent

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15633: Summary: Make package name for Java tests consistent (was: Make package name for Java 8 tests cons

[jira] [Assigned] (SPARK-15633) Make package name for Java 8 tests consistent

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15633: Assignee: Reynold Xin (was: Apache Spark) > Make package name for Java 8 tests consistent

[jira] [Commented] (SPARK-15633) Make package name for Java 8 tests consistent

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304865#comment-15304865 ] Apache Spark commented on SPARK-15633: -- User 'rxin' has created a pull request for t

[jira] [Assigned] (SPARK-15633) Make package name for Java 8 tests consistent

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15633: Assignee: Apache Spark (was: Reynold Xin) > Make package name for Java 8 tests consistent

[jira] [Updated] (SPARK-15633) Make package name for Java 8 tests consistent

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15633: Priority: Minor (was: Major) > Make package name for Java 8 tests consistent > ---

[jira] [Created] (SPARK-15633) Make package name for Java 8 tests consistent

2016-05-27 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15633: --- Summary: Make package name for Java 8 tests consistent Key: SPARK-15633 URL: https://issues.apache.org/jira/browse/SPARK-15633 Project: Spark Issue Type: Sub-t

[jira] [Created] (SPARK-15632) Dataset typed filter operation changes query plan schema

2016-05-27 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-15632: -- Summary: Dataset typed filter operation changes query plan schema Key: SPARK-15632 URL: https://issues.apache.org/jira/browse/SPARK-15632 Project: Spark Issue Ty

[jira] [Updated] (SPARK-15441) dataset outer join seems to return incorrect result

2016-05-27 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-15441: Issue Type: Sub-task (was: Bug) Parent: SPARK-15631 > dataset outer join seems to return i

[jira] [Updated] (SPARK-15112) Dataset filter returns garbage

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15112: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-15631 > Dataset filter returns garbage > -

[jira] [Updated] (SPARK-15550) Dataset.show() doesn't disply inner nested structs properly

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15550: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-15631 > Dataset.show() doesn't disply inner ne

[jira] [Updated] (SPARK-15547) Encoder validation is too strict for inner nested structs

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15547: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-15631 > Encoder validation is too strict for i

[jira] [Created] (SPARK-15631) Dataset and encoder bug fixes

2016-05-27 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-15631: -- Summary: Dataset and encoder bug fixes Key: SPARK-15631 URL: https://issues.apache.org/jira/browse/SPARK-15631 Project: Spark Issue Type: Bug Component

[jira] [Updated] (SPARK-15604) Spark-SQL: Get com.esotericsoftware.kryo.KryoException: java.lang.NullPointerException when runing query_1.sql of TPC-DS

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15604: Description: When I run query_1.sql of TPC-DS like: bin/spark-sql --master spark://192.168.30.78:70

[jira] [Updated] (SPARK-15604) Spark-SQL: Get com.esotericsoftware.kryo.KryoException: java.lang.NullPointerException when runing query_1.sql of TPC-DS

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15604: Affects Version/s: (was: 2.1.0) > Spark-SQL: Get com.esotericsoftware.kryo.KryoException: > ja

[jira] [Updated] (SPARK-15604) Spark-SQL: Get com.esotericsoftware.kryo.KryoException: java.lang.NullPointerException when runing query_1.sql of TPC-DS

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15604: Target Version/s: 2.0.0 > Spark-SQL: Get com.esotericsoftware.kryo.KryoException: > java.lang.Null

[jira] [Comment Edited] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304782#comment-15304782 ] Dilip Biswal edited comment on SPARK-15557 at 5/27/16 9:09 PM:

[jira] [Commented] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304782#comment-15304782 ] Dilip Biswal commented on SPARK-15557: -- I am looking into this issue. > expression

[jira] [Commented] (SPARK-15509) R MLlib algorithms should support input columns "features" and "label"

2016-05-27 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304777#comment-15304777 ] Xin Ren commented on SPARK-15509: - Hi [~josephkb], I tried many times but cannot reproduc

[jira] [Resolved] (SPARK-15413) Change `toBreeze` to `asBreeze` in Vector and Matrix

2016-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-15413. --- Resolution: Fixed Issue resolved by pull request 13198 [https://github.com/apache/spa

[jira] [Updated] (SPARK-15623) 2.0 python converage ml.feature

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-15623: Component/s: PySpark ML > 2.0 python converage ml.feature > --

[jira] [Commented] (SPARK-15625) 2.0 python converage ml.classification module

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304734#comment-15304734 ] holdenk commented on SPARK-15625: - This audit is complete (outstanding PRs and issue in p

[jira] [Created] (SPARK-15630) 2.0 python converage ml root module

2016-05-27 Thread holdenk (JIRA)
holdenk created SPARK-15630: --- Summary: 2.0 python converage ml root module Key: SPARK-15630 URL: https://issues.apache.org/jira/browse/SPARK-15630 Project: Spark Issue Type: Improvement C

[jira] [Commented] (SPARK-14813) ML 2.0 QA: API: Python API coverage

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304728#comment-15304728 ] holdenk commented on SPARK-14813: - I'm thinking we should skip read/write missing in comp

[jira] [Commented] (SPARK-15627) 2.0 python converage ml.tuning module

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304723#comment-15304723 ] holdenk commented on SPARK-15627: - ml.tuning audit complete > 2.0 python converage ml.tu

[jira] [Commented] (SPARK-15623) 2.0 python converage ml.feature

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304722#comment-15304722 ] holdenk commented on SPARK-15623: - cc [~bryanc] can you just double check/confirm that yo

[jira] [Commented] (SPARK-6932) A Prototype of Parameter Server

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304721#comment-15304721 ] Reynold Xin commented on SPARK-6932: Thanks for sharing this, Rolf! It'd be great if

[jira] [Updated] (SPARK-15628) pyspark.ml.evaluation module

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-15628: Component/s: PySpark ML > pyspark.ml.evaluation module > > >

[jira] [Commented] (SPARK-15628) pyspark.ml.evaluation module

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304720#comment-15304720 ] holdenk commented on SPARK-15628: - API Audit of this component complete > pyspark.ml.eva

[jira] [Resolved] (SPARK-15008) Python ML persistence integration test: OneVsRest

2016-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-15008. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12875 [h

[jira] [Updated] (SPARK-15484) Document Iteratively reweighted least squares (IRLS) in user guide

2016-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15484: -- Assignee: Yanbo Liang > Document Iteratively reweighted least squares (IRLS) in user gu

[jira] [Resolved] (SPARK-15484) Document Iteratively reweighted least squares (IRLS) in user guide

2016-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-15484. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13262 [h

[jira] [Created] (SPARK-15628) pyspark.ml.evaluation module

2016-05-27 Thread holdenk (JIRA)
holdenk created SPARK-15628: --- Summary: pyspark.ml.evaluation module Key: SPARK-15628 URL: https://issues.apache.org/jira/browse/SPARK-15628 Project: Spark Issue Type: Improvement Report

[jira] [Created] (SPARK-15629) 2.0 python converage pyspark.ml.linalg

2016-05-27 Thread holdenk (JIRA)
holdenk created SPARK-15629: --- Summary: 2.0 python converage pyspark.ml.linalg Key: SPARK-15629 URL: https://issues.apache.org/jira/browse/SPARK-15629 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-11959) Document normal equation solver for ordinary least squares in user guide

2016-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11959. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13262 [h

[jira] [Commented] (SPARK-15589) Anaylze simple PySpark closures and generate SQL expressions

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304692#comment-15304692 ] holdenk commented on SPARK-15589: - Of course needs to wait for the Python Dataset API to

[jira] [Created] (SPARK-15627) 2.0 python converage ml.tuning module

2016-05-27 Thread holdenk (JIRA)
holdenk created SPARK-15627: --- Summary: 2.0 python converage ml.tuning module Key: SPARK-15627 URL: https://issues.apache.org/jira/browse/SPARK-15627 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-15626) 2.0 python converage ml.regression module

2016-05-27 Thread holdenk (JIRA)
holdenk created SPARK-15626: --- Summary: 2.0 python converage ml.regression module Key: SPARK-15626 URL: https://issues.apache.org/jira/browse/SPARK-15626 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-15624) 2.0 python converage ml.recommendation module

2016-05-27 Thread holdenk (JIRA)
holdenk created SPARK-15624: --- Summary: 2.0 python converage ml.recommendation module Key: SPARK-15624 URL: https://issues.apache.org/jira/browse/SPARK-15624 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-15625) 2.0 python converage ml.classification module

2016-05-27 Thread holdenk (JIRA)
holdenk created SPARK-15625: --- Summary: 2.0 python converage ml.classification module Key: SPARK-15625 URL: https://issues.apache.org/jira/browse/SPARK-15625 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-15623) 2.0 python converage ml.feature

2016-05-27 Thread holdenk (JIRA)
holdenk created SPARK-15623: --- Summary: 2.0 python converage ml.feature Key: SPARK-15623 URL: https://issues.apache.org/jira/browse/SPARK-15623 Project: Spark Issue Type: Improvement Rep

[jira] [Commented] (SPARK-15112) Dataset filter returns garbage

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304686#comment-15304686 ] Apache Spark commented on SPARK-15112: -- User 'liancheng' has created a pull request

[jira] [Commented] (SPARK-14813) ML 2.0 QA: API: Python API coverage

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304681#comment-15304681 ] holdenk commented on SPARK-14813: - No worries, I'll break it up then. > ML 2.0 QA: API:

[jira] [Resolved] (SPARK-15186) Add user guide for Generalized Linear Regression.

2016-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-15186. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13139 [h

[jira] [Created] (SPARK-15622) Janino's classloader has an unexpected behavior when its parent classloader throws an ClassNotFoundException with a cause set

2016-05-27 Thread Yin Huai (JIRA)
Yin Huai created SPARK-15622: Summary: Janino's classloader has an unexpected behavior when its parent classloader throws an ClassNotFoundException with a cause set Key: SPARK-15622 URL: https://issues.apache.org/jira

[jira] [Created] (SPARK-15621) BatchEvalPythonExec fails with OOM

2016-05-27 Thread Krisztian Szucs (JIRA)
Krisztian Szucs created SPARK-15621: --- Summary: BatchEvalPythonExec fails with OOM Key: SPARK-15621 URL: https://issues.apache.org/jira/browse/SPARK-15621 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-15620) Dataset.map creates a dataset that can't be self-joined

2016-05-27 Thread Tim Gautier (JIRA)
Tim Gautier created SPARK-15620: --- Summary: Dataset.map creates a dataset that can't be self-joined Key: SPARK-15620 URL: https://issues.apache.org/jira/browse/SPARK-15620 Project: Spark Issue T

[jira] [Commented] (SPARK-15465) AnalysisException: cannot cast StructType to VectorUDT

2016-05-27 Thread Dmitry Zhukov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304632#comment-15304632 ] Dmitry Zhukov commented on SPARK-15465: --- [~josephkb] with .rdd transform it works p

[jira] [Updated] (SPARK-15619) spark builds filling up /tmp

2016-05-27 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp updated SPARK-15619: Description: spark builds aren't cleaning up /tmp after they run... it's hard to pinpoint EXACTLY

[jira] [Created] (SPARK-15619) spark builds filling up /tmp

2016-05-27 Thread shane knapp (JIRA)
shane knapp created SPARK-15619: --- Summary: spark builds filling up /tmp Key: SPARK-15619 URL: https://issues.apache.org/jira/browse/SPARK-15619 Project: Spark Issue Type: Bug Componen

[jira] [Commented] (SPARK-15176) Job Scheduling Within Application Suffers from Priority Inversion

2016-05-27 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304589#comment-15304589 ] Mark Hamstra commented on SPARK-15176: -- It's not an unreasonable use case, and is si

[jira] [Updated] (SPARK-15551) Scaladoc for KeyValueGroupedDataset points to old method

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15551: Issue Type: Sub-task (was: Documentation) Parent: SPARK-15426 > Scaladoc for KeyValueGroup

[jira] [Resolved] (SPARK-14400) ScriptTransformation does not fail the job for bad user command

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14400. - Resolution: Fixed Fix Version/s: 2.0.0 > ScriptTransformation does not fail the job for ba

[jira] [Commented] (SPARK-15575) Remove breeze from dependencies?

2016-05-27 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304546#comment-15304546 ] Nick Pentreath commented on SPARK-15575: What specifically are the "performance i

[jira] [Commented] (SPARK-15078) Add all TPCDS 1.4 benchmark queries for SparkSQL

2016-05-27 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304529#comment-15304529 ] Sameer Agarwal commented on SPARK-15078: [~ovidiumarcu] Can you please share the

[jira] [Resolved] (SPARK-15531) spark-class tries to use too much memory when running Launcher

2016-05-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-15531. Resolution: Fixed Assignee: Sean Owen Fix Version/s: 2.0.0 > spark-class tr

[jira] [Commented] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304509#comment-15304509 ] Dongjoon Hyun commented on SPARK-15618: --- Thank you! Right. That's better. > Use Sp

[jira] [Commented] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304506#comment-15304506 ] Andrew Or commented on SPARK-15618: --- it needs to be internal. At least it should be pri

[jira] [Resolved] (SPARK-15569) Executors spending significant time in DiskObjectWriter.updateBytesWritten function

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15569. --- Resolution: Fixed Assignee: Sital Kedia Fix Version/s: 2.0.0 Target Versi

[jira] [Commented] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304498#comment-15304498 ] Dongjoon Hyun commented on SPARK-15618: --- Is it okay if I remove `private[sql]` from

[jira] [Commented] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304486#comment-15304486 ] Dongjoon Hyun commented on SPARK-15618: --- Thank you for creating JIRA for this. I'll

[jira] [Updated] (SPARK-15599) Document createDataset functions in SparkSession

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15599: -- Affects Version/s: 2.0.0 Target Version/s: 2.0.0 Component/s: Documentation > Document c

[jira] [Resolved] (SPARK-15599) Document createDataset functions in SparkSession

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15599. --- Resolution: Fixed Fix Version/s: 2.0.0 > Document createDataset functions in SparkSession > --

[jira] [Updated] (SPARK-15599) Document createDataset functions in SparkSession

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15599: -- Assignee: Sameer Agarwal > Document createDataset functions in SparkSession > -

[jira] [Resolved] (SPARK-15584) Abstract duplicate code: "spark.sql.sources." properties

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15584. --- Resolution: Fixed Fix Version/s: 2.0.0 > Abstract duplicate code: "spark.sql.sources." propert

[jira] [Updated] (SPARK-15603) Replace SQLContext with SparkSession in ML/MLLib

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15603: -- Fix Version/s: 2.0.0 > Replace SQLContext with SparkSession in ML/MLLib > -

[jira] [Updated] (SPARK-15603) Replace SQLContext with SparkSession in ML/MLLib

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15603: -- Assignee: Dongjoon Hyun > Replace SQLContext with SparkSession in ML/MLLib > --

[jira] [Commented] (SPARK-13233) Python Dataset

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304478#comment-15304478 ] holdenk commented on SPARK-13233: - So curious - is this targeted for 2.0 or are we planni

[jira] [Updated] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15618: -- Priority: Minor (was: Major) > Use SparkSession.builder.sparkContext(...) in tests where possible > --

[jira] [Updated] (SPARK-15603) Replace SQLContext with SparkSession in ML/MLLib

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15603: -- Affects Version/s: 2.0.0 > Replace SQLContext with SparkSession in ML/MLLib > -

[jira] [Created] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Andrew Or (JIRA)
Andrew Or created SPARK-15618: - Summary: Use SparkSession.builder.sparkContext(...) in tests where possible Key: SPARK-15618 URL: https://issues.apache.org/jira/browse/SPARK-15618 Project: Spark

[jira] [Commented] (SPARK-12776) Implement Python API for Datasets

2016-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304477#comment-15304477 ] holdenk commented on SPARK-12776: - I think this might be duplicated by SPARK-13233, altho

[jira] [Created] (SPARK-15617) Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score

2016-05-27 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-15617: - Summary: Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score Key: SPARK-15617 URL: https://issues.apache.org/jira/browse

[jira] [Updated] (SPARK-15613) Incorrect days to millis conversion

2016-05-27 Thread Dmitry Bushev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Bushev updated SPARK-15613: -- Description: There is an issue with {{DateTimeUtils.daysToMillis}} implementation. It affects

[jira] [Updated] (SPARK-15613) Incorrect days to millis conversion

2016-05-27 Thread Dmitry Bushev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Bushev updated SPARK-15613: -- Description: There is an issue with {{DateTimeUtils.daysToMillis}} implementation. It affects

[jira] [Commented] (SPARK-15431) Support LIST FILE(s)|JAR(s) command natively

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304418#comment-15304418 ] Apache Spark commented on SPARK-15431: -- User 'xwu0226' has created a pull request fo

[jira] [Updated] (SPARK-14361) Support EXCLUDE clause in Window function framing

2016-05-27 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Wu updated SPARK-14361: --- Issue Type: New Feature (was: Improvement) > Support EXCLUDE clause in Window function framing > ---

[jira] [Updated] (SPARK-15565) The default value of spark.sql.warehouse.dir needs to explicitly point to local filesystem

2016-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15565: - Description: The default value of {{spark.sql.warehouse.dir}} is {{System.getProperty("user.dir")/wareh

[jira] [Updated] (SPARK-15565) The default value of spark.sql.warehouse.dir needs to explicitly point to local filesystem

2016-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15565: - Assignee: Xiao Li > The default value of spark.sql.warehouse.dir needs to explicitly point to > local fi

[jira] [Resolved] (SPARK-15565) The default value of spark.sql.warehouse.dir needs to explicitly point to local filesystem

2016-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15565. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13348 [https://github.com/

[jira] [Updated] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-05-27 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-15616: - Description: Currently if some partitions of a partitioned table are used in join operation we r

[jira] [Created] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-05-27 Thread Lianhui Wang (JIRA)
Lianhui Wang created SPARK-15616: Summary: Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available. Key: SPARK-15616 URL: https://issues.apache.org/

[jira] [Created] (SPARK-15615) Support for creating a dataframe from JSON in Dataset[String]

2016-05-27 Thread PJ Fanning (JIRA)
PJ Fanning created SPARK-15615: -- Summary: Support for creating a dataframe from JSON in Dataset[String] Key: SPARK-15615 URL: https://issues.apache.org/jira/browse/SPARK-15615 Project: Spark

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304267#comment-15304267 ] Shivaram Venkataraman commented on SPARK-15585: --- [~maropu] Can you also add

[jira] [Commented] (SPARK-15489) Dataset kryo encoder fails on Collections$UnmodifiableCollection

2016-05-27 Thread Amit Sela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304222#comment-15304222 ] Amit Sela commented on SPARK-15489: --- I would expect this to be related to KryoSerialize

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-05-27 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304143#comment-15304143 ] Cody Koeninger commented on SPARK-12177: This issue already does link to Mark's P

[jira] [Updated] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-05-27 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cody Koeninger updated SPARK-12177: --- Summary: Update KafkaDStreams to new Kafka 0.10 Consumer API (was: Update KafkaDStreams to n

[jira] [Commented] (SPARK-15575) Remove breeze from dependencies?

2016-05-27 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304135#comment-15304135 ] koert kuipers commented on SPARK-15575: --- we can help out porting breeze to scala 2.

[jira] [Comment Edited] (SPARK-15614) ml.feature should support default value of input column

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304124#comment-15304124 ] Yanbo Liang edited comment on SPARK-15614 at 5/27/16 2:33 PM: -

[jira] [Commented] (SPARK-15614) ml.feature should support default value of input column

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304124#comment-15304124 ] Yanbo Liang commented on SPARK-15614: - I vote -1. * The ML pipeline will firstly use

[jira] [Commented] (SPARK-14813) ML 2.0 QA: API: Python API coverage

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304100#comment-15304100 ] Yanbo Liang commented on SPARK-14813: - [~holdenk] Sorry for late response. I'm focuse

[jira] [Assigned] (SPARK-15531) spark-class tries to use too much memory when running Launcher

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15531: Assignee: (was: Apache Spark) > spark-class tries to use too much memory when running

[jira] [Commented] (SPARK-15531) spark-class tries to use too much memory when running Launcher

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304054#comment-15304054 ] Apache Spark commented on SPARK-15531: -- User 'srowen' has created a pull request for

[jira] [Assigned] (SPARK-15531) spark-class tries to use too much memory when running Launcher

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15531: Assignee: Apache Spark > spark-class tries to use too much memory when running Launcher >

[jira] [Resolved] (SPARK-6932) A Prototype of Parameter Server

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6932. -- Resolution: Won't Fix So, after this much inactivity, I assume this will exist if anywhere as a project

[jira] [Resolved] (SPARK-15564) App name is the main class name in Spark streaming jobs

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15564. --- Resolution: Not A Problem > App name is the main class name in Spark streaming jobs > ---

[jira] [Commented] (SPARK-15575) Remove breeze from dependencies?

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304017#comment-15304017 ] Sean Owen commented on SPARK-15575: --- Hm, is Breeze really not supporting 2.12? It seem

[jira] [Resolved] (SPARK-15602) spark

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15602. --- Resolution: Invalid Fix Version/s: (was: 1.4.0) Target Version/s: (was: 1.4.0)

<    1   2   3   >