[jira] [Commented] (SPARK-14248) Get the path hierarchy from root to leaf in the BisectingKMeansModel

2016-03-29 Thread Lakesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216747#comment-15216747 ] Lakesh commented on SPARK-14248: [~josephkb]. It would be a very small addition. You can

[jira] [Commented] (SPARK-14246) vars not updated after Scala script reload

2016-03-29 Thread Jim Powers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216734#comment-15216734 ] Jim Powers commented on SPARK-14246: It appears that this is a Scala problem and not

[jira] [Closed] (SPARK-14250) Parquet import failure: No predefined schema found

2016-03-29 Thread Georg Heiler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Georg Heiler closed SPARK-14250. Resolution: Won't Fix > Parquet import failure: No predefined schema found > --

[jira] [Commented] (SPARK-14250) Parquet import failure: No predefined schema found

2016-03-29 Thread Georg Heiler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216740#comment-15216740 ] Georg Heiler commented on SPARK-14250: -- The local file seems to have been corrupted

[jira] [Commented] (SPARK-14246) vars not updated after Scala script reload

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216737#comment-15216737 ] Sean Owen commented on SPARK-14246: --- Huh. That is strange. Yes it does look like someth

[jira] [Resolved] (SPARK-14246) vars not updated after Scala script reload

2016-03-29 Thread Jim Powers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Powers resolved SPARK-14246. Resolution: Fixed Fix Version/s: 1.6.0 1.6.1 2.0.0 Thi

[jira] [Updated] (SPARK-14246) vars not updated after Scala script reload

2016-03-29 Thread Jim Powers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Powers updated SPARK-14246: --- Attachment: Fail.scala Fail.scala -- exhibits the problem with regular scala invoked with -Yrepl-cla

[jira] [Updated] (SPARK-14240) PySpark Standalone Application hangs without any Error message

2016-03-29 Thread Sayak Ghosh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sayak Ghosh updated SPARK-14240: Attachment: DAG visualisation of pending stages.png event timeline.png Other two fi

[jira] [Updated] (SPARK-14251) Add SQL command for printing out generated code for debugging

2016-03-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14251: Description: SPARK-14227 adds a programatic way to dump generated code. In pure SQL environment th

[jira] [Created] (SPARK-14251) Add SQL command for printing out generated code for debugging

2016-03-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-14251: --- Summary: Add SQL command for printing out generated code for debugging Key: SPARK-14251 URL: https://issues.apache.org/jira/browse/SPARK-14251 Project: Spark

[jira] [Updated] (SPARK-14227) Add method for printing out generated code for debugging

2016-03-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14227: Summary: Add method for printing out generated code for debugging (was: [SQL] Add method for print

[jira] [Closed] (SPARK-14247) Spark does not compile with CDH-5.4.x due to the possible bug of ivy.....

2016-03-29 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nan Zhu closed SPARK-14247. --- Resolution: Not A Problem > Spark does not compile with CDH-5.4.x due to the possible bug of ivy. > -

[jira] [Comment Edited] (SPARK-14247) Spark does not compile with CDH-5.4.x due to the possible bug of ivy.....

2016-03-29 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216719#comment-15216719 ] Nan Zhu edited comment on SPARK-14247 at 3/29/16 7:39 PM: -- thank

[jira] [Commented] (SPARK-14247) Spark does not compile with CDH-5.4.x due to the possible bug of ivy.....

2016-03-29 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216719#comment-15216719 ] Nan Zhu commented on SPARK-14247: - thanks [~sowen], it seems that change the hadoop.versi

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216705#comment-15216705 ] Sean Owen commented on SPARK-14103: --- Printing the bad line is helpful but would be grea

[jira] [Created] (SPARK-14250) Parquet import failure: No predefined schema found

2016-03-29 Thread Georg Heiler (JIRA)
Georg Heiler created SPARK-14250: Summary: Parquet import failure: No predefined schema found Key: SPARK-14250 URL: https://issues.apache.org/jira/browse/SPARK-14250 Project: Spark Issue Type

[jira] [Commented] (SPARK-13802) Fields order in Row(**kwargs) is not consistent with Schema.toInternal method

2016-03-29 Thread Jason C Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216695#comment-15216695 ] Jason C Lee commented on SPARK-13802: - I tried making a fix where I treat a kwarg row

[jira] [Updated] (SPARK-14181) TrainValidationSplit should have HasSeed

2016-03-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14181: -- Shepherd: Joseph K. Bradley Assignee: Xusen Yin Target Version/s

[jira] [Commented] (SPARK-14249) Change MLReader.read to be a property for PySpark

2016-03-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216689#comment-15216689 ] Joseph K. Bradley commented on SPARK-14249: --- Question: Is it odd to have a clas

[jira] [Created] (SPARK-14249) Change MLReader.read to be a property for PySpark

2016-03-29 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14249: - Summary: Change MLReader.read to be a property for PySpark Key: SPARK-14249 URL: https://issues.apache.org/jira/browse/SPARK-14249 Project: Spark I

[jira] [Commented] (SPARK-14225) Cap the length of toCommentSafeString at 128 chars

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216687#comment-15216687 ] Apache Spark commented on SPARK-14225: -- User 'sameeragarwal' has created a pull requ

[jira] [Commented] (SPARK-14247) Spark does not compile with CDH-5.4.x due to the possible bug of ivy.....

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216672#comment-15216672 ] Sean Owen commented on SPARK-14247: --- hadoop-core is the old artifact name from pre-YARN

[jira] [Commented] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2016-03-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216668#comment-15216668 ] Joseph K. Bradley commented on SPARK-14174: --- Quick note: Do you have a use case

[jira] [Comment Edited] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2016-03-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216668#comment-15216668 ] Joseph K. Bradley edited comment on SPARK-14174 at 3/29/16 7:10 PM: ---

[jira] [Commented] (SPARK-14248) Get the path hierarchy from root to leaf in the BisectingKMeansModel

2016-03-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216670#comment-15216670 ] Joseph K. Bradley commented on SPARK-14248: --- [~yuu.ishik...@gmail.com] What do

[jira] [Updated] (SPARK-14247) Spark does not compile with CDH-5.4.x due to the possible bug of ivy.....

2016-03-29 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nan Zhu updated SPARK-14247: Priority: Minor (was: Major) > Spark does not compile with CDH-5.4.x due to the possible bug of ivy. >

[jira] [Commented] (SPARK-14247) Spark does not compile with CDH-5.4.x due to the possible bug of ivy.....

2016-03-29 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216661#comment-15216661 ] Nan Zhu commented on SPARK-14247: - [~srowen] I always blindly copied "CDH.*" string from

[jira] [Assigned] (SPARK-13710) Spark shell shows ERROR when launching on Windows

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13710: Assignee: Apache Spark > Spark shell shows ERROR when launching on Windows > -

[jira] [Comment Edited] (SPARK-14240) PySpark Standalone Application hangs without any Error message

2016-03-29 Thread Sayak Ghosh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216655#comment-15216655 ] Sayak Ghosh edited comment on SPARK-14240 at 3/29/16 7:08 PM: -

[jira] [Commented] (SPARK-13710) Spark shell shows ERROR when launching on Windows

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216659#comment-15216659 ] Apache Spark commented on SPARK-13710: -- User 'michellemay' has created a pull reques

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216660#comment-15216660 ] Shubhanshu Mishra commented on SPARK-14103: --- [~srowen] yes, you are right. The

[jira] [Assigned] (SPARK-13710) Spark shell shows ERROR when launching on Windows

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13710: Assignee: (was: Apache Spark) > Spark shell shows ERROR when launching on Windows > --

[jira] [Updated] (SPARK-14240) PySpark Standalone Application hangs without any Error message

2016-03-29 Thread Sayak Ghosh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sayak Ghosh updated SPARK-14240: Attachment: error_terminal.png Now it is giving this. Is it memory issue? > PySpark Standalone Ap

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216639#comment-15216639 ] Sean Owen commented on SPARK-14103: --- I'm saying that this is already done by {{TextInpu

[jira] [Commented] (SPARK-14247) Spark does not compile with CDH-5.4.x due to the possible bug of ivy.....

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216628#comment-15216628 ] Sean Owen commented on SPARK-14247: --- This is an Ivy / Maven problem, no? You're also po

[jira] [Commented] (SPARK-14246) vars not updated after Scala script reload

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216619#comment-15216619 ] Sean Owen commented on SPARK-14246: --- I suspect this ends up being some classloader issu

[jira] [Updated] (SPARK-14246) vars not updated after Scala script reload

2016-03-29 Thread Jim Powers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Powers updated SPARK-14246: --- Description: Attached are two scripts. The problem only exhibits itself with Spark 1.6.0, 1.6.1, an

[jira] [Created] (SPARK-14248) Get the path hierarchy from root to leaf in the BisectingKMeansModel

2016-03-29 Thread Lakesh (JIRA)
Lakesh created SPARK-14248: -- Summary: Get the path hierarchy from root to leaf in the BisectingKMeansModel Key: SPARK-14248 URL: https://issues.apache.org/jira/browse/SPARK-14248 Project: Spark Iss

[jira] [Created] (SPARK-14247) Spark does not compile with CDH-5.4.x due to the possible bug of ivy.....

2016-03-29 Thread Nan Zhu (JIRA)
Nan Zhu created SPARK-14247: --- Summary: Spark does not compile with CDH-5.4.x due to the possible bug of ivy. Key: SPARK-14247 URL: https://issues.apache.org/jira/browse/SPARK-14247 Project: Spark

[jira] [Commented] (SPARK-14240) PySpark Standalone Application hangs without any Error message

2016-03-29 Thread Sayak Ghosh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216580#comment-15216580 ] Sayak Ghosh commented on SPARK-14240: - Slow means at the end of the application the e

[jira] [Resolved] (SPARK-14120) Import/Export commands (Exception)

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-14120. --- Resolution: Fixed Assignee: Andrew Or Fix Version/s: 2.0.0 > Import/Export commands (

[jira] [Resolved] (SPARK-14119) Role management commands (Exception)

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-14119. --- Resolution: Fixed Fix Version/s: 2.0.0 > Role management commands (Exception) > --

[jira] [Assigned] (SPARK-14119) Role management commands (Exception)

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or reassigned SPARK-14119: - Assignee: Andrew Or > Role management commands (Exception) > ---

[jira] [Updated] (SPARK-14246) vars not updated after Scala script reload

2016-03-29 Thread Jim Powers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Powers updated SPARK-14246: --- Attachment: Null.scala reproduce_transient_npe.scala reproduce_transient_npe.scala --

[jira] [Resolved] (SPARK-14122) Show commands (Exception)

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-14122. --- Resolution: Fixed Assignee: Andrew Or Fix Version/s: 2.0.0 > Show commands (Exception

[jira] [Created] (SPARK-14246) vars not updated after Scala script reload

2016-03-29 Thread Jim Powers (JIRA)
Jim Powers created SPARK-14246: -- Summary: vars not updated after Scala script reload Key: SPARK-14246 URL: https://issues.apache.org/jira/browse/SPARK-14246 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216547#comment-15216547 ] Shubhanshu Mishra commented on SPARK-14103: --- Another issue with your [#comment-

[jira] [Created] (SPARK-14245) webUI should display the user

2016-03-29 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-14245: - Summary: webUI should display the user Key: SPARK-14245 URL: https://issues.apache.org/jira/browse/SPARK-14245 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216541#comment-15216541 ] Shubhanshu Mishra commented on SPARK-14103: --- Ok I tried your suggestion of incr

[jira] [Updated] (SPARK-13864) TPCDS query 74 returns wrong results compared to TPC official result set

2016-03-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13864: Fix Version/s: 2.0.0 > TPCDS query 74 returns wrong results compared to TPC official result set >

[jira] [Updated] (SPARK-13862) TPCDS query 49 returns wrong results compared to TPC official result set

2016-03-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13862: Fix Version/s: 2.0.0 > TPCDS query 49 returns wrong results compared to TPC official result set >

[jira] [Resolved] (SPARK-10570) Add Spark version endpoint to standalone JSON API

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10570. --- Resolution: Fixed Assignee: Jakob Odersky Fix Version/s: 2.0.0 Target Ver

[jira] [Resolved] (SPARK-14232) Event timeline on job page doesn't show if an executor is removed with multiple line reason

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-14232. --- Resolution: Fixed Fix Version/s: 2.0.0 1.6.2 Target Version/s:

[jira] [Updated] (SPARK-14232) Event timeline on job page doesn't show if an executor is removed with multiple line reason

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14232: -- Assignee: Carson Wang > Event timeline on job page doesn't show if an executor is removed with > multi

[jira] [Commented] (SPARK-14087) PySpark ML JavaModel does not properly own params after being fit

2016-03-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216510#comment-15216510 ] Joseph K. Bradley commented on SPARK-14087: --- We may hold off on this until maki

[jira] [Commented] (SPARK-13862) TPCDS query 49 returns wrong results compared to TPC official result set

2016-03-29 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216500#comment-15216500 ] JESSE CHEN commented on SPARK-13862: PR fixed the issue. New result is ordered correc

[jira] [Updated] (SPARK-14095) LogisticRegression fails when a DataFrame has only a one-class label

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14095: -- Assignee: Feynman Liang > LogisticRegression fails when a DataFrame has only a one-class label > --

[jira] [Commented] (SPARK-14240) PySpark Standalone Application hangs without any Error message

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216497#comment-15216497 ] Sean Owen commented on SPARK-14240: --- It could be. So far you're just reporting that thi

[jira] [Closed] (SPARK-14095) LogisticRegression fails when a DataFrame has only a one-class label

2016-03-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14095. - Resolution: Fixed Fix Version/s: 2.0.0 The linked bug fix should be for this same

[jira] [Closed] (SPARK-13862) TPCDS query 49 returns wrong results compared to TPC official result set

2016-03-29 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JESSE CHEN closed SPARK-13862. -- PR fixed this issue. Thanks, [~smilegator] > TPCDS query 49 returns wrong results compared to TPC official

[jira] [Closed] (SPARK-13864) TPCDS query 74 returns wrong results compared to TPC official result set

2016-03-29 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JESSE CHEN closed SPARK-13864. -- PR fixed the issue. Nice work, [~smilegator] > TPCDS query 74 returns wrong results compared to TPC offici

[jira] [Updated] (SPARK-12384) Allow -Xms to be set differently then -Xmx

2016-03-29 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-12384: -- Assignee: Dhruve Ashar > Allow -Xms to be set differently then -Xmx > -

[jira] [Commented] (SPARK-13864) TPCDS query 74 returns wrong results compared to TPC official result set

2016-03-29 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216483#comment-15216483 ] JESSE CHEN commented on SPARK-13864: Validated successfully. Returned the correct res

[jira] [Commented] (SPARK-14240) PySpark Standalone Application hangs without any Error message

2016-03-29 Thread Sayak Ghosh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216475#comment-15216475 ] Sayak Ghosh commented on SPARK-14240: - As I am new to this environment, I cannot get

[jira] [Updated] (SPARK-13845) BlockStatus and StreamBlockId keep on growing result driver OOM

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-13845: -- Assignee: jeanlyn > BlockStatus and StreamBlockId keep on growing result driver OOM > -

[jira] [Closed] (SPARK-14076) Naive Bayes should output attributes in predictions

2016-03-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14076. - Resolution: Duplicate Closing since this is part of an existing JIRA > Naive Bayes shoul

[jira] [Updated] (SPARK-14243) updatedBlockStatuses does not update correctly when removing blocks

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14243: -- Assignee: jeanlyn > updatedBlockStatuses does not update correctly when removing blocks > -

[jira] [Commented] (SPARK-14243) updatedBlockStatuses does not update correctly when removing blocks

2016-03-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216448#comment-15216448 ] Andrew Or commented on SPARK-14243: --- Thanks I've assigned this to you. > updatedBlockS

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216437#comment-15216437 ] Shubhanshu Mishra commented on SPARK-14103: --- I just double checked using the fo

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216400#comment-15216400 ] Sean Owen commented on SPARK-14103: --- You show the length of one line there, not the max

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216394#comment-15216394 ] Shubhanshu Mishra commented on SPARK-14103: --- [~srowen] In [#comment-15215064] c

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216378#comment-15216378 ] Sean Owen commented on SPARK-14103: --- Did you compute the maximum line length? that's no

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216350#comment-15216350 ] Shubhanshu Mishra commented on SPARK-14103: --- [~srowen] As I have mentioned abov

[jira] [Commented] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-29 Thread Oscar Boykin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216346#comment-15216346 ] Oscar Boykin commented on SPARK-11416: -- Ok. I see the issue now (I overlooked your r

[jira] [Commented] (SPARK-14188) Running HiveQuery through Spark-Shell getting org.apache.spark.sql.AnalysisException: Table not found:

2016-03-29 Thread Sowmyanarayanan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216342#comment-15216342 ] Sowmyanarayanan commented on SPARK-14188: - Hi Sorry for creating a blocker.I make

[jira] [Updated] (SPARK-14188) Running HiveQuery through Spark-Shell getting org.apache.spark.sql.AnalysisException: Table not found:

2016-03-29 Thread Sowmyanarayanan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sowmyanarayanan updated SPARK-14188: Priority: Minor (was: Blocker) > Running HiveQuery through Spark-Shell getting > org.apac

[jira] [Commented] (SPARK-14244) Physical Window operator uses global SizeBasedWindowFunction.n attribute generated on both driver and executor side

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216329#comment-15216329 ] Apache Spark commented on SPARK-14244: -- User 'liancheng' has created a pull request

[jira] [Assigned] (SPARK-14244) Physical Window operator uses global SizeBasedWindowFunction.n attribute generated on both driver and executor side

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14244: Assignee: Apache Spark (was: Cheng Lian) > Physical Window operator uses global SizeBased

[jira] [Assigned] (SPARK-14244) Physical Window operator uses global SizeBasedWindowFunction.n attribute generated on both driver and executor side

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14244: Assignee: Cheng Lian (was: Apache Spark) > Physical Window operator uses global SizeBased

[jira] [Created] (SPARK-14244) Physical Window operator uses global SizeBasedWindowFunction.n attribute generated on both driver and executor side

2016-03-29 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-14244: -- Summary: Physical Window operator uses global SizeBasedWindowFunction.n attribute generated on both driver and executor side Key: SPARK-14244 URL: https://issues.apache.org/jira/brows

[jira] [Commented] (SPARK-14057) sql time stamps do not respect time zones

2016-03-29 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216293#comment-15216293 ] Andrew Davidson commented on SPARK-14057: - Hi Vijay I am fairly new to this also

[jira] [Commented] (SPARK-14209) Application failure during preemption.

2016-03-29 Thread Miles Crawford (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216283#comment-15216283 ] Miles Crawford commented on SPARK-14209: It will be useful to know whether the is

[jira] [Updated] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14154: -- Priority: Minor (was: Major) > Simplify the implementation for Kolmogorov–Smirnov test > -

[jira] [Resolved] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14154. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11954 [https://github.co

[jira] [Updated] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14154: -- Assignee: yuhao yang > Simplify the implementation for Kolmogorov–Smirnov test > --

[jira] [Resolved] (SPARK-14183) UnsupportedOperationException: empty.max when fitting CrossValidator model

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14183. --- Resolution: Not A Problem OK looks like something else made this a moot point > UnsupportedOperation

[jira] [Commented] (SPARK-12800) Subtle bug on Spark Yarn Client under Kerberos Security Mode

2016-03-29 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216243#comment-15216243 ] Thomas Graves commented on SPARK-12800: --- You are talking about launching a job usin

[jira] [Commented] (SPARK-14243) updatedBlockStatuses does not update correctly when removing blocks

2016-03-29 Thread jeanlyn (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216229#comment-15216229 ] jeanlyn commented on SPARK-14243: - [~andrewor14] Let me know if the descriptions does no

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216228#comment-15216228 ] Sean Owen commented on SPARK-14103: --- It shouldn't be platform dependent; configurable m

[jira] [Updated] (SPARK-14243) updatedBlockStatuses does not update correctly when removing blocks

2016-03-29 Thread jeanlyn (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jeanlyn updated SPARK-14243: Summary: updatedBlockStatuses does not update correctly when removing blocks (was: updatedBlockStatuses do

[jira] [Created] (SPARK-14243) updatedBlockStatuses does not update correctly

2016-03-29 Thread jeanlyn (JIRA)
jeanlyn created SPARK-14243: --- Summary: updatedBlockStatuses does not update correctly Key: SPARK-14243 URL: https://issues.apache.org/jira/browse/SPARK-14243 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14059: Assignee: (was: Apache Spark) > Define R wrappers under org.apache.spark.ml.r > --

[jira] [Assigned] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14059: Assignee: Apache Spark > Define R wrappers under org.apache.spark.ml.r > -

[jira] [Commented] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216212#comment-15216212 ] Apache Spark commented on SPARK-14059: -- User 'yanboliang' has created a pull request

[jira] [Commented] (SPARK-14183) UnsupportedOperationException: empty.max when fitting CrossValidator model

2016-03-29 Thread Yong Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216206#comment-15216206 ] Yong Tang commented on SPARK-14183: --- With the latest master build the message changes t

[jira] [Updated] (SPARK-14242) avoid too many copies in network when a network frame is large

2016-03-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14242: -- Priority: Major (was: Critical) Component/s: Input/Output > avoid too many copies in network wh

[jira] [Commented] (SPARK-14165) NoSuchElementException: None.get when joining DataFrames with Seq of fields of different case

2016-03-29 Thread Subhobrata Dey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216187#comment-15216187 ] Subhobrata Dey commented on SPARK-14165: Hi [~jlaskowski], I think the issue is

[jira] [Commented] (SPARK-14242) avoid too many copies in network when a network frame is large

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216166#comment-15216166 ] Apache Spark commented on SPARK-14242: -- User 'liyezhang556520' has created a pull re

[jira] [Assigned] (SPARK-14242) avoid too many copies in network when a network frame is large

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14242: Assignee: Apache Spark > avoid too many copies in network when a network frame is large >

[jira] [Assigned] (SPARK-14242) avoid too many copies in network when a network frame is large

2016-03-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14242: Assignee: (was: Apache Spark) > avoid too many copies in network when a network frame

<    1   2   3   >