[jira] [Assigned] (SPARK-14607) Partition pruning is case sensitive even with HiveContext

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14607: Assignee: Apache Spark > Partition pruning is case sensitive even with HiveContext > -

[jira] [Assigned] (SPARK-14484) Fail to create parquet filter if the column name does not match exactly

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14484: Assignee: (was: Apache Spark) > Fail to create parquet filter if the column name does

[jira] [Assigned] (SPARK-14607) Partition pruning is case sensitive even with HiveContext

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14607: Assignee: (was: Apache Spark) > Partition pruning is case sensitive even with HiveCont

[jira] [Commented] (SPARK-14484) Fail to create parquet filter if the column name does not match exactly

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240131#comment-15240131 ] Apache Spark commented on SPARK-14484: -- User 'davies' has created a pull request for

[jira] [Commented] (SPARK-7146) Should ML sharedParams be a public API?

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240127#comment-15240127 ] Joseph K. Bradley commented on SPARK-7146: -- I just did an audit of our current sh

[jira] [Updated] (SPARK-7146) Should ML sharedParams be a public API?

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7146: - Description: Discussion: Should the Param traits in sharedParams.scala be public? Pros: *

[jira] [Commented] (SPARK-14599) BaggedPoint should support weighted instances.

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240115#comment-15240115 ] Apache Spark commented on SPARK-14599: -- User 'sethah' has created a pull request for

[jira] [Assigned] (SPARK-14599) BaggedPoint should support weighted instances.

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14599: Assignee: (was: Apache Spark) > BaggedPoint should support weighted instances. > -

[jira] [Assigned] (SPARK-14599) BaggedPoint should support weighted instances.

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14599: Assignee: Apache Spark > BaggedPoint should support weighted instances. >

[jira] [Updated] (SPARK-14610) Remove superfluous split from random forest findSplitsForContinousFeature

2016-04-13 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seth Hendrickson updated SPARK-14610: - Description: Currently, the method findSplitsForContinuousFeature in random forest produc

[jira] [Commented] (SPARK-14610) Remove superfluous split from random forest findSplitsForContinousFeature

2016-04-13 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240105#comment-15240105 ] Seth Hendrickson commented on SPARK-14610: -- One thing to note, is that fixing th

[jira] [Created] (SPARK-14610) Remove superfluous split from random forest findSplitsForContinousFeature

2016-04-13 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-14610: Summary: Remove superfluous split from random forest findSplitsForContinousFeature Key: SPARK-14610 URL: https://issues.apache.org/jira/browse/SPARK-14610 Pro

[jira] [Resolved] (SPARK-14574) Pure Java modules should not have _2.xx suffixes in their package names

2016-04-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-14574. Resolution: Later Resolving as "later" since this is prohibitively costly to fix and I don't have

[jira] [Created] (SPARK-14609) LOAD DATA

2016-04-13 Thread Yin Huai (JIRA)
Yin Huai created SPARK-14609: Summary: LOAD DATA Key: SPARK-14609 URL: https://issues.apache.org/jira/browse/SPARK-14609 Project: Spark Issue Type: Sub-task Components: SQL

[jira] [Updated] (SPARK-9961) ML prediction abstractions should have defaultEvaluator fields

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9961: - Target Version/s: 2.0.0 (was: ) > ML prediction abstractions should have defaultEvaluator

[jira] [Updated] (SPARK-9961) ML prediction abstractions should have defaultEvaluator fields

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9961: - Description: Predictor and PredictionModel should have abstract defaultEvaluator methods

[jira] [Commented] (SPARK-14606) Different maxBins value for categorical and continuous features in RandomForest implementation.

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240058#comment-15240058 ] Joseph K. Bradley commented on SPARK-14606: --- We should choose a good way to sup

[jira] [Updated] (SPARK-14606) Different maxBins value for categorical and continuous features in RandomForest implementation.

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14606: -- Fix Version/s: (was: 2.0.0) > Different maxBins value for categorical and continuou

[jira] [Updated] (SPARK-14606) Different maxBins value for categorical and continuous features in RandomForest implementation.

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14606: -- Affects Version/s: (was: 1.6.1) (was: 1.5.2)

[jira] [Commented] (SPARK-10574) HashingTF should use MurmurHash3

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240048#comment-15240048 ] Joseph K. Bradley commented on SPARK-10574: --- [~yanboliang] Will you have time t

[jira] [Updated] (SPARK-14606) Different maxBins value for categorical and continuous features in RandomForest implementation.

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14606: -- Shepherd: (was: Xiangrui Meng) > Different maxBins value for categorical and continuo

[jira] [Commented] (SPARK-14560) Cooperative Memory Management for Spillables

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240047#comment-15240047 ] Apache Spark commented on SPARK-14560: -- User 'squito' has created a pull request for

[jira] [Assigned] (SPARK-14560) Cooperative Memory Management for Spillables

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14560: Assignee: Apache Spark (was: Imran Rashid) > Cooperative Memory Management for Spillables

[jira] [Assigned] (SPARK-14560) Cooperative Memory Management for Spillables

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14560: Assignee: Imran Rashid (was: Apache Spark) > Cooperative Memory Management for Spillables

[jira] [Commented] (SPARK-10574) HashingTF should use MurmurHash3

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240044#comment-15240044 ] Joseph K. Bradley commented on SPARK-10574: --- Yeah, I'd like to prioritize switc

[jira] [Commented] (SPARK-14209) Application failure during preemption.

2016-04-13 Thread Miles Crawford (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240040#comment-15240040 ] Miles Crawford commented on SPARK-14209: Just had a very similar error when a hos

[jira] [Created] (SPARK-14608) transformSchema needs better documentation

2016-04-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14608: - Summary: transformSchema needs better documentation Key: SPARK-14608 URL: https://issues.apache.org/jira/browse/SPARK-14608 Project: Spark Issue Ty

[jira] [Updated] (SPARK-14445) Show columns/partitions

2016-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14445: -- Assignee: Dilip Biswal > Show columns/partitions > --- > > Key: SPA

[jira] [Updated] (SPARK-14445) Support native execution of SHOW COLUMNS and SHOW PARTITIONS command

2016-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14445: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-14118 > Support native execution of SHOW

[jira] [Updated] (SPARK-14445) Show columns/partitions

2016-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14445: -- Summary: Show columns/partitions (was: show columns/partitions) > Show columns/partitions > --

[jira] [Updated] (SPARK-14445) show columns/partitions

2016-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14445: -- Summary: show columns/partitions (was: Support native execution of SHOW COLUMNS and SHOW PARTITIONS c

[jira] [Commented] (SPARK-13434) Reduce Spark RandomForest memory footprint

2016-04-13 Thread Shirish Tatikonda (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240020#comment-15240020 ] Shirish Tatikonda commented on SPARK-13434: --- [~josephkb] Just curious if there

[jira] [Resolved] (SPARK-14472) Cleanup PySpark-ML Java wrapper classes so that JavaWrapper will inherit from JavaCallable

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14472. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12304 [h

[jira] [Resolved] (SPARK-13089) spark.ml Naive Bayes user guide

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-13089. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11015 [h

[jira] [Resolved] (SPARK-14509) Add python CountVectorizerExample

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14509. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11917 [h

[jira] [Created] (SPARK-14607) Partition pruning is case sensitive even with HiveContext

2016-04-13 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14607: -- Summary: Partition pruning is case sensitive even with HiveContext Key: SPARK-14607 URL: https://issues.apache.org/jira/browse/SPARK-14607 Project: Spark Issue T

[jira] [Resolved] (SPARK-14375) Unit test for spark.ml KMeansSummary

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14375. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12254 [h

[jira] [Resolved] (SPARK-14461) GLM training summaries should provide solver

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14461. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12253 [h

[jira] [Resolved] (SPARK-10386) Model import/export for PrefixSpan

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-10386. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10664 [h

[jira] [Assigned] (SPARK-14605) Python spark.ml classes should use unicode uid

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14605: Assignee: Joseph K. Bradley (was: Apache Spark) > Python spark.ml classes should use unic

[jira] [Assigned] (SPARK-14605) Python spark.ml classes should use unicode uid

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14605: Assignee: Apache Spark (was: Joseph K. Bradley) > Python spark.ml classes should use unic

[jira] [Commented] (SPARK-14605) Python spark.ml classes should use unicode uid

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239945#comment-15239945 ] Apache Spark commented on SPARK-14605: -- User 'jkbradley' has created a pull request

[jira] [Resolved] (SPARK-14581) Improve filter push down

2016-04-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14581. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12342 [https://github.

[jira] [Commented] (SPARK-11157) Allow Spark to be built without assemblies

2016-04-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239927#comment-15239927 ] Steve Loughran commented on SPARK-11157: thanks. Although it's not directly a YAR

[jira] [Commented] (SPARK-14548) Support !> and !< operator in Spark SQL

2016-04-13 Thread Jia Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239894#comment-15239894 ] Jia Li commented on SPARK-14548: Microsoft SQL Server, IBM DB2 and Sybase Adaptive Server

[jira] [Created] (SPARK-14606) Different maxBins value for categorical and continuous features in RandomForest implementation.

2016-04-13 Thread Rahul Tanwani (JIRA)
Rahul Tanwani created SPARK-14606: - Summary: Different maxBins value for categorical and continuous features in RandomForest implementation. Key: SPARK-14606 URL: https://issues.apache.org/jira/browse/SPARK-14606

[jira] [Updated] (SPARK-14499) Add tests to make sure drop partitions of an external table will not delete data

2016-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14499: -- Assignee: Xiao Li > Add tests to make sure drop partitions of an external table will not delete > data

[jira] [Assigned] (SPARK-14441) Consolidate DDL tests

2016-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or reassigned SPARK-14441: - Assignee: Andrew Or > Consolidate DDL tests > - > > Key: SPA

[jira] [Created] (SPARK-14605) Python spark.ml classes should use unicode uid

2016-04-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14605: - Summary: Python spark.ml classes should use unicode uid Key: SPARK-14605 URL: https://issues.apache.org/jira/browse/SPARK-14605 Project: Spark Issu

[jira] [Commented] (SPARK-12154) Upgrade to Jersey 2

2016-04-13 Thread David Wood (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239871#comment-15239871 ] David Wood commented on SPARK-12154: Just some experience noted here. I'm running to

[jira] [Closed] (SPARK-14590) Update pull request template with link to jira

2016-04-13 Thread Luciano Resende (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luciano Resende closed SPARK-14590. --- Resolution: Won't Fix > Update pull request template with link to jira >

[jira] [Commented] (SPARK-14489) RegressionEvaluator returns NaN for ALS in Spark ml

2016-04-13 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239847#comment-15239847 ] Nick Pentreath commented on SPARK-14489: In the live setting you definitely want

[jira] [Commented] (SPARK-14299) Scala ML examples code merge and clean up

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239802#comment-15239802 ] Apache Spark commented on SPARK-14299: -- User 'yinxusen' has created a pull request f

[jira] [Commented] (SPARK-14571) Log instrumentation in ALS

2016-04-13 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239794#comment-15239794 ] Timothy Hunter commented on SPARK-14571: SPARK-14568 has been merged, so it shoul

[jira] [Resolved] (SPARK-6725) Model export/import for Pipeline API (Scala)

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-6725. -- Resolution: Fixed Fix Version/s: 2.0.0 I'm marking this complete since we now hav

[jira] [Resolved] (SPARK-13783) Model export/import for spark.ml: GBTs

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-13783. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12230 [h

[jira] [Commented] (SPARK-14570) Log instrumentation in Random forests

2016-04-13 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239775#comment-15239775 ] Timothy Hunter commented on SPARK-14570: SPARK-14568 has been merged, so it shoul

[jira] [Resolved] (SPARK-14598) Can spark-mllib upgrade to Jersey 2.x

2016-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14598. --- Resolution: Duplicate Have a look through JIRA first please > Can spark-mllib upgrade to Jersey 2.x

[jira] [Commented] (SPARK-14569) Log instrumentation in KMeans

2016-04-13 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239751#comment-15239751 ] Timothy Hunter commented on SPARK-14569: SPARK-14568 has been merged, so it shoul

[jira] [Created] (SPARK-14604) Modify design of ML model summaries

2016-04-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14604: - Summary: Modify design of ML model summaries Key: SPARK-14604 URL: https://issues.apache.org/jira/browse/SPARK-14604 Project: Spark Issue Type: Imp

[jira] [Updated] (SPARK-14461) GLM training summaries should provide solver

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14461: -- Shepherd: Joseph K. Bradley Assignee: Yanbo Liang Target Version

[jira] [Resolved] (SPARK-14388) Create Table

2016-04-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14388. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12271 [https://github.com/

[jira] [Resolved] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14568. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12331 [h

[jira] [Commented] (SPARK-14603) SessionCatalog needs to check if a metadata operation is valid

2016-04-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239718#comment-15239718 ] Xiao Li commented on SPARK-14603: - Sure, will do it. Thanks! > SessionCatalog needs to c

[jira] [Assigned] (SPARK-14601) Minor doc/usage changes related to removal of Spark assembly

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14601: Assignee: Apache Spark > Minor doc/usage changes related to removal of Spark assembly > --

[jira] [Commented] (SPARK-14601) Minor doc/usage changes related to removal of Spark assembly

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239710#comment-15239710 ] Apache Spark commented on SPARK-14601: -- User 'markgrover' has created a pull request

[jira] [Assigned] (SPARK-14601) Minor doc/usage changes related to removal of Spark assembly

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14601: Assignee: (was: Apache Spark) > Minor doc/usage changes related to removal of Spark as

[jira] [Commented] (SPARK-11157) Allow Spark to be built without assemblies

2016-04-13 Thread Sebastian Kochman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239708#comment-15239708 ] Sebastian Kochman commented on SPARK-11157: --- I have filed a Spark bug: https://

[jira] [Created] (SPARK-14602) [YARN+Windows] Setting SPARK_YARN_CACHE_FILES exceeds command line length limit on Windows

2016-04-13 Thread Sebastian Kochman (JIRA)
Sebastian Kochman created SPARK-14602: - Summary: [YARN+Windows] Setting SPARK_YARN_CACHE_FILES exceeds command line length limit on Windows Key: SPARK-14602 URL: https://issues.apache.org/jira/browse/SPARK-146

[jira] [Created] (SPARK-14603) SessionCatalog needs to check if a metadata operation is valid

2016-04-13 Thread Yin Huai (JIRA)
Yin Huai created SPARK-14603: Summary: SessionCatalog needs to check if a metadata operation is valid Key: SPARK-14603 URL: https://issues.apache.org/jira/browse/SPARK-14603 Project: Spark Issue

[jira] [Created] (SPARK-14601) Minor doc/usage changes related to removal of Spark assembly

2016-04-13 Thread Mark Grover (JIRA)
Mark Grover created SPARK-14601: --- Summary: Minor doc/usage changes related to removal of Spark assembly Key: SPARK-14601 URL: https://issues.apache.org/jira/browse/SPARK-14601 Project: Spark I

[jira] [Commented] (SPARK-14600) Push predicates through Expand

2016-04-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239688#comment-15239688 ] Davies Liu commented on SPARK-14600: cc [~cloud_fan] > Push predicates through Expan

[jira] [Commented] (SPARK-14306) PySpark ml.classification OneVsRest support export/import

2016-04-13 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239678#comment-15239678 ] Xusen Yin commented on SPARK-14306: --- Yes, but blocked by this https://github.com/apache

[jira] [Commented] (SPARK-14389) OOM during BroadcastNestedLoopJoin

2016-04-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239687#comment-15239687 ] Cheng Lian commented on SPARK-14389: Exception thrown by UnsafeRow.copy() inside BNL

[jira] [Created] (SPARK-14600) Push predicates through Expand

2016-04-13 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14600: -- Summary: Push predicates through Expand Key: SPARK-14600 URL: https://issues.apache.org/jira/browse/SPARK-14600 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-14599) BaggedPoint should support weighted instances.

2016-04-13 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-14599: Summary: BaggedPoint should support weighted instances. Key: SPARK-14599 URL: https://issues.apache.org/jira/browse/SPARK-14599 Project: Spark Issue

[jira] [Commented] (SPARK-14599) BaggedPoint should support weighted instances.

2016-04-13 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239685#comment-15239685 ] Seth Hendrickson commented on SPARK-14599: -- I will submit a PR for this shortly.

[jira] [Commented] (SPARK-14598) Can spark-mllib upgrade to Jersey 2.x

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239682#comment-15239682 ] Joseph K. Bradley commented on SPARK-14598: --- I don't think MLlib uses Jersey, b

[jira] [Updated] (SPARK-14598) Can spark-mllib upgrade to Jersey 2.x

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14598: -- Component/s: (was: MLlib) Spark Core Build > Can

[jira] [Commented] (SPARK-14306) PySpark ml.classification OneVsRest support export/import

2016-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239673#comment-15239673 ] Joseph K. Bradley commented on SPARK-14306: --- Ping! Are you still working on th

[jira] [Commented] (SPARK-14495) Distinct aggregation cannot be used in the having clause

2016-04-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239674#comment-15239674 ] Cheng Lian commented on SPARK-14495: This ticket is for branch-1.6. > Distinct aggre

[jira] [Commented] (SPARK-13253) Error aliasing array columns.

2016-04-13 Thread Rakesh Chalasani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239668#comment-15239668 ] Rakesh Chalasani commented on SPARK-13253: -- Hi [~cloud_fan], just checked it on

[jira] [Closed] (SPARK-13253) Error aliasing array columns.

2016-04-13 Thread Rakesh Chalasani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rakesh Chalasani closed SPARK-13253. Resolution: Resolved > Error aliasing array columns. > - > >

[jira] [Commented] (SPARK-14516) Clustering evaluator

2016-04-13 Thread Ahmed Kamal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239652#comment-15239652 ] Ahmed Kamal commented on SPARK-14516: - I will go through the Mlib code to familiarize

[jira] [Created] (SPARK-14598) Can spark-mllib upgrade to Jersey 2.x

2016-04-13 Thread David Wood (JIRA)
David Wood created SPARK-14598: -- Summary: Can spark-mllib upgrade to Jersey 2.x Key: SPARK-14598 URL: https://issues.apache.org/jira/browse/SPARK-14598 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-13753) Column nullable is derived incorrectly

2016-04-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239637#comment-15239637 ] Cheng Lian commented on SPARK-13753: [~jingweilu] Could you please provide the schema

[jira] [Commented] (SPARK-14463) read.text broken for partitioned tables

2016-04-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239630#comment-15239630 ] Reynold Xin commented on SPARK-14463: - The problem is that the return type is String,

[jira] [Commented] (SPARK-14463) read.text broken for partitioned tables

2016-04-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239617#comment-15239617 ] Cheng Lian commented on SPARK-14463: Seems that this is because {{buildReader()}} doe

[jira] [Commented] (SPARK-14139) Dataset loses nullability in operations with RowEncoder

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239613#comment-15239613 ] Apache Spark commented on SPARK-14139: -- User 'cloud-fan' has created a pull request

[jira] [Commented] (SPARK-14585) Provide accessor methods for Pipeline stages

2016-04-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239595#comment-15239595 ] yuhao yang commented on SPARK-14585: I think you basically have implemented it... >

[jira] [Commented] (SPARK-14560) Cooperative Memory Management for Spillables

2016-04-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239573#comment-15239573 ] Imran Rashid commented on SPARK-14560: -- Here's a reproduction. This could almost ce

[jira] [Commented] (SPARK-14434) User guide doc and examples for GaussianMixture in spark.ml

2016-04-13 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239563#comment-15239563 ] Miao Wang commented on SPARK-14434: --- I will work on this once I commit [SPARK-14433] Py

[jira] [Commented] (SPARK-14560) Cooperative Memory Management for Spillables

2016-04-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239559#comment-15239559 ] Imran Rashid commented on SPARK-14560: -- After some doing some more testing, I realiz

[jira] [Commented] (SPARK-14433) PySpark ml GaussianMixture

2016-04-13 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239549#comment-15239549 ] Miao Wang commented on SPARK-14433: --- Coding 70% completed. Designing test cases. > Py

[jira] [Commented] (SPARK-14388) Create Table

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239535#comment-15239535 ] Apache Spark commented on SPARK-14388: -- User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239531#comment-15239531 ] Xiangrui Meng commented on SPARK-14154: --- [~yuhaoyan] Thanks for the benchmark! I re

[jira] [Closed] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-14154. - Resolution: Not A Problem Fix Version/s: (was: 2.0.0) > Simplify the implementation fo

[jira] [Updated] (SPARK-14572) Update Config Doc to specify -Xms in extraJavaOptions

2016-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14572: -- Priority: Trivial (was: Minor) Fix Version/s: (was: 2.0.0) > Update Config Doc to specify

[jira] [Assigned] (SPARK-14572) Update Config Doc to specify -Xms in extraJavaOptions

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14572: Assignee: (was: Apache Spark) > Update Config Doc to specify -Xms in extraJavaOptions

[jira] [Commented] (SPARK-14572) Update Config Doc to specify -Xms in extraJavaOptions

2016-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239486#comment-15239486 ] Apache Spark commented on SPARK-14572: -- User 'dhruve' has created a pull request for

<    1   2   3   >