[jira] [Resolved] (SPARK-15198) Support for filter push down for boolean types in ORC

2016-07-04 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15198. Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 12972 [https://github.

[jira] [Updated] (SPARK-16354) Illegal Inputs In LIMIT or TABLESAMPLE

2016-07-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16354: Description: {noformat} SELECT * FROM testData TABLESAMPLE (-1 rows) SELECT * FROM testData LIMIT -1 {nofor

[jira] [Updated] (SPARK-16354) Illegal Inputs In LIMIT or TABLESAMPLE

2016-07-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16354: Summary: Illegal Inputs In LIMIT or TABLESAMPLE (was: Negative Inputs In LIMIT or TABLESAMPLE) > Illegal

[jira] [Commented] (SPARK-16354) Negative Inputs In LIMIT or TABLESAMPLE

2016-07-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362084#comment-15362084 ] Xiao Li commented on SPARK-16354: - Failure. > Negative Inputs In LIMIT or TABLESAMPLE >

[jira] [Updated] (SPARK-15198) Support for filter push down for boolean types in ORC

2016-07-04 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15198: --- Assignee: Hyukjin Kwon > Support for filter push down for boolean types in ORC >

[jira] [Issue Comment Deleted] (SPARK-15440) Add CSRF Filter for REST APIs to Spark

2016-07-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15440: Comment: was deleted (was: User 'yanboliang' has created a pull request for this issue: https://git

[jira] [Issue Comment Deleted] (SPARK-15440) Add CSRF Filter for REST APIs to Spark

2016-07-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15440: Comment: was deleted (was: User 'yanboliang' has created a pull request for this issue: https://git

[jira] [Commented] (SPARK-15440) Add CSRF Filter for REST APIs to Spark

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362025#comment-15362025 ] Apache Spark commented on SPARK-15440: -- User 'yanboliang' has created a pull request

[jira] [Commented] (SPARK-15440) Add CSRF Filter for REST APIs to Spark

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362021#comment-15362021 ] Apache Spark commented on SPARK-15440: -- User 'yanboliang' has created a pull request

[jira] [Commented] (SPARK-15790) Audit @Since annotations in ML

2016-07-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362019#comment-15362019 ] Yanbo Liang commented on SPARK-15790: - [~mlnick] Can this catch up with 2.0.0? This i

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-07-04 Thread Takao Magoori (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362014#comment-15362014 ] Takao Magoori commented on SPARK-13587: --- Sorry. It seems there is no isolated site-

[jira] [Commented] (SPARK-6764) Add wheel package support for PySpark

2016-07-04 Thread Takao Magoori (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362015#comment-15362015 ] Takao Magoori commented on SPARK-6764: -- Sorry. It seems there is no isolated site-pac

[jira] [Created] (SPARK-16373) Joda Datetime , unable to validate the year format

2016-07-04 Thread UnussKhan (JIRA)
UnussKhan created SPARK-16373: - Summary: Joda Datetime , unable to validate the year format Key: SPARK-16373 URL: https://issues.apache.org/jira/browse/SPARK-16373 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-16372) Retag RDD to tallSkinnyQR of RowMatrix

2016-07-04 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-16372: -- Summary: Retag RDD to tallSkinnyQR of RowMatrix (was: RowMatrix constructor should use retag for Java

[jira] [Updated] (SPARK-16360) Speed up SQL query performance by removing redundant `executePlan` call in `Dataset`

2016-07-04 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-16360: -- Description: Currently, there are a few reports about Spark 2.0 query performance regression f

[jira] [Comment Edited] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-04 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361946#comment-15361946 ] Liwei Lin edited comment on SPARK-16371 at 7/5/16 3:35 AM: --- Hi

[jira] [Comment Edited] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-04 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361946#comment-15361946 ] Liwei Lin edited comment on SPARK-16371 at 7/5/16 3:32 AM: --- Hi

[jira] [Commented] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-04 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361946#comment-15361946 ] Liwei Lin commented on SPARK-16371: --- Hi [~maver1ck], I can not reproduce this issue (se

[jira] [Comment Edited] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-04 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361946#comment-15361946 ] Liwei Lin edited comment on SPARK-16371 at 7/5/16 3:30 AM: --- Hi

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-07-04 Thread Takao Magoori (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361940#comment-15361940 ] Takao Magoori commented on SPARK-13587: --- What is the reason why virtualenv is requi

[jira] [Commented] (SPARK-6764) Add wheel package support for PySpark

2016-07-04 Thread Takao Magoori (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361922#comment-15361922 ] Takao Magoori commented on SPARK-6764: -- Sorry all, I have been busy on my projects fo

[jira] [Commented] (SPARK-16342) Add a new Configurable Token Manager for Spark Running on YARN

2016-07-04 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361917#comment-15361917 ] Saisai Shao commented on SPARK-16342: - [~tgraves] [~vanzin] [~ste...@apache.org] woul

[jira] [Updated] (SPARK-16342) Add a new Configurable Token Manager for Spark Running on YARN

2016-07-04 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-16342: Description: Current Spark on YARN token management has some problems: 1. Supported service is har

[jira] [Updated] (SPARK-15968) HiveMetastoreCatalog does not correctly validate partitioned metastore relation when searching the internal table cache

2016-07-04 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-15968: Assignee: Michael Allman > HiveMetastoreCatalog does not correctly validate partitioned metastore

[jira] [Resolved] (SPARK-15968) HiveMetastoreCatalog does not correctly validate partitioned metastore relation when searching the internal table cache

2016-07-04 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-15968. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 13818 [https://githu

[jira] [Issue Comment Deleted] (SPARK-16361) It takes a long time for gc when building cube with many fields

2016-07-04 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lichenglin updated SPARK-16361: --- Comment: was deleted (was: I have set master url in java application. here is a copy from spark mast

[jira] [Commented] (SPARK-16361) It takes a long time for gc when building cube with many fields

2016-07-04 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361869#comment-15361869 ] lichenglin commented on SPARK-16361: I have set master url in java application. here

[jira] [Commented] (SPARK-16361) It takes a long time for gc when building cube with many fields

2016-07-04 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361870#comment-15361870 ] lichenglin commented on SPARK-16361: I have set master url in java application. here

[jira] [Assigned] (SPARK-16372) RowMatrix constructor should use retag for Java compatibility

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16372: Assignee: (was: Apache Spark) > RowMatrix constructor should use retag for Java compat

[jira] [Commented] (SPARK-16372) RowMatrix constructor should use retag for Java compatibility

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361849#comment-15361849 ] Apache Spark commented on SPARK-16372: -- User 'yinxusen' has created a pull request f

[jira] [Assigned] (SPARK-16372) RowMatrix constructor should use retag for Java compatibility

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16372: Assignee: Apache Spark > RowMatrix constructor should use retag for Java compatibility > -

[jira] [Commented] (SPARK-16232) Getting error by making columns using DataFrame

2016-07-04 Thread Inam Ur Rehman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361841#comment-15361841 ] Inam Ur Rehman commented on SPARK-16232: [~srowen] How did you resolved this issu

[jira] [Created] (SPARK-16372) RowMatrix constructor should use retag for Java compatibility

2016-07-04 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-16372: - Summary: RowMatrix constructor should use retag for Java compatibility Key: SPARK-16372 URL: https://issues.apache.org/jira/browse/SPARK-16372 Project: Spark Issu

[jira] [Commented] (SPARK-16372) RowMatrix constructor should use retag for Java compatibility

2016-07-04 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361822#comment-15361822 ] Xusen Yin commented on SPARK-16372: --- SPARK-11497 fixed this for PySpark. > RowMatrix c

[jira] [Updated] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16371: --- Summary: IS NOT NULL clause gives false for nested not empty column (was: IS NOT NULL clause

[jira] [Created] (SPARK-16371) IS NOT NULL clause gives false for nested column

2016-07-04 Thread JIRA
Maciej Bryński created SPARK-16371: -- Summary: IS NOT NULL clause gives false for nested column Key: SPARK-16371 URL: https://issues.apache.org/jira/browse/SPARK-16371 Project: Spark Issue Ty

[jira] [Commented] (SPARK-13645) DAG Diagram not shown properly in Chrome

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-13645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361807#comment-15361807 ] Maciej Bryński commented on SPARK-13645: I have the same problem. > DAG Diagram

[jira] [Assigned] (SPARK-16369) tallSkinnyQR of RowMatrix should aware of empty partition

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16369: Assignee: Apache Spark > tallSkinnyQR of RowMatrix should aware of empty partition > -

[jira] [Commented] (SPARK-16369) tallSkinnyQR of RowMatrix should aware of empty partition

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361806#comment-15361806 ] Apache Spark commented on SPARK-16369: -- User 'yinxusen' has created a pull request f

[jira] [Assigned] (SPARK-16369) tallSkinnyQR of RowMatrix should aware of empty partition

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16369: Assignee: (was: Apache Spark) > tallSkinnyQR of RowMatrix should aware of empty partit

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Labels: newbie wh (was: newbie) > Wheelhouse Support for PySpark > -- > >

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Labels: newbie python python-wheel wheelhouse (was: newbie wh) > Wheelhouse Support for PySpark >

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Description: *Rational* Is it recommended, in order to deploying Scala packages written in Scala, to build big

[jira] [Commented] (SPARK-16363) Spark-submit doesn't work with IAM Roles

2016-07-04 Thread Ashic Mahtab (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361798#comment-15361798 ] Ashic Mahtab commented on SPARK-16363: -- I just tried setting the two exports...still

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Issue Type: New Feature (was: Improvement) > Wheelhouse Support for PySpark > -- >

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Priority: Critical (was: Major) > Spark 2.0 slower than 1.6 when querying nested columns > -

[jira] [Commented] (SPARK-16370) Union queries with side effects should be executed eagerly

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361790#comment-15361790 ] Apache Spark commented on SPARK-16370: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Assigned] (SPARK-16370) Union queries with side effects should be executed eagerly

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16370: Assignee: (was: Apache Spark) > Union queries with side effects should be executed eag

[jira] [Assigned] (SPARK-16370) Union queries with side effects should be executed eagerly

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16370: Assignee: Apache Spark > Union queries with side effects should be executed eagerly >

[jira] [Created] (SPARK-16370) Union queries with side effects should be executed eagerly

2016-07-04 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-16370: - Summary: Union queries with side effects should be executed eagerly Key: SPARK-16370 URL: https://issues.apache.org/jira/browse/SPARK-16370 Project: Spark

[jira] [Updated] (SPARK-7159) Support multiclass logistic regression in spark.ml

2016-07-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7159: --- Shepherd: DB Tsai > Support multiclass logistic regression in spark.ml > -

[jira] [Updated] (SPARK-7159) Support multiclass logistic regression in spark.ml

2016-07-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7159: --- Assignee: Seth Hendrickson (was: DB Tsai) > Support multiclass logistic regression in spark.ml >

[jira] [Commented] (SPARK-10597) MultivariateOnlineSummarizer for weighted instances

2016-07-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361786#comment-15361786 ] DB Tsai commented on SPARK-10597: - I don't have a plan to add it as part of built-in feat

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400 parti

[jira] [Comment Edited] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361781#comment-15361781 ] Maciej Bryński edited comment on SPARK-16320 at 7/4/16 9:17 PM: ---

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400 parti

[jira] [Commented] (SPARK-16363) Spark-submit doesn't work with IAM Roles

2016-07-04 Thread Ashic Mahtab (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361783#comment-15361783 ] Ashic Mahtab commented on SPARK-16363: -- That will possibly work. However, this issue

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361781#comment-15361781 ] Maciej Bryński commented on SPARK-16320: [~rxin] I created benchmark script and a

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400 parti

[jira] [Created] (SPARK-16369) tallSkinnyQR of RowMatrix should aware of empty partition

2016-07-04 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-16369: - Summary: tallSkinnyQR of RowMatrix should aware of empty partition Key: SPARK-16369 URL: https://issues.apache.org/jira/browse/SPARK-16369 Project: Spark Issue Typ

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400 parti

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400 parti

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400 parti

[jira] [Updated] (SPARK-11496) Parallel implementation of personalized pagerank

2016-07-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-11496: Issue Type: New Feature (was: Improvement) > Parallel implementation of personalized pagerank > --

[jira] [Updated] (SPARK-11496) Parallel implementation of personalized pagerank

2016-07-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-11496: Affects Version/s: (was: 1.5.1) 2.1.0 > Parallel implementation of personalized

[jira] [Issue Comment Deleted] (SPARK-15897) Function Registry should just take in FunctionIdentifier for type safety and avoid duplicating

2016-07-04 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandeep Singh updated SPARK-15897: -- Comment: was deleted (was: I'm working on this, will create a PR soon.) > Function Registry sh

[jira] [Resolved] (SPARK-16232) Getting error by making columns using DataFrame

2016-07-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16232. --- Resolution: Invalid > Getting error by making columns using DataFrame >

[jira] [Updated] (SPARK-16353) Intended javadoc options are not honored for Java unidoc

2016-07-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16353: -- Assignee: Michael Allman > Intended javadoc options are not honored for Java unidoc > -

[jira] [Resolved] (SPARK-16353) Intended javadoc options are not honored for Java unidoc

2016-07-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16353. --- Resolution: Fixed Fix Version/s: 2.0.1 1.6.3 Issue resolved by pull request

[jira] [Commented] (SPARK-14815) ML, Graph, R 2.0 QA: Update user guide for new features & APIs

2016-07-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361745#comment-15361745 ] yuhao yang commented on SPARK-14815: This can be closed. Thank you all for the contri

[jira] [Commented] (SPARK-16206) Defining our own folds using CrossValidator

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361737#comment-15361737 ] Rémi Delassus commented on SPARK-16206: --- >You can implement whatever you want to pr

[jira] [Commented] (SPARK-16363) Spark-submit doesn't work with IAM Roles

2016-07-04 Thread sandeep purohit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361698#comment-15361698 ] sandeep purohit commented on SPARK-16363: - Are you set AWS keys before deploying

[jira] [Commented] (SPARK-16368) Strange Errors When Creating View With Unmatched Column Num

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361696#comment-15361696 ] Apache Spark commented on SPARK-16368: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-16368) Strange Errors When Creating View With Unmatched Column Num

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16368: Assignee: (was: Apache Spark) > Strange Errors When Creating View With Unmatched Colum

[jira] [Assigned] (SPARK-16368) Strange Errors When Creating View With Unmatched Column Num

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16368: Assignee: Apache Spark > Strange Errors When Creating View With Unmatched Column Num > ---

[jira] [Created] (SPARK-16368) Strange Errors When Creating View With Unmatched Column Num

2016-07-04 Thread Xiao Li (JIRA)
Xiao Li created SPARK-16368: --- Summary: Strange Errors When Creating View With Unmatched Column Num Key: SPARK-16368 URL: https://issues.apache.org/jira/browse/SPARK-16368 Project: Spark Issue Type

[jira] [Assigned] (SPARK-16354) Negative Inputs In LIMIT or TABLESAMPLE

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16354: Assignee: (was: Apache Spark) > Negative Inputs In LIMIT or TABLESAMPLE >

[jira] [Assigned] (SPARK-16354) Negative Inputs In LIMIT or TABLESAMPLE

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16354: Assignee: Apache Spark > Negative Inputs In LIMIT or TABLESAMPLE > ---

[jira] [Assigned] (SPARK-16355) Incorrect Statistics when Queries Containing LIMIT/TABLESAMPLE 0

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16355: Assignee: (was: Apache Spark) > Incorrect Statistics when Queries Containing LIMIT/TAB

[jira] [Commented] (SPARK-16355) Incorrect Statistics when Queries Containing LIMIT/TABLESAMPLE 0

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361671#comment-15361671 ] Apache Spark commented on SPARK-16355: -- User 'gatorsmile' has created a pull request

[jira] [Commented] (SPARK-16354) Negative Inputs In LIMIT or TABLESAMPLE

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361672#comment-15361672 ] Apache Spark commented on SPARK-16354: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-16355) Incorrect Statistics when Queries Containing LIMIT/TABLESAMPLE 0

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16355: Assignee: Apache Spark > Incorrect Statistics when Queries Containing LIMIT/TABLESAMPLE 0

[jira] [Commented] (SPARK-16364) Allow spark-submit to upload jars to nodes in cluster mode

2016-07-04 Thread Ashic Mahtab (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361556#comment-15361556 ] Ashic Mahtab commented on SPARK-16364: -- Would it be more reasonable to have spark-su

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Description: *Rational* Is it recommended, in order to deploying Scala packages written in Scala, to build big

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Description: *Rational* Is it recommended, in order to deploying Scala packages written in Scala, to build big

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Description: *Rational* Is it recommended, in order to deploying Scala packages written in Scala, to build big

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Description: *Rational* Is it recommended, in order to deploying Scala packages written in Scala, to build big

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Description: *Rational* Is it recommended, in order to deploying Scala packages written in Scala, to build big

[jira] [Assigned] (SPARK-16366) Time comparison failures in SparkR unit tests

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16366: Assignee: (was: Apache Spark) > Time comparison failures in SparkR unit tests > --

[jira] [Assigned] (SPARK-16366) Time comparison failures in SparkR unit tests

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16366: Assignee: Apache Spark > Time comparison failures in SparkR unit tests > -

[jira] [Commented] (SPARK-16366) Time comparison failures in SparkR unit tests

2016-07-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361463#comment-15361463 ] Apache Spark commented on SPARK-16366: -- User 'sun-rui' has created a pull request fo

[jira] [Updated] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Semet updated SPARK-16367: -- Description: *Rational* Is it recommended, in order to deploying Scala packages written in Scala, to build big

[jira] [Created] (SPARK-16367) Wheelhouse Support for PySpark

2016-07-04 Thread Semet (JIRA)
Semet created SPARK-16367: - Summary: Wheelhouse Support for PySpark Key: SPARK-16367 URL: https://issues.apache.org/jira/browse/SPARK-16367 Project: Spark Issue Type: Improvement Components

[jira] [Created] (SPARK-16366) Time comparison failures in SparkR unit tests

2016-07-04 Thread Sun Rui (JIRA)
Sun Rui created SPARK-16366: --- Summary: Time comparison failures in SparkR unit tests Key: SPARK-16366 URL: https://issues.apache.org/jira/browse/SPARK-16366 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6764) Add wheel package support for PySpark

2016-07-04 Thread Semet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361408#comment-15361408 ] Semet commented on SPARK-6764: -- Hello I am working on a new proposal for complete wheel suppo

[jira] [Comment Edited] (SPARK-14810) ML, Graph 2.0 QA: API: Binary incompatible changes

2016-07-04 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15293316#comment-15293316 ] Nick Pentreath edited comment on SPARK-14810 at 7/4/16 2:31 PM: ---

[jira] [Resolved] (SPARK-14810) ML, Graph 2.0 QA: API: Binary incompatible changes

2016-07-04 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-14810. Resolution: Fixed > ML, Graph 2.0 QA: API: Binary incompatible changes > --

[jira] [Commented] (SPARK-13448) Document MLlib behavior changes in Spark 2.0

2016-07-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361370#comment-15361370 ] Yanbo Liang commented on SPARK-13448: - No remaining should be documented, I will reso

[jira] [Resolved] (SPARK-13448) Document MLlib behavior changes in Spark 2.0

2016-07-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-13448. - Resolution: Fixed Fix Version/s: 2.0.0 > Document MLlib behavior changes in Spark 2.0 > --

[jira] [Commented] (SPARK-14815) ML, Graph, R 2.0 QA: Update user guide for new features & APIs

2016-07-04 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361333#comment-15361333 ] Nick Pentreath commented on SPARK-14815: [~yuhaoyan] is this done now? > ML, Gra

  1   2   >