[jira] [Commented] (SPARK-8866) Use 1 microsecond (us) precision for TimestampType

2015-07-07 Thread Yijie Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616290#comment-14616290 ] Yijie Shen commented on SPARK-8866: --- I'll take this one. > Use 1 microsecond (us) preci

[jira] [Commented] (SPARK-8864) Date/time function and data type design

2015-07-07 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616293#comment-14616293 ] Adrian Wang commented on SPARK-8864: Then we are using a Long for us. Long can be up t

[jira] [Updated] (SPARK-8866) Use 1 microsecond (us) precision for TimestampType

2015-07-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8866: --- Assignee: Yijie Shen > Use 1 microsecond (us) precision for TimestampType > --

[jira] [Commented] (SPARK-8864) Date/time function and data type design

2015-07-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616296#comment-14616296 ] Reynold Xin commented on SPARK-8864: Are you suggesting we use a single 8 byte long to

[jira] [Commented] (SPARK-8864) Date/time function and data type design

2015-07-07 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616299#comment-14616299 ] Adrian Wang commented on SPARK-8864: no, that's not enough. > Date/time function and

[jira] [Commented] (SPARK-8864) Date/time function and data type design

2015-07-07 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616301#comment-14616301 ] Adrian Wang commented on SPARK-8864: just provide the precise of current design for yo

[jira] [Comment Edited] (SPARK-8864) Date/time function and data type design

2015-07-07 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616293#comment-14616293 ] Adrian Wang edited comment on SPARK-8864 at 7/7/15 7:34 AM: Th

[jira] [Commented] (SPARK-8685) dataframe left joins are not working as expected in pyspark

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616340#comment-14616340 ] Apache Spark commented on SPARK-8685: - User 'davies' has created a pull request for th

[jira] [Updated] (SPARK-6912) Throw an AnalysisException when unsupported Java Map types used in Hive UDF

2015-07-07 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-6912: Summary: Throw an AnalysisException when unsupported Java Map types used in Hive UDF (was:

[jira] [Commented] (SPARK-6912) Throw an AnalysisException when unsupported Java Map types used in Hive UDF

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616346#comment-14616346 ] Apache Spark commented on SPARK-6912: - User 'maropu' has created a pull request for th

[jira] [Assigned] (SPARK-6912) Throw an AnalysisException when unsupported Java Map types used in Hive UDF

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6912: --- Assignee: Apache Spark > Throw an AnalysisException when unsupported Java Map types used in H

[jira] [Assigned] (SPARK-6912) Throw an AnalysisException when unsupported Java Map types used in Hive UDF

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6912: --- Assignee: (was: Apache Spark) > Throw an AnalysisException when unsupported Java Map type

[jira] [Updated] (SPARK-8223) math function: shiftleft

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8223: - Assignee: Tarek Auel (was: zhichao-li) > math function: shiftleft > > >

[jira] [Updated] (SPARK-8224) math function: shiftright

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8224: - Assignee: Tarek Auel (was: zhichao-li) > math function: shiftright > - > >

[jira] [Commented] (SPARK-6487) Add sequential pattern mining algorithm to Spark MLlib

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616354#comment-14616354 ] Apache Spark commented on SPARK-6487: - User 'zhangjiajin' has created a pull request f

[jira] [Assigned] (SPARK-6487) Add sequential pattern mining algorithm to Spark MLlib

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6487: --- Assignee: Zhang JiaJin (was: Apache Spark) > Add sequential pattern mining algorithm to Spar

[jira] [Assigned] (SPARK-6487) Add sequential pattern mining algorithm to Spark MLlib

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6487: --- Assignee: Apache Spark (was: Zhang JiaJin) > Add sequential pattern mining algorithm to Spar

[jira] [Created] (SPARK-8867) Show the UDF usage for user.

2015-07-07 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-8867: Summary: Show the UDF usage for user. Key: SPARK-8867 URL: https://issues.apache.org/jira/browse/SPARK-8867 Project: Spark Issue Type: Task Components: SQL

[jira] [Commented] (SPARK-8867) Show the UDF usage for user.

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616357#comment-14616357 ] Apache Spark commented on SPARK-8867: - User 'chenghao-intel' has created a pull reques

[jira] [Assigned] (SPARK-8867) Show the UDF usage for user.

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8867: --- Assignee: Apache Spark > Show the UDF usage for user. > > >

[jira] [Assigned] (SPARK-8867) Show the UDF usage for user.

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8867: --- Assignee: (was: Apache Spark) > Show the UDF usage for user. > --

[jira] [Updated] (SPARK-8865) Fix bug: init SimpleConsumerConfig with kafka params

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8865: - Description: (was: [~guowei2] Again, please read https://cwiki.apache.org/confluence/display/SPARK/Con

[jira] [Updated] (SPARK-8865) Fix bug: init SimpleConsumerConfig with kafka params

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8865: - Priority: Minor (was: Major) Fix Version/s: (was: 1.4.0) Description: [~guowei2] Again,

[jira] [Created] (SPARK-8868) SqlSerializer2 can go into infinite loop when row consists only of NullType columns

2015-07-07 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-8868: - Summary: SqlSerializer2 can go into infinite loop when row consists only of NullType columns Key: SPARK-8868 URL: https://issues.apache.org/jira/browse/SPARK-8868 Project:

[jira] [Updated] (SPARK-7884) Move block deserialization from BlockStoreShuffleFetcher to ShuffleReader

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7884: - Assignee: Matt Massie > Move block deserialization from BlockStoreShuffleFetcher to ShuffleReader > --

[jira] [Updated] (SPARK-8019) [SparkR] Create worker R processes with a command other then Rscript

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8019: - Assignee: Michael Sannella > [SparkR] Create worker R processes with a command other then Rscript > --

[jira] [Updated] (SPARK-8649) Mapr repository is not defined properly

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8649: - Assignee: Ashok Kumar > Mapr repository is not defined properly > ---

[jira] [Updated] (SPARK-8639) Instructions for executing jekyll in docs/README.md could be slightly more clear, typo in docs/api.md

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8639: - Assignee: Rosstin Murphy > Instructions for executing jekyll in docs/README.md could be slightly more > c

[jira] [Updated] (SPARK-8754) YarnClientSchedulerBackend doesn't stop gracefully in failure conditions

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8754: - Assignee: Devaraj K > YarnClientSchedulerBackend doesn't stop gracefully in failure conditions > -

[jira] [Updated] (SPARK-8726) Wrong spark.executor.memory when using different EC2 master and worker machine types

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8726: - Assignee: Stefano Parmesan > Wrong spark.executor.memory when using different EC2 master and worker > mac

[jira] [Updated] (SPARK-8851) in Yarn client mode, Client.scala does not login even when credentials are specified

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8851: - Component/s: YARN [~hshreedharan] component please > in Yarn client mode, Client.scala does not login eve

[jira] [Commented] (SPARK-7944) Spark-Shell 2.11 1.4.0-RC-03 does not add jars to class path

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616480#comment-14616480 ] Sean Owen commented on SPARK-7944: -- [~brkyvz] Can you comment on the PR? https://github.c

[jira] [Closed] (SPARK-8039) OOM when using DataFrame join operation

2015-07-07 Thread Will Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Chen closed SPARK-8039. Resolution: Fixed > OOM when using DataFrame join operation > --- > >

[jira] [Commented] (SPARK-6731) Upgrade Apache commons-math3 to 3.4.1

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616504#comment-14616504 ] Apache Spark commented on SPARK-6731: - User 'srowen' has created a pull request for th

[jira] [Commented] (SPARK-8840) Float type coercion with hiveContext

2015-07-07 Thread Evgeny SInelnikov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616528#comment-14616528 ] Evgeny SInelnikov commented on SPARK-8840: -- No, this work in R (sparkR shell from

[jira] [Comment Edited] (SPARK-8840) Float type coercion with hiveContext

2015-07-07 Thread Evgeny SInelnikov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616528#comment-14616528 ] Evgeny SInelnikov edited comment on SPARK-8840 at 7/7/15 11:11 AM: -

[jira] [Created] (SPARK-8869) DataFrameWriter save action makes DataFrameReader load failed

2015-07-07 Thread Hao Ren (JIRA)
Hao Ren created SPARK-8869: -- Summary: DataFrameWriter save action makes DataFrameReader load failed Key: SPARK-8869 URL: https://issues.apache.org/jira/browse/SPARK-8869 Project: Spark Issue Type:

[jira] [Updated] (SPARK-8869) DataFrameWriter save action makes DataFrameReader load failed

2015-07-07 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hao Ren updated SPARK-8869: --- Description: Given the following code, the action is save on DataFrame writer. However, it blocks, no errors,

[jira] [Commented] (SPARK-8596) Install and configure RStudio server on Spark EC2

2015-07-07 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616616#comment-14616616 ] Vincent Warmerdam commented on SPARK-8596: -- made the changes. confirmed that rstu

[jira] [Resolved] (SPARK-8788) Java unit test for PCA transformer

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8788. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7184 [https://githu

[jira] [Updated] (SPARK-8788) Java unit test for PCA transformer

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8788: - Assignee: Yanbo Liang > Java unit test for PCA transformer > -- >

[jira] [Updated] (SPARK-8788) Java unit test for PCA transformer

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8788: - Target Version/s: 1.5.0 > Java unit test for PCA transformer > --

[jira] [Resolved] (SPARK-8570) Improve MLlib Local Matrix Documentation.

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8570. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6958 [https://githu

[jira] [Created] (SPARK-8870) Use SQLContext.getOrCreate in model save/load

2015-07-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8870: Summary: Use SQLContext.getOrCreate in model save/load Key: SPARK-8870 URL: https://issues.apache.org/jira/browse/SPARK-8870 Project: Spark Issue Type: Impro

[jira] [Updated] (SPARK-8445) MLlib 1.5 Roadmap

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8445: - Description: We expect to see many MLlib contributors for the 1.5 release. To scale out the devel

[jira] [Updated] (SPARK-8870) Use SQLContext.getOrCreate in model save/load

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8870: - Remaining Estimate: 2h Original Estimate: 2h > Use SQLContext.getOrCreate in model save/load

[jira] [Updated] (SPARK-8744) StringIndexerModel should have public constructor

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8744: - Target Version/s: 1.5.0 > StringIndexerModel should have public constructor >

[jira] [Updated] (SPARK-8445) MLlib 1.5 Roadmap

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8445: - Description: We expect to see many MLlib contributors for the 1.5 release. To scale out the devel

[jira] [Updated] (SPARK-8445) MLlib 1.5 Roadmap

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8445: - Description: We expect to see many MLlib contributors for the 1.5 release. To scale out the devel

[jira] [Created] (SPARK-8871) Add maximal frequent itemsets filter in Spark MLib FPGrowth

2015-07-07 Thread Jonathan Svirsky (JIRA)
Jonathan Svirsky created SPARK-8871: --- Summary: Add maximal frequent itemsets filter in Spark MLib FPGrowth Key: SPARK-8871 URL: https://issues.apache.org/jira/browse/SPARK-8871 Project: Spark

[jira] [Created] (SPARK-8872) Improve FPGrowthSuite with equivalent R code

2015-07-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-8872: Summary: Improve FPGrowthSuite with equivalent R code Key: SPARK-8872 URL: https://issues.apache.org/jira/browse/SPARK-8872 Project: Spark Issue Type: Improv

[jira] [Updated] (SPARK-8872) Improve FPGrowthSuite with equivalent R code

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8872: - Description: In `FPGrowthSuite`, we only tested output with minSupport 0.5, where the expected out

[jira] [Assigned] (SPARK-8868) SqlSerializer2 can go into infinite loop when row consists only of NullType columns

2015-07-07 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai reassigned SPARK-8868: --- Assignee: Yin Huai > SqlSerializer2 can go into infinite loop when row consists only of NullType > c

[jira] [Resolved] (SPARK-8711) Add additional methods to JavaModel wrappers in trees

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8711. -- Resolution: Fixed Fix Version/s: 1.5.0 > Add additional methods to JavaModel wrappers in

[jira] [Comment Edited] (SPARK-8596) Install and configure RStudio server on Spark EC2

2015-07-07 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616616#comment-14616616 ] Vincent Warmerdam edited comment on SPARK-8596 at 7/7/15 3:59 PM: --

[jira] [Resolved] (SPARK-8823) Optimizations for sparse vector products in pyspark.mllib.linalg

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8823. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7222 [https://githu

[jira] [Updated] (SPARK-8871) Add maximal frequent itemsets filter in Spark MLib FPGrowth

2015-07-07 Thread Jonathan Svirsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Svirsky updated SPARK-8871: Description: Maximal frequent itemsets can be exctracted as all root-to-leaf paths(sets) fro

[jira] [Commented] (SPARK-8840) Float type coercion with hiveContext

2015-07-07 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616918#comment-14616918 ] Shivaram Venkataraman commented on SPARK-8840: -- Sorry my question is can you

[jira] [Commented] (SPARK-8596) Install and configure RStudio server on Spark EC2

2015-07-07 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616924#comment-14616924 ] Shivaram Venkataraman commented on SPARK-8596: -- You can test this by launchin

[jira] [Commented] (SPARK-8868) SqlSerializer2 can go into infinite loop when row consists only of NullType columns

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14616933#comment-14616933 ] Apache Spark commented on SPARK-8868: - User 'yhuai' has created a pull request for thi

[jira] [Assigned] (SPARK-8868) SqlSerializer2 can go into infinite loop when row consists only of NullType columns

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8868: --- Assignee: Apache Spark (was: Yin Huai) > SqlSerializer2 can go into infinite loop when row c

[jira] [Updated] (SPARK-8868) SqlSerializer2 can go into infinite loop when row consists only of NullType columns

2015-07-07 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-8868: Shepherd: Josh Rosen > SqlSerializer2 can go into infinite loop when row consists only of NullType > column

[jira] [Updated] (SPARK-8868) SqlSerializer2 can go into infinite loop when row consists only of NullType columns

2015-07-07 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-8868: Target Version/s: 1.5.0 > SqlSerializer2 can go into infinite loop when row consists only of NullType > col

[jira] [Assigned] (SPARK-8868) SqlSerializer2 can go into infinite loop when row consists only of NullType columns

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8868: --- Assignee: Yin Huai (was: Apache Spark) > SqlSerializer2 can go into infinite loop when row c

[jira] [Resolved] (SPARK-8821) The ec2 script doesn't run on python 3 with an utf8 env

2015-07-07 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-8821. -- Resolution: Fixed Fix Version/s: 1.5.0 1.4.2 Issue res

[jira] [Updated] (SPARK-8821) The ec2 script doesn't run on python 3 with an utf8 env

2015-07-07 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-8821: - Assignee: Simon Hafner > The ec2 script doesn't run on python 3 with an utf8 env >

[jira] [Created] (SPARK-8873) Support cleaning up shuffle files for drivers launched with Mesos

2015-07-07 Thread Timothy Chen (JIRA)
Timothy Chen created SPARK-8873: --- Summary: Support cleaning up shuffle files for drivers launched with Mesos Key: SPARK-8873 URL: https://issues.apache.org/jira/browse/SPARK-8873 Project: Spark

[jira] [Updated] (SPARK-6485) Add CoordinateMatrix/RowMatrix/IndexedRowMatrix in PySpark

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6485: - Assignee: (was: Manoj Kumar) > Add CoordinateMatrix/RowMatrix/IndexedRowMatrix in PySpark > --

[jira] [Updated] (SPARK-6485) Add CoordinateMatrix/RowMatrix/IndexedRowMatrix in PySpark

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6485: - Assignee: Manoj Kumar > Add CoordinateMatrix/RowMatrix/IndexedRowMatrix in PySpark > -

[jira] [Commented] (SPARK-6485) Add CoordinateMatrix/RowMatrix/IndexedRowMatrix in PySpark

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617015#comment-14617015 ] Xiangrui Meng commented on SPARK-6485: -- [~mwdus...@us.ibm.com] Thanks for working on

[jira] [Comment Edited] (SPARK-6485) Add CoordinateMatrix/RowMatrix/IndexedRowMatrix in PySpark

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617015#comment-14617015 ] Xiangrui Meng edited comment on SPARK-6485 at 7/7/15 5:20 PM: --

[jira] [Updated] (SPARK-8704) Add missing methods in StandardScaler

2015-07-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-8704: --- Summary: Add missing methods in StandardScaler (was: Add additional methods to wrappers in ml.pyspark

[jira] [Updated] (SPARK-8704) Add missing methods in StandardScaler (ML and PySpark)

2015-07-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-8704: --- Summary: Add missing methods in StandardScaler (ML and PySpark) (was: Add missing methods in Standard

[jira] [Commented] (SPARK-8385) java.lang.UnsupportedOperationException: Not implemented by the TFS FileSystem implementation

2015-07-07 Thread Jerrick Hoang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617067#comment-14617067 ] Jerrick Hoang commented on SPARK-8385: -- [~angel2014] I'm running into the same issues

[jira] [Updated] (SPARK-8839) Thrift Sever will throw `java.util.NoSuchElementException: key not found` exception when many clients connect it

2015-07-07 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8839: -- Shepherd: Yi Tian > Thrift Sever will throw `java.util.NoSuchElementException: key not found` > excepti

[jira] [Commented] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-07 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617115#comment-14617115 ] Matt Cheah commented on SPARK-7917: --- [~sowen] was there a patch specifically written in

[jira] [Commented] (SPARK-8872) Improve FPGrowthSuite with equivalent R code

2015-07-07 Thread Kashif Rasul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617123#comment-14617123 ] Kashif Rasul commented on SPARK-8872: - I would like to work on this. > Improve FPGrow

[jira] [Resolved] (SPARK-8559) Support association rule generation in FPGrowth

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8559. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7005 [https://githu

[jira] [Commented] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617159#comment-14617159 ] Sean Owen commented on SPARK-7917: -- I'm thinking of the two I mentioned above, in particu

[jira] [Updated] (SPARK-7555) User guide update for ElasticNet

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7555: - Shepherd: Joseph K. Bradley > User guide update for ElasticNet >

[jira] [Commented] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-07 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617166#comment-14617166 ] Matt Cheah commented on SPARK-7917: --- Definitely not 7503 - the PR there only did things

[jira] [Comment Edited] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-07 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617166#comment-14617166 ] Matt Cheah edited comment on SPARK-7917 at 7/7/15 6:45 PM: --- Defi

[jira] [Updated] (SPARK-7422) Add argmax to Vector, SparseVector

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7422: - Shepherd: Xiangrui Meng > Add argmax to Vector, SparseVector > --

[jira] [Updated] (SPARK-8674) [WIP] 2-sample, 2-sided Kolmogorov Smirnov Test Implementation

2015-07-07 Thread Jose Cambronero (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jose Cambronero updated SPARK-8674: --- Summary: [WIP] 2-sample, 2-sided Kolmogorov Smirnov Test Implementation (was: 2-sample, 2-sid

[jira] [Commented] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617177#comment-14617177 ] Sean Owen commented on SPARK-7917: -- Right, this is about standalone. There's https://git

[jira] [Commented] (SPARK-8743) Deregister Codahale metrics for streaming when StreamingContext is closed

2015-07-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617184#comment-14617184 ] Juan Rodríguez Hortalá commented on SPARK-8743: --- Hi, I guess you already h

[jira] [Commented] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-07 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617202#comment-14617202 ] Matt Cheah commented on SPARK-7917: --- Just wanted to clarify: Worker shutdown, or executo

[jira] [Updated] (SPARK-8704) Add missing methods in StandardScaler (ML and PySpark)

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8704: - Description: Add std, mean to StandardScalerModel (was: std, mean to StandardScalerModel ~~getVec

[jira] [Resolved] (SPARK-8704) Add missing methods in StandardScaler (ML and PySpark)

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8704. -- Resolution: Fixed Assignee: Manoj Kumar Fix Version/s: 1.5.0 Target

[jira] [Updated] (SPARK-8704) Add missing methods in StandardScaler (ML and PySpark)

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8704: - Description: std, mean to StandardScalerModel ~~getVectors, findSynonyms to Word2Vec Model~~ ~~set

[jira] [Created] (SPARK-8874) Add missing methods in Word2Vec ML

2015-07-07 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-8874: -- Summary: Add missing methods in Word2Vec ML Key: SPARK-8874 URL: https://issues.apache.org/jira/browse/SPARK-8874 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-8840) Float type coercion with hiveContext

2015-07-07 Thread Evgeny SInelnikov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617267#comment-14617267 ] Evgeny SInelnikov commented on SPARK-8840: -- I tested it on SparkSQL - problem not

[jira] [Updated] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5016: - Shepherd: Xiangrui Meng > GaussianMixtureEM should distribute matrix inverse for large numFeatures

[jira] [Updated] (SPARK-7879) KMeans API for spark.ml Pipelines

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7879: - Priority: Critical (was: Major) > KMeans API for spark.ml Pipelines > ---

[jira] [Updated] (SPARK-8484) Add TrainValidationSplit to ml.tuning

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8484: - Priority: Critical (was: Major) > Add TrainValidationSplit to ml.tuning > ---

[jira] [Updated] (SPARK-6487) Add sequential pattern mining algorithm to Spark MLlib

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6487: - Priority: Critical (was: Major) > Add sequential pattern mining algorithm to Spark MLlib > --

[jira] [Updated] (SPARK-6517) Implement the Algorithm of Hierarchical Clustering

2015-07-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6517: - Priority: Critical (was: Major) > Implement the Algorithm of Hierarchical Clustering > --

[jira] [Commented] (SPARK-7917) Spark doesn't clean up Application Directories (local dirs)

2015-07-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617278#comment-14617278 ] Sean Owen commented on SPARK-7917: -- Oops I mean executor. At least, I'm looking at Utils.

[jira] [Updated] (SPARK-8874) Add missing methods in Word2Vec ML

2015-07-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-8874: --- Component/s: PySpark ML > Add missing methods in Word2Vec ML > --

[jira] [Commented] (SPARK-8874) Add missing methods in Word2Vec ML

2015-07-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617282#comment-14617282 ] Apache Spark commented on SPARK-8874: - User 'MechCoder' has created a pull request for

  1   2   3   >