[jira] [Commented] (SPARK-5310) Update SQL programming guide for 1.3

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363024#comment-14363024 ] Cheng Lian commented on SPARK-5310: --- [~marmbrus], [~rxin] and [~yhuai] do we want to

[jira] [Commented] (SPARK-6355) Spark standalone cluster does not support local:/ url for jar file

2015-03-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363084#comment-14363084 ] Sean Owen commented on SPARK-6355: -- Is {{local}} a valid URI scheme? I hadn't seen that

[jira] [Updated] (SPARK-6356) Support the ROLLUP/CUBE/GROUPING SETS/grouping() in SQLContext

2015-03-16 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yadong Qi updated SPARK-6356: - Description: Support for the expression below: ``` GROUP BY expression list WITH ROLLUP GROUP BY

[jira] [Created] (SPARK-6354) Replace the plan which is part of cached query

2015-03-16 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6354: -- Summary: Replace the plan which is part of cached query Key: SPARK-6354 URL: https://issues.apache.org/jira/browse/SPARK-6354 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6301) Unable to load external jars while submitting Spark Job

2015-03-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363077#comment-14363077 ] Sean Owen commented on SPARK-6301: -- [~rajupats91] I closed this as a duplicate of

[jira] [Created] (SPARK-6355) Spark standalone cluster does not support local:/ url for jar file

2015-03-16 Thread Jesper Lundgren (JIRA)
Jesper Lundgren created SPARK-6355: -- Summary: Spark standalone cluster does not support local:/ url for jar file Key: SPARK-6355 URL: https://issues.apache.org/jira/browse/SPARK-6355 Project: Spark

[jira] [Updated] (SPARK-6356) Support the ROLLUP/CUBE/GROUPING SETS/grouping() in SQLContext

2015-03-16 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yadong Qi updated SPARK-6356: - Description: Support for the expression below: ``` GROUP BY expression list WITH ROLLUP GROUP BY

[jira] [Commented] (SPARK-6354) Replace the plan which is part of cached query

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362991#comment-14362991 ] Apache Spark commented on SPARK-6354: - User 'viirya' has created a pull request for

[jira] [Closed] (SPARK-6301) Unable to load external jars while submitting Spark Job

2015-03-16 Thread raju patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] raju patel closed SPARK-6301. - Resolution: Duplicate Unable to load external jars while submitting Spark Job

[jira] [Commented] (SPARK-6319) DISTINCT doesn't work for binary type

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363011#comment-14363011 ] Cheng Lian commented on SPARK-6319: --- SPARK-5553 is somewhat related to this one. By

[jira] [Commented] (SPARK-6356) Support the ROLLUP/CUBE/GROUPING SETS/grouping() in SQLContext

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363137#comment-14363137 ] Apache Spark commented on SPARK-6356: - User 'watermen' has created a pull request for

[jira] [Commented] (SPARK-6358) Spark-submit error when using PYSPARK_PYTHON enviromnental variable

2015-03-16 Thread dustin davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363406#comment-14363406 ] dustin davidson commented on SPARK-6358: [~srowen] I agree that is what it looks

[jira] [Created] (SPARK-6360) For Spark 1.2, after any RDD transformations, calling saveAsParquetFile over a SchemaRDD with decimal column throws

2015-03-16 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-6360: - Summary: For Spark 1.2, after any RDD transformations, calling saveAsParquetFile over a SchemaRDD with decimal column throws Key: SPARK-6360 URL:

[jira] [Commented] (SPARK-5393) Flood of util.RackResolver log messages after SPARK-1714

2015-03-16 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363423#comment-14363423 ] Sandy Ryza commented on SPARK-5393: --- Hi [~djp], there's no special reason we didn't fix

[jira] [Updated] (SPARK-6360) For Spark 1.1 and 1.2, after any RDD transformations, calling saveAsParquetFile over a SchemaRDD with decimal or UDT column throws

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6360: -- Summary: For Spark 1.1 and 1.2, after any RDD transformations, calling saveAsParquetFile over a

[jira] [Commented] (SPARK-6358) Spark-submit error when using PYSPARK_PYTHON enviromnental variable

2015-03-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363411#comment-14363411 ] Sean Owen commented on SPARK-6358: -- What are its permissions? what user owns it and what

[jira] [Updated] (SPARK-6360) For Spark 1.2, after any RDD transformations, calling saveAsParquetFile over a SchemaRDD with decimal or UDT column throws

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6360: -- Summary: For Spark 1.2, after any RDD transformations, calling saveAsParquetFile over a SchemaRDD with

[jira] [Updated] (SPARK-6360) For Spark 1.2, after any RDD transformations, calling saveAsParquetFile over a SchemaRDD with decimal or UDT column throws

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6360: -- Description: Spark shell session for reproduction (use {{:paste}}): {noformat} import

[jira] [Commented] (SPARK-5393) Flood of util.RackResolver log messages after SPARK-1714

2015-03-16 Thread Junping Du (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363451#comment-14363451 ] Junping Du commented on SPARK-5393: --- Thanks [~sandyr] for confirmation on this! I will

[jira] [Commented] (SPARK-5563) LDA with online variational inference

2015-03-16 Thread Matthew Willson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363308#comment-14363308 ] Matthew Willson commented on SPARK-5563: Definitely keen on this! In case it's

[jira] [Commented] (SPARK-6282) Strange Python import error when using random() in a lambda function

2015-03-16 Thread Pavel Laskov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363340#comment-14363340 ] Pavel Laskov commented on SPARK-6282: - Hi Davies, Yes, I was also quite baffled that

[jira] [Commented] (SPARK-6358) Spark-submit error when using PYSPARK_PYTHON enviromnental variable

2015-03-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363356#comment-14363356 ] Sean Owen commented on SPARK-6358: -- That just looks like an error from your environment,

[jira] [Created] (SPARK-6357) Add unapply in EdgeContext

2015-03-16 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-6357: --- Summary: Add unapply in EdgeContext Key: SPARK-6357 URL: https://issues.apache.org/jira/browse/SPARK-6357 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6357) Add unapply in EdgeContext

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363292#comment-14363292 ] Apache Spark commented on SPARK-6357: - User 'maropu' has created a pull request for

[jira] [Commented] (SPARK-6245) jsonRDD() of empty RDD results in exception

2015-03-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363307#comment-14363307 ] Sean Owen commented on SPARK-6245: -- Seems reasonable to me. I had avoided back-porting

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2015-03-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363351#comment-14363351 ] Imran Rashid commented on SPARK-4746: - gonna wait a few days to see if anyone is

[jira] [Created] (SPARK-6358) Spark-submit error when using PYSPARK_PYTHON enviromnental variable

2015-03-16 Thread dustin davidson (JIRA)
dustin davidson created SPARK-6358: -- Summary: Spark-submit error when using PYSPARK_PYTHON enviromnental variable Key: SPARK-6358 URL: https://issues.apache.org/jira/browse/SPARK-6358 Project: Spark

[jira] [Commented] (SPARK-6245) jsonRDD() of empty RDD results in exception

2015-03-16 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363290#comment-14363290 ] Matthew Farrellee commented on SPARK-6245: -- [~srowen] thanks for fixing this.

[jira] [Updated] (SPARK-6345) Model update propagation during prediction in Streaming Regression

2015-03-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6345: - Assignee: Jeremy Freeman Model update propagation during prediction in Streaming Regression

[jira] [Created] (SPARK-6359) Expose IMain binding as part of ILoop Developer API

2015-03-16 Thread Dan McClary (JIRA)
Dan McClary created SPARK-6359: -- Summary: Expose IMain binding as part of ILoop Developer API Key: SPARK-6359 URL: https://issues.apache.org/jira/browse/SPARK-6359 Project: Spark Issue Type:

[jira] [Closed] (SPARK-4746) integration tests should be separated from faster unit tests

2015-03-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid closed SPARK-4746. --- Resolution: Fixed integration tests should be separated from faster unit tests

[jira] [Commented] (SPARK-6226) Support model save/load in Python's KMeans

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363500#comment-14363500 ] Apache Spark commented on SPARK-6226: - User 'mengxr' has created a pull request for

[jira] [Created] (SPARK-6361) Support adding a column with metadata in DataFrames

2015-03-16 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6361: Summary: Support adding a column with metadata in DataFrames Key: SPARK-6361 URL: https://issues.apache.org/jira/browse/SPARK-6361 Project: Spark Issue

[jira] [Commented] (SPARK-5563) LDA with online variational inference

2015-03-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363578#comment-14363578 ] Joseph K. Bradley commented on SPARK-5563: -- [~matthjw] That's a good point: For

[jira] [Comment Edited] (SPARK-6269) Using a different implementation of java array reflection for size estimation

2015-03-16 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363632#comment-14363632 ] Matt Cheah edited comment on SPARK-6269 at 3/16/15 6:12 PM: I

[jira] [Commented] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363629#comment-14363629 ] Xiangrui Meng commented on SPARK-6334: --

[jira] [Updated] (SPARK-2087) Clean Multi-user semantics for thrift JDBC/ODBC server.

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-2087: -- Issue Type: Improvement (was: Bug) Clean Multi-user semantics for thrift JDBC/ODBC server.

[jira] [Commented] (SPARK-6315) SparkSQL 1.3.0 (RC3) fails to read parquet file generated by 1.1.1

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363523#comment-14363523 ] Cheng Lian commented on SPARK-6315: --- Hi [~Michael Davies], really sorry for the trouble!

[jira] [Commented] (SPARK-6358) Spark-submit error when using PYSPARK_PYTHON enviromnental variable

2015-03-16 Thread dustin davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363542#comment-14363542 ] dustin davidson commented on SPARK-6358: Got me. The weird thing is I run the

[jira] [Updated] (SPARK-6362) Broken pipe error when training a RandomForest on a union of two RDDs

2015-03-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6362: --- Component/s: PySpark Broken pipe error when training a RandomForest on a union of two RDDs

[jira] [Commented] (SPARK-6269) Using a different implementation of java array reflection for size estimation

2015-03-16 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363632#comment-14363632 ] Matt Cheah commented on SPARK-6269: --- I updated the code since then so the updated micro

[jira] [Commented] (SPARK-6358) Spark-submit error when using PYSPARK_PYTHON enviromnental variable

2015-03-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363527#comment-14363527 ] Sean Owen commented on SPARK-6358: -- Hm, Spark executors are also run as hdadmin or

[jira] [Commented] (SPARK-6293) SQLContext.implicits should provide automatic conversion for RDD[Row]

2015-03-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363572#comment-14363572 ] Joseph K. Bradley commented on SPARK-6293: -- [~smacat] That issue of failure is

[jira] [Updated] (SPARK-2087) Clean Multi-user semantics for thrift JDBC/ODBC server.

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-2087: -- Affects Version/s: 1.3.0 1.0.2 1.1.1

[jira] [Updated] (SPARK-2087) Clean Multi-user semantics for thrift JDBC/ODBC server.

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-2087: -- Assignee: Cheng Hao Clean Multi-user semantics for thrift JDBC/ODBC server.

[jira] [Comment Edited] (SPARK-6358) Spark-submit error when using PYSPARK_PYTHON enviromnental variable

2015-03-16 Thread dustin davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363498#comment-14363498 ] dustin davidson edited comment on SPARK-6358 at 3/16/15 5:12 PM:

[jira] [Created] (SPARK-6362) Broken pipe error when training a RandomForest on a union of two RDDs

2015-03-16 Thread Pavel Laskov (JIRA)
Pavel Laskov created SPARK-6362: --- Summary: Broken pipe error when training a RandomForest on a union of two RDDs Key: SPARK-6362 URL: https://issues.apache.org/jira/browse/SPARK-6362 Project: Spark

[jira] [Updated] (SPARK-6229) Support SASL encryption in network/common module

2015-03-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6229: --- Summary: Support SASL encryption in network/common module (was: Support encryption in network/common

[jira] [Resolved] (SPARK-5712) Semicolon at end of a comment line

2015-03-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-5712. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4500

[jira] [Commented] (SPARK-6247) Certain self joins cannot be analyzed

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364606#comment-14364606 ] Apache Spark commented on SPARK-6247: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-6379) Support an implicit conversion from udfname to an UDF defined in SQLContext

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364581#comment-14364581 ] Apache Spark commented on SPARK-6379: - User 'maropu' has created a pull request for

[jira] [Commented] (SPARK-6229) Support encryption in network/common module

2015-03-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364555#comment-14364555 ] Reynold Xin commented on SPARK-6229: I haven't read through the various discussions

[jira] [Created] (SPARK-6378) srcAttr in graph.triplets don't update when the size of graph is huge

2015-03-16 Thread zhangzhenyue (JIRA)
zhangzhenyue created SPARK-6378: --- Summary: srcAttr in graph.triplets don't update when the size of graph is huge Key: SPARK-6378 URL: https://issues.apache.org/jira/browse/SPARK-6378 Project: Spark

[jira] [Commented] (SPARK-6304) Checkpointing doesn't retain driver port

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364574#comment-14364574 ] Apache Spark commented on SPARK-6304: - User 'jerryshao' has created a pull request for

[jira] [Commented] (SPARK-6250) Types are now reserved words in DDL parser.

2015-03-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364603#comment-14364603 ] Yin Huai commented on SPARK-6250: - [~nitay] Just a quick update. I tried {code}

[jira] [Updated] (SPARK-6350) Make mesosExecutorCores configurable in mesos fine-grained mode

2015-03-16 Thread Jongyoul Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jongyoul Lee updated SPARK-6350: Summary: Make mesosExecutorCores configurable in mesos fine-grained mode (was: make

[jira] [Commented] (SPARK-6229) Support encryption in network/common module

2015-03-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363898#comment-14363898 ] Marcelo Vanzin commented on SPARK-6229: --- HI [~adav], thanks for the comments. I'm

[jira] [Commented] (SPARK-6319) DISTINCT doesn't work for binary type

2015-03-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364209#comment-14364209 ] Michael Armbrust commented on SPARK-6319: - I'm going to bump this to 1.4.0. I

[jira] [Commented] (SPARK-6362) Broken pipe error when training a RandomForest on a union of two RDDs

2015-03-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364231#comment-14364231 ] Joseph K. Bradley commented on SPARK-6362: -- This may be caused by [SPARK-5973],

[jira] [Created] (SPARK-6365) jetty-security needed for SPARK_PREPEND_CLASSES to work

2015-03-16 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-6365: --- Summary: jetty-security needed for SPARK_PREPEND_CLASSES to work Key: SPARK-6365 URL: https://issues.apache.org/jira/browse/SPARK-6365 Project: Spark Issue

[jira] [Commented] (SPARK-6348) Enable useFeatureScaling in SVMWithSGD

2015-03-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364226#comment-14364226 ] Joseph K. Bradley commented on SPARK-6348: -- Providing this feature sounds good.

[jira] [Created] (SPARK-6368) Build a specialized serializer for Exchange operator.

2015-03-16 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6368: --- Summary: Build a specialized serializer for Exchange operator. Key: SPARK-6368 URL: https://issues.apache.org/jira/browse/SPARK-6368 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6319) DISTINCT doesn't work for binary type

2015-03-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6319: Priority: Major (was: Blocker) DISTINCT doesn't work for binary type

[jira] [Commented] (SPARK-6340) mllib.IDF for LabelPoints

2015-03-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364219#comment-14364219 ] Joseph K. Bradley commented on SPARK-6340: -- It is possible to keep track, as

[jira] [Commented] (SPARK-4556) binary distribution assembly can't run in local mode

2015-03-16 Thread Eric Aldinger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363935#comment-14363935 ] Eric Aldinger commented on SPARK-4556: -- This is also an issue with Spark 1.3.0. I

[jira] [Commented] (SPARK-6366) In Python API, the default save mode for save and saveAsTable should be error instead of append.

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364003#comment-14364003 ] Apache Spark commented on SPARK-6366: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-6319) DISTINCT doesn't work for binary type

2015-03-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364202#comment-14364202 ] Yin Huai commented on SPARK-6319: - Had a discussion with [~marmbrus] on it. We agree that

[jira] [Commented] (SPARK-6345) Model update propagation during prediction in Streaming Regression

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362823#comment-14362823 ] Apache Spark commented on SPARK-6345: - User 'freeman-lab' has created a pull request

[jira] [Created] (SPARK-6349) Add probability estimates in SVMModel predict result

2015-03-16 Thread tanyinyan (JIRA)
tanyinyan created SPARK-6349: Summary: Add probability estimates in SVMModel predict result Key: SPARK-6349 URL: https://issues.apache.org/jira/browse/SPARK-6349 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-3203) ClassNotFoundException in spark-shell with Cassandra

2015-03-16 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362783#comment-14362783 ] Kevin (Sangwoo) Kim commented on SPARK-3203: I've found workaround, in this

[jira] [Reopened] (SPARK-6301) Unable to load external jars while submitting Spark Job

2015-03-16 Thread raju patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] raju patel reopened SPARK-6301: --- Unable to load external jars while submitting Spark Job

[jira] [Commented] (SPARK-6309) Add MatrixUDT to support dense/sparse matrices in DataFrames

2015-03-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362794#comment-14362794 ] Manoj Kumar commented on SPARK-6309: Can this be assigned to me, since this feature

[jira] [Created] (SPARK-6348) Enable useFeatureScaling in SVMWithSGD

2015-03-16 Thread tanyinyan (JIRA)
tanyinyan created SPARK-6348: Summary: Enable useFeatureScaling in SVMWithSGD Key: SPARK-6348 URL: https://issues.apache.org/jira/browse/SPARK-6348 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6313) Fetch File Lock file creation doesnt work when Spark working dir is on a NFS mount

2015-03-16 Thread Nathan McCarthy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362820#comment-14362820 ] Nathan McCarthy commented on SPARK-6313: Thanks for the feedback guys. The config

[jira] [Commented] (SPARK-6350) make mesosExecutorCores configurable in mesos fine-grained mode

2015-03-16 Thread Jongyoul Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362841#comment-14362841 ] Jongyoul Lee commented on SPARK-6350: - Assign it to me, please. make

[jira] [Created] (SPARK-6350) make mesosExecutorCores configurable in mesos fine-grained mode

2015-03-16 Thread Jongyoul Lee (JIRA)
Jongyoul Lee created SPARK-6350: --- Summary: make mesosExecutorCores configurable in mesos fine-grained mode Key: SPARK-6350 URL: https://issues.apache.org/jira/browse/SPARK-6350 Project: Spark

[jira] [Commented] (SPARK-4414) SparkContext.wholeTextFiles Doesn't work with S3 Buckets

2015-03-16 Thread madhukara phatak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362856#comment-14362856 ] madhukara phatak commented on SPARK-4414: - Hi, Just ran your example on my local

[jira] [Commented] (SPARK-5523) TaskMetrics and TaskInfo have innumerable copies of the hostname string

2015-03-16 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362867#comment-14362867 ] Saisai Shao commented on SPARK-5523: After investigating a little on the

[jira] [Commented] (SPARK-3203) ClassNotFoundException in spark-shell with Cassandra

2015-03-16 Thread Kevin (Sangwoo) Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362785#comment-14362785 ] Kevin (Sangwoo) Kim commented on SPARK-3203: BTW, this is an issue on Spark

[jira] [Commented] (SPARK-6313) Fetch File Lock file creation doesnt work when Spark working dir is on a NFS mount

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362818#comment-14362818 ] Apache Spark commented on SPARK-6313: - User 'nemccarthy' has created a pull request

[jira] [Commented] (SPARK-6293) SQLContext.implicits should provide automatic conversion for RDD[Row]

2015-03-16 Thread Chen Song (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362824#comment-14362824 ] Chen Song commented on SPARK-6293: -- The schema is preserved in the row and rdd doesn't

[jira] [Commented] (SPARK-6291) GLM toString should not output full weight vector

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362860#comment-14362860 ] Apache Spark commented on SPARK-6291: - User 'yanboliang' has created a pull request

[jira] [Comment Edited] (SPARK-5523) TaskMetrics and TaskInfo have innumerable copies of the hostname string

2015-03-16 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362867#comment-14362867 ] Saisai Shao edited comment on SPARK-5523 at 3/16/15 7:36 AM: -

[jira] [Created] (SPARK-6351) ParquetRelation2

2015-03-16 Thread Pei-Lun Lee (JIRA)
Pei-Lun Lee created SPARK-6351: -- Summary: ParquetRelation2 Key: SPARK-6351 URL: https://issues.apache.org/jira/browse/SPARK-6351 Project: Spark Issue Type: Bug Components: SQL

[jira] [Updated] (SPARK-6351) ParquetRelation2 does not support paths for different file systems

2015-03-16 Thread Pei-Lun Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pei-Lun Lee updated SPARK-6351: --- Summary: ParquetRelation2 does not support paths for different file systems (was: ParquetRelation2 )

[jira] [Commented] (SPARK-6363) make scala 2.11 default language

2015-03-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363663#comment-14363663 ] Sean Owen commented on SPARK-6363: -- Spark is already cross-built for 2.10 and 2.11, and

[jira] [Comment Edited] (SPARK-6323) Large rank matrix factorization with Nonlinear loss and constraints

2015-03-16 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14360956#comment-14360956 ] Debasish Das edited comment on SPARK-6323 at 3/16/15 6:30 PM: --

[jira] [Resolved] (SPARK-6330) newParquetRelation gets incorrect FileSystem

2015-03-16 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson resolved SPARK-6330. --- Resolution: Fixed Fix Version/s: 1.3.1 Assignee: Volodymyr Lyubinets

[jira] [Updated] (SPARK-6330) newParquetRelation gets incorrect FileSystem

2015-03-16 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson updated SPARK-6330: -- Fix Version/s: 1.4.0 newParquetRelation gets incorrect FileSystem

[jira] [Created] (SPARK-6363) make scala 2.11 default language

2015-03-16 Thread antonkulaga (JIRA)
antonkulaga created SPARK-6363: -- Summary: make scala 2.11 default language Key: SPARK-6363 URL: https://issues.apache.org/jira/browse/SPARK-6363 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-6364) hashCode and equals for Matrices

2015-03-16 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6364: Summary: hashCode and equals for Matrices Key: SPARK-6364 URL: https://issues.apache.org/jira/browse/SPARK-6364 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-6245) jsonRDD() of empty RDD results in exception

2015-03-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6245: - Fix Version/s: 1.3.1 jsonRDD() of empty RDD results in exception

[jira] [Resolved] (SPARK-6351) ParquetRelation2 does not support paths for different file systems

2015-03-16 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson resolved SPARK-6351. --- Resolution: Duplicate Fix Version/s: 1.3.1 Assignee: Volodymyr Lyubinets

[jira] [Commented] (SPARK-6304) Checkpointing doesn't retain driver port

2015-03-16 Thread Marius Soutier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363047#comment-14363047 ] Marius Soutier commented on SPARK-6304: --- Yeah but if the user doesn't set the port,

[jira] [Updated] (SPARK-6356) Support the ROLLUP/CUBE/GROUPING SETS/grouping() in SQLContext

2015-03-16 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yadong Qi updated SPARK-6356: - Description: Support for the expression below: ` GROUP BY expression list WITH ROLLUP GROUP BY expression

[jira] [Commented] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-03-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363767#comment-14363767 ] Manoj Kumar commented on SPARK-6192: [~mengxr] Google Summer of Code applications are

[jira] [Resolved] (SPARK-6077) Multiple spark streaming tabs on UI when reuse the same sparkcontext

2015-03-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-6077. -- Resolution: Fixed Fix Version/s: 1.3.1 1.4.0 Multiple spark

[jira] [Updated] (SPARK-6346) Use faster converging optimization method in MLlib

2015-03-16 Thread Reza Zadeh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reza Zadeh updated SPARK-6346: -- Description: According to experiments in SPARK-1503, the LBFGS algorithm converges much faster than

[jira] [Commented] (SPARK-6304) Checkpointing doesn't retain driver port

2015-03-16 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362907#comment-14362907 ] Saisai Shao commented on SPARK-6304: Hi [~msoutier], seldom user will set this

[jira] [Commented] (SPARK-6351) ParquetRelation2 does not support paths for different file systems

2015-03-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362908#comment-14362908 ] Apache Spark commented on SPARK-6351: - User 'ypcat' has created a pull request for

<    1   2   3   >