[jira] [Resolved] (SPARK-4683) Add a beeline.cmd to run on Windows

2014-12-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4683. Resolution: Fixed Fix Version/s: 1.2.0 > Add a beeline.cmd to run on Windows > --

[jira] [Commented] (SPARK-4747) Move JobProgressListener out of org.apache.spark.ui.jobs

2014-12-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234434#comment-14234434 ] Marcelo Vanzin commented on SPARK-4747: --- I don't really have a recommendation aside

[jira] [Commented] (SPARK-4739) spark.files.userClassPathFirst does not work in local[*] mode

2014-12-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234433#comment-14234433 ] Marcelo Vanzin commented on SPARK-4739: --- BTW my fix for SPARK-2996 (https://github.c

[jira] [Commented] (SPARK-4747) Move JobProgressListener out of org.apache.spark.ui.jobs

2014-12-04 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234422#comment-14234422 ] Ryan Williams commented on SPARK-4747: -- [~vanzin] let me know what package you think

[jira] [Created] (SPARK-4747) Move JobProgressListener out of org.apache.spark.ui.jobs

2014-12-04 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-4747: Summary: Move JobProgressListener out of org.apache.spark.ui.jobs Key: SPARK-4747 URL: https://issues.apache.org/jira/browse/SPARK-4747 Project: Spark Issue

[jira] [Commented] (SPARK-546) Support full outer join and multiple join in a single shuffle

2014-12-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234404#comment-14234404 ] Reynold Xin commented on SPARK-546: --- Actually my experience implementing full join in a s

[jira] [Commented] (SPARK-4727) Add "dimensional" RDDs (time series, spatial)

2014-12-04 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234399#comment-14234399 ] RJ Nowling commented on SPARK-4727: --- Thanks, Jeremy! Your work may cover my needs, and

[jira] [Created] (SPARK-4746) integration tests should be seseparated from faster unit tests

2014-12-04 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-4746: --- Summary: integration tests should be seseparated from faster unit tests Key: SPARK-4746 URL: https://issues.apache.org/jira/browse/SPARK-4746 Project: Spark I

[jira] [Commented] (SPARK-546) Support full outer join and multiple join in a single shuffle

2014-12-04 Thread Thiago Souza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234383#comment-14234383 ] Thiago Souza commented on SPARK-546: What about #2? Did you file a new ticket? I'm qui

[jira] [Commented] (SPARK-4616) SPARK_CONF_DIR is not effective in spark-submit

2014-12-04 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234347#comment-14234347 ] Brennon York commented on SPARK-4616: - [~pwendell] could you review this? Since this a

[jira] [Commented] (SPARK-4298) The spark-submit cannot read Main-Class from Manifest.

2014-12-04 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234346#comment-14234346 ] Brennon York commented on SPARK-4298: - [~pwendell] could you take a look at this? This

[jira] [Commented] (SPARK-4181) Create separate options to control the client-mode AM resource allocation request

2014-12-04 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234333#comment-14234333 ] Thomas Graves commented on SPARK-4181: -- ok. as you discovered extraJavaOptions and po

[jira] [Commented] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-04 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234307#comment-14234307 ] Yana Kadiyska commented on SPARK-4702: -- Just confirming that https://github.com/apach

[jira] [Commented] (SPARK-4181) Create separate options to control the client-mode AM resource allocation request

2014-12-04 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234305#comment-14234305 ] WangTaoTheTonic commented on SPARK-4181: Maybe I didn't describe exactly here. Wha

[jira] [Commented] (SPARK-4181) Create separate options to control the client-mode AM resource allocation request

2014-12-04 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234289#comment-14234289 ] Thomas Graves commented on SPARK-4181: -- What exactly is the change you are proposing

[jira] [Commented] (SPARK-1010) Update all unit tests to use SparkConf instead of system properties

2014-12-04 Thread liu chang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234272#comment-14234272 ] liu chang commented on SPARK-1010: -- please assign to me, I will fix it. > Update all uni

[jira] [Commented] (SPARK-4727) Add "dimensional" RDDs (time series, spatial)

2014-12-04 Thread Jeremy Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234269#comment-14234269 ] Jeremy Freeman commented on SPARK-4727: --- Great to brainstorm about this RJ! To som

[jira] [Commented] (SPARK-4745) get_existing_cluster() doesn't work with additional security groups

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234240#comment-14234240 ] Apache Spark commented on SPARK-4745: - User 'alexdebrie' has created a pull request fo

[jira] [Created] (SPARK-4745) get_existing_cluster() doesn't work with additional security groups

2014-12-04 Thread Alex DeBrie (JIRA)
Alex DeBrie created SPARK-4745: -- Summary: get_existing_cluster() doesn't work with additional security groups Key: SPARK-4745 URL: https://issues.apache.org/jira/browse/SPARK-4745 Project: Spark

[jira] [Updated] (SPARK-4740) Netty's network bandwidth is much lower than NIO in spark-perf and Netty takes longer running time

2014-12-04 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhang, Liye updated SPARK-4740: --- Attachment: Spark-perf Test Report.pdf > Netty's network bandwidth is much lower than NIO in spark-per

[jira] [Commented] (SPARK-1953) yarn client mode Application Master memory size is same as driver memory size

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234212#comment-14234212 ] Apache Spark commented on SPARK-1953: - User 'WangTaoTheTonic' has created a pull reque

[jira] [Commented] (SPARK-2188) Support sbt/sbt for Windows

2014-12-04 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234209#comment-14234209 ] Masayoshi TSUZUKI commented on SPARK-2188: -- We have some bugs reported on JIRA ab

[jira] [Commented] (SPARK-4744) Short Circuit evaluation for AND & OR in code gen

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234182#comment-14234182 ] Apache Spark commented on SPARK-4744: - User 'chenghao-intel' has created a pull reques

[jira] [Created] (SPARK-4744) Short Circuit evaluation for AND & OR in code gen

2014-12-04 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-4744: Summary: Short Circuit evaluation for AND & OR in code gen Key: SPARK-4744 URL: https://issues.apache.org/jira/browse/SPARK-4744 Project: Spark Issue Type: Improveme

[jira] [Commented] (SPARK-4743) Use SparkEnv.serializer instead of closureSerializer in aggregateByKey and foldByKey

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234165#comment-14234165 ] Apache Spark commented on SPARK-4743: - User 'IvanVergiliev' has created a pull request

[jira] [Created] (SPARK-4743) Use SparkEnv.serializer instead of closureSerializer in aggregateByKey and foldByKey

2014-12-04 Thread Ivan Vergiliev (JIRA)
Ivan Vergiliev created SPARK-4743: - Summary: Use SparkEnv.serializer instead of closureSerializer in aggregateByKey and foldByKey Key: SPARK-4743 URL: https://issues.apache.org/jira/browse/SPARK-4743

[jira] [Commented] (SPARK-4735) Spark SQL UDF doesn't support 0 arguments.

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234153#comment-14234153 ] Apache Spark commented on SPARK-4735: - User 'potix2' has created a pull request for th

[jira] [Commented] (SPARK-4734) [Streaming]limit the file Dstream size for each batch

2014-12-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234146#comment-14234146 ] Sean Owen commented on SPARK-4734: -- I don't quite understand this suggestion. In general,

[jira] [Commented] (SPARK-2188) Support sbt/sbt for Windows

2014-12-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234144#comment-14234144 ] Sean Owen commented on SPARK-2188: -- I tend to agree, the build complexity is very high al

[jira] [Commented] (SPARK-4726) NotSerializableException thrown on SystemDefaultHttpClient with stack not related to my functions

2014-12-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234142#comment-14234142 ] Sean Owen commented on SPARK-4726: -- You can use it, you just can't serialize these object

[jira] [Commented] (SPARK-4494) IDFModel.transform() add support for single vector

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234091#comment-14234091 ] Apache Spark commented on SPARK-4494: - User 'yu-iskw' has created a pull request for t

[jira] [Commented] (SPARK-4742) The name of Parquet File generated by AppendingParquetOutputFormat should be zero padded

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234081#comment-14234081 ] Apache Spark commented on SPARK-4742: - User 'sasakitoa' has created a pull request for

[jira] [Created] (SPARK-4742) The name of Parquet File generated by AppendingParquetOutputFormat should be zero padded

2014-12-04 Thread Sasaki Toru (JIRA)
Sasaki Toru created SPARK-4742: -- Summary: The name of Parquet File generated by AppendingParquetOutputFormat should be zero padded Key: SPARK-4742 URL: https://issues.apache.org/jira/browse/SPARK-4742 Pr

[jira] [Resolved] (SPARK-4575) Documentation for the pipeline features

2014-12-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4575. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3588 [https://githu

[jira] [Updated] (SPARK-4685) Update JavaDoc settings to include spark.ml and all spark.mllib subpackages in the right sections

2014-12-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4685: - Assignee: Kai Sasaki > Update JavaDoc settings to include spark.ml and all spark.mllib subpackages

[jira] [Resolved] (SPARK-4685) Update JavaDoc settings to include spark.ml and all spark.mllib subpackages in the right sections

2014-12-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4685. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3598 [https://githu

[jira] [Resolved] (SPARK-4719) Consolidate various narrow dep RDD classes with MapPartitionsRDD

2014-12-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-4719. Resolution: Fixed Fix Version/s: 1.3.0 > Consolidate various narrow dep RDD classes with MapP

[jira] [Commented] (SPARK-4741) Do not destroy and re-create FileInputStream

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234016#comment-14234016 ] Apache Spark commented on SPARK-4741: - User 'viirya' has created a pull request for th

[jira] [Created] (SPARK-4741) Do not destroy and re-create FileInputStream

2014-12-04 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4741: -- Summary: Do not destroy and re-create FileInputStream Key: SPARK-4741 URL: https://issues.apache.org/jira/browse/SPARK-4741 Project: Spark Issue Type: Im

[jira] [Updated] (SPARK-4740) Netty's network bandwidth is much lower than NIO in spark-perf and Netty takes longer running time

2014-12-04 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-4740: --- Affects Version/s: 1.2.0 > Netty's network bandwidth is much lower than NIO in spark-perf and Netty >

[jira] [Commented] (SPARK-4683) Add a beeline.cmd to run on Windows

2014-12-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14233995#comment-14233995 ] Apache Spark commented on SPARK-4683: - User 'liancheng' has created a pull request for

<    1   2