[jira] [Resolved] (SPARK-2247) Data frame (or Pandas) like API for structured data

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-2247. Resolution: Duplicate Assignee: Reynold Xin Target Version/s: 1.3.0 Data

[jira] [Updated] (SPARK-5097) Adding data frame APIs to SchemaRDD

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5097: --- Description: SchemaRDD, through its DSL, already has many of the functionalities provided by common

[jira] [Commented] (SPARK-2247) Data frame (or Pandas) like API for structured data

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265784#comment-14265784 ] Reynold Xin commented on SPARK-2247: Ok I uploaded a design doc in

[jira] [Updated] (SPARK-5097) Adding data frame APIs to SchemaRDD

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5097: --- Description: SchemaRDD, through its DSL, already provides common data frame functionalities.

[jira] [Resolved] (SPARK-4843) Squash ExecutorRunnable and ExecutorRunnableUtil hierarchy in yarn module

2015-01-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4843. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3696

[jira] [Updated] (SPARK-5098) Number of running tasks become negative after tasks lost

2015-01-05 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-5098: -- Description: 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on

[jira] [Commented] (SPARK-3452) Maven build should skip publishing artifacts people shouldn't depend on

2015-01-05 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265749#comment-14265749 ] Aniket Bhatnagar commented on SPARK-3452: - I would like this to be revisited. The

[jira] [Created] (SPARK-5097) Adding data frame APIs to SchemaRDD

2015-01-05 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5097: -- Summary: Adding data frame APIs to SchemaRDD Key: SPARK-5097 URL: https://issues.apache.org/jira/browse/SPARK-5097 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-5097) Adding data frame APIs to SchemaRDD

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5097: --- Attachment: DesignDocAddingDataFrameAPIstoSchemaRDD.pdf Adding data frame APIs to SchemaRDD

[jira] [Created] (SPARK-5098) Number of running tasks become negative after tasks lost

2015-01-05 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5098: - Summary: Number of running tasks become negative after tasks lost Key: SPARK-5098 URL: https://issues.apache.org/jira/browse/SPARK-5098 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2015-01-05 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265796#comment-14265796 ] Timothy Chen commented on SPARK-5095: - I think instead of configuring the number of

[jira] [Commented] (SPARK-4940) Support more evenly distributing cores for Mesos mode

2015-01-05 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265802#comment-14265802 ] Timothy Chen commented on SPARK-4940: - So I assume you're specifiying coarse grain

[jira] [Commented] (SPARK-4960) Interceptor pattern in receivers

2015-01-05 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1426#comment-1426 ] Tathagata Das commented on SPARK-4960: -- Both [~c...@koeninger.org] and [~jerryshao]

[jira] [Commented] (SPARK-4960) Interceptor pattern in receivers

2015-01-05 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264450#comment-14264450 ] Tathagata Das commented on SPARK-4960: -- [~ted.m] This interceptor pattern discussion

[jira] [Comment Edited] (SPARK-4960) Interceptor pattern in receivers

2015-01-05 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1426#comment-1426 ] Tathagata Das edited comment on SPARK-4960 at 1/5/15 10:15 AM:

[jira] [Commented] (SPARK-4908) Spark SQL built for Hive 13 fails under concurrent metadata queries

2015-01-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264469#comment-14264469 ] Cheng Lian commented on SPARK-4908: --- Would like to add a comment about the root cause of

[jira] [Updated] (SPARK-5068) When the path not found in the hdfs,we can't get the result

2015-01-05 Thread jeanlyn (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jeanlyn updated SPARK-5068: --- Fix Version/s: (was: 1.2.1) When the path not found in the hdfs,we can't get the result

[jira] [Commented] (SPARK-5066) Can not get all key that has same hashcode when reading key ordered from different Streaming.

2015-01-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264508#comment-14264508 ] Sean Owen commented on SPARK-5066: -- I'm not clear what this issue is trying to report.

[jira] [Resolved] (SPARK-5082) Minor typo in the Tuning Spark document about Data Serialization

2015-01-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5082. -- Resolution: Not a Problem This was already fixed by

[jira] [Commented] (SPARK-1529) Support setting spark.local.dirs to a hadoop FileSystem

2015-01-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264527#comment-14264527 ] Sean Owen commented on SPARK-1529: -- OK, but, why do these files *have* to go on a

[jira] [Commented] (SPARK-5073) spark.storage.memoryMapThreshold have two default value

2015-01-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264535#comment-14264535 ] Sean Owen commented on SPARK-5073: -- The documentation suggests that 8192 is the intended

[jira] [Updated] (SPARK-5009) allCaseVersions function in SqlLexical leads to StackOverflow Exception

2015-01-05 Thread shengli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shengli updated SPARK-5009: --- Description: Recently I found a bug when I add new feature in SqlParser. Which is : If I define a KeyWord

[jira] [Commented] (SPARK-5073) spark.storage.memoryMapThreshold have two default value

2015-01-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264554#comment-14264554 ] Apache Spark commented on SPARK-5073: - User 'Lewuathe' has created a pull request for

[jira] [Commented] (SPARK-4940) Document or Support more evenly distributing cores for Mesos mode

2015-01-05 Thread Gerard Maas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264579#comment-14264579 ] Gerard Maas commented on SPARK-4940: From the perspective of evenly allocating Spark

[jira] [Commented] (SPARK-5073) spark.storage.memoryMapThreshold have two default value

2015-01-05 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264558#comment-14264558 ] Kai Sasaki commented on SPARK-5073: --- I did not notice above comment. Sorry, I've just

[jira] [Commented] (SPARK-4897) Python 3 support

2015-01-05 Thread Matthew Cornell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264643#comment-14264643 ] Matthew Cornell commented on SPARK-4897: Please!! Python 3 support

[jira] [Updated] (SPARK-5085) netty shuffle service causing connection timeouts

2015-01-05 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephen Haberman updated SPARK-5085: Description: In Spark 1.2.0, the netty backend is causing our report's cluster to lock up

[jira] [Updated] (SPARK-5089) Vector conversion broken for non-float64 arrays

2015-01-05 Thread Jeremy Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Freeman updated SPARK-5089: -- Description: Prior to performing many MLlib operations in PySpark (e.g. KMeans), data are

[jira] [Updated] (SPARK-5089) Vector conversion broken for non-float64 arrays

2015-01-05 Thread Jeremy Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Freeman updated SPARK-5089: -- Description: Prior to performing many MLlib operations in PySpark (e.g. KMeans), data are

[jira] [Created] (SPARK-5089) Vector conversion broken for non-float64 arrays

2015-01-05 Thread Jeremy Freeman (JIRA)
Jeremy Freeman created SPARK-5089: - Summary: Vector conversion broken for non-float64 arrays Key: SPARK-5089 URL: https://issues.apache.org/jira/browse/SPARK-5089 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5089) Vector conversion broken for non-float64 arrays

2015-01-05 Thread Jeremy Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy Freeman updated SPARK-5089: -- Description: Prior to performing many MLlib operations in PySpark (e.g. KMeans), data are

[jira] [Commented] (SPARK-4898) Replace cloudpickle with Dill

2015-01-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264801#comment-14264801 ] Nicholas Chammas commented on SPARK-4898: - cc [~davies] Replace cloudpickle with

[jira] [Commented] (SPARK-4905) Flaky FlumeStreamSuite test: org.apache.spark.streaming.flume.FlumeStreamSuite.flume input stream

2015-01-05 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265228#comment-14265228 ] Hari Shreedharan commented on SPARK-4905: - I can't reproduce this - but once

[jira] [Commented] (SPARK-5085) netty shuffle service causing connection timeouts

2015-01-05 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265194#comment-14265194 ] Stephen Haberman commented on SPARK-5085: - I think I've found a good clue: {code}

[jira] [Commented] (SPARK-4737) Prevent serialization errors from ever crashing the DAG scheduler

2015-01-05 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265267#comment-14265267 ] Matt Cheah commented on SPARK-4737: --- I will be out of the office with limited access to

[jira] [Updated] (SPARK-5055) Minor typos on the downloads page

2015-01-05 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neelesh Srinivas Salian updated SPARK-5055: --- Attachment: Spark_DownloadsPage_FixedTypos.html Here is the HTML page with

[jira] [Commented] (SPARK-927) PySpark sample() doesn't work if numpy is installed on master but not on workers

2015-01-05 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265290#comment-14265290 ] Matthew Farrellee commented on SPARK-927: - PR #2313 was subsumed by PR #3351, which

[jira] [Commented] (SPARK-4943) Parsing error for query with table name having dot

2015-01-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265157#comment-14265157 ] Michael Armbrust commented on SPARK-4943: - I wouldn't say that the notion of

[jira] [Commented] (SPARK-4943) Parsing error for query with table name having dot

2015-01-05 Thread Alex Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265212#comment-14265212 ] Alex Liu commented on SPARK-4943: - Catalog part of table identifier should be handled by

[jira] [Resolved] (SPARK-927) PySpark sample() doesn't work if numpy is installed on master but not on workers

2015-01-05 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee resolved SPARK-927. - Resolution: Fixed Fix Version/s: 1.2.0 PySpark sample() doesn't work if numpy is

[jira] [Commented] (SPARK-4905) Flaky FlumeStreamSuite test: org.apache.spark.streaming.flume.FlumeStreamSuite.flume input stream

2015-01-05 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265304#comment-14265304 ] Tathagata Das commented on SPARK-4905: -- What is the reason behind such a behavior

[jira] [Created] (SPARK-5094) Python API for gradient-boosted trees

2015-01-05 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5094: Summary: Python API for gradient-boosted trees Key: SPARK-5094 URL: https://issues.apache.org/jira/browse/SPARK-5094 Project: Spark Issue Type: New Feature

[jira] [Comment Edited] (SPARK-5085) netty shuffle service causing connection timeouts

2015-01-05 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265194#comment-14265194 ] Stephen Haberman edited comment on SPARK-5085 at 1/5/15 10:14 PM:

[jira] [Commented] (SPARK-4943) Parsing error for query with table name having dot

2015-01-05 Thread Alex Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265224#comment-14265224 ] Alex Liu commented on SPARK-4943: - For each catalog, the configuration settings should

[jira] [Updated] (SPARK-4737) Prevent serialization errors from ever crashing the DAG scheduler

2015-01-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4737: --- Affects Version/s: 1.0.2 1.1.1 Prevent serialization errors from ever

[jira] [Resolved] (SPARK-5093) Make network related timeouts consistent

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5093. Resolution: Fixed Fix Version/s: 1.3.0 Make network related timeouts consistent

[jira] [Commented] (SPARK-4905) Flaky FlumeStreamSuite test: org.apache.spark.streaming.flume.FlumeStreamSuite.flume input stream

2015-01-05 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265322#comment-14265322 ] Hari Shreedharan commented on SPARK-4905: - I am not sure. It might have something

[jira] [Commented] (SPARK-4943) Parsing error for query with table name having dot

2015-01-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265335#comment-14265335 ] Michael Armbrust commented on SPARK-4943: - Thanks for your comments Alex. Are you

[jira] [Commented] (SPARK-5055) Minor typos on the downloads page

2015-01-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265331#comment-14265331 ] Sean Owen commented on SPARK-5055: -- Normally the right way to propose a change is to open

[jira] [Comment Edited] (SPARK-2352) [MLLIB] Add Artificial Neural Network (ANN) to Spark

2015-01-05 Thread Bert Greevenbosch (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265344#comment-14265344 ] Bert Greevenbosch edited comment on SPARK-2352 at 1/5/15 11:36 PM:

[jira] [Resolved] (SPARK-5040) Support expressing unresolved attributes using $attribute name notation in SQL DSL

2015-01-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5040. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3862

[jira] [Updated] (SPARK-4940) Support more evenly distributing cores for Mesos mode

2015-01-05 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Chen updated SPARK-4940: Summary: Support more evenly distributing cores for Mesos mode (was: Document or Support more

[jira] [Updated] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2015-01-05 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Chen updated SPARK-5095: Description: Currently in coarse grained mesos mode, it's expected that we only launch one Mesos

[jira] [Commented] (SPARK-2352) [MLLIB] Add Artificial Neural Network (ANN) to Spark

2015-01-05 Thread Bert Greevenbosch (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265344#comment-14265344 ] Bert Greevenbosch commented on SPARK-2352: -- Hi Nathan, Great to year of your

[jira] [Commented] (SPARK-1517) Publish nightly snapshots of documentation, maven artifacts, and binary builds

2015-01-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265390#comment-14265390 ] Sean Owen commented on SPARK-1517: -- Recap: old URL was building-with-maven.html, new URL

[jira] [Updated] (SPARK-4296) Throw Expression not in GROUP BY when using same expression in group by clause and select clause

2015-01-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4296: Priority: Blocker (was: Critical) Throw Expression not in GROUP BY when using same

[jira] [Commented] (SPARK-4943) Parsing error for query with table name having dot

2015-01-05 Thread Alex Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265351#comment-14265351 ] Alex Liu commented on SPARK-4943: - No changes to your approach. Regarding

[jira] [Commented] (SPARK-4960) Interceptor pattern in receivers

2015-01-05 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265533#comment-14265533 ] Tathagata Das commented on SPARK-4960: -- The reason we have I am suggesting the

[jira] [Commented] (SPARK-1517) Publish nightly snapshots of documentation, maven artifacts, and binary builds

2015-01-05 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265355#comment-14265355 ] Ryan Williams commented on SPARK-1517: -- Hey [~pwendell], any updates here? The

[jira] [Created] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2015-01-05 Thread Timothy Chen (JIRA)
Timothy Chen created SPARK-5095: --- Summary: Support launching multiple mesos executors in coarse grained mesos mode Key: SPARK-5095 URL: https://issues.apache.org/jira/browse/SPARK-5095 Project: Spark

[jira] [Updated] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2015-01-05 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Chen updated SPARK-5095: Component/s: Mesos Support launching multiple mesos executors in coarse grained mesos mode

[jira] [Issue Comment Deleted] (SPARK-4737) Prevent serialization errors from ever crashing the DAG scheduler

2015-01-05 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-4737: -- Comment: was deleted (was: I will be out of the office with limited access to e-mail from January 05

[jira] [Commented] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2015-01-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265416#comment-14265416 ] Patrick Wendell commented on SPARK-4687: I spent some more time looking at this

[jira] [Commented] (SPARK-1517) Publish nightly snapshots of documentation, maven artifacts, and binary builds

2015-01-05 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265419#comment-14265419 ] Ryan Williams commented on SPARK-1517: -- Agreed that the redirect you speak of should

[jira] [Commented] (SPARK-5085) netty shuffle service causing connection timeouts

2015-01-05 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265472#comment-14265472 ] Stephen Haberman commented on SPARK-5085: - Looks like this is probably a known

[jira] [Commented] (SPARK-4960) Interceptor pattern in receivers

2015-01-05 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265508#comment-14265508 ] Saisai Shao commented on SPARK-4960: Hi TD, thanks a lot for your suggestions. What I

[jira] [Commented] (SPARK-5082) Minor typo in the Tuning Spark document about Data Serialization

2015-01-05 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265549#comment-14265549 ] yangping wu commented on SPARK-5082: Yes, I also found the pull after I created the

[jira] [Commented] (SPARK-4960) Interceptor pattern in receivers

2015-01-05 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265548#comment-14265548 ] Saisai Shao commented on SPARK-4960: Thanks TD, this sounds reasonable, I will

[jira] [Commented] (SPARK-5082) Minor typo in the Tuning Spark document about Data Serialization

2015-01-05 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265550#comment-14265550 ] yangping wu commented on SPARK-5082: Yes, I also found the pull after I created the

[jira] [Issue Comment Deleted] (SPARK-5082) Minor typo in the Tuning Spark document about Data Serialization

2015-01-05 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yangping wu updated SPARK-5082: --- Comment: was deleted (was: Yes, I also found the pull after I created the issue.) Minor typo in the

[jira] [Reopened] (SPARK-4258) NPE with new Parquet Filters

2015-01-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reopened SPARK-4258: - Oops this got closed by your documentation fix. Reopening. NPE with new Parquet Filters

[jira] [Commented] (SPARK-4960) Interceptor pattern in receivers

2015-01-05 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265570#comment-14265570 ] Cody Koeninger commented on SPARK-4960: --- At that point, it sounds like you're

[jira] [Created] (SPARK-5096) SparkBuild.scala assumes you are at the spark root dir

2015-01-05 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-5096: --- Summary: SparkBuild.scala assumes you are at the spark root dir Key: SPARK-5096 URL: https://issues.apache.org/jira/browse/SPARK-5096 Project: Spark

[jira] [Commented] (SPARK-5096) SparkBuild.scala assumes you are at the spark root dir

2015-01-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265666#comment-14265666 ] Apache Spark commented on SPARK-5096: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-5085) netty shuffle service causing connection timeouts

2015-01-05 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265642#comment-14265642 ] Stephen Haberman commented on SPARK-5085: - I've confirmed that adding: sudo

[jira] [Closed] (SPARK-5085) netty shuffle service causing connection timeouts

2015-01-05 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephen Haberman closed SPARK-5085. --- Resolution: Invalid netty shuffle service causing connection timeouts

[jira] [Commented] (SPARK-4960) Interceptor pattern in receivers

2015-01-05 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265589#comment-14265589 ] Tathagata Das commented on SPARK-4960: -- That is a good question. What we could do is

[jira] [Updated] (SPARK-5090) The improvement of python converter for hbase

2015-01-05 Thread Gen TANG (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gen TANG updated SPARK-5090: Description: The python converter `HBaseResultToStringConverter` provided in the HBaseConverter.scala

[jira] [Resolved] (SPARK-5057) Log message in failed askWithReply attempts

2015-01-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5057. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3875

[jira] [Updated] (SPARK-5057) Log message in failed askWithReply attempts

2015-01-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5057: -- Summary: Log message in failed askWithReply attempts (was: Add more details in log when using actor to

[jira] [Updated] (SPARK-5057) Log message in failed askWithReply attempts

2015-01-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5057: -- Assignee: WangTaoTheTonic Log message in failed askWithReply attempts

[jira] [Updated] (SPARK-4465) runAsSparkUser doesn't affect TaskRunner in Mesos environment at all.

2015-01-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4465: -- Assignee: Jongyoul Lee runAsSparkUser doesn't affect TaskRunner in Mesos environment at all.

[jira] [Resolved] (SPARK-4465) runAsSparkUser doesn't affect TaskRunner in Mesos environment at all.

2015-01-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4465. --- Resolution: Fixed Issue resolved by pull request 3741 [https://github.com/apache/spark/pull/3741]

[jira] [Commented] (SPARK-5089) Vector conversion broken for non-float64 arrays

2015-01-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265025#comment-14265025 ] Apache Spark commented on SPARK-5089: - User 'freeman-lab' has created a pull request

[jira] [Created] (SPARK-5091) Hooks for PySpark tasks

2015-01-05 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5091: - Summary: Hooks for PySpark tasks Key: SPARK-5091 URL: https://issues.apache.org/jira/browse/SPARK-5091 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-5092) Selecting from a nested structure with SparkSQL should return a nested structure

2015-01-05 Thread Brad Willard (JIRA)
Brad Willard created SPARK-5092: --- Summary: Selecting from a nested structure with SparkSQL should return a nested structure Key: SPARK-5092 URL: https://issues.apache.org/jira/browse/SPARK-5092

[jira] [Commented] (SPARK-4850) GROUP BY can't work if the schema of SchemaRDD contains struct or array type

2015-01-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264862#comment-14264862 ] Cheng Lian commented on SPARK-4850: --- [~debugger87] The query you provided in the

[jira] [Closed] (SPARK-4850) GROUP BY can't work if the schema of SchemaRDD contains struct or array type

2015-01-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian closed SPARK-4850. - Resolution: Invalid Not a bug. GROUP BY can't work if the schema of SchemaRDD contains struct or array

[jira] [Resolved] (SPARK-4296) Throw Expression not in GROUP BY when using same expression in group by clause and select clause

2015-01-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-4296. --- Resolution: Duplicate Fix Version/s: 1.2.0 Target Version/s: 1.2.0 (was: 1.3.0)

[jira] [Reopened] (SPARK-4296) Throw Expression not in GROUP BY when using same expression in group by clause and select clause

2015-01-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reopened SPARK-4296: --- Sorry, missed David's comment below. Throw Expression not in GROUP BY when using same expression in

[jira] [Resolved] (SPARK-4688) Have a single shared network timeout in Spark

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-4688. Resolution: Fixed Fix Version/s: 1.3.0 Have a single shared network timeout in Spark

[jira] [Commented] (SPARK-4387) Refactoring python profiling code to make it extensible

2015-01-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264919#comment-14264919 ] Apache Spark commented on SPARK-4387: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-4905) Flaky FlumeStreamSuite test: org.apache.spark.streaming.flume.FlumeStreamSuite.flume input stream

2015-01-05 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264936#comment-14264936 ] Hari Shreedharan commented on SPARK-4905: - Taking a look now. Flaky

[jira] [Commented] (SPARK-4688) Have a single shared network timeout in Spark

2015-01-05 Thread Varun Saxena (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264942#comment-14264942 ] Varun Saxena commented on SPARK-4688: - Thanks [~rxin] for the commit. Mind assigning

[jira] [Comment Edited] (SPARK-4688) Have a single shared network timeout in Spark

2015-01-05 Thread Varun Saxena (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264942#comment-14264942 ] Varun Saxena edited comment on SPARK-4688 at 1/5/15 7:20 PM: -

[jira] [Commented] (SPARK-4757) Yarn-client failed to start due to Wrong FS error in distCacheMgr.addResource

2015-01-05 Thread Chris Albright (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264979#comment-14264979 ] Chris Albright commented on SPARK-4757: --- Is there an ETA on when this might make it

[jira] [Updated] (SPARK-4688) Have a single shared network timeout in Spark

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-4688: --- Assignee: Varun Saxena Have a single shared network timeout in Spark

[jira] [Commented] (SPARK-4688) Have a single shared network timeout in Spark

2015-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264987#comment-14264987 ] Reynold Xin commented on SPARK-4688: Done - thanks for doing this. Have a single

[jira] [Created] (SPARK-5090) The improvement of python converter for hbase

2015-01-05 Thread Gen TANG (JIRA)
Gen TANG created SPARK-5090: --- Summary: The improvement of python converter for hbase Key: SPARK-5090 URL: https://issues.apache.org/jira/browse/SPARK-5090 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4943) Parsing error for query with table name having dot

2015-01-05 Thread Alex Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265089#comment-14265089 ] Alex Liu commented on SPARK-4943: - The approach of {code} case class

  1   2   >