[jira] [Resolved] (SPARK-6536) Add IN to python Column

2015-03-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6536. Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Target Version/s:

[jira] [Closed] (SPARK-6547) Missing import Files in InsertIntoHiveTableSuite.scala

2015-03-26 Thread Zhichao Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhichao Zhang closed SPARK-6547. - Resolution: Duplicate It is duplicate [SPARK-6546](https://issues.apache.org/jira/browse/SPARK-654

[jira] [Assigned] (SPARK-6117) describe function for summary statistics

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6117: --- Assignee: (was: Apache Spark) > describe function for summary statistics > --

[jira] [Assigned] (SPARK-6117) describe function for summary statistics

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6117: --- Assignee: Apache Spark > describe function for summary statistics > -

[jira] [Commented] (SPARK-6117) describe function for summary statistics

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381527#comment-14381527 ] Apache Spark commented on SPARK-6117: - User 'rxin' has created a pull request for this

[jira] [Assigned] (SPARK-6521) executors in the same node read local shuffle file

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6521: --- Assignee: Apache Spark > executors in the same node read local shuffle file > ---

[jira] [Assigned] (SPARK-6521) executors in the same node read local shuffle file

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6521: --- Assignee: (was: Apache Spark) > executors in the same node read local shuffle file >

[jira] [Updated] (SPARK-6548) Adding stddev to DataFrame functions

2015-03-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6548: --- Labels: DataFrame starter (was: starter) > Adding stddev to DataFrame functions > ---

[jira] [Created] (SPARK-6548) Adding stddev to DataFrame functions

2015-03-26 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-6548: -- Summary: Adding stddev to DataFrame functions Key: SPARK-6548 URL: https://issues.apache.org/jira/browse/SPARK-6548 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-6548) Adding stddev to DataFrame functions

2015-03-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6548: --- Fix Version/s: 1.4.0 > Adding stddev to DataFrame functions > > >

[jira] [Commented] (SPARK-4587) Model export/import

2015-03-26 Thread zhangyouhua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381544#comment-14381544 ] zhangyouhua commented on SPARK-4587: “Sorry, this file is invalid so it cannot be disp

[jira] [Created] (SPARK-6549) Spark console logger logs to stderr by default

2015-03-26 Thread Pavel Sakun (JIRA)
Pavel Sakun created SPARK-6549: -- Summary: Spark console logger logs to stderr by default Key: SPARK-6549 URL: https://issues.apache.org/jira/browse/SPARK-6549 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6550: -- Summary: Add PreAnalyzer to keep logical plan consistent across DataFrame Key: SPARK-6550 URL: https://issues.apache.org/jira/browse/SPARK-6550 Project: Spark

[jira] [Updated] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-6550: --- Description: h2. Problems In some cases, the expressions in a logical plan will be modified t

[jira] [Commented] (SPARK-6549) Spark console logger logs to stderr by default

2015-03-26 Thread Pavel Sakun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381568#comment-14381568 ] Pavel Sakun commented on SPARK-6549: Pull request: https://github.com/apache/spark/pul

[jira] [Assigned] (SPARK-6549) Spark console logger logs to stderr by default

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6549: --- Assignee: (was: Apache Spark) > Spark console logger logs to stderr by default >

[jira] [Commented] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381571#comment-14381571 ] Apache Spark commented on SPARK-6550: - User 'viirya' has created a pull request for th

[jira] [Assigned] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6550: --- Assignee: Apache Spark > Add PreAnalyzer to keep logical plan consistent across DataFrame > -

[jira] [Assigned] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6550: --- Assignee: (was: Apache Spark) > Add PreAnalyzer to keep logical plan consistent across Da

[jira] [Assigned] (SPARK-6549) Spark console logger logs to stderr by default

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6549: --- Assignee: Apache Spark > Spark console logger logs to stderr by default > ---

[jira] [Commented] (SPARK-6549) Spark console logger logs to stderr by default

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381570#comment-14381570 ] Apache Spark commented on SPARK-6549: - User 'pavel-sakun' has created a pull request f

[jira] [Updated] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-6550: --- Description: h2. Problems In some cases, the expressions in a logical plan will be modified t

[jira] [Commented] (SPARK-6480) histogram() bucket function is wrong in some simple edge cases

2015-03-26 Thread Frank Rosner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381588#comment-14381588 ] Frank Rosner commented on SPARK-6480: - [~srowen] will do today! > histogram() bucket

[jira] [Created] (SPARK-6551) Incorrect aggregate results if op(...) mutates first argument

2015-03-26 Thread Jarno Seppanen (JIRA)
Jarno Seppanen created SPARK-6551: - Summary: Incorrect aggregate results if op(...) mutates first argument Key: SPARK-6551 URL: https://issues.apache.org/jira/browse/SPARK-6551 Project: Spark

[jira] [Updated] (SPARK-6546) Using the wrong code that will make spark compile failed!!

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6546: -- Assignee: DoingDone9 > Using the wrong code that will make spark compile failed!! > ---

[jira] [Created] (SPARK-6552) expose start-slave.sh to user and update outdated doc

2015-03-26 Thread Tao Wang (JIRA)
Tao Wang created SPARK-6552: --- Summary: expose start-slave.sh to user and update outdated doc Key: SPARK-6552 URL: https://issues.apache.org/jira/browse/SPARK-6552 Project: Spark Issue Type: Improve

[jira] [Updated] (SPARK-6551) Incorrect aggregate results if seqOp(...) mutates its first argument

2015-03-26 Thread Jarno Seppanen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jarno Seppanen updated SPARK-6551: -- Description: Python RDD.aggregate method doesn't match its documentation w.r.t. seqOp mutating

[jira] [Updated] (SPARK-6552) expose start-slave.sh to user and update outdated doc

2015-03-26 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Wang updated SPARK-6552: Component/s: Documentation > expose start-slave.sh to user and update outdated doc > ---

[jira] [Resolved] (SPARK-6546) Using the wrong code that will make spark compile failed!!

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6546. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5198 [https://github.com/

[jira] [Updated] (SPARK-6546) Using the wrong code that will make spark compile failed!!

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6546: -- Affects Version/s: 1.4.0 > Using the wrong code that will make spark compile failed!! > ---

[jira] [Updated] (SPARK-6546) Build failure caused by PR #5029 together with #4289

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6546: -- Component/s: (was: Build) SQL Description: PR [#4289|https://gith

[jira] [Commented] (SPARK-6552) expose start-slave.sh to user and update outdated doc

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381604#comment-14381604 ] Apache Spark commented on SPARK-6552: - User 'WangTaoTheTonic' has created a pull reque

[jira] [Commented] (SPARK-6546) Build failure caused by PR #5029 together with #4289

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381606#comment-14381606 ] Cheng Lian commented on SPARK-6546: --- Updated ticket title and description to refect the

[jira] [Assigned] (SPARK-6552) expose start-slave.sh to user and update outdated doc

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6552: --- Assignee: (was: Apache Spark) > expose start-slave.sh to user and update outdated doc > -

[jira] [Assigned] (SPARK-6552) expose start-slave.sh to user and update outdated doc

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6552: --- Assignee: Apache Spark > expose start-slave.sh to user and update outdated doc >

[jira] [Created] (SPARK-6553) Support for functools.partial as UserDefinedFunction

2015-03-26 Thread Kalle Jepsen (JIRA)
Kalle Jepsen created SPARK-6553: --- Summary: Support for functools.partial as UserDefinedFunction Key: SPARK-6553 URL: https://issues.apache.org/jira/browse/SPARK-6553 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-6553) Support for functools.partial as UserDefinedFunction

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6553: --- Assignee: (was: Apache Spark) > Support for functools.partial as UserDefinedFunction > --

[jira] [Commented] (SPARK-6553) Support for functools.partial as UserDefinedFunction

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381616#comment-14381616 ] Apache Spark commented on SPARK-6553: - User 'ksonj' has created a pull request for thi

[jira] [Assigned] (SPARK-6553) Support for functools.partial as UserDefinedFunction

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6553: --- Assignee: Apache Spark > Support for functools.partial as UserDefinedFunction > -

[jira] [Commented] (SPARK-6435) spark-shell --jars option does not add all jars to classpath

2015-03-26 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381641#comment-14381641 ] Masayoshi TSUZUKI commented on SPARK-6435: -- I think {{"%~2"==""}} way is better f

[jira] [Commented] (SPARK-2213) Sort Merge Join

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381650#comment-14381650 ] Apache Spark commented on SPARK-2213: - User 'adrian-wang' has created a pull request f

[jira] [Commented] (SPARK-4830) Spark Streaming Java Application : java.lang.ClassNotFoundException

2015-03-26 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381663#comment-14381663 ] sam commented on SPARK-4830: Could also be related to https://issues.apache.org/jira/browse/SP

[jira] [Assigned] (SPARK-6471) Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6471: --- Assignee: Apache Spark > Metastore schema should only be a subset of parquet schema to suppor

[jira] [Assigned] (SPARK-6471) Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6471: --- Assignee: (was: Apache Spark) > Metastore schema should only be a subset of parquet schem

[jira] [Created] (SPARK-6554) Cannot use partition columns in where clause

2015-03-26 Thread Jon Chase (JIRA)
Jon Chase created SPARK-6554: Summary: Cannot use partition columns in where clause Key: SPARK-6554 URL: https://issues.apache.org/jira/browse/SPARK-6554 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-6465) GenericRowWithSchema: KryoException: Class cannot be created (missing no-arg constructor):

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6465. --- Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Issue resolved by pull request

[jira] [Commented] (SPARK-6481) Set "In Progress" when a PR is opened for an issue

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381703#comment-14381703 ] Cheng Lian commented on SPARK-6481: --- Maybe unrelated to this issue, but I saw a lot of J

[jira] [Updated] (SPARK-6549) Spark console logger logs to stderr by default

2015-03-26 Thread Pavel Sakun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pavel Sakun updated SPARK-6549: --- Description: Spark's console logger is configured to log message with INFO level to stderr by default

[jira] [Created] (SPARK-6555) Override equals and hashCode in MetastoreRelation

2015-03-26 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-6555: - Summary: Override equals and hashCode in MetastoreRelation Key: SPARK-6555 URL: https://issues.apache.org/jira/browse/SPARK-6555 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-6544) Problem with Avro and Kryo Serialization

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6544: --- Assignee: Apache Spark > Problem with Avro and Kryo Serialization > -

[jira] [Assigned] (SPARK-6544) Problem with Avro and Kryo Serialization

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6544: --- Assignee: (was: Apache Spark) > Problem with Avro and Kryo Serialization > --

[jira] [Commented] (SPARK-6554) Cannot use partition columns in where clause

2015-03-26 Thread Jon Chase (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381720#comment-14381720 ] Jon Chase commented on SPARK-6554: -- Here's a test case to reproduce the issue: {code} @T

[jira] [Created] (SPARK-6556) Fix wrong parsing logic of executorTimeoutMs and checkTimeoutIntervalMs in HeartbeatReceiver

2015-03-26 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-6556: --- Summary: Fix wrong parsing logic of executorTimeoutMs and checkTimeoutIntervalMs in HeartbeatReceiver Key: SPARK-6556 URL: https://issues.apache.org/jira/browse/SPARK-6556

[jira] [Updated] (SPARK-6556) Fix wrong parsing logic of executorTimeoutMs and checkTimeoutIntervalMs in HeartbeatReceiver

2015-03-26 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-6556: Description: The current reading logic of "executorTimeoutMs" is: {code} private val executorTimeou

[jira] [Commented] (SPARK-6556) Fix wrong parsing logic of executorTimeoutMs and checkTimeoutIntervalMs in HeartbeatReceiver

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381769#comment-14381769 ] Apache Spark commented on SPARK-6556: - User 'zsxwing' has created a pull request for t

[jira] [Assigned] (SPARK-6556) Fix wrong parsing logic of executorTimeoutMs and checkTimeoutIntervalMs in HeartbeatReceiver

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6556: --- Assignee: Apache Spark > Fix wrong parsing logic of executorTimeoutMs and checkTimeoutInterva

[jira] [Assigned] (SPARK-6556) Fix wrong parsing logic of executorTimeoutMs and checkTimeoutIntervalMs in HeartbeatReceiver

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6556: --- Assignee: (was: Apache Spark) > Fix wrong parsing logic of executorTimeoutMs and checkTim

[jira] [Commented] (SPARK-6481) Set "In Progress" when a PR is opened for an issue

2015-03-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381771#comment-14381771 ] Sean Owen commented on SPARK-6481: -- [~nchammas] Agree, I really like this, though it's ge

[jira] [Updated] (SPARK-6554) Cannot use partition columns in where clause

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6554: -- Description: I'm having trouble referencing partition columns in my queries with Parquet. In the foll

[jira] [Updated] (SPARK-6554) Cannot use partition columns in where clause

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6554: -- Target Version/s: 1.3.1, 1.4.0 > Cannot use partition columns in where clause >

[jira] [Assigned] (SPARK-6554) Cannot use partition columns in where clause

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reassigned SPARK-6554: - Assignee: Cheng Lian > Cannot use partition columns in where clause > ---

[jira] [Commented] (SPARK-6554) Cannot use partition columns in where clause

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381788#comment-14381788 ] Cheng Lian commented on SPARK-6554: --- Hi [~jonchase], did you happen to turn on Parquet f

[jira] [Updated] (SPARK-6554) Cannot use partition columns in where clause

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6554: -- Priority: Critical (was: Major) > Cannot use partition columns in where clause > --

[jira] [Updated] (SPARK-6554) Cannot use partition columns in where clause when Parquet filter push-down is enabled

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6554: -- Summary: Cannot use partition columns in where clause when Parquet filter push-down is enabled (was: Ca

[jira] [Commented] (SPARK-6435) spark-shell --jars option does not add all jars to classpath

2015-03-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381797#comment-14381797 ] Sean Owen commented on SPARK-6435: -- Yes this is a moot point in 1.4 and after, but I'd lo

[jira] [Commented] (SPARK-6551) Incorrect aggregate results if seqOp(...) mutates its first argument

2015-03-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381807#comment-14381807 ] Sean Owen commented on SPARK-6551: -- FWIW an equivalent example works as expected in Scala

[jira] [Commented] (SPARK-6481) Set "In Progress" when a PR is opened for an issue

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381811#comment-14381811 ] Cheng Lian commented on SPARK-6481: --- Aha, so I'm not the only one! Although I just start

[jira] [Commented] (SPARK-6554) Cannot use partition columns in where clause when Parquet filter push-down is enabled

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381814#comment-14381814 ] Apache Spark commented on SPARK-6554: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-6554) Cannot use partition columns in where clause when Parquet filter push-down is enabled

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6554: --- Assignee: Cheng Lian (was: Apache Spark) > Cannot use partition columns in where clause when

[jira] [Assigned] (SPARK-6554) Cannot use partition columns in where clause when Parquet filter push-down is enabled

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6554: --- Assignee: Apache Spark (was: Cheng Lian) > Cannot use partition columns in where clause when

[jira] [Resolved] (SPARK-6515) Use while(true) in OpenHashSet.getPos

2015-03-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6515. -- Resolution: Fixed Fix Version/s: 1.4.0 (Looks like this was merged in https://github.com/apache/

[jira] [Resolved] (SPARK-6468) Fix the race condition of subDirs in DiskBlockManager

2015-03-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6468. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5136 [https://github.com/ap

[jira] [Updated] (SPARK-6468) Fix the race condition of subDirs in DiskBlockManager

2015-03-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6468: - Assignee: Shixiong Zhu > Fix the race condition of subDirs in DiskBlockManager > -

[jira] [Assigned] (SPARK-6480) histogram() bucket function is wrong in some simple edge cases

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6480: --- Assignee: Sean Owen (was: Apache Spark) > histogram() bucket function is wrong in some simpl

[jira] [Assigned] (SPARK-6480) histogram() bucket function is wrong in some simple edge cases

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6480: --- Assignee: Apache Spark (was: Sean Owen) > histogram() bucket function is wrong in some simpl

[jira] [Commented] (SPARK-6554) Cannot use partition columns in where clause when Parquet filter push-down is enabled

2015-03-26 Thread Jon Chase (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381831#comment-14381831 ] Jon Chase commented on SPARK-6554: -- "spark.sql.parquet.filterPushdown" was the problem.

[jira] [Commented] (SPARK-6508) error handling issue running python in yarn cluster mode

2015-03-26 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381838#comment-14381838 ] Thomas Graves commented on SPARK-6508: -- [~TomStewart] are you running on yarn? If so

[jira] [Resolved] (SPARK-6471) Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6471. --- Resolution: Fixed Fix Version/s: 1.3.1 Issue resolved by pull request 5141 [https://github.com/

[jira] [Updated] (SPARK-6471) Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6471: -- Assignee: Yash Datta > Metastore schema should only be a subset of parquet schema to support > dropping

[jira] [Commented] (SPARK-6506) python support yarn cluster mode requires SPARK_HOME to be set

2015-03-26 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381844#comment-14381844 ] Lianhui Wang commented on SPARK-6506: - hi [~tgraves] I use 1.3.0 to run. if i donot se

[jira] [Updated] (SPARK-6471) Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6471: -- Priority: Blocker (was: Major) Target Version/s: 1.3.1, 1.4.0 > Metastore schema should onl

[jira] [Commented] (SPARK-6471) Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381846#comment-14381846 ] Cheng Lian commented on SPARK-6471: --- Bumped to blocker level since this is actually a re

[jira] [Comment Edited] (SPARK-6506) python support yarn cluster mode requires SPARK_HOME to be set

2015-03-26 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381844#comment-14381844 ] Lianhui Wang edited comment on SPARK-6506 at 3/26/15 1:17 PM: --

[jira] [Commented] (SPARK-6554) Cannot use partition columns in where clause when Parquet filter push-down is enabled

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381850#comment-14381850 ] Cheng Lian commented on SPARK-6554: --- Marked this as critical rather than blocker mostly

[jira] [Comment Edited] (SPARK-6506) python support yarn cluster mode requires SPARK_HOME to be set

2015-03-26 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381844#comment-14381844 ] Lianhui Wang edited comment on SPARK-6506 at 3/26/15 1:18 PM: --

[jira] [Commented] (SPARK-6506) python support yarn cluster mode requires SPARK_HOME to be set

2015-03-26 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381860#comment-14381860 ] Thomas Graves commented on SPARK-6506: -- If you are running on yarn you just have to s

[jira] [Commented] (SPARK-6554) Cannot use partition columns in where clause when Parquet filter push-down is enabled

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381862#comment-14381862 ] Cheng Lian commented on SPARK-6554: --- Parquet filter push-down isn't enabled by default i

[jira] [Updated] (SPARK-6554) Cannot use partition columns in where clause when Parquet filter push-down is enabled

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6554: -- Issue Type: Sub-task (was: Bug) Parent: SPARK-5463 > Cannot use partition columns in where clau

[jira] [Updated] (SPARK-5463) Fix Parquet filter push-down

2015-03-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5463: -- Affects Version/s: 1.3.0 > Fix Parquet filter push-down > > >

[jira] [Resolved] (SPARK-6491) Spark will put the current working dir to the CLASSPATH

2015-03-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6491. -- Resolution: Fixed Fix Version/s: 1.3.1 Assignee: Liangliang Gu Resolved by https://githu

[jira] [Assigned] (SPARK-6538) Add missing nullable Metastore fields when merging a Parquet schema

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6538: --- Assignee: (was: Apache Spark) > Add missing nullable Metastore fields when merging a Parq

[jira] [Assigned] (SPARK-6538) Add missing nullable Metastore fields when merging a Parquet schema

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6538: --- Assignee: Apache Spark > Add missing nullable Metastore fields when merging a Parquet schema

[jira] [Commented] (SPARK-6532) LDAModel.scala fails scalastyle on Windows

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381897#comment-14381897 ] Apache Spark commented on SPARK-6532: - User 'srowen' has created a pull request for th

[jira] [Commented] (SPARK-6548) Adding stddev to DataFrame functions

2015-03-26 Thread sdfox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381901#comment-14381901 ] sdfox commented on SPARK-6548: -- I will take it. > Adding stddev to DataFrame functions > ---

[jira] [Assigned] (SPARK-2475) Check whether #cores > #receivers in local mode

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2475: --- Assignee: (was: Apache Spark) > Check whether #cores > #receivers in local mode > ---

[jira] [Assigned] (SPARK-2475) Check whether #cores > #receivers in local mode

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2475: --- Assignee: Apache Spark > Check whether #cores > #receivers in local mode > --

[jira] [Commented] (SPARK-2475) Check whether #cores > #receivers in local mode

2015-03-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381906#comment-14381906 ] Apache Spark commented on SPARK-2475: - User 'ArcherShao' has created a pull request fo

[jira] [Commented] (SPARK-6481) Set "In Progress" when a PR is opened for an issue

2015-03-26 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381934#comment-14381934 ] Nicholas Chammas commented on SPARK-6481: - Im willing to update this if there is a

[jira] [Commented] (SPARK-6481) Set "In Progress" when a PR is opened for an issue

2015-03-26 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381939#comment-14381939 ] Nicholas Chammas commented on SPARK-6481: - Also, there was a one time mass update

[jira] [Commented] (SPARK-6481) Set "In Progress" when a PR is opened for an issue

2015-03-26 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381944#comment-14381944 ] Josh Rosen commented on SPARK-6481: --- Actually, I haven't triggered the mass update quite

  1   2   3   >