[jira] [Commented] (SPARK-5752) Don't implicitly convert RDDs directly to DataFrames

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317761#comment-14317761 ] Apache Spark commented on SPARK-5752: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5761) Revamp StandaloneRestProtocolSuite

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317762#comment-14317762 ] Apache Spark commented on SPARK-5761: - User 'andrewor14' has created a pull request fo

[jira] [Commented] (SPARK-5760) StandaloneRestClient/Server error behavior is incorrect

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317760#comment-14317760 ] Apache Spark commented on SPARK-5760: - User 'andrewor14' has created a pull request fo

[jira] [Created] (SPARK-5761) Revamp StandaloneRestProtocolSuite

2015-02-11 Thread Andrew Or (JIRA)
Andrew Or created SPARK-5761: Summary: Revamp StandaloneRestProtocolSuite Key: SPARK-5761 URL: https://issues.apache.org/jira/browse/SPARK-5761 Project: Spark Issue Type: Bug Components

[jira] [Created] (SPARK-5760) StandaloneRestClient/Server error behavior is incorrect

2015-02-11 Thread Andrew Or (JIRA)
Andrew Or created SPARK-5760: Summary: StandaloneRestClient/Server error behavior is incorrect Key: SPARK-5760 URL: https://issues.apache.org/jira/browse/SPARK-5760 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-575) Maintain a cache of JARs on each node to avoid unnecessary copying

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317741#comment-14317741 ] Apache Spark commented on SPARK-575: User 'mengxr' has created a pull request for this

[jira] [Commented] (SPARK-5759) ExecutorRunnable should catch YarnException while NMClient start container

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317734#comment-14317734 ] Apache Spark commented on SPARK-5759: - User 'lianhuiwang' has created a pull request f

[jira] [Created] (SPARK-5759) ExecutorRunnable should catch YarnException while NMClient start container

2015-02-11 Thread Lianhui Wang (JIRA)
Lianhui Wang created SPARK-5759: --- Summary: ExecutorRunnable should catch YarnException while NMClient start container Key: SPARK-5759 URL: https://issues.apache.org/jira/browse/SPARK-5759 Project: Spark

[jira] [Created] (SPARK-5758) Use LongType as the default type for integers in JSON schema inference.

2015-02-11 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5758: --- Summary: Use LongType as the default type for integers in JSON schema inference. Key: SPARK-5758 URL: https://issues.apache.org/jira/browse/SPARK-5758 Project: Spark

[jira] [Created] (SPARK-5757) Use json4s instead of DataFrame.toJSON in model export

2015-02-11 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5757: Summary: Use json4s instead of DataFrame.toJSON in model export Key: SPARK-5757 URL: https://issues.apache.org/jira/browse/SPARK-5757 Project: Spark Issue Ty

[jira] [Updated] (SPARK-5756) Analyzer should not throw scala.NotImplementedError for illegitimate sql

2015-02-11 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangfei updated SPARK-5756: --- Summary: Analyzer should not throw scala.NotImplementedError for illegitimate sql (was: Analyzer should not t

[jira] [Commented] (SPARK-5756) Analyzer should not throw scala.NotImplementedError for legitimate sql

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317655#comment-14317655 ] Apache Spark commented on SPARK-5756: - User 'scwf' has created a pull request for this

[jira] [Created] (SPARK-5756) Analyzer should not throw scala.NotImplementedError for legitimate sql

2015-02-11 Thread wangfei (JIRA)
wangfei created SPARK-5756: -- Summary: Analyzer should not throw scala.NotImplementedError for legitimate sql Key: SPARK-5756 URL: https://issues.apache.org/jira/browse/SPARK-5756 Project: Spark Iss

[jira] [Commented] (SPARK-5755) remove unnecessary Add for unary plus sign

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317584#comment-14317584 ] Apache Spark commented on SPARK-5755: - User 'adrian-wang' has created a pull request f

[jira] [Updated] (SPARK-5606) Support plus sign in HiveContext

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5606: --- Assignee: Yadong Qi > Support plus sign in HiveContext > > >

[jira] [Updated] (SPARK-5135) Add support for describe [extended] table to DDL in SQLContext

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5135: --- Assignee: Li Sheng > Add support for describe [extended] table to DDL in SQLContext >

[jira] [Updated] (SPARK-5509) EqualTo operator doesn't handle binary type properly

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5509: --- Assignee: Cheng Lian > EqualTo operator doesn't handle binary type properly >

[jira] [Updated] (SPARK-5528) Support schema merging while reading Parquet files

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5528: --- Assignee: Cheng Lian > Support schema merging while reading Parquet files > --

[jira] [Updated] (SPARK-5380) There will be an ArrayIndexOutOfBoundsException if the format of the source file is wrong

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5380: --- Assignee: Leo_lh > There will be an ArrayIndexOutOfBoundsException if the format of the source

[jira] [Updated] (SPARK-5619) Support 'show roles' in HiveContext

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5619: --- Assignee: Yadong Qi > Support 'show roles' in HiveContext > --

[jira] [Updated] (SPARK-5640) org.apache.spark.sql.catalyst.ScalaReflection is not thread safe

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5640: --- Assignee: Tobias Schlatter > org.apache.spark.sql.catalyst.ScalaReflection is not thread safe

[jira] [Updated] (SPARK-5650) Optional 'FROM' clause in HiveQl

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5650: --- Assignee: Liang-Chi Hsieh > Optional 'FROM' clause in HiveQl > ---

[jira] [Updated] (SPARK-5603) Preinsert casting and renaming rule is needed in the Analyzer

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5603: --- Assignee: Yin Huai > Preinsert casting and renaming rule is needed in the Analyzer > -

[jira] [Created] (SPARK-5755) remove unnecessary Add for unary plus sign

2015-02-11 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-5755: -- Summary: remove unnecessary Add for unary plus sign Key: SPARK-5755 URL: https://issues.apache.org/jira/browse/SPARK-5755 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-5667) Remove version from spark-ec2 example.

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5667: --- Assignee: Miguel Peralvo > Remove version from spark-ec2 example. > --

[jira] [Updated] (SPARK-5595) In memory data cache should be invalidated after insert into/overwrite

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5595: --- Assignee: Yin Huai > In memory data cache should be invalidated after insert into/overwrite >

[jira] [Updated] (SPARK-5278) check ambiguous reference to fields in Spark SQL is incompleted

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5278: --- Assignee: Wenchen Fan > check ambiguous reference to fields in Spark SQL is incompleted >

[jira] [Updated] (SPARK-5324) Results of describe can't be queried

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5324: --- Assignee: Li Sheng > Results of describe can't be queried > --

[jira] [Updated] (SPARK-5366) check for mode of private key file

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5366: --- Assignee: liu chang > check for mode of private key file > --

[jira] [Updated] (SPARK-5656) NegativeArraySizeException in EigenValueDecomposition.symmetricEigs for large n and/or large k

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5656: --- Assignee: Mark Bittmann > NegativeArraySizeException in EigenValueDecomposition.symmetricEigs

[jira] [Updated] (SPARK-5611) Allow spark-ec2 repo to be specified in CLI of spark_ec2.py

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5611: --- Assignee: Florian Verhein > Allow spark-ec2 repo to be specified in CLI of spark_ec2.py >

[jira] [Updated] (SPARK-5614) Predicate pushdown through Generate

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5614: --- Assignee: Lu Yan > Predicate pushdown through Generate > --- >

[jira] [Updated] (SPARK-5664) Restore stty settings when exiting for launching spark-shell from SBT

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5664: --- Assignee: Liang-Chi Hsieh > Restore stty settings when exiting for launching spark-shell from

[jira] [Updated] (SPARK-5648) support "alter ... unset tblproperties("key")"

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5648: --- Assignee: DoingDone9 > support "alter ... unset tblproperties("key")" > -

[jira] [Updated] (SPARK-5568) Python API for the write support of the data source API

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5568: --- Assignee: Yin Huai > Python API for the write support of the data source API > ---

[jira] [Updated] (SPARK-5454) [SQL] Self join with ArrayType columns problems

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5454: --- Assignee: Michael Armbrust > [SQL] Self join with ArrayType columns problems > ---

[jira] [Updated] (SPARK-5716) Support TOK_CHARSETLITERAL in HiveQl

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5716: --- Assignee: Adrian Wang > Support TOK_CHARSETLITERAL in HiveQl > ---

[jira] [Updated] (SPARK-5668) spark_ec2.py region parameter could be either mandatory or its value displayed

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5668: --- Assignee: Miguel Peralvo > spark_ec2.py region parameter could be either mandatory or its valu

[jira] [Updated] (SPARK-5686) Support `show current roles`

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5686: --- Assignee: Li Sheng > Support `show current roles` > > >

[jira] [Updated] (SPARK-5658) Finalize DDL and write support APIs

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5658: --- Assignee: Yin Huai > Finalize DDL and write support APIs > ---

[jira] [Updated] (SPARK-5343) ShortestPaths traverses backwards

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5343: --- Assignee: Brennon York > ShortestPaths traverses backwards > -

[jira] [Updated] (SPARK-5683) Improve the json serialization for DataFrame API

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5683: --- Assignee: Cheng Hao > Improve the json serialization for DataFrame API > -

[jira] [Updated] (SPARK-5709) Add "EXPLAIN" support for DataFrame API for debugging purpose

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5709: --- Assignee: Cheng Hao > Add "EXPLAIN" support for DataFrame API for debugging purpose >

[jira] [Updated] (SPARK-5704) createDataFrame replace applySchema/inferSchema

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5704: --- Assignee: Davies Liu > createDataFrame replace applySchema/inferSchema > -

[jira] [Updated] (SPARK-5733) Error Link in Pagination of HistroyPage when showing Incomplete Applications

2015-02-11 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5733: --- Assignee: Liangliang Gu > Error Link in Pagination of HistroyPage when showing Incomplete Appl

[jira] [Updated] (SPARK-5754) Spark AM not launching on Windows

2015-02-11 Thread Inigo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Inigo updated SPARK-5754: - Description: I'm trying to run Spark Pi on a YARN cluster running on Windows and the AM container fails to start.

[jira] [Updated] (SPARK-5754) Spark AM not launching on Windows

2015-02-11 Thread Inigo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Inigo updated SPARK-5754: - Description: I'm trying to run Spark Pi on a YARN cluster running on Windows and the AM container fails to start.

[jira] [Created] (SPARK-5754) Spark AM not launching on Windows

2015-02-11 Thread Inigo (JIRA)
Inigo created SPARK-5754: Summary: Spark AM not launching on Windows Key: SPARK-5754 URL: https://issues.apache.org/jira/browse/SPARK-5754 Project: Spark Issue Type: Bug Components: Windows

[jira] [Updated] (SPARK-5754) Spark AM not launching on Windows

2015-02-11 Thread Inigo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Inigo updated SPARK-5754: - Environment: Windows Server 2012, Hadoop 2.4.1. (was: Windows Server 2012) > Spark AM not launching on Windows >

[jira] [Comment Edited] (SPARK-5159) Thrift server does not respect hive.server2.enable.doAs=true

2015-02-11 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14315951#comment-14315951 ] Tao Wang edited comment on SPARK-5159 at 2/12/15 4:15 AM: -- I have

[jira] [Commented] (SPARK-3570) Shuffle write time does not include time to open shuffle files

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317495#comment-14317495 ] Apache Spark commented on SPARK-3570: - User 'kayousterhout' has created a pull request

[jira] [Comment Edited] (SPARK-4423) Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior

2015-02-11 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317334#comment-14317334 ] Ilya Ganelin edited comment on SPARK-4423 at 2/12/15 2:40 AM: --

[jira] [Commented] (SPARK-5753) add basic support to JDBCRDD for postgresql types: uuid, hstore, and array

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317464#comment-14317464 ] Apache Spark commented on SPARK-5753: - User 'lepfhty' has created a pull request for t

[jira] [Comment Edited] (SPARK-4423) Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior

2015-02-11 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317334#comment-14317334 ] Ilya Ganelin edited comment on SPARK-4423 at 2/12/15 2:39 AM: --

[jira] [Created] (SPARK-5753) add basic support to JDBCRDD for postgresql types: uuid, hstore, and array

2015-02-11 Thread Ricky Nguyen (JIRA)
Ricky Nguyen created SPARK-5753: --- Summary: add basic support to JDBCRDD for postgresql types: uuid, hstore, and array Key: SPARK-5753 URL: https://issues.apache.org/jira/browse/SPARK-5753 Project: Spark

[jira] [Updated] (SPARK-5752) Don't implicitly convert RDDs directly to DataFrames

2015-02-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5752: --- Assignee: Reynold Xin > Don't implicitly convert RDDs directly to DataFrames > ---

[jira] [Updated] (SPARK-5752) Don't implicitly convert RDDs directly to DataFrames

2015-02-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5752: --- Target Version/s: 1.3.0 > Don't implicitly convert RDDs directly to DataFrames > -

[jira] [Created] (SPARK-5752) Don't implicitly convert RDDs directly to DataFrames

2015-02-11 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5752: -- Summary: Don't implicitly convert RDDs directly to DataFrames Key: SPARK-5752 URL: https://issues.apache.org/jira/browse/SPARK-5752 Project: Spark Issue Type: Su

[jira] [Commented] (SPARK-3299) [SQL] Public API in SQLContext to list tables

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317400#comment-14317400 ] Apache Spark commented on SPARK-3299: - User 'yhuai' has created a pull request for thi

[jira] [Commented] (SPARK-5739) Size exceeds Integer.MAX_VALUE in File Map

2015-02-11 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317384#comment-14317384 ] DjvuLee commented on SPARK-5739: Yes, I do not explain cleanly. What I mean is that we can

[jira] [Comment Edited] (SPARK-4423) Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior

2015-02-11 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317334#comment-14317334 ] Ilya Ganelin edited comment on SPARK-4423 at 2/12/15 1:46 AM: --

[jira] [Comment Edited] (SPARK-4423) Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior

2015-02-11 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317334#comment-14317334 ] Ilya Ganelin edited comment on SPARK-4423 at 2/12/15 1:46 AM: --

[jira] [Comment Edited] (SPARK-4423) Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior

2015-02-11 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317334#comment-14317334 ] Ilya Ganelin edited comment on SPARK-4423 at 2/12/15 1:43 AM: --

[jira] [Commented] (SPARK-5573) Support explode in DataFrame DSL

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317369#comment-14317369 ] Apache Spark commented on SPARK-5573: - User 'marmbrus' has created a pull request for

[jira] [Created] (SPARK-5751) First test case of HiveThriftServer2Suite sometimes timeouts

2015-02-11 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5751: - Summary: First test case of HiveThriftServer2Suite sometimes timeouts Key: SPARK-5751 URL: https://issues.apache.org/jira/browse/SPARK-5751 Project: Spark Issue T

[jira] [Commented] (SPARK-4423) Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior

2015-02-11 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317334#comment-14317334 ] Ilya Ganelin commented on SPARK-4423: - Hi [~pwendell] and [~joshrosen], how do you guy

[jira] [Commented] (SPARK-2808) update kafka to version 0.8.2

2015-02-11 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317304#comment-14317304 ] Saisai Shao commented on SPARK-2808: I'd like to upgrade Kafka to 0.8.2, currently in

[jira] [Updated] (SPARK-5740) Change comment default value from empty string to "null" in DescribeCommand

2015-02-11 Thread Li Sheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Sheng updated SPARK-5740: Description: Change comment default value from empty string to "null" in DescribeCommand (was: Change defau

[jira] [Updated] (SPARK-5740) Change comment default value from empty string to "null" in DescribeCommand

2015-02-11 Thread Li Sheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Sheng updated SPARK-5740: Summary: Change comment default value from empty string to "null" in DescribeCommand (was: Change default v

[jira] [Updated] (SPARK-5750) Document that ordering of elements in shuffled partitions is not deterministic across runs

2015-02-11 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5750: -- Summary: Document that ordering of elements in shuffled partitions is not deterministic across runs (wa

[jira] [Created] (SPARK-5750) Document that ordering of elements in post-shuffle partitions is not deterministic across runs

2015-02-11 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-5750: - Summary: Document that ordering of elements in post-shuffle partitions is not deterministic across runs Key: SPARK-5750 URL: https://issues.apache.org/jira/browse/SPARK-5750

[jira] [Updated] (SPARK-5748) Improve Vectors.sqdist implementation

2015-02-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5748: - Description: Saw some regression of k-means in 1.3 performance tests. I think the problem is the s

[jira] [Comment Edited] (SPARK-2808) update kafka to version 0.8.2

2015-02-11 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317240#comment-14317240 ] koert kuipers edited comment on SPARK-2808 at 2/12/15 12:05 AM:

[jira] [Commented] (SPARK-2808) update kafka to version 0.8.2

2015-02-11 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317240#comment-14317240 ] koert kuipers commented on SPARK-2808: -- scala 2.11, thats good point, i didnt think a

[jira] [Updated] (SPARK-5749) Fix Bash word splitting bugs in compute-classpath.sh

2015-02-11 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5749: Issue Type: Bug (was: Sub-task) Parent: (was: SPARK-5747) > Fix Bash word split

[jira] [Created] (SPARK-5749) Fix Bash word splitting bugs in compute-classpath.sh

2015-02-11 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5749: --- Summary: Fix Bash word splitting bugs in compute-classpath.sh Key: SPARK-5749 URL: https://issues.apache.org/jira/browse/SPARK-5749 Project: Spark Issu

[jira] [Updated] (SPARK-5747) Review all Bash scripts for word splitting bugs

2015-02-11 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5747: Description: Triggered by [this discussion|http://apache-spark-developers-list.1001551.n3.n

[jira] [Created] (SPARK-5748) Improve Vectors.sqdist implementation

2015-02-11 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5748: Summary: Improve Vectors.sqdist implementation Key: SPARK-5748 URL: https://issues.apache.org/jira/browse/SPARK-5748 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-5746) INSERT OVERWRITE throws FileNotFoundException when the source and destination point to the same table.

2015-02-11 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5746: Priority: Blocker (was: Major) > INSERT OVERWRITE throws FileNotFoundException when the source and destinat

[jira] [Updated] (SPARK-3688) LogicalPlan can't resolve column correctlly

2015-02-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-3688: --- Assignee: Yi Tian > LogicalPlan can't resolve column correctlly >

[jira] [Created] (SPARK-5747) Review all Bash scripts for word splitting bugs

2015-02-11 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5747: --- Summary: Review all Bash scripts for word splitting bugs Key: SPARK-5747 URL: https://issues.apache.org/jira/browse/SPARK-5747 Project: Spark Issue Typ

[jira] [Commented] (SPARK-3688) LogicalPlan can't resolve column correctlly

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317073#comment-14317073 ] Apache Spark commented on SPARK-3688: - User 'rxin' has created a pull request for this

[jira] [Resolved] (SPARK-1302) httpd doesn't start in spark-ec2 (cc2.8xlarge)

2015-02-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1302. -- Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Shivaram Venkataraman > httpd doesn't s

[jira] [Commented] (SPARK-1302) httpd doesn't start in spark-ec2 (cc2.8xlarge)

2015-02-11 Thread Greg Temchenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317063#comment-14317063 ] Greg Temchenko commented on SPARK-1302: --- Indeed, I used 1.2.0. Sorry for the false a

[jira] [Resolved] (SPARK-4648) Support COALESCE function in Spark SQL and HiveQL

2015-02-11 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-4648. - Resolution: Duplicate It has been resolved by https://github.com/apache/spark/pull/4057/ (the PR of SPARK

[jira] [Resolved] (SPARK-5736) Add executor log url to Executors page on Yarn

2015-02-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5736. -- Resolution: Duplicate > Add executor log url to Executors page on Yarn > ---

[jira] [Resolved] (SPARK-3688) LogicalPlan can't resolve column correctlly

2015-02-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3688. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4524 [https:/

[jira] [Commented] (SPARK-5736) Add executor log url to Executors page on Yarn

2015-02-11 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14316949#comment-14316949 ] Sandy Ryza commented on SPARK-5736: --- Is this the same as SPARK-2450? > Add executor log

[jira] [Commented] (SPARK-5722) Infer_schema_type incorrect for Integers in pyspark

2015-02-11 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14316940#comment-14316940 ] Don Drake commented on SPARK-5722: -- Hi, I've submitted 2 pull requests for branch-1.2 and

[jira] [Commented] (SPARK-2808) update kafka to version 0.8.2

2015-02-11 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14316937#comment-14316937 ] Cody Koeninger commented on SPARK-2808: --- I'm also kind of curious what the motivatio

[jira] [Commented] (SPARK-5722) Infer_schema_type incorrect for Integers in pyspark

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14316936#comment-14316936 ] Apache Spark commented on SPARK-5722: - User 'dondrake' has created a pull request for

[jira] [Commented] (SPARK-2808) update kafka to version 0.8.2

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14316933#comment-14316933 ] Apache Spark commented on SPARK-2808: - User 'koeninger' has created a pull request for

[jira] [Resolved] (SPARK-5454) [SQL] Self join with ArrayType columns problems

2015-02-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5454. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4520 [https:/

[jira] [Resolved] (SPARK-5677) Python DataFrame API remaining tasks

2015-02-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5677. Resolution: Fixed Fix Version/s: 1.3.0 > Python DataFrame API remaining tasks > -

[jira] [Resolved] (SPARK-5734) Allow creating a DataFrame from local Python data

2015-02-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5734. Resolution: Fixed Fix Version/s: 1.3.0 > Allow creating a DataFrame from local Python data >

[jira] [Commented] (SPARK-5746) INSERT OVERWRITE throws FileNotFoundException when the source and destination point to the same table.

2015-02-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14316891#comment-14316891 ] Cheng Lian commented on SPARK-5746: --- cc [~yhuai] > INSERT OVERWRITE throws FileNotFound

[jira] [Created] (SPARK-5746) INSERT OVERWRITE throws FileNotFoundException when the source and destination point to the same table.

2015-02-11 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5746: - Summary: INSERT OVERWRITE throws FileNotFoundException when the source and destination point to the same table. Key: SPARK-5746 URL: https://issues.apache.org/jira/browse/SPARK-5746

[jira] [Updated] (SPARK-5745) Allow to use custom TaskMetrics implementation

2015-02-11 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Lewandowski updated SPARK-5745: - Description: There can be various RDDs implemented and the {{TaskMetrics}} provides a grea

[jira] [Commented] (SPARK-5502) User guide for isotonic regression

2015-02-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14316861#comment-14316861 ] Apache Spark commented on SPARK-5502: - User 'zapletal-martin' has created a pull reque

[jira] [Created] (SPARK-5745) Allow to use custom TaskMetrics implementation

2015-02-11 Thread Jacek Lewandowski (JIRA)
Jacek Lewandowski created SPARK-5745: Summary: Allow to use custom TaskMetrics implementation Key: SPARK-5745 URL: https://issues.apache.org/jira/browse/SPARK-5745 Project: Spark Issue Ty

  1   2   >