[jira] [Updated] (SPARK-7186) Decouple internal Row from external Row

2015-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7186: --- Description: Currently, we use o.a.s.sql.Row both internally and externally. The external interface i

[jira] [Updated] (SPARK-7186) Decouple internal Row from external Row

2015-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7186: --- Priority: Blocker (was: Major) > Decouple internal Row from external Row > --

[jira] [Updated] (SPARK-7186) Decouple internal Row from external Row

2015-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7186: --- Description: Currently, we use o.a.s.sql.Row both internally and externally. The external interface i

[jira] [Updated] (SPARK-7186) Decouple internal Row from external Row

2015-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7186: --- Description: Currently, we use o.a.s.sql.Row both internally and externally. The external interface i

[jira] [Commented] (SPARK-7081) Faster sort-based shuffle path using binary processing cache-aware sort

2015-05-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562386#comment-14562386 ] Rui Li commented on SPARK-7081: --- Hi [~joshrosen], requiring the dependency having no aggrega

[jira] [Commented] (SPARK-7782) A small problem on history server webpage

2015-05-27 Thread Xia Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562375#comment-14562375 ] Xia Hu commented on SPARK-7782: --- Yeah, ok , I will use a more detail title next time. And I

[jira] [Updated] (SPARK-4176) Support decimals with precision > 18 in Parquet

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4176: Target Version/s: (was: 1.4.0) > Support decimals with precision > 18 in Parquet > ---

[jira] [Commented] (SPARK-4176) Support decimals with precision > 18 in Parquet

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562362#comment-14562362 ] Yin Huai commented on SPARK-4176: - Seems our Decimal is using a BigDecimal when we cannot

[jira] [Updated] (SPARK-6591) Python data source load options should auto convert common types into strings

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6591: Target Version/s: 1.5.0 (was: 1.4.0) > Python data source load options should auto convert common types int

[jira] [Updated] (SPARK-7659) Sort by attributes that are not present in the SELECT clause when there is windowfunction analysis error

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7659: Target Version/s: 1.5.0 (was: 1.4.0) > Sort by attributes that are not present in the SELECT clause when th

[jira] [Updated] (SPARK-6467) Override QueryPlan.missingInput when necessary and rely on it CheckAnalysis

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6467: Target Version/s: 1.5.0 (was: 1.4.0) > Override QueryPlan.missingInput when necessary and rely on it CheckA

[jira] [Updated] (SPARK-7163) minor refactory for HiveQl

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7163: Target Version/s: (was: 1.4.0) > minor refactory for HiveQl > -- > >

[jira] [Updated] (SPARK-6419) GenerateOrdering does not support BinaryType and complex types.

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6419: Target Version/s: 1.5.0 (was: 1.4.0) > GenerateOrdering does not support BinaryType and complex types. > --

[jira] [Updated] (SPARK-6212) The EXPLAIN output of CTAS only shows the analyzed plan

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6212: Target Version/s: 1.5.0 (was: 1.4.0) > The EXPLAIN output of CTAS only shows the analyzed plan > --

[jira] [Updated] (SPARK-4561) PySparkSQL's Row.asDict() should convert nested rows to dictionaries

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4561: Target Version/s: 1.5.0 (was: 1.4.0) > PySparkSQL's Row.asDict() should convert nested rows to dictionaries

[jira] [Updated] (SPARK-4944) Table Not Found exception in "Create Table Like registered RDD table"

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4944: Target Version/s: 1.5.0 (was: 1.4.0) > Table Not Found exception in "Create Table Like registered RDD table

[jira] [Updated] (SPARK-4944) Table Not Found exception in "Create Table Like registered RDD table"

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4944: Target Version/s: (was: 1.5.0) > Table Not Found exception in "Create Table Like registered RDD table" > -

[jira] [Updated] (SPARK-6377) Set the number of shuffle partitions for Exchange operator automatically based on the size of input tables and the reduce-side operation.

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6377: Target Version/s: 1.5.0 (was: 1.4.0) > Set the number of shuffle partitions for Exchange operator automatic

[jira] [Updated] (SPARK-4559) Adding support for ucase and lcase

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4559: Target Version/s: (was: 1.4.0) > Adding support for ucase and lcase > -- >

[jira] [Resolved] (SPARK-5229) Use tableIdentifier as the reference of a table

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-5229. - Resolution: Duplicate Target Version/s: (was: 1.4.0) > Use tableIdentifier as the reference of

[jira] [Updated] (SPARK-5251) Using `tableIdentifier` in hive metastore

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5251: Target Version/s: 1.5.0 (was: 1.4.0) > Using `tableIdentifier` in hive metastore > ---

[jira] [Updated] (SPARK-5288) Stabilize Spark SQL data type API followup

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5288: Target Version/s: 1.5.0 (was: 1.4.0) > Stabilize Spark SQL data type API followup > --

[jira] [Commented] (SPARK-6624) Convert filters into CNF for data sources

2015-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562349#comment-14562349 ] Reynold Xin commented on SPARK-6624: [~cloud_fan] interested in doing this one also?

[jira] [Resolved] (SPARK-7782) A small problem on history server webpage

2015-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7782. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Shekhar Bansal > A small problem on

[jira] [Updated] (SPARK-6319) DISTINCT doesn't work for binary type

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6319: Target Version/s: 1.5.0 (was: 1.4.0) > DISTINCT doesn't work for binary type >

[jira] [Resolved] (SPARK-4892) java.io.FileNotFound exceptions when creating EXTERNAL hive tables

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-4892. - Resolution: Won't Fix Since it is a Hive bug (https://issues.apache.org/jira/browse/HIVE-7633), I am reso

[jira] [Updated] (SPARK-4892) java.io.FileNotFound exceptions when creating EXTERNAL hive tables

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4892: Target Version/s: (was: 1.4.0) > java.io.FileNotFound exceptions when creating EXTERNAL hive tables >

[jira] [Updated] (SPARK-4892) java.io.FileNotFound exceptions when creating EXTERNAL hive tables

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4892: Labels: (was: starter) > java.io.FileNotFound exceptions when creating EXTERNAL hive tables >

[jira] [Resolved] (SPARK-4854) Custom UDTF with Lateral View throws ClassNotFound exception in Spark SQL CLI

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-4854. - Resolution: Duplicate Seems SPARK-6708 has resolved this issue. > Custom UDTF with Lateral View throws Cl

[jira] [Updated] (SPARK-4576) Add concatenation operator

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4576: Target Version/s: (was: 1.4.0) > Add concatenation operator > -- > >

[jira] [Updated] (SPARK-2863) Emulate Hive type coercion in native reimplementations of Hive functions

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2863: Target Version/s: (was: 1.4.0) > Emulate Hive type coercion in native reimplementations of Hive functions

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562331#comment-14562331 ] Yin Huai commented on SPARK-5327: - Since we only support a single hive version in the exec

[jira] [Updated] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5327: Target Version/s: (was: 1.4.0) > HiveCompatibilitySuite fails when executed against Hive 0.12.0 >

[jira] [Updated] (SPARK-6583) Support aggregated function in order by

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6583: Target Version/s: 1.5.0 (was: 1.4.0) > Support aggregated function in order by > --

[jira] [Updated] (SPARK-6964) Support Cancellation in the Thrift Server

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6964: Target Version/s: 1.5.0 (was: 1.4.0, 1.5.0) > Support Cancellation in the Thrift Server > -

[jira] [Updated] (SPARK-7505) Update PySpark DataFrame docs: encourage __getitem__, mark as experimental, etc.

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7505: Target Version/s: 1.5.0 (was: 1.4.0) > Update PySpark DataFrame docs: encourage __getitem__, mark as experi

[jira] [Resolved] (SPARK-7832) Always run SQL tests in master build.

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-7832. - Resolution: Fixed Fix Version/s: 1.4.0 It has been resolved by https://github.com/apache/spark/pull

[jira] [Resolved] (SPARK-7896) IndexOutOfBoundsException in ChainedBuffer

2015-05-27 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7896. Resolution: Fixed Fix Version/s: 1.4.0 > IndexOutOfBoundsException in ChainedBuffer >

[jira] [Commented] (SPARK-6548) Adding stddev to DataFrame functions

2015-05-27 Thread Jihong MA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562298#comment-14562298 ] Jihong MA commented on SPARK-6548: -- Hi sdfox, I thought you are no longer working on thi

[jira] [Created] (SPARK-7912) Make OneHotEncoder more optimistic about the input

2015-05-27 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7912: Summary: Make OneHotEncoder more optimistic about the input Key: SPARK-7912 URL: https://issues.apache.org/jira/browse/SPARK-7912 Project: Spark Issue Type:

[jira] [Commented] (SPARK-7198) VectorAssembler should carry ML metadata

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562275#comment-14562275 ] Apache Spark commented on SPARK-7198: - User 'mengxr' has created a pull request for th

[jira] [Assigned] (SPARK-7577) User guide update for Bucketizer

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7577: --- Assignee: Apache Spark (was: Xusen Yin) > User guide update for Bucketizer > ---

[jira] [Commented] (SPARK-7577) User guide update for Bucketizer

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562252#comment-14562252 ] Apache Spark commented on SPARK-7577: - User 'yinxusen' has created a pull request for

[jira] [Assigned] (SPARK-7577) User guide update for Bucketizer

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7577: --- Assignee: Xusen Yin (was: Apache Spark) > User guide update for Bucketizer > ---

[jira] [Comment Edited] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562242#comment-14562242 ] Joseph K. Bradley edited comment on SPARK-7529 at 5/28/15 3:50 AM: -

[jira] [Comment Edited] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562242#comment-14562242 ] Joseph K. Bradley edited comment on SPARK-7529 at 5/28/15 3:50 AM: -

[jira] [Comment Edited] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562199#comment-14562199 ] Joseph K. Bradley edited comment on SPARK-7529 at 5/28/15 3:49 AM: -

[jira] [Comment Edited] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562199#comment-14562199 ] Joseph K. Bradley edited comment on SPARK-7529 at 5/28/15 3:46 AM: -

[jira] [Commented] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562242#comment-14562242 ] Joseph K. Bradley commented on SPARK-7529: -- I've finished a pass. I mainly check

[jira] [Comment Edited] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562199#comment-14562199 ] Joseph K. Bradley edited comment on SPARK-7529 at 5/28/15 3:37 AM: -

[jira] [Issue Comment Deleted] (SPARK-7903) PythonUDT shouldn't get serialized on the Scala side

2015-05-27 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7903: - Comment: was deleted (was: User 'mengxr' has created a pull request for this issue: https://github

[jira] [Resolved] (SPARK-7873) Serializer re-use + Kryo autoReset disabled leads to AraryIndexOutOfBounds exception in sort-shuffle bypassMergeSort path

2015-05-27 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7873. Resolution: Fixed Fix Version/s: 1.4.0 > Serializer re-use + Kryo autoReset disabled

[jira] [Updated] (SPARK-6294) PySpark task may hang while call take() on in Java/Scala

2015-05-27 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6294: --- Fix Version/s: 1.3.1 1.4.0 > PySpark task may hang while call take() on in

[jira] [Assigned] (SPARK-7911) A workaround for VectorUDT serialize (or deserialize) being called multiple times

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7911: --- Assignee: Xiangrui Meng (was: Apache Spark) > A workaround for VectorUDT serialize (or deser

[jira] [Assigned] (SPARK-7911) A workaround for VectorUDT serialize (or deserialize) being called multiple times

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7911: --- Assignee: Apache Spark (was: Xiangrui Meng) > A workaround for VectorUDT serialize (or deser

[jira] [Assigned] (SPARK-7911) A workaround for VectorUDT serialize (or deserialize) being called multiple times

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7911: --- Assignee: Apache Spark (was: Xiangrui Meng) > A workaround for VectorUDT serialize (or deser

[jira] [Assigned] (SPARK-7911) A workaround for VectorUDT serialize (or deserialize) being called multiple times

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7911: --- Assignee: Xiangrui Meng (was: Apache Spark) > A workaround for VectorUDT serialize (or deser

[jira] [Commented] (SPARK-7911) A workaround for VectorUDT serialize (or deserialize) being called multiple times

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562233#comment-14562233 ] Apache Spark commented on SPARK-7911: - User 'mengxr' has created a pull request for th

[jira] [Created] (SPARK-7911) A workaround for VectorUDT serialize (or deserialize) being called multiple times

2015-05-27 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7911: Summary: A workaround for VectorUDT serialize (or deserialize) being called multiple times Key: SPARK-7911 URL: https://issues.apache.org/jira/browse/SPARK-7911 Proje

[jira] [Comment Edited] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562199#comment-14562199 ] Joseph K. Bradley edited comment on SPARK-7529 at 5/28/15 3:05 AM: -

[jira] [Resolved] (SPARK-7907) Rename ThriftServer tab in Spark UI

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-7907. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6448 [https://github.com/apac

[jira] [Comment Edited] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562199#comment-14562199 ] Joseph K. Bradley edited comment on SPARK-7529 at 5/28/15 2:53 AM: -

[jira] [Commented] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562199#comment-14562199 ] Joseph K. Bradley commented on SPARK-7529: -- *spark.mllib: Issues found in a pass

[jira] [Commented] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562187#comment-14562187 ] holdenk commented on SPARK-7888: Sounds good, I can go do some reading before I bug you ab

[jira] [Updated] (SPARK-7910) Expose partitioner information in Java & Python APIs.

2015-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-7910: --- Component/s: (was: PySpark) (was: Spark Core) > Expose partitioner information in Jav

[jira] [Updated] (SPARK-7910) Expose partitioner information in JavaRDD

2015-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-7910: --- Summary: Expose partitioner information in JavaRDD (was: Expose partitioner information in Java & Python APIs

[jira] [Created] (SPARK-7910) Expose partitioner information in Java & Python APIs.

2015-05-27 Thread holdenk (JIRA)
holdenk created SPARK-7910: -- Summary: Expose partitioner information in Java & Python APIs. Key: SPARK-7910 URL: https://issues.apache.org/jira/browse/SPARK-7910 Project: Spark Issue Type: Improveme

[jira] [Resolved] (SPARK-7897) Column with an unsigned bigint should be treated as DecimalType in JDBCRDD

2015-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7897. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Liang-Chi Hsieh > Column with an un

[jira] [Commented] (SPARK-7909) spark-ec2 and associated tools not py3 ready

2015-05-27 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562161#comment-14562161 ] Shivaram Venkataraman commented on SPARK-7909: -- Could you check the worker lo

[jira] [Updated] (SPARK-7852) Set the initial weights based on the previous when GLMs are run with multiple regParams

2015-05-27 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7852: --- Issue Type: New Feature (was: Bug) > Set the initial weights based on the previous when GLMs are run with mul

[jira] [Commented] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-05-27 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562148#comment-14562148 ] DB Tsai commented on SPARK-7888: Sounds great. This requires some math to understand how R

[jira] [Commented] (SPARK-7806) spark-ec2 launch script fails for Python3

2015-05-27 Thread Matthew Goodman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562132#comment-14562132 ] Matthew Goodman commented on SPARK-7806: I started one here. https://issues.apach

[jira] [Commented] (SPARK-7529) Java compatibility check for MLlib 1.4

2015-05-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562133#comment-14562133 ] Joseph K. Bradley commented on SPARK-7529: -- *spark.ml: Here is a list of issues f

[jira] [Commented] (SPARK-7909) spark-ec2 and associated tools not py3 ready

2015-05-27 Thread Matthew Goodman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562121#comment-14562121 ] Matthew Goodman commented on SPARK-7909: I have almost everything working, but I a

[jira] [Commented] (SPARK-6987) Node Locality is determined with String Matching instead of Inet Comparison

2015-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562120#comment-14562120 ] holdenk commented on SPARK-6987: What do you mean by inet comparison? If the problem is th

[jira] [Commented] (SPARK-7908) PySpark Streaming tests are flaky.

2015-05-27 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562112#comment-14562112 ] Hari Shreedharan commented on SPARK-7908: - Note that {{test_window}} also failed i

[jira] [Resolved] (SPARK-7908) PySpark Streaming tests are flaky.

2015-05-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-7908. --- Resolution: Duplicate > PySpark Streaming tests are flaky. > -- > >

[jira] [Assigned] (SPARK-6112) Provide external block store support through HDFS RAM_DISK

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6112: --- Assignee: Apache Spark > Provide external block store support through HDFS RAM_DISK > ---

[jira] [Commented] (SPARK-6112) Provide external block store support through HDFS RAM_DISK

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562027#comment-14562027 ] Apache Spark commented on SPARK-6112: - User 'zhzhan' has created a pull request for th

[jira] [Assigned] (SPARK-6112) Provide external block store support through HDFS RAM_DISK

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6112: --- Assignee: (was: Apache Spark) > Provide external block store support through HDFS RAM_DIS

[jira] [Updated] (SPARK-6112) Provide external block store support through HDFS RAM_DISK

2015-05-27 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-6112: -- Summary: Provide external block store support through HDFS RAM_DISK (was: Provide OffHeap support throu

[jira] [Created] (SPARK-7909) spark-ec2 and associated tools not py3 ready

2015-05-27 Thread Matthew Goodman (JIRA)
Matthew Goodman created SPARK-7909: -- Summary: spark-ec2 and associated tools not py3 ready Key: SPARK-7909 URL: https://issues.apache.org/jira/browse/SPARK-7909 Project: Spark Issue Type: Im

[jira] [Updated] (SPARK-4629) Spark SQL uses Hadoop Configuration in a thread-unsafe manner when writing Parquet files

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4629: Target Version/s: (was: 1.4.0) > Spark SQL uses Hadoop Configuration in a thread-unsafe manner when writin

[jira] [Assigned] (SPARK-7907) Rename ThriftServer tab in Spark UI

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7907: --- Assignee: Apache Spark (was: Yin Huai) > Rename ThriftServer tab in Spark UI > -

[jira] [Assigned] (SPARK-7907) Rename ThriftServer tab in Spark UI

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7907: --- Assignee: Yin Huai (was: Apache Spark) > Rename ThriftServer tab in Spark UI > -

[jira] [Commented] (SPARK-7907) Rename ThriftServer tab in Spark UI

2015-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562005#comment-14562005 ] Apache Spark commented on SPARK-7907: - User 'yhuai' has created a pull request for thi

[jira] [Commented] (SPARK-4629) Spark SQL uses Hadoop Configuration in a thread-unsafe manner when writing Parquet files

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562000#comment-14562000 ] Yin Huai commented on SPARK-4629: - [~michael] Can you try our 1.4 branch and see if it is

[jira] [Updated] (SPARK-7852) Set the initial weights based on the previous when GLMs are run with multiple regParams

2015-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-7852: --- Summary: Set the initial weights based on the previous when GLMs are run with multiple regParams (was: Use co

[jira] [Commented] (SPARK-7908) PySpark Streaming tests are flaky.

2015-05-27 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14561972#comment-14561972 ] Hari Shreedharan commented on SPARK-7908: - /cc [~tdas], [~davies] > PySpark Strea

[jira] [Updated] (SPARK-7908) PySpark Streaming tests are flaky.

2015-05-27 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hari Shreedharan updated SPARK-7908: Priority: Critical (was: Major) > PySpark Streaming tests are flaky. >

[jira] [Created] (SPARK-7908) PySpark Streaming tests are flaky.

2015-05-27 Thread Hari Shreedharan (JIRA)
Hari Shreedharan created SPARK-7908: --- Summary: PySpark Streaming tests are flaky. Key: SPARK-7908 URL: https://issues.apache.org/jira/browse/SPARK-7908 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-7907) Rename ThriftServer tab in Spark UI

2015-05-27 Thread Yin Huai (JIRA)
Yin Huai created SPARK-7907: --- Summary: Rename ThriftServer tab in Spark UI Key: SPARK-7907 URL: https://issues.apache.org/jira/browse/SPARK-7907 Project: Spark Issue Type: Bug Components:

[jira] [Updated] (SPARK-2205) Unnecessary exchange operators in a join on multiple tables with the same join key.

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2205: Target Version/s: 1.5.0 (was: 1.4.0) > Unnecessary exchange operators in a join on multiple tables with the

[jira] [Updated] (SPARK-6570) Spark SQL arrays: "explode()" fails and cannot save array type to Parquet

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6570: Target Version/s: (was: 1.4.0) > Spark SQL arrays: "explode()" fails and cannot save array type to Parquet

[jira] [Commented] (SPARK-6570) Spark SQL arrays: "explode()" fails and cannot save array type to Parquet

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14561907#comment-14561907 ] Yin Huai commented on SPARK-6570: - [~jonchase] Can you test our Spark 1.4 branch? I think

[jira] [Updated] (SPARK-4497) HiveThriftServer2 does not exit properly on failure

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4497: Target Version/s: (was: 1.4.0) > HiveThriftServer2 does not exit properly on failure > ---

[jira] [Updated] (SPARK-6675) HiveContext setConf is not stable

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6675: Target Version/s: 1.5.0 (was: 1.4.0) > HiveContext setConf is not stable >

[jira] [Updated] (SPARK-6482) Remove synchronization of Hive Native commands

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6482: Target Version/s: 1.5.0 (was: 1.4.0) > Remove synchronization of Hive Native commands > ---

[jira] [Updated] (SPARK-6749) Make metastore client robust to underlying socket connection loss

2015-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6749: Target Version/s: 1.5.0 (was: 1.4.0) > Make metastore client robust to underlying socket connection loss >

[jira] [Commented] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-05-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14561864#comment-14561864 ] holdenk commented on SPARK-7888: I could do this since I'm sort of poking around at this c

  1   2   3   4   >