[jira] [Commented] (SPARK-6859) Parquet File Binary column statistics error when reuse byte[] among rows

2015-04-12 Thread Yijie Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491820#comment-14491820 ] Yijie Shen commented on SPARK-6859: --- I opened a JIRA ticket in Parquet:

[jira] [Updated] (SPARK-6199) Support CTE

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6199: --- Assignee: (was: Cheng Hao) Support CTE --- Key: SPARK-6199

[jira] [Created] (SPARK-6875) Add support for Joda-time types

2015-04-12 Thread Patrick Grandjean (JIRA)
Patrick Grandjean created SPARK-6875: Summary: Add support for Joda-time types Key: SPARK-6875 URL: https://issues.apache.org/jira/browse/SPARK-6875 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6849) The constructor of GradientDescent should be public

2015-04-12 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491861#comment-14491861 ] Guoqiang Li commented on SPARK-6849: [~srowen] https://github.com/cloudml/zen The

[jira] [Commented] (SPARK-6849) The constructor of GradientDescent should be public

2015-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491887#comment-14491887 ] Joseph K. Bradley commented on SPARK-6849: -- It would be great to open up the

[jira] [Commented] (SPARK-6545) Minor changes for CompactBuffer

2015-04-12 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491914#comment-14491914 ] Cheng Hao commented on SPARK-6545: -- Thank you [~srowen], we should close this for now, I

[jira] [Updated] (SPARK-6643) Python API for StandardScalerModel

2015-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6643: - Assignee: Kai Sasaki Python API for StandardScalerModel --

[jira] [Resolved] (SPARK-6643) Python API for StandardScalerModel

2015-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6643. -- Resolution: Fixed Issue resolved by pull request 5310

[jira] [Commented] (SPARK-765) Test suite should run Spark example programs

2015-04-12 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491941#comment-14491941 ] Yu Ishikawa commented on SPARK-765: --- [~joshrosen] sorry, one more thing. Are we allowed

[jira] [Commented] (SPARK-6765) Turn scalastyle on for test code

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491765#comment-14491765 ] Apache Spark commented on SPARK-6765: - User 'rxin' has created a pull request for this

[jira] [Comment Edited] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-04-12 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491864#comment-14491864 ] Yi Zhou edited comment on SPARK-5791 at 4/13/15 2:57 AM: - We

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-04-12 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491864#comment-14491864 ] Yi Zhou commented on SPARK-5791: We changed file format from ORC to Parquet. Got the

[jira] [Commented] (SPARK-6847) Stack overflow on updateStateByKey which followed by a dstream with checkpoint set

2015-04-12 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491873#comment-14491873 ] Jack Hu commented on SPARK-6847: Hi, [~sowen] I tested more cases: # only change the

[jira] [Comment Edited] (SPARK-6847) Stack overflow on updateStateByKey which followed by a dstream with checkpoint set

2015-04-12 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491873#comment-14491873 ] Jack Hu edited comment on SPARK-6847 at 4/13/15 3:34 AM: - Hi,

[jira] [Commented] (SPARK-6151) schemaRDD to parquetfile with saveAsParquetFile control the HDFS block size

2015-04-12 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491881#comment-14491881 ] Littlestar commented on SPARK-6151: --- The HDFS Block Size is set once when you first

[jira] [Commented] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-12 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491905#comment-14491905 ] Yu Ishikawa commented on SPARK-6682: [~josephkb] sounds great. As you're suggesting,

[jira] [Resolved] (SPARK-6562) DataFrame.na.replace value support

2015-04-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6562. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Reynold Xin DataFrame.na.replace

[jira] [Updated] (SPARK-6858) Register Java HashMap for SparkSqlSerializer

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6858: --- Assignee: Liang-Chi Hsieh Register Java HashMap for SparkSqlSerializer

[jira] [Resolved] (SPARK-4760) ANALYZE TABLE table COMPUTE STATISTICS noscan failed estimating table size for tables created from Parquet files

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4760. Resolution: Not A Problem ANALYZE TABLE table COMPUTE STATISTICS noscan failed estimating

[jira] [Updated] (SPARK-6611) Add support for INTEGER as synonym of INT to DDLParser

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6611: --- Assignee: Santiago M. Mola Add support for INTEGER as synonym of INT to DDLParser

[jira] [Commented] (SPARK-1529) Support setting spark.local.dirs to a hadoop FileSystem

2015-04-12 Thread Kannan Rajah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491834#comment-14491834 ] Kannan Rajah commented on SPARK-1529: - Thanks. FYI, I have pushed few more commits to

[jira] [Commented] (SPARK-1227) Diagnostics for ClassificationRegression

2015-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491895#comment-14491895 ] Joseph K. Bradley commented on SPARK-1227: -- I agree it will be nice to provide

[jira] [Commented] (SPARK-3727) DecisionTree, RandomForest: More prediction functionality

2015-04-12 Thread Michael Kuhlen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491919#comment-14491919 ] Michael Kuhlen commented on SPARK-3727: --- Hello! I've implemented

[jira] [Commented] (SPARK-1529) Support setting spark.local.dirs to a hadoop FileSystem

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491863#comment-14491863 ] Patrick Wendell commented on SPARK-1529: Hey Kannan, We originally considered

[jira] [Resolved] (SPARK-4081) Categorical feature indexing

2015-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4081. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 3000

[jira] [Updated] (SPARK-6869) Pass PYTHONPATH to executor, so that executor can read pyspark file from local file system on executor node

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6869: - Component/s: PySpark Pass PYTHONPATH to executor, so that executor can read pyspark file from local

[jira] [Updated] (SPARK-6870) Catch InterruptedException when yarn application state monitor thread been interrupted

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6870: - Component/s: YARN Catch InterruptedException when yarn application state monitor thread been

[jira] [Commented] (SPARK-1529) Support setting spark.local.dirs to a hadoop FileSystem

2015-04-12 Thread Kannan Rajah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491869#comment-14491869 ] Kannan Rajah commented on SPARK-1529: - [~pwendell] The default code path still uses

[jira] [Commented] (SPARK-6765) Turn scalastyle on for test code

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491893#comment-14491893 ] Apache Spark commented on SPARK-6765: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491891#comment-14491891 ] Joseph K. Bradley commented on SPARK-5256: -- Added link to [SPARK-1227], which

[jira] [Commented] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-12 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491906#comment-14491906 ] Yu Ishikawa commented on SPARK-6682: [~avulanov] thank you for your answer. And I

[jira] [Reopened] (SPARK-4760) ANALYZE TABLE table COMPUTE STATISTICS noscan failed estimating table size for tables created from Parquet files

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-4760: ANALYZE TABLE table COMPUTE STATISTICS noscan failed estimating table size for tables

[jira] [Updated] (SPARK-6179) Support SHOW PRINCIPALS role_name;

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6179: --- Assignee: Zhongshuai Pei Support SHOW PRINCIPALS role_name;

[jira] [Updated] (SPARK-6199) Support CTE

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6199: --- Assignee: Cheng Hao Support CTE --- Key: SPARK-6199

[jira] [Commented] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-12 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491877#comment-14491877 ] Ilya Ganelin commented on SPARK-6703: - Patrick - I can look into this. Thank you.

[jira] [Updated] (SPARK-6865) Decide on semantics for string identifiers in DataFrame API

2015-04-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6865: --- Summary: Decide on semantics for string identifiers in DataFrame API (was: Decide on semantics for

[jira] [Created] (SPARK-6876) DataFrame.na.replace value support for Python

2015-04-12 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-6876: -- Summary: DataFrame.na.replace value support for Python Key: SPARK-6876 URL: https://issues.apache.org/jira/browse/SPARK-6876 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-6863) Formatted list broken on Hive compatibility section of SQL programming guide

2015-04-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6863: --- Assignee: Santiago M. Mola Formatted list broken on Hive compatibility section of SQL

[jira] [Commented] (SPARK-3937) Unsafe memory access inside of Snappy library

2015-04-12 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491857#comment-14491857 ] Guoqiang Li commented on SPARK-3937: Get data: {code:none}wget

[jira] [Commented] (SPARK-6823) Add a model.matrix like capability to DataFrames (modelDataFrame)

2015-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491885#comment-14491885 ] Joseph K. Bradley commented on SPARK-6823: -- This sounds like it would be covered

[jira] [Resolved] (SPARK-5885) Add VectorAssembler

2015-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5885. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5196

[jira] [Resolved] (SPARK-5886) Add LabelIndexer

2015-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5886. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4735

[jira] [Issue Comment Deleted] (SPARK-6868) Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY

2015-04-12 Thread Dean Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dean Chen updated SPARK-6868: - Comment: was deleted (was: https://github.com/apache/spark/pull/5477) Container link broken on Spark UI

[jira] [Created] (SPARK-6868) Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY

2015-04-12 Thread Dean Chen (JIRA)
Dean Chen created SPARK-6868: Summary: Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY Key: SPARK-6868 URL: https://issues.apache.org/jira/browse/SPARK-6868 Project: Spark

[jira] [Assigned] (SPARK-6869) Pass PYTHONPATH to executor, so that executor can read pyspark file from local file system on executor node

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6869: --- Assignee: Apache Spark Pass PYTHONPATH to executor, so that executor can read pyspark file

[jira] [Commented] (SPARK-6869) Pass PYTHONPATH to executor, so that executor can read pyspark file from local file system on executor node

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491373#comment-14491373 ] Apache Spark commented on SPARK-6869: - User 'Sephiroth-Lin' has created a pull request

[jira] [Assigned] (SPARK-6869) Pass PYTHONPATH to executor, so that executor can read pyspark file from local file system on executor node

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6869: --- Assignee: (was: Apache Spark) Pass PYTHONPATH to executor, so that executor can read

[jira] [Created] (SPARK-6870) Catch InterruptedException when yarn application state monitor thread been interrupted

2015-04-12 Thread Weizhong (JIRA)
Weizhong created SPARK-6870: --- Summary: Catch InterruptedException when yarn application state monitor thread been interrupted Key: SPARK-6870 URL: https://issues.apache.org/jira/browse/SPARK-6870 Project:

[jira] [Updated] (SPARK-6868) Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY

2015-04-12 Thread Dean Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dean Chen updated SPARK-6868: - Attachment: Screen Shot 2015-04-11 at 11.49.21 PM.png Container link broken on Spark UI Executors page

[jira] [Assigned] (SPARK-6868) Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6868: --- Assignee: Apache Spark Container link broken on Spark UI Executors page when YARN is set to

[jira] [Commented] (SPARK-6868) Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY

2015-04-12 Thread Dean Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491354#comment-14491354 ] Dean Chen commented on SPARK-6868: -- https://github.com/apache/spark/pull/5477 Container

[jira] [Assigned] (SPARK-6868) Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6868: --- Assignee: (was: Apache Spark) Container link broken on Spark UI Executors page when

[jira] [Commented] (SPARK-6868) Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491355#comment-14491355 ] Apache Spark commented on SPARK-6868: - User 'deanchen' has created a pull request for

[jira] [Updated] (SPARK-6868) Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY

2015-04-12 Thread Dean Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dean Chen updated SPARK-6868: - Component/s: (was: Spark Core) YARN Container link broken on Spark UI Executors

[jira] [Created] (SPARK-6869) Pass PYTHONPATH to executor, so that executor can read pyspark file from local file system on executor node

2015-04-12 Thread Weizhong (JIRA)
Weizhong created SPARK-6869: --- Summary: Pass PYTHONPATH to executor, so that executor can read pyspark file from local file system on executor node Key: SPARK-6869 URL: https://issues.apache.org/jira/browse/SPARK-6869

[jira] [Assigned] (SPARK-6870) Catch InterruptedException when yarn application state monitor thread been interrupted

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6870: --- Assignee: Apache Spark Catch InterruptedException when yarn application state monitor

[jira] [Commented] (SPARK-6870) Catch InterruptedException when yarn application state monitor thread been interrupted

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491380#comment-14491380 ] Apache Spark commented on SPARK-6870: - User 'Sephiroth-Lin' has created a pull request

[jira] [Assigned] (SPARK-6870) Catch InterruptedException when yarn application state monitor thread been interrupted

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6870: --- Assignee: (was: Apache Spark) Catch InterruptedException when yarn application state

[jira] [Updated] (SPARK-6866) Cleanup duplicated dependency in pom.xml

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6866: - Due Date: (was: 15/Apr/15) Priority: Trivial (was: Minor) Assignee: Guancheng Chen Cleanup

[jira] [Resolved] (SPARK-6866) Cleanup duplicated dependency in pom.xml

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6866. -- Resolution: Fixed Issue resolved by pull request 5476 [https://github.com/apache/spark/pull/5476]

[jira] [Commented] (SPARK-761) Print a nicer error message when incompatible Spark binaries try to talk

2015-04-12 Thread Harsh Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491420#comment-14491420 ] Harsh Gupta commented on SPARK-761: --- [~aash] How do I do a compatibility check on API on

[jira] [Closed] (SPARK-6842) mvn -DskipTests clean package fails

2015-04-12 Thread Sree Vaddi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sree Vaddi closed SPARK-6842. - build successful. mvn -DskipTests clean package fails ---

[jira] [Created] (SPARK-6871) WITH clause in CTE can not following another WITH clause

2015-04-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6871: -- Summary: WITH clause in CTE can not following another WITH clause Key: SPARK-6871 URL: https://issues.apache.org/jira/browse/SPARK-6871 Project: Spark

[jira] [Commented] (SPARK-6871) WITH clause in CTE can not following another WITH clause

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491401#comment-14491401 ] Apache Spark commented on SPARK-6871: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-6871) WITH clause in CTE can not following another WITH clause

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6871: --- Assignee: (was: Apache Spark) WITH clause in CTE can not following another WITH clause

[jira] [Resolved] (SPARK-6545) Minor changes for CompactBuffer

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6545. -- Resolution: Won't Fix I think this is WontFix given https://github.com/apache/spark/pull/5199 but

[jira] [Resolved] (SPARK-1303) Added discretization capability to MLlib.

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1303. -- Resolution: Won't Fix Sounds like this should start outside MLlib:

[jira] [Commented] (SPARK-6864) Spark's Multilabel Classifier runs out of memory on small datasets

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491437#comment-14491437 ] Sean Owen commented on SPARK-6864: -- I believe this is the *driver* process running out of

[jira] [Assigned] (SPARK-6871) WITH clause in CTE can not following another WITH clause

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6871: --- Assignee: Apache Spark WITH clause in CTE can not following another WITH clause

[jira] [Commented] (SPARK-6677) pyspark.sql nondeterministic issue with row fields

2015-04-12 Thread Stefano Parmesan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491459#comment-14491459 ] Stefano Parmesan commented on SPARK-6677: - glad it helped! we're very eager to try

[jira] [Updated] (SPARK-6867) Dropout regularization

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6867: - Target Version/s: (was: 1.4.0) Dropout regularization -- Key:

[jira] [Resolved] (SPARK-6843) Potential visibility problem for the state of Executor

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6843. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5448

[jira] [Updated] (SPARK-6843) Potential visibility problem for the state of Executor

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6843: - Priority: Trivial (was: Minor) Assignee: zhichao-li Potential visibility problem for the state of

[jira] [Updated] (SPARK-6842) mvn -DskipTests clean package fails

2015-04-12 Thread Sree Vaddi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sree Vaddi updated SPARK-6842: -- Attachment: mvn.clean.package.log mvn package is successful on my machine, now. previously, i was

[jira] [Commented] (SPARK-6151) schemaRDD to parquetfile with saveAsParquetFile control the HDFS block size

2015-04-12 Thread Sree Vaddi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491502#comment-14491502 ] Sree Vaddi commented on SPARK-6151: --- [~cnstar9988] The HDFS Block Size is set once when

[jira] [Created] (SPARK-6872) external sort need to copy

2015-04-12 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-6872: -- Summary: external sort need to copy Key: SPARK-6872 URL: https://issues.apache.org/jira/browse/SPARK-6872 Project: Spark Issue Type: Bug Components:

[jira] [Created] (SPARK-6873) Some Hive-Catalyst comparison tests fail due to unimportant order of some printed elements

2015-04-12 Thread Sean Owen (JIRA)
Sean Owen created SPARK-6873: Summary: Some Hive-Catalyst comparison tests fail due to unimportant order of some printed elements Key: SPARK-6873 URL: https://issues.apache.org/jira/browse/SPARK-6873

[jira] [Commented] (SPARK-6859) Parquet File Binary column statistics error when reuse byte[] among rows

2015-04-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491575#comment-14491575 ] Cheng Lian commented on SPARK-6859: --- A better way can be defensive copy while inserting

[jira] [Commented] (SPARK-6872) external sort need to copy

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491509#comment-14491509 ] Apache Spark commented on SPARK-6872: - User 'adrian-wang' has created a pull request

[jira] [Assigned] (SPARK-6872) external sort need to copy

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6872: --- Assignee: (was: Apache Spark) external sort need to copy --

[jira] [Assigned] (SPARK-6872) external sort need to copy

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6872: --- Assignee: Apache Spark external sort need to copy --

[jira] [Resolved] (SPARK-6431) Couldn't find leader offsets exception when creating KafkaDirectStream

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6431. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5454

[jira] [Commented] (SPARK-5107) A trick log info for the start of Receiver

2015-04-12 Thread Sree Vaddi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491626#comment-14491626 ] Sree Vaddi commented on SPARK-5107: --- [~srowen] This may be closed. I could do, but I do

[jira] [Commented] (SPARK-5364) HiveQL transform doesn't support the non output clause

2015-04-12 Thread Sree Vaddi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491625#comment-14491625 ] Sree Vaddi commented on SPARK-5364: --- [~srowen] This may be closed. I could do, but I do

[jira] [Resolved] (SPARK-5364) HiveQL transform doesn't support the non output clause

2015-04-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5364. Resolution: Duplicate Fix Version/s: (was: 1.3.1) 1.3.0 HiveQL

[jira] [Commented] (SPARK-6859) Parquet File Binary column statistics error when reuse byte[] among rows

2015-04-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491558#comment-14491558 ] Cheng Lian commented on SPARK-6859: --- For 1.3 and prior versions, this issue isn't that

[jira] [Commented] (SPARK-6873) Some Hive-Catalyst comparison tests fail due to unimportant order of some printed elements

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491567#comment-14491567 ] Sean Owen commented on SPARK-6873: -- CC [~lian cheng] [~marmbrus] as I bet this would be

[jira] [Updated] (SPARK-6431) Couldn't find leader offsets exception when creating KafkaDirectStream

2015-04-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6431: - Assignee: Cody Koeninger Couldn't find leader offsets exception when creating KafkaDirectStream

[jira] [Assigned] (SPARK-6874) Add support for SQL:2003 array type declaration syntax

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6874: --- Assignee: (was: Apache Spark) Add support for SQL:2003 array type declaration syntax

[jira] [Commented] (SPARK-6874) Add support for SQL:2003 array type declaration syntax

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491573#comment-14491573 ] Apache Spark commented on SPARK-6874: - User 'smola' has created a pull request for

[jira] [Assigned] (SPARK-6874) Add support for SQL:2003 array type declaration syntax

2015-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6874: --- Assignee: Apache Spark Add support for SQL:2003 array type declaration syntax

[jira] [Commented] (SPARK-6859) Parquet File Binary column statistics error when reuse byte[] among rows

2015-04-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491548#comment-14491548 ] Cheng Lian commented on SPARK-6859: --- [~yijieshen] Thanks for reporting! And yes, please

[jira] [Created] (SPARK-6874) Add support for SQL:2003 array type declaration syntax

2015-04-12 Thread Santiago M. Mola (JIRA)
Santiago M. Mola created SPARK-6874: --- Summary: Add support for SQL:2003 array type declaration syntax Key: SPARK-6874 URL: https://issues.apache.org/jira/browse/SPARK-6874 Project: Spark

[jira] [Resolved] (SPARK-5364) HiveQL transform doesn't support the non output clause

2015-04-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5364. Resolution: Fixed Fix Version/s: 1.3.1 HiveQL transform doesn't support the non output

[jira] [Reopened] (SPARK-5364) HiveQL transform doesn't support the non output clause

2015-04-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reopened SPARK-5364: HiveQL transform doesn't support the non output clause

[jira] [Resolved] (SPARK-4801) Add CTE capability to HiveContext

2015-04-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4801. - Resolution: Duplicate Fix Version/s: 1.4.0 Add CTE capability to HiveContext

[jira] [Closed] (SPARK-5364) HiveQL transform doesn't support the non output clause

2015-04-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian closed SPARK-5364. - Assignee: Liang-Chi Hsieh HiveQL transform doesn't support the non output clause

[jira] [Resolved] (SPARK-4760) ANALYZE TABLE table COMPUTE STATISTICS noscan failed estimating table size for tables created from Parquet files

2015-04-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4760. - Resolution: Fixed Fix Version/s: 1.3.0 The native parquet support (which is used

[jira] [Updated] (SPARK-1412) Disable partial aggregation automatically when reduction factor is low

2015-04-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1412: Summary: Disable partial aggregation automatically when reduction factor is low (was:

[jira] [Updated] (SPARK-1412) [SQL] Disable partial aggregation automatically when reduction factor is low

2015-04-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-1412: Assignee: (was: Michael Armbrust) [SQL] Disable partial aggregation automatically when

  1   2   >