[jira] [Updated] (SPARK-7171) Allow for more flexible use of metric sources

2015-04-30 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Lewandowski updated SPARK-7171: - Description: With the current API, the user is allowed to add a custom metric source by p

[jira] [Commented] (SPARK-5938) Generate row from json efficiently

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521054#comment-14521054 ] Apache Spark commented on SPARK-5938: - User 'NathanHowell' has created a pull request

[jira] [Commented] (SPARK-7196) decimal precision lost when loading DataFrame from JDBC

2015-04-30 Thread Ken Geis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521067#comment-14521067 ] Ken Geis commented on SPARK-7196: - This does not fix my issue. {noformat} scala> val amou

[jira] [Commented] (SPARK-7111) Add a tracker to track the direct (receiver-less) streams

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521078#comment-14521078 ] Tathagata Das commented on SPARK-7111: -- [~jerryshao] Since you are implementing this

[jira] [Updated] (SPARK-7111) Exposing of input data rates of non-receiver streams like Kafka Direct stream

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7111: - Summary: Exposing of input data rates of non-receiver streams like Kafka Direct stream (was: Add

[jira] [Updated] (SPARK-7112) Add InputInfoTracker to have a generic way to track input data rates for all input streams.

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7112: - Summary: Add InputInfoTracker to have a generic way to track input data rates for all input stream

[jira] [Updated] (SPARK-7112) Add InputInfoTracker to have a generic way to track input data rates for all input streams.

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7112: - Assignee: Saisai Shao > Add InputInfoTracker to have a generic way to track input data rates for a

[jira] [Updated] (SPARK-7112) Add InputInfoTracker to have a generic way to track input data rates for all input streams.

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7112: - Description: Non-receiver streams like Kafka Direct stream should be able to report input data ra

[jira] [Updated] (SPARK-7111) Exposing of input data rates of non-receiver streams like Kafka Direct stream

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7111: - Assignee: Saisai Shao > Exposing of input data rates of non-receiver streams like Kafka Direct str

[jira] [Updated] (SPARK-7111) Exposing of input data rates of non-receiver streams like Kafka Direct stream

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7111: - Target Version/s: 1.4.0 > Exposing of input data rates of non-receiver streams like Kafka Direct s

[jira] [Updated] (SPARK-7113) Add the direct stream related information to the streaming listener and web UI

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7113: - Fix Version/s: 1.4.0 > Add the direct stream related information to the streaming listener and web

[jira] [Updated] (SPARK-7112) Add InputInfoTracker to have a generic way to track input data rates for all input streams.

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7112: - Fix Version/s: 1.4.0 > Add InputInfoTracker to have a generic way to track input data rates for al

[jira] [Updated] (SPARK-7111) Exposing of input data rates of non-receiver streams like Kafka Direct stream

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7111: - Priority: Blocker (was: Major) > Exposing of input data rates of non-receiver streams like Kafka

[jira] [Updated] (SPARK-7224) Mock repositories for testing with --packages

2015-04-30 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7224: --- Target Version/s: 1.3.2, 1.4.0 Not that I've disabled the test in master and 1.3 branches. >

[jira] [Created] (SPARK-7274) Create Column expression for array/struct creation

2015-04-30 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7274: -- Summary: Create Column expression for array/struct creation Key: SPARK-7274 URL: https://issues.apache.org/jira/browse/SPARK-7274 Project: Spark Issue Type: Sub-

[jira] [Updated] (SPARK-7139) Allow received block metadata to be saved to WAL and recovered on driver failure

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7139: - Priority: Blocker (was: Critical) > Allow received block metadata to be saved to WAL and recovere

[jira] [Commented] (SPARK-7274) Create Column expression for array/struct creation

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521088#comment-14521088 ] Apache Spark commented on SPARK-7274: - User 'rxin' has created a pull request for this

[jira] [Updated] (SPARK-6755) Throw exception if the user tries to concurrently start multiple StreamingContexts in same JVM.

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6755: - Priority: Critical (was: Major) > Throw exception if the user tries to concurrently start multipl

[jira] [Assigned] (SPARK-7274) Create Column expression for array/struct creation

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7274: --- Assignee: Reynold Xin (was: Apache Spark) > Create Column expression for array/struct creati

[jira] [Assigned] (SPARK-7274) Create Column expression for array/struct creation

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7274: --- Assignee: Apache Spark (was: Reynold Xin) > Create Column expression for array/struct creati

[jira] [Updated] (SPARK-5959) Create a ReliableKinesisReceiver similar to the ReliableKafkaReceiver

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-5959: - Target Version/s: 1.5.0 (was: 1.4.0) > Create a ReliableKinesisReceiver similar to the ReliableKa

[jira] [Closed] (SPARK-5959) Create a ReliableKinesisReceiver similar to the ReliableKafkaReceiver

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das closed SPARK-5959. Resolution: Duplicate > Create a ReliableKinesisReceiver similar to the ReliableKafkaReceiver >

[jira] [Commented] (SPARK-5155) Python API for MQTT streaming

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521090#comment-14521090 ] Tathagata Das commented on SPARK-5155: -- This is being bumped to 1.5.0 because we dont

[jira] [Updated] (SPARK-5155) Python API for MQTT streaming

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-5155: - Target Version/s: 1.5.0 (was: 1.4.0) > Python API for MQTT streaming > --

[jira] [Commented] (SPARK-5443) jsonRDD with schema should ignore sub-objects that are omitted in schema

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521098#comment-14521098 ] Apache Spark commented on SPARK-5443: - User 'NathanHowell' has created a pull request

[jira] [Assigned] (SPARK-5443) jsonRDD with schema should ignore sub-objects that are omitted in schema

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5443: --- Assignee: (was: Apache Spark) > jsonRDD with schema should ignore sub-objects that are om

[jira] [Assigned] (SPARK-5456) Decimal Type comparison issue

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5456: --- Assignee: Apache Spark > Decimal Type comparison issue > - > >

[jira] [Assigned] (SPARK-5456) Decimal Type comparison issue

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5456: --- Assignee: (was: Apache Spark) > Decimal Type comparison issue > -

[jira] [Updated] (SPARK-5046) Update KinesisReceiver to use updated Receiver API

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-5046: - Target Version/s: (was: 1.4.0) > Update KinesisReceiver to use updated Receiver API > --

[jira] [Updated] (SPARK-5048) Add Flume to the Python Streaming API

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-5048: - Target Version/s: 1.5.0 (was: 1.4.0) > Add Flume to the Python Streaming API > --

[jira] [Commented] (SPARK-5456) Decimal Type comparison issue

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521100#comment-14521100 ] Apache Spark commented on SPARK-5456: - User 'adrian-wang' has created a pull request f

[jira] [Updated] (SPARK-6654) Update Kinesis Streaming impls (both KCL-based and Direct) to use latest aws-java-sdk and kinesis-client-library

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6654: - Priority: Blocker (was: Major) > Update Kinesis Streaming impls (both KCL-based and Direct) to us

[jira] [Closed] (SPARK-6249) Get Kafka offsets from consumer group in ZK when using direct stream

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das closed SPARK-6249. Resolution: Won't Fix > Get Kafka offsets from consumer group in ZK when using direct stream > -

[jira] [Updated] (SPARK-6702) Update the Streaming Tab in Spark UI to show more batch information

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6702: - Priority: Blocker (was: Critical) > Update the Streaming Tab in Spark UI to show more batch infor

[jira] [Updated] (SPARK-5960) Allow AWS credentials to be passed to KinesisUtils.createStream()

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-5960: - Priority: Blocker (was: Major) > Allow AWS credentials to be passed to KinesisUtils.createStream(

[jira] [Updated] (SPARK-6702) Update the Streaming Tab in Spark UI to show more batch information

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6702: - Priority: Critical (was: Major) > Update the Streaming Tab in Spark UI to show more batch informa

[jira] [Updated] (SPARK-6599) Improve reliability and usability of Kinesis-based Spark Streaming

2015-04-30 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6599: - Target Version/s: 1.5.0 (was: 1.4.0) > Improve reliability and usability of Kinesis-based Spark S

[jira] [Assigned] (SPARK-5443) jsonRDD with schema should ignore sub-objects that are omitted in schema

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5443: --- Assignee: Apache Spark > jsonRDD with schema should ignore sub-objects that are omitted in sc

[jira] [Commented] (SPARK-5456) Decimal Type comparison issue

2015-04-30 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521109#comment-14521109 ] Adrian Wang commented on SPARK-5456: I created a pull request for [~zhangyi] > Decima

[jira] [Commented] (SPARK-6999) infinite recursion with createDataFrame(JavaRDD[Row], java.util.List[String])

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521121#comment-14521121 ] Apache Spark commented on SPARK-6999: - User 'chenghao-intel' has created a pull reques

[jira] [Assigned] (SPARK-6999) infinite recursion with createDataFrame(JavaRDD[Row], java.util.List[String])

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6999: --- Assignee: (was: Apache Spark) > infinite recursion with createDataFrame(JavaRDD[Row], jav

[jira] [Assigned] (SPARK-6999) infinite recursion with createDataFrame(JavaRDD[Row], java.util.List[String])

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6999: --- Assignee: Apache Spark > infinite recursion with createDataFrame(JavaRDD[Row], java.util.List

[jira] [Created] (SPARK-7275) Make LogicalRelation public

2015-04-30 Thread Santiago M. Mola (JIRA)
Santiago M. Mola created SPARK-7275: --- Summary: Make LogicalRelation public Key: SPARK-7275 URL: https://issues.apache.org/jira/browse/SPARK-7275 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-7171) Allow for more flexible use of metric sources

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521130#comment-14521130 ] Apache Spark commented on SPARK-7171: - User 'jacek-lewandowski' has created a pull req

[jira] [Assigned] (SPARK-7171) Allow for more flexible use of metric sources

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7171: --- Assignee: (was: Apache Spark) > Allow for more flexible use of metric sources > -

[jira] [Commented] (SPARK-7171) Allow for more flexible use of metric sources

2015-04-30 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521131#comment-14521131 ] Jacek Lewandowski commented on SPARK-7171: -- https://github.com/apache/spark/pull/

[jira] [Assigned] (SPARK-7171) Allow for more flexible use of metric sources

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7171: --- Assignee: Apache Spark > Allow for more flexible use of metric sources >

[jira] [Commented] (SPARK-7233) ClosureCleaner#clean blocks concurrent job submitter threads

2015-04-30 Thread Oleksii Kostyliev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521134#comment-14521134 ] Oleksii Kostyliev commented on SPARK-7233: -- [~andrewor14] With pleasure but afrai

[jira] [Commented] (SPARK-4629) Spark SQL uses Hadoop Configuration in a thread-unsafe manner when writing Parquet files

2015-04-30 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521153#comment-14521153 ] Cheng Hao commented on SPARK-4629: -- Are they relate issues? > Spark SQL uses Hadoop Conf

[jira] [Commented] (SPARK-7233) ClosureCleaner#clean blocks concurrent job submitter threads

2015-04-30 Thread Oleksii Kostyliev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521154#comment-14521154 ] Oleksii Kostyliev commented on SPARK-7233: -- [~andrewor14] This got priority on ou

[jira] [Comment Edited] (SPARK-4629) Spark SQL uses Hadoop Configuration in a thread-unsafe manner when writing Parquet files

2015-04-30 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521153#comment-14521153 ] Cheng Hao edited comment on SPARK-4629 at 4/30/15 8:54 AM: --- Are

[jira] [Commented] (SPARK-4629) Spark SQL uses Hadoop Configuration in a thread-unsafe manner when writing Parquet files

2015-04-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521157#comment-14521157 ] Cheng Lian commented on SPARK-4629: --- I don't think so. SPARK-4229 is about configuration

[jira] [Commented] (SPARK-4699) Make caseSensitive configurable in Analyzer.scala

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521163#comment-14521163 ] Apache Spark commented on SPARK-4699: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-7196) decimal precision lost when loading DataFrame from JDBC

2015-04-30 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521240#comment-14521240 ] Liang-Chi Hsieh commented on SPARK-7196: [~kgeis] I can't reproduce your problem.

[jira] [Created] (SPARK-7276) withColumn is very slow on dataframe with large number of columns

2015-04-30 Thread Alexandre CLEMENT (JIRA)
Alexandre CLEMENT created SPARK-7276: Summary: withColumn is very slow on dataframe with large number of columns Key: SPARK-7276 URL: https://issues.apache.org/jira/browse/SPARK-7276 Project: Spar

[jira] [Commented] (SPARK-6923) Spark SQL CLI does not read Data Source schema correctly

2015-04-30 Thread pin_zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521277#comment-14521277 ] pin_zhang commented on SPARK-6923: -- Hi, Cheng Hao Thanks for your reply! Do you mea

[jira] [Commented] (SPARK-6923) Spark SQL CLI does not read Data Source schema correctly

2015-04-30 Thread pin_zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521280#comment-14521280 ] pin_zhang commented on SPARK-6923: -- Hi, Cheng Hao Thanks for your reply! Do you mea

[jira] [Commented] (SPARK-6923) Spark SQL CLI does not read Data Source schema correctly

2015-04-30 Thread pin_zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521278#comment-14521278 ] pin_zhang commented on SPARK-6923: -- Hi, Cheng Hao Thanks for your reply! Do you mea

[jira] [Issue Comment Deleted] (SPARK-6923) Spark SQL CLI does not read Data Source schema correctly

2015-04-30 Thread pin_zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pin_zhang updated SPARK-6923: - Comment: was deleted (was: Hi, Cheng Hao Thanks for your reply! Do you mean if provide a wrapper for

[jira] [Issue Comment Deleted] (SPARK-6923) Spark SQL CLI does not read Data Source schema correctly

2015-04-30 Thread pin_zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pin_zhang updated SPARK-6923: - Comment: was deleted (was: Hi, Cheng Hao Thanks for your reply! Do you mean if provide a wrapper for

[jira] [Commented] (SPARK-6257) Python MLlib API missing items: Recommendation

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521351#comment-14521351 ] Apache Spark commented on SPARK-6257: - User 'MechCoder' has created a pull request for

[jira] [Assigned] (SPARK-6257) Python MLlib API missing items: Recommendation

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6257: --- Assignee: (was: Apache Spark) > Python MLlib API missing items: Recommendation >

[jira] [Assigned] (SPARK-6257) Python MLlib API missing items: Recommendation

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6257: --- Assignee: Apache Spark > Python MLlib API missing items: Recommendation > ---

[jira] [Created] (SPARK-7277) property mapred.reduce.task replaced by spark.sql.shuffle.partitions

2015-04-30 Thread Sebastian (JIRA)
Sebastian created SPARK-7277: Summary: property mapred.reduce.task replaced by spark.sql.shuffle.partitions Key: SPARK-7277 URL: https://issues.apache.org/jira/browse/SPARK-7277 Project: Spark I

[jira] [Updated] (SPARK-7251) Perform sequential scan when iterating over entries in BytesToBytesMap

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7251: - Component/s: Spark Core > Perform sequential scan when iterating over entries in BytesToBytesMap > ---

[jira] [Updated] (SPARK-7190) UTF8String backed by binary data

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7190: - Component/s: Spark Core > UTF8String backed by binary data > > >

[jira] [Updated] (SPARK-7277) property mapred.reduce.task replaced by spark.sql.shuffle.partitions

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7277: - Component/s: SQL (Set Components please) > property mapred.reduce.task replaced by spark.sql.shuffle.part

[jira] [Updated] (SPARK-7177) Create standard way to wrap Spark CLI scripts for external projects

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7177: - Component/s: Spark Shell PySpark > Create standard way to wrap Spark CLI scripts for exte

[jira] [Updated] (SPARK-4989) wrong application configuration cause cluster down in standalone mode

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4989: - Labels: (was: backport-needed) > wrong application configuration cause cluster down in standalone mode >

[jira] [Resolved] (SPARK-4989) wrong application configuration cause cluster down in standalone mode

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4989. -- Resolution: Fixed Target Version/s: (was: 1.0.3) > wrong application configuration cause clu

[jira] [Updated] (SPARK-4159) Maven build doesn't run JUnit test suites

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4159: - Labels: (was: backport-needed) > Maven build doesn't run JUnit test suites > ---

[jira] [Resolved] (SPARK-4159) Maven build doesn't run JUnit test suites

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4159. -- Resolution: Fixed Target Version/s: (was: 1.0.3, 1.1.1, 1.2.1) I suggest we leave it at this

[jira] [Updated] (SPARK-6505) Remove the reflection call in HiveFunctionWrapper

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6505: - Assignee: Jeff Harrison > Remove the reflection call in HiveFunctionWrapper >

[jira] [Updated] (SPARK-6856) Make RDD information more useful in SparkR

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6856: - Assignee: Jeff Harrison > Make RDD information more useful in SparkR > ---

[jira] [Updated] (SPARK-6505) Remove the reflection call in HiveFunctionWrapper

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6505: - Assignee: (was: Jeff Harrison) > Remove the reflection call in HiveFunctionWrapper > -

[jira] [Commented] (SPARK-7277) property mapred.reduce.task replaced by spark.sql.shuffle.partitions

2015-04-30 Thread Sebastian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521462#comment-14521462 ] Sebastian commented on SPARK-7277: -- Apologies, I wasn't sure which component of Spark is

[jira] [Commented] (SPARK-7272) User guide for PMML model export

2015-04-30 Thread Vincenzo Selvaggio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521464#comment-14521464 ] Vincenzo Selvaggio commented on SPARK-7272: --- [~mengxr] Where should be this user

[jira] [Updated] (SPARK-5361) Multiple Java RDD <-> Python RDD conversions not working correctly

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5361: - Assignee: Winston Chen > Multiple Java RDD <-> Python RDD conversions not working correctly >

[jira] [Updated] (SPARK-5361) Multiple Java RDD <-> Python RDD conversions not working correctly

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5361: - Labels: (was: backport-needed) > Multiple Java RDD <-> Python RDD conversions not working correctly > --

[jira] [Commented] (SPARK-7250) computeInverse for RowMatrix

2015-04-30 Thread Stephanie Rivera (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521510#comment-14521510 ] Stephanie Rivera commented on SPARK-7250: - OK > computeInverse for RowMatrix > --

[jira] [Closed] (SPARK-7250) computeInverse for RowMatrix

2015-04-30 Thread Stephanie Rivera (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephanie Rivera closed SPARK-7250. --- Resolution: Not A Problem > computeInverse for RowMatrix > > >

[jira] [Commented] (SPARK-6069) Deserialization Error ClassNotFoundException with Kryo, Guava 14

2015-04-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521542#comment-14521542 ] ASF GitHub Bot commented on SPARK-6069: --- Github user pferrel commented on the pull r

[jira] [Commented] (SPARK-7196) decimal precision lost when loading DataFrame from JDBC

2015-04-30 Thread Ken Geis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521590#comment-14521590 ] Ken Geis commented on SPARK-7196: - Sorry, I thought I checked out the jdbc_precision branc

[jira] [Commented] (SPARK-6069) Deserialization Error ClassNotFoundException with Kryo, Guava 14

2015-04-30 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521664#comment-14521664 ] Pat Ferrel commented on SPARK-6069: --- didn't mean for those comments to be cross-posted.

[jira] [Created] (SPARK-7278) Inconsistent handling of dates in PySparks Row object

2015-04-30 Thread Kalle Jepsen (JIRA)
Kalle Jepsen created SPARK-7278: --- Summary: Inconsistent handling of dates in PySparks Row object Key: SPARK-7278 URL: https://issues.apache.org/jira/browse/SPARK-7278 Project: Spark Issue Type:

[jira] [Updated] (SPARK-984) SPARK_TOOLS_JAR not set if multiple tools jars exists

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-984: Assignee: Marcelo Vanzin > SPARK_TOOLS_JAR not set if multiple tools jars exists > --

[jira] [Assigned] (SPARK-4044) Thriftserver fails to start when JAVA_HOME points to JRE instead of JDK

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-4044: Assignee: Sean Owen > Thriftserver fails to start when JAVA_HOME points to JRE instead of JDK > ---

[jira] [Updated] (SPARK-4631) Add real unit test for MQTT

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4631: - Assignee: Iulian Dragos > Add real unit test for MQTT > > >

[jira] [Updated] (SPARK-4986) Graceful shutdown for Spark Streaming does not work in Standalone cluster mode

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4986: - Assignee: Jesper Lundgren > Graceful shutdown for Spark Streaming does not work in Standalone cluster mode

[jira] [Updated] (SPARK-5153) Flaky test: o.a.s.streaming.kafka.ReliableKafkaStreamSuite

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5153: - Assignee: Tathagata Das > Flaky test: o.a.s.streaming.kafka.ReliableKafkaStreamSuite > ---

[jira] [Updated] (SPARK-6180) Error logged into log4j when use the HiveMetastoreCatalog::tableExists

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6180: - Assignee: Fei Wang > Error logged into log4j when use the HiveMetastoreCatalog::tableExists >

[jira] [Commented] (SPARK-7263) Add new shuffle manager which stores shuffle blocks in Parquet

2015-04-30 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521729#comment-14521729 ] Andrew Ash commented on SPARK-7263: --- [~massie] this is really exciting work! Thinking t

[jira] [Updated] (SPARK-5529) BlockManager heartbeat expiration does not kill executor

2015-04-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5529: - Fix Version/s: 1.3.2 Backported to 1.3 in https://github.com/apache/spark/pull/5793 > BlockManager heartb

[jira] [Commented] (SPARK-7278) Inconsistent handling of dates in PySparks Row object

2015-04-30 Thread Kalle Jepsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521753#comment-14521753 ] Kalle Jepsen commented on SPARK-7278: - This is probably a duplicate of [this issue|ht

[jira] [Comment Edited] (SPARK-7196) decimal precision lost when loading DataFrame from JDBC

2015-04-30 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521776#comment-14521776 ] Liang-Chi Hsieh edited comment on SPARK-7196 at 4/30/15 4:31 PM: ---

[jira] [Commented] (SPARK-7196) decimal precision lost when loading DataFrame from JDBC

2015-04-30 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521776#comment-14521776 ] Liang-Chi Hsieh commented on SPARK-7196: [~kgeis] thanks. I think I know where is

[jira] [Assigned] (SPARK-7255) spark.streaming.kafka.maxRetries not documented

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7255: --- Assignee: Apache Spark > spark.streaming.kafka.maxRetries not documented > --

[jira] [Assigned] (SPARK-7255) spark.streaming.kafka.maxRetries not documented

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7255: --- Assignee: (was: Apache Spark) > spark.streaming.kafka.maxRetries not documented > ---

[jira] [Commented] (SPARK-7255) spark.streaming.kafka.maxRetries not documented

2015-04-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521812#comment-14521812 ] Apache Spark commented on SPARK-7255: - User 'BenFradet' has created a pull request for

[jira] [Commented] (SPARK-7203) Python API for local linear algebra

2015-04-30 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14521821#comment-14521821 ] Manoj Kumar commented on SPARK-7203: [~josephkb] I just copy pasted your comment. I ad

  1   2   3   >