[jira] [Assigned] (SPARK-6640) Executor may connect to HeartbeartReceiver before it's setup in the driver side

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6640: --- Assignee: (was: Apache Spark) > Executor may connect to HeartbeartReceiver before it's se

[jira] [Assigned] (SPARK-6640) Executor may connect to HeartbeartReceiver before it's setup in the driver side

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6640: --- Assignee: Apache Spark > Executor may connect to HeartbeartReceiver before it's setup in the

[jira] [Commented] (SPARK-6640) Executor may connect to HeartbeartReceiver before it's setup in the driver side

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14390090#comment-14390090 ] Apache Spark commented on SPARK-6640: - User 'zsxwing' has created a pull request for t

[jira] [Commented] (SPARK-3596) Support changing the yarn client monitor interval

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14390087#comment-14390087 ] Apache Spark commented on SPARK-3596: - User 'Sephiroth-Lin' has created a pull request

[jira] [Commented] (SPARK-4346) YarnClientSchedulerBack.asyncMonitorApplication should be common with Client.monitorApplication

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14390086#comment-14390086 ] Apache Spark commented on SPARK-4346: - User 'Sephiroth-Lin' has created a pull request

[jira] [Assigned] (SPARK-4346) YarnClientSchedulerBack.asyncMonitorApplication should be common with Client.monitorApplication

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4346: --- Assignee: Apache Spark > YarnClientSchedulerBack.asyncMonitorApplication should be common wit

[jira] [Assigned] (SPARK-4346) YarnClientSchedulerBack.asyncMonitorApplication should be common with Client.monitorApplication

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4346: --- Assignee: (was: Apache Spark) > YarnClientSchedulerBack.asyncMonitorApplication should be

[jira] [Updated] (SPARK-6646) Spark 2.0: Rearchitecting Spark for Mobile Platforms

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6646: --- Description: Mobile computing is quickly rising to dominance, and by the end of 2017, it is estimated

[jira] [Updated] (SPARK-6646) Spark 2.0: Rearchitecting Spark for Mobile Platforms

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6646: --- Description: Mobile computing is quickly rising to dominance, and by the end of 2017, it is estimated

[jira] [Updated] (SPARK-6646) Spark 2.0: Rearchitecting Spark for Mobile Platforms

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6646: --- Attachment: Spark on Mobile - Design Doc - v1.pdf > Spark 2.0: Rearchitecting Spark for Mobile Platfor

[jira] [Updated] (SPARK-6644) After adding new columns to a partitioned table and inserting data to an old partition, data of newly added columns are all NULL

2015-03-31 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6644: -- Summary: After adding new columns to a partitioned table and inserting data to an old partition, data of

[jira] [Updated] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

2015-03-31 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6644: -- Description: In Hive, the schema of a partition may differ from the table schema. For example, we may a

[jira] [Updated] (SPARK-6106) Support user group mapping and groups in view, modify and admin acls

2015-03-31 Thread Colin Ma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Colin Ma updated SPARK-6106: Attachment: SPARK-6106.001.patch Hi, I prepared the initial patch for this improvement. > Support user grou

[jira] [Updated] (SPARK-6646) Spark 2.0: Rearchitecting Spark for Mobile Platforms

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6646: --- Component/s: Project Infra > Spark 2.0: Rearchitecting Spark for Mobile Platforms > --

[jira] [Updated] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

2015-03-31 Thread dongxu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dongxu updated SPARK-6644: -- Description: In hive,the schema of partition may be difference from the table schema. For example, we add new c

[jira] [Created] (SPARK-6646) Spark 2.0: Rearchitecting Spark for Mobile Platforms

2015-03-31 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-6646: -- Summary: Spark 2.0: Rearchitecting Spark for Mobile Platforms Key: SPARK-6646 URL: https://issues.apache.org/jira/browse/SPARK-6646 Project: Spark Issue Type: Im

[jira] [Updated] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

2015-03-31 Thread dongxu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dongxu updated SPARK-6644: -- Description: In hive,the schema of partition may be difference from the table schema. For example, we add new c

[jira] [Updated] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

2015-03-31 Thread dongxu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dongxu updated SPARK-6644: -- Description: In hive,the schema of partition may be difference from the table schema. For example, we add new c

[jira] [Updated] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

2015-03-31 Thread dongxu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dongxu updated SPARK-6644: -- Description: In hive,the schema of partition may be difference from the table schema. For example, we add new c

[jira] [Created] (SPARK-6645) StructField/StructType and related classes are not in the Scaladoc

2015-03-31 Thread Aaron Defazio (JIRA)
Aaron Defazio created SPARK-6645: Summary: StructField/StructType and related classes are not in the Scaladoc Key: SPARK-6645 URL: https://issues.apache.org/jira/browse/SPARK-6645 Project: Spark

[jira] [Updated] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

2015-03-31 Thread dongxu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dongxu updated SPARK-6644: -- Description: In hive,the schema of partition may be difference from the table schema. For example, we add new c

[jira] [Updated] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

2015-03-31 Thread dongxu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dongxu updated SPARK-6644: -- Description: In hive,the schema of partition may be difference from the table schema. For example, we add new c

[jira] [Updated] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

2015-03-31 Thread dongxu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dongxu updated SPARK-6644: -- Summary: [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column value is NULL

[jira] [Created] (SPARK-6644) [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column is NULL

2015-03-31 Thread dongxu (JIRA)
dongxu created SPARK-6644: - Summary: [SPARK-SQL]when the partition schema does not match table schema(ADD COLUMN), new column is NULL Key: SPARK-6644 URL: https://issues.apache.org/jira/browse/SPARK-6644 Proj

[jira] [Assigned] (SPARK-3872) Rewrite the test for ActorInputStream.

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3872: --- Assignee: Apache Spark (was: Prashant Sharma) > Rewrite the test for ActorInputStream. > --

[jira] [Assigned] (SPARK-3872) Rewrite the test for ActorInputStream.

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3872: --- Assignee: Prashant Sharma (was: Apache Spark) > Rewrite the test for ActorInputStream. > --

[jira] [Comment Edited] (SPARK-3066) Support recommendAll in matrix factorization model

2015-03-31 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389973#comment-14389973 ] Debasish Das edited comment on SPARK-3066 at 4/1/15 4:28 AM: -

[jira] [Commented] (SPARK-3066) Support recommendAll in matrix factorization model

2015-03-31 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389973#comment-14389973 ] Debasish Das commented on SPARK-3066: - Also unless the raw flow runs there is no way t

[jira] [Created] (SPARK-6643) Python API for StandardScalerModel

2015-03-31 Thread Kai Sasaki (JIRA)
Kai Sasaki created SPARK-6643: - Summary: Python API for StandardScalerModel Key: SPARK-6643 URL: https://issues.apache.org/jira/browse/SPARK-6643 Project: Spark Issue Type: Task Compone

[jira] [Assigned] (SPARK-5360) For CoGroupedRDD, rdds for narrow dependencies and shuffle handles are included twice in serialized task

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5360: --- Assignee: Apache Spark (was: Kay Ousterhout) > For CoGroupedRDD, rdds for narrow dependencie

[jira] [Created] (SPARK-6642) Change the lambda weight to number of explicit ratings in implicit ALS

2015-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6642: Summary: Change the lambda weight to number of explicit ratings in implicit ALS Key: SPARK-6642 URL: https://issues.apache.org/jira/browse/SPARK-6642 Project: Spark

[jira] [Assigned] (SPARK-5360) For CoGroupedRDD, rdds for narrow dependencies and shuffle handles are included twice in serialized task

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5360: --- Assignee: Kay Ousterhout (was: Apache Spark) > For CoGroupedRDD, rdds for narrow dependencie

[jira] [Assigned] (SPARK-2808) update kafka to version 0.8.2

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2808: --- Assignee: Apache Spark > update kafka to version 0.8.2 > - > >

[jira] [Assigned] (SPARK-2808) update kafka to version 0.8.2

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2808: --- Assignee: (was: Apache Spark) > update kafka to version 0.8.2 > -

[jira] [Updated] (SPARK-6641) Add config or control of accumulator on python

2015-03-31 Thread Weizhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhong updated SPARK-6641: Description: Now if we init SparkContext of Python, then will create a single Accumulator in Java and start

[jira] [Created] (SPARK-6641) Add config or control of accumulator on python

2015-03-31 Thread Weizhong (JIRA)
Weizhong created SPARK-6641: --- Summary: Add config or control of accumulator on python Key: SPARK-6641 URL: https://issues.apache.org/jira/browse/SPARK-6641 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-6640) Executor may connect to HeartbeartReceiver before it's setup in the driver side

2015-03-31 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-6640: --- Summary: Executor may connect to HeartbeartReceiver before it's setup in the driver side Key: SPARK-6640 URL: https://issues.apache.org/jira/browse/SPARK-6640 Project:

[jira] [Commented] (SPARK-799) Windows versions of the deploy scripts

2015-03-31 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389853#comment-14389853 ] Masayoshi TSUZUKI commented on SPARK-799: - I tend not to think it is a good idea th

[jira] [Assigned] (SPARK-6626) TwitterUtils.createStream documentation error

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6626: --- Assignee: (was: Apache Spark) > TwitterUtils.createStream documentation error > -

[jira] [Assigned] (SPARK-6626) TwitterUtils.createStream documentation error

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6626: --- Assignee: Apache Spark > TwitterUtils.createStream documentation error >

[jira] [Commented] (SPARK-6626) TwitterUtils.createStream documentation error

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389852#comment-14389852 ] Apache Spark commented on SPARK-6626: - User 'JaysonSunshine' has created a pull reques

[jira] [Created] (SPARK-6639) Create a new script to start multiple masters

2015-03-31 Thread Tao Wang (JIRA)
Tao Wang created SPARK-6639: --- Summary: Create a new script to start multiple masters Key: SPARK-6639 URL: https://issues.apache.org/jira/browse/SPARK-6639 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6435) spark-shell --jars option does not add all jars to classpath

2015-03-31 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389829#comment-14389829 ] Masayoshi TSUZUKI commented on SPARK-6435: -- [~srowen] OK, thank you! Then I'm goi

[jira] [Assigned] (SPARK-6638) optimize StringType in SQL

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6638: --- Assignee: Davies Liu (was: Apache Spark) > optimize StringType in SQL >

[jira] [Commented] (SPARK-6638) optimize StringType in SQL

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389804#comment-14389804 ] Apache Spark commented on SPARK-6638: - User 'davies' has created a pull request for th

[jira] [Assigned] (SPARK-6638) optimize StringType in SQL

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6638: --- Assignee: Apache Spark (was: Davies Liu) > optimize StringType in SQL >

[jira] [Updated] (SPARK-6573) Convert inbound NaN values as null

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6573: --- Summary: Convert inbound NaN values as null (was: expect pandas null values as numpy.nan (not only as

[jira] [Updated] (SPARK-6573) Convert inbound NaN values as null

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6573: --- Target Version/s: 1.4.0 > Convert inbound NaN values as null > -- > >

[jira] [Created] (SPARK-6638) optimize StringType in SQL

2015-03-31 Thread Davies Liu (JIRA)
Davies Liu created SPARK-6638: - Summary: optimize StringType in SQL Key: SPARK-6638 URL: https://issues.apache.org/jira/browse/SPARK-6638 Project: Spark Issue Type: Improvement Componen

[jira] [Created] (SPARK-6637) Test lambda weighting in implicit ALS

2015-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6637: Summary: Test lambda weighting in implicit ALS Key: SPARK-6637 URL: https://issues.apache.org/jira/browse/SPARK-6637 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-6620) Speed up toDF() and rdd() functions by constructing converters in ScalaReflection

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6620: Target Version/s: 1.4.0 > Speed up toDF() and rdd() functions by constructing converters in

[jira] [Updated] (SPARK-6620) Speed up toDF() and rdd() functions by constructing converters in ScalaReflection

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6620: Assignee: Volodymyr Lyubinets > Speed up toDF() and rdd() functions by constructing converte

[jira] [Assigned] (SPARK-3066) Support recommendAll in matrix factorization model

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3066: --- Assignee: Debasish Das (was: Apache Spark) > Support recommendAll in matrix factorization mo

[jira] [Assigned] (SPARK-3066) Support recommendAll in matrix factorization model

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3066: --- Assignee: Apache Spark (was: Debasish Das) > Support recommendAll in matrix factorization mo

[jira] [Commented] (SPARK-5295) Stabilize data types

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389722#comment-14389722 ] Apache Spark commented on SPARK-5295: - User 'JaysonSunshine' has created a pull reques

[jira] [Updated] (SPARK-6578) Outbound channel in network library is not thread-safe, can lead to fetch failures

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6578: --- Priority: Blocker (was: Critical) > Outbound channel in network library is not thread-safe, can lead

[jira] [Updated] (SPARK-5508) Arrays and Maps stored with Hive Parquet Serde may not be able to read by the Parquet support in the Data Souce API

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5508: Target Version/s: 1.4.0, 1.3.2 (was: 1.4.0) > Arrays and Maps stored with Hive Parquet Serd

[jira] [Updated] (SPARK-5508) Arrays and Maps stored with Hive Parquet Serde may not be able to read by the Parquet support in the Data Souce API

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5508: Target Version/s: 1.4.0 (was: 1.3.1) > Arrays and Maps stored with Hive Parquet Serde may n

[jira] [Resolved] (SPARK-6614) OutputCommitCoordinator should clear authorized committers only after authorized committer fails, not after any failure

2015-03-31 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6614. --- Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 > OutputCommitCoordinator shoul

[jira] [Resolved] (SPARK-5692) Model import/export for Word2Vec

2015-03-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5692. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5291 [https://githu

[jira] [Issue Comment Deleted] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-03-31 Thread Matt Aasted (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Aasted updated SPARK-6636: --- Comment: was deleted (was: Pull request: https://github.com/apache/spark/pull/5302) > Use public DNS

[jira] [Assigned] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6636: --- Assignee: Apache Spark > Use public DNS hostname everywhere in spark_ec2.py > ---

[jira] [Assigned] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6636: --- Assignee: (was: Apache Spark) > Use public DNS hostname everywhere in spark_ec2.py >

[jira] [Commented] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-03-31 Thread Matt Aasted (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389535#comment-14389535 ] Matt Aasted commented on SPARK-6636: Pull request: https://github.com/apache/spark/pul

[jira] [Commented] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389537#comment-14389537 ] Apache Spark commented on SPARK-6636: - User 'aasted' has created a pull request for th

[jira] [Created] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-03-31 Thread Matt Aasted (JIRA)
Matt Aasted created SPARK-6636: -- Summary: Use public DNS hostname everywhere in spark_ec2.py Key: SPARK-6636 URL: https://issues.apache.org/jira/browse/SPARK-6636 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6573) expect pandas null values as numpy.nan (not only as None)

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389386#comment-14389386 ] Reynold Xin commented on SPARK-6573: I just tested. NaN in Python is turned into NaN i

[jira] [Resolved] (SPARK-6633) Should be "Contains" instead of "EndsWith" when constructing sources.StringContains

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6633. Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 > Should be "Contains" instea

[jira] [Updated] (SPARK-6633) Should be "Contains" instead of "EndsWith" when constructing sources.StringContains

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6633: --- Priority: Critical (was: Minor) > Should be "Contains" instead of "EndsWith" when constructing > sou

[jira] [Updated] (SPARK-6633) Should be "Contains" instead of "EndsWith" when constructing sources.StringContains

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6633: --- Assignee: Liang-Chi Hsieh > Should be "Contains" instead of "EndsWith" when constructing > sources.St

[jira] [Commented] (SPARK-6065) Optimize word2vec.findSynonyms speed

2015-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389281#comment-14389281 ] Joseph K. Bradley commented on SPARK-6065: -- I'm not sure what the best solution w

[jira] [Comment Edited] (SPARK-6065) Optimize word2vec.findSynonyms speed

2015-03-31 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389244#comment-14389244 ] Manoj Kumar edited comment on SPARK-6065 at 3/31/15 7:31 PM: -

[jira] [Commented] (SPARK-6065) Optimize word2vec.findSynonyms speed

2015-03-31 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389244#comment-14389244 ] Manoj Kumar commented on SPARK-6065: [~josephkb] I would like to work on this. Does t

[jira] [Resolved] (SPARK-5371) Failure to analyze query with UNION ALL and double aggregation

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5371. - Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Issue resolved by p

[jira] [Commented] (SPARK-6631) I am unable to get the Maven Build file in Example 2.13 to build anything but an empty file

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389141#comment-14389141 ] Sean Owen commented on SPARK-6631: -- If it's about the book, I don't think the general Spa

[jira] [Resolved] (SPARK-6145) ORDER BY fails to resolve nested fields

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6145. - Resolution: Fixed Fix Version/s: (was: 1.3.0) 1.4.0

[jira] [Resolved] (SPARK-6255) Python MLlib API missing items: Classification

2015-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-6255. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5137 [https

[jira] [Updated] (SPARK-6598) Python API for IDFModel

2015-03-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6598: - Assignee: Kai Sasaki > Python API for IDFModel > --- > > Key:

[jira] [Updated] (SPARK-6612) Python KMeans parity

2015-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6612: - Assignee: Hrishikesh > Python KMeans parity > > > Key

[jira] [Resolved] (SPARK-6598) Python API for IDFModel

2015-03-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6598. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5264 [https://githu

[jira] [Updated] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4894: - Assignee: Leah McGuire (was: RJ Nowling) > Add Bernoulli-variant of Naive Bayes > ---

[jira] [Resolved] (SPARK-6575) Add configuration to disable schema merging while converting metastore Parquet tables

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6575. - Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Issue resolved by p

[jira] [Resolved] (SPARK-6555) Override equals and hashCode in MetastoreRelation

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6555. - Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Issue resolved by p

[jira] [Updated] (SPARK-6555) Override equals and hashCode in MetastoreRelation

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6555: Target Version/s: 1.3.1, 1.4.0 (was: 1.4.0) > Override equals and hashCode in MetastoreRela

[jira] [Resolved] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-03-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-4894. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4087 [https

[jira] [Updated] (SPARK-6633) Should be "Contains" instead of "EndsWith" when constructing sources.StringContains

2015-03-31 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6633: Target Version/s: 1.3.1 > Should be "Contains" instead of "EndsWith" when constructing > so

[jira] [Assigned] (SPARK-5854) Implement Personalized PageRank with GraphX

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5854: --- Assignee: (was: Apache Spark) > Implement Personalized PageRank with GraphX > ---

[jira] [Assigned] (SPARK-5854) Implement Personalized PageRank with GraphX

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5854: --- Assignee: Apache Spark > Implement Personalized PageRank with GraphX > --

[jira] [Created] (SPARK-6635) DataFrame.withColumn can create columns with identical names

2015-03-31 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6635: Summary: DataFrame.withColumn can create columns with identical names Key: SPARK-6635 URL: https://issues.apache.org/jira/browse/SPARK-6635 Project: Spark

[jira] [Assigned] (SPARK-6617) Word2Vec is nondeterministic

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6617: --- Assignee: Apache Spark > Word2Vec is nondeterministic > > >

[jira] [Assigned] (SPARK-6617) Word2Vec is nondeterministic

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6617: --- Assignee: (was: Apache Spark) > Word2Vec is nondeterministic > --

[jira] [Commented] (SPARK-6617) Word2Vec is nondeterministic

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14388872#comment-14388872 ] Apache Spark commented on SPARK-6617: - User 'ilganeli' has created a pull request for

[jira] [Created] (SPARK-6634) Allow replacing columns in Transformers

2015-03-31 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6634: Summary: Allow replacing columns in Transformers Key: SPARK-6634 URL: https://issues.apache.org/jira/browse/SPARK-6634 Project: Spark Issue Type: Imp

[jira] [Assigned] (SPARK-6633) Should be "Contains" instead of "EndsWith" when constructing sources.StringContains

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6633: --- Assignee: Apache Spark > Should be "Contains" instead of "EndsWith" when constructing > sour

[jira] [Assigned] (SPARK-6633) Should be "Contains" instead of "EndsWith" when constructing sources.StringContains

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6633: --- Assignee: (was: Apache Spark) > Should be "Contains" instead of "EndsWith" when construct

[jira] [Commented] (SPARK-6633) Should be "Contains" instead of "EndsWith" when constructing sources.StringContains

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14388740#comment-14388740 ] Apache Spark commented on SPARK-6633: - User 'viirya' has created a pull request for th

[jira] [Created] (SPARK-6633) Should be "Contains" instead of "EndsWith" when constructing sources.StringContains

2015-03-31 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6633: -- Summary: Should be "Contains" instead of "EndsWith" when constructing sources.StringContains Key: SPARK-6633 URL: https://issues.apache.org/jira/browse/SPARK-6633

[jira] [Comment Edited] (SPARK-2629) Improve performance of DStream.updateStateByKey

2015-03-31 Thread Vinoth Chandar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14388681#comment-14388681 ] Vinoth Chandar edited comment on SPARK-2629 at 3/31/15 3:34 PM:

[jira] [Comment Edited] (SPARK-5004) PySpark does not handle SOCKS proxy

2015-03-31 Thread Eric O. LEBIGOT (EOL) (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14388680#comment-14388680 ] Eric O. LEBIGOT (EOL) edited comment on SPARK-5004 at 3/31/15 3:14 PM: -

[jira] [Commented] (SPARK-2629) Improve performance of DStream.updateStateByKey

2015-03-31 Thread Vinoth Chandar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14388681#comment-14388681 ] Vinoth Chandar commented on SPARK-2629: --- [~tdas] are you guys thinking along the lin

  1   2   >