[jira] [Resolved] (SPARK-2510) word2vec: Distributed Representation of Words

2014-08-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2510. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1719 [https://gith

[jira] [Created] (SPARK-2823) GraphX jobs throw IllegalArgumentException

2014-08-03 Thread Lu Lu (JIRA)
Lu Lu created SPARK-2823: Summary: GraphX jobs throw IllegalArgumentException Key: SPARK-2823 URL: https://issues.apache.org/jira/browse/SPARK-2823 Project: Spark Issue Type: Bug Components

[jira] [Issue Comment Deleted] (SPARK-2820) Group by query not returning random values

2014-08-03 Thread Athira Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Athira Das updated SPARK-2820: -- Comment: was deleted (was: sqlContext.sql("SELECT id, month, AVG(marks) FROM data WHERE marks>25 GROUP

[jira] [Reopened] (SPARK-2820) Group by query not returning random values

2014-08-03 Thread Athira Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Athira Das reopened SPARK-2820: --- > Group by query not returning random values > -- > >

[jira] [Commented] (SPARK-2820) Group by query not returning random values

2014-08-03 Thread Athira Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084365#comment-14084365 ] Athira Das commented on SPARK-2820: --- sqlContext.sql("SELECT id, month, AVG(marks) FROM d

[jira] [Updated] (SPARK-2820) Group by query not returning random values

2014-08-03 Thread Athira Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Athira Das updated SPARK-2820: -- Description: sqlContext.sql("SELECT id, month, AVG(marks) FROM data WHERE marks>25 GROUP BY id, month"

[jira] [Created] (SPARK-2822) Group by returning random values in SparkSQL

2014-08-03 Thread Athira Das (JIRA)
Athira Das created SPARK-2822: - Summary: Group by returning random values in SparkSQL Key: SPARK-2822 URL: https://issues.apache.org/jira/browse/SPARK-2822 Project: Spark Issue Type: Question

[jira] [Created] (SPARK-2821) Group by returning random values in Spark SQL. While running the query sqlContext.sql("SELECT id, month, AVG(marks) FROM data WHERE marks>25 GROUP BY id, month")

2014-08-03 Thread Athira Das (JIRA)
Athira Das created SPARK-2821: - Summary: Group by returning random values in Spark SQL. While running the query sqlContext.sql("SELECT id, month, AVG(marks) FROM data WHERE marks>25 GROUP BY id, month") Key: SPARK-2821

[jira] [Closed] (SPARK-2820) Group by query not returning random values

2014-08-03 Thread Athira Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Athira Das closed SPARK-2820. - Resolution: Fixed > Group by query not returning random values >

[jira] [Comment Edited] (SPARK-2812) convert maven to archetype based build

2014-08-03 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084347#comment-14084347 ] Prashant Sharma edited comment on SPARK-2812 at 8/4/14 6:17 AM:

[jira] [Created] (SPARK-2820) Group by query not returning random values

2014-08-03 Thread Athira Das (JIRA)
Athira Das created SPARK-2820: - Summary: Group by query not returning random values Key: SPARK-2820 URL: https://issues.apache.org/jira/browse/SPARK-2820 Project: Spark Issue Type: Question

[jira] [Commented] (SPARK-2812) convert maven to archetype based build

2014-08-03 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084347#comment-14084347 ] Prashant Sharma commented on SPARK-2812: What do you mean by archetype based build

[jira] [Updated] (SPARK-2818) Improve joinning RDDs that transformed from the same cached RDD

2014-08-03 Thread Lu Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lu Lu updated SPARK-2818: - Component/s: Spark Core Description: if the joinning RDDs are originating from a same cached RDD a, the DAGSc

[jira] [Created] (SPARK-2819) Difficult to turn on intercept with linear models

2014-08-03 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-2819: - Summary: Difficult to turn on intercept with linear models Key: SPARK-2819 URL: https://issues.apache.org/jira/browse/SPARK-2819 Project: Spark Issue Type: Improve

[jira] [Created] (SPARK-2818) Improve joinning RDDs that transformed from the same cached RDD

2014-08-03 Thread Lu Lu (JIRA)
Lu Lu created SPARK-2818: Summary: Improve joinning RDDs that transformed from the same cached RDD Key: SPARK-2818 URL: https://issues.apache.org/jira/browse/SPARK-2818 Project: Spark Issue Type: Im

[jira] [Commented] (SPARK-2817) add "show create table" support

2014-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084308#comment-14084308 ] Apache Spark commented on SPARK-2817: - User 'tianyi' has created a pull request for th

[jira] [Resolved] (SPARK-2272) Feature scaling which standardizes the range of independent variables or features of data.

2014-08-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2272. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1207 [https://gith

[jira] [Commented] (SPARK-2815) Compilation failed upon the hadoop version 2.0.0-cdh4.5.0

2014-08-03 Thread pengyanhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084287#comment-14084287 ] pengyanhong commented on SPARK-2815: I changed the YarnAllocationHandler.scala file as

[jira] [Created] (SPARK-2817) add "show create table" support

2014-08-03 Thread Yi Tian (JIRA)
Yi Tian created SPARK-2817: -- Summary: add "show create table" support Key: SPARK-2817 URL: https://issues.apache.org/jira/browse/SPARK-2817 Project: Spark Issue Type: Bug Components: SQL

[jira] [Commented] (SPARK-2816) Type-safe SQL queries

2014-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084257#comment-14084257 ] Apache Spark commented on SPARK-2816: - User 'marmbrus' has created a pull request for

[jira] [Created] (SPARK-2816) Type-safe SQL queries

2014-08-03 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-2816: --- Summary: Type-safe SQL queries Key: SPARK-2816 URL: https://issues.apache.org/jira/browse/SPARK-2816 Project: Spark Issue Type: New Feature C

[jira] [Closed] (SPARK-2744) The configuration "spark.history.retainedApplications" is invalid

2014-08-03 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] meiyoula closed SPARK-2744. --- Resolution: Not a Problem > The configuration "spark.history.retainedApplications" is invalid > -

[jira] [Commented] (SPARK-2583) ConnectionManager cannot distinguish whether error occurred or not

2014-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084195#comment-14084195 ] Apache Spark commented on SPARK-2583: - User 'JoshRosen' has created a pull request for

[jira] [Resolved] (SPARK-2810) update scala-maven-plugin to version 3.2.0

2014-08-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2810. Resolution: Fixed Fix Version/s: 1.1.0 Target Version/s: 1.1.0 Fixed by: h

[jira] [Updated] (SPARK-2810) update scala-maven-plugin to version 3.2.0

2014-08-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2810: --- Assignee: Anand Avati > update scala-maven-plugin to version 3.2.0 >

[jira] [Commented] (SPARK-1981) Add AWS Kinesis streaming support

2014-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084164#comment-14084164 ] Apache Spark commented on SPARK-1981: - User 'cfregly' has created a pull request for t

[jira] [Resolved] (SPARK-1740) Pyspark cancellation kills unrelated pyspark workers

2014-08-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-1740. --- Resolution: Fixed Fix Version/s: 1.1.0 > Pyspark cancellation kills unrelated pyspark workers

[jira] [Updated] (SPARK-2360) CSV import to SchemaRDDs

2014-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2360: Target Version/s: 1.2.0 (was: 1.1.0) > CSV import to SchemaRDDs >

[jira] [Resolved] (SPARK-2783) Basic support for analyze in HiveContext

2014-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2783. - Resolution: Fixed Fix Version/s: 1.1.0 > Basic support for analyze in HiveContext

[jira] [Updated] (SPARK-2360) CSV import to SchemaRDDs

2014-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2360: Priority: Major (was: Minor) > CSV import to SchemaRDDs > > >

[jira] [Resolved] (SPARK-2752) spark sql cli should not exit when get a exception

2014-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2752. - Resolution: Fixed Target Version/s: 1.1.0 > spark sql cli should not exit when

[jira] [Resolved] (SPARK-2784) Make language configurable using SQLConf instead of hql/sql functions

2014-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2784. - Resolution: Fixed Fix Version/s: 1.1.0 > Make language configurable using SQLConf

[jira] [Resolved] (SPARK-2814) HiveThriftServer throws NPE when executing native commands

2014-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2814. - Resolution: Fixed Fix Version/s: 1.1.0 Assignee: Cheng Lian > HiveThriftS

[jira] [Commented] (SPARK-1997) Update breeze to version 0.8.1

2014-08-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084057#comment-14084057 ] Xiangrui Meng commented on SPARK-1997: -- It's fine within Spark. If we add breeze-0.8.

[jira] [Resolved] (SPARK-2197) Spark invoke DecisionTree by Java

2014-08-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2197. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1740 [https://gith

[jira] [Resolved] (SPARK-2246) Add user-data option to EC2 scripts

2014-08-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2246. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1186 [https://

[jira] [Updated] (SPARK-2246) Add user-data option to EC2 scripts

2014-08-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2246: --- Assignee: Allan Douglas R. de Oliveira > Add user-data option to EC2 scripts > --

[jira] [Resolved] (SPARK-2712) Add a small note that mvn "package" must happen before "test"

2014-08-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2712. Resolution: Fixed Issue resolved by pull request 1615 [https://github.com/apache/spark/pull

[jira] [Commented] (SPARK-2815) Compilation failed upon the hadoop version 2.0.0-cdh4.5.0

2014-08-03 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084034#comment-14084034 ] Guoqiang Li commented on SPARK-2815: Currently {{yarn-alpha}} does not support version

[jira] [Commented] (SPARK-1335) Also increase perm gen / code cache for scalatest when invoked via Maven build

2014-08-03 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084026#comment-14084026 ] Guoqiang Li commented on SPARK-1335: The problem also appeared in branch 1.1. The foll

[jira] [Commented] (SPARK-1981) Add AWS Kinesis streaming support

2014-08-03 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084025#comment-14084025 ] Nicholas Chammas commented on SPARK-1981: - Word. Thanks for the clarification! >

[jira] [Commented] (SPARK-2815) Compilation failed upon the hadoop version 2.0.0-cdh4.5.0

2014-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084009#comment-14084009 ] Sean Owen commented on SPARK-2815: -- Your build command is out of date. SPARK_HADOOP_VERSI

[jira] [Comment Edited] (SPARK-2815) Compilation failed upon the hadoop version 2.0.0-cdh4.5.0

2014-08-03 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084006#comment-14084006 ] Guoqiang Li edited comment on SPARK-2815 at 8/3/14 3:10 PM: [~

[jira] [Commented] (SPARK-2815) Compilation failed upon the hadoop version 2.0.0-cdh4.5.0

2014-08-03 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084006#comment-14084006 ] Guoqiang Li commented on SPARK-2815: [~pengyanhong] You can try this first {{./sbt/sbt

[jira] [Commented] (SPARK-2815) Compilation failed upon the hadoop version 2.0.0-cdh4.5.0

2014-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14084000#comment-14084000 ] Apache Spark commented on SPARK-2815: - User 'witgo' has created a pull request for thi

[jira] [Commented] (SPARK-2815) Compilation failed upon the hadoop version 2.0.0-cdh4.5.0

2014-08-03 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083998#comment-14083998 ] Guoqiang Li commented on SPARK-2815: I also encountered this bug. PRed: https://githu

[jira] [Created] (SPARK-2815) Compilation failed upon the hadoop version 2.0.0-cdh4.5.0

2014-08-03 Thread pengyanhong (JIRA)
pengyanhong created SPARK-2815: -- Summary: Compilation failed upon the hadoop version 2.0.0-cdh4.5.0 Key: SPARK-2815 URL: https://issues.apache.org/jira/browse/SPARK-2815 Project: Spark Issue Typ

[jira] [Updated] (SPARK-2814) HiveThriftServer throws NPE when executing native commands

2014-08-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-2814: -- Description: After [PR #1686|https://github.com/apache/spark/pull/1686], {{HiveThriftServer2}} throws

[jira] [Commented] (SPARK-2814) HiveThriftServer throws NPE when executing native commands

2014-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083974#comment-14083974 ] Apache Spark commented on SPARK-2814: - User 'liancheng' has created a pull request for

[jira] [Created] (SPARK-2814) HiveThriftServer throws NPE when executing native commands

2014-08-03 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-2814: - Summary: HiveThriftServer throws NPE when executing native commands Key: SPARK-2814 URL: https://issues.apache.org/jira/browse/SPARK-2814 Project: Spark Issue Type

[jira] [Commented] (SPARK-2803) add Kafka stream feature for fetch messages from specified starting offset position

2014-08-03 Thread pengyanhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083973#comment-14083973 ] pengyanhong commented on SPARK-2803: resolved this issue in the pull request #1602 >

[jira] [Commented] (SPARK-1449) Please delete old releases from mirroring system

2014-08-03 Thread Sebb (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083942#comment-14083942 ] Sebb commented on SPARK-1449: - No need to check out the directory tree (which is large), you c

[jira] [Commented] (SPARK-1449) Please delete old releases from mirroring system

2014-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083933#comment-14083933 ] Sean Owen commented on SPARK-1449: -- Sebb, is this just a matter of "svn co https://dist.

[jira] [Commented] (SPARK-1997) Update breeze to version 0.8.1

2014-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083929#comment-14083929 ] Sean Owen commented on SPARK-1997: -- Was scalalogging a problem per se? the issue was that

[jira] [Commented] (SPARK-1022) Add unit tests for kafka streaming

2014-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083928#comment-14083928 ] Apache Spark commented on SPARK-1022: - User 'jerryshao' has created a pull request for