[jira] [Comment Edited] (SPARK-14520) ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true

2016-04-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234496#comment-15234496 ] Liang-Chi Hsieh edited comment on SPARK-14520 at 4/11/16 5:09 AM: -- Hi

[jira] [Commented] (SPARK-14520) ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true

2016-04-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234496#comment-15234496 ] Liang-Chi Hsieh commented on SPARK-14520: - Hi [~Rajesh Balamohan], I submitted a PR for this

[jira] [Assigned] (SPARK-14520) ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14520: Assignee: Apache Spark > ClasscastException thrown with

[jira] [Commented] (SPARK-14520) ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234495#comment-15234495 ] Apache Spark commented on SPARK-14520: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14520) ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14520: Assignee: (was: Apache Spark) > ClasscastException thrown with

[jira] [Comment Edited] (SPARK-13352) BlockFetch does not scale well on large block

2016-04-10 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234488#comment-15234488 ] Zhang, Liye edited comment on SPARK-13352 at 4/11/16 5:02 AM: -- Hi [~davies],

[jira] [Commented] (SPARK-14253) Avoid registering temporary functions in Hive

2016-04-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234492#comment-15234492 ] Liang-Chi Hsieh commented on SPARK-14253: - This can be closed now. > Avoid registering temporary

[jira] [Commented] (SPARK-13352) BlockFetch does not scale well on large block

2016-04-10 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234488#comment-15234488 ] Zhang, Liye commented on SPARK-13352: - Hi [~davies], I think this JIRA is related with

[jira] [Created] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-10 Thread Justin Pihony (JIRA)
Justin Pihony created SPARK-14525: - Summary: DataFrameWriter's save method should delegate to jdbc for jdbc datasource Key: SPARK-14525 URL: https://issues.apache.org/jira/browse/SPARK-14525 Project:

[jira] [Updated] (SPARK-14486) For partition table, the dag occurs oom because of too many same rdds

2016-04-10 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] meiyoula updated SPARK-14486: - Description: For partition table, when partition rdds do some maps, the rdd number will multiple grow.

[jira] [Updated] (SPARK-14486) For partition table, the dag occurs oom because of too many same rdds

2016-04-10 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] meiyoula updated SPARK-14486: - Attachment: screenshot-1.png > For partition table, the dag occurs oom because of too many same rdds >

[jira] [Created] (SPARK-14524) In SparkSQL, it can't be select column of String type because of UTF8String when setting more than 32G for executors.

2016-04-10 Thread Deng Changchun (JIRA)
Deng Changchun created SPARK-14524: -- Summary: In SparkSQL, it can't be select column of String type because of UTF8String when setting more than 32G for executors. Key: SPARK-14524 URL:

[jira] [Created] (SPARK-14523) Feature parity for Statistics ML with MLlib

2016-04-10 Thread yuhao yang (JIRA)
yuhao yang created SPARK-14523: -- Summary: Feature parity for Statistics ML with MLlib Key: SPARK-14523 URL: https://issues.apache.org/jira/browse/SPARK-14523 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-14522) Getting an error of BoneCP specified but not present in CLASSPATH

2016-04-10 Thread Niranjan Molkeri` (JIRA)
Niranjan Molkeri` created SPARK-14522: - Summary: Getting an error of BoneCP specified but not present in CLASSPATH Key: SPARK-14522 URL: https://issues.apache.org/jira/browse/SPARK-14522 Project:

[jira] [Commented] (SPARK-14521) StackOverflowError in Kryo when executing TPC-DS Query27

2016-04-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234380#comment-15234380 ] Rajesh Balamohan commented on SPARK-14521: -- Build with commit

[jira] [Updated] (SPARK-14521) StackOverflowError in Kryo when executing TPC-DS Query27

2016-04-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated SPARK-14521: - Summary: StackOverflowError in Kryo when executing TPC-DS Query27 (was:

[jira] [Created] (SPARK-14521) StackOverflowError when executing TPC-DS Query27

2016-04-10 Thread Rajesh Balamohan (JIRA)
Rajesh Balamohan created SPARK-14521: Summary: StackOverflowError when executing TPC-DS Query27 Key: SPARK-14521 URL: https://issues.apache.org/jira/browse/SPARK-14521 Project: Spark

[jira] [Updated] (SPARK-14520) ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true

2016-04-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated SPARK-14520: - Description: Build details: Spark build from master branch (Apr-10) TPC-DS at 200 GB

[jira] [Updated] (SPARK-14520) ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true

2016-04-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated SPARK-14520: - Description: Build details: Spark build from master branch (Apr-10) TPC-DS at 200 GB

[jira] [Created] (SPARK-14520) ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true

2016-04-10 Thread Rajesh Balamohan (JIRA)
Rajesh Balamohan created SPARK-14520: Summary: ClasscastException thrown with spark.sql.parquet.enableVectorizedReader=true Key: SPARK-14520 URL: https://issues.apache.org/jira/browse/SPARK-14520

[jira] [Commented] (SPARK-14419) Improve the HashedRelation for key fit within Long

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234353#comment-15234353 ] Apache Spark commented on SPARK-14419: -- User 'davies' has created a pull request for this issue:

[jira] [Commented] (SPARK-14289) Support multiple eviction strategies for cached RDD partitions

2016-04-10 Thread Ben Manes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234267#comment-15234267 ] Ben Manes commented on SPARK-14289: --- How about

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-04-10 Thread John Berryman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234264#comment-15234264 ] John Berryman commented on SPARK-13587: --- At my work we're using devpi and a homecooked proxy server

[jira] [Created] (SPARK-14519) Cross-publish Kafka for Scala 2.12.0-M4

2016-04-10 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-14519: -- Summary: Cross-publish Kafka for Scala 2.12.0-M4 Key: SPARK-14519 URL: https://issues.apache.org/jira/browse/SPARK-14519 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-14415) All functions should show usages by command `DESC FUNCTION`

2016-04-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-14415: - Assignee: Dongjoon Hyun > All functions should show usages by command `DESC FUNCTION` >

[jira] [Resolved] (SPARK-14415) All functions should show usages by command `DESC FUNCTION`

2016-04-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14415. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12185

[jira] [Assigned] (SPARK-14518) Support Comment in CREATE VIEW

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14518: Assignee: Apache Spark > Support Comment in CREATE VIEW > --

[jira] [Commented] (SPARK-14518) Support Comment in CREATE VIEW

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234160#comment-15234160 ] Apache Spark commented on SPARK-14518: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14518) Support Comment in CREATE VIEW

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14518: Assignee: (was: Apache Spark) > Support Comment in CREATE VIEW >

[jira] [Updated] (SPARK-14518) Support Comment in CREATE VIEW

2016-04-10 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-14518: Description: {noformat} CREATE VIEW [IF NOT EXISTS] [db_name.]view_name [(column_name [COMMENT

[jira] [Created] (SPARK-14518) Support Comment in CREATE VIEW

2016-04-10 Thread Xiao Li (JIRA)
Xiao Li created SPARK-14518: --- Summary: Support Comment in CREATE VIEW Key: SPARK-14518 URL: https://issues.apache.org/jira/browse/SPARK-14518 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-14505) Creating two SparkContext Object in the same jvm, the first one will can not run any tasks!

2016-04-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234155#comment-15234155 ] Sean Owen commented on SPARK-14505: --- I'd say the solution is clearly to not make a second context, but,

[jira] [Commented] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234153#comment-15234153 ] Sean Owen commented on SPARK-14022: --- It's pretty simple to implement even without library support. It

[jira] [Updated] (SPARK-14516) What about adding general clustering metrics?

2016-04-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14516: -- Priority: Minor (was: Major) I personally think silhouette could be worth adding. The supervised

[jira] [Comment Edited] (SPARK-14479) GLM supports output link prediction

2016-04-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234124#comment-15234124 ] Yanbo Liang edited comment on SPARK-14479 at 4/10/16 1:59 PM: -- Had offline

[jira] [Updated] (SPARK-14479) GLM supports output link prediction

2016-04-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-14479: Summary: GLM supports output link prediction (was: GLM predict type should be link or response?)

[jira] [Updated] (SPARK-14479) GLM predict type should be link or response?

2016-04-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-14479: Issue Type: Improvement (was: Question) > GLM predict type should be link or response? >

[jira] [Commented] (SPARK-14479) GLM predict type should be link or response?

2016-04-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234124#comment-15234124 ] Yanbo Liang commented on SPARK-14479: - Had offline discussion with [~mengxr], we can output 2

[jira] [Comment Edited] (SPARK-14479) GLM predict type should be link or response?

2016-04-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234124#comment-15234124 ] Yanbo Liang edited comment on SPARK-14479 at 4/10/16 1:52 PM: -- Had offline

[jira] [Assigned] (SPARK-14479) GLM predict type should be link or response?

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14479: Assignee: Apache Spark > GLM predict type should be link or response? >

[jira] [Commented] (SPARK-14479) GLM predict type should be link or response?

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234121#comment-15234121 ] Apache Spark commented on SPARK-14479: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14479) GLM predict type should be link or response?

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14479: Assignee: (was: Apache Spark) > GLM predict type should be link or response? >

[jira] [Closed] (SPARK-14517) GLM should support predict link

2016-04-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang closed SPARK-14517. --- Resolution: Duplicate > GLM should support predict link > --- > >

[jira] [Comment Edited] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-04-10 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234078#comment-15234078 ] Nick Pentreath edited comment on SPARK-13944 at 4/10/16 12:15 PM: -- What

[jira] [Commented] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-04-10 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234078#comment-15234078 ] Nick Pentreath commented on SPARK-13944: What about the case of {{dataFrame.rdd.map { case Row(v:

[jira] [Commented] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-04-10 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234077#comment-15234077 ] Nick Pentreath commented on SPARK-13944: Type alias is a better solution if we aim to break

[jira] [Updated] (SPARK-14517) GLM should support predict link

2016-04-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-14517: Issue Type: Improvement (was: Question) > GLM should support predict link >

[jira] [Updated] (SPARK-14517) GLM should support predict link

2016-04-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-14517: Summary: GLM should support predict link (was: CLONE - GLM predict type should be link or

[jira] [Created] (SPARK-14517) CLONE - GLM predict type should be link or response?

2016-04-10 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-14517: --- Summary: CLONE - GLM predict type should be link or response? Key: SPARK-14517 URL: https://issues.apache.org/jira/browse/SPARK-14517 Project: Spark Issue

[jira] [Closed] (SPARK-9882) Priority-based scheduling for Spark applications

2016-04-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-9882. -- Resolution: Won't Fix > Priority-based scheduling for Spark applications >

[jira] [Comment Edited] (SPARK-9882) Priority-based scheduling for Spark applications

2016-04-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234028#comment-15234028 ] Liang-Chi Hsieh edited comment on SPARK-9882 at 4/10/16 9:54 AM: - This PR

[jira] [Commented] (SPARK-9882) Priority-based scheduling for Spark applications

2016-04-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234028#comment-15234028 ] Liang-Chi Hsieh commented on SPARK-9882: This PR stays for a while. As the PR doesn't get the

[jira] [Commented] (SPARK-9882) Priority-based scheduling for Spark applications

2016-04-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234027#comment-15234027 ] Liang-Chi Hsieh commented on SPARK-9882: I've updated the description. Thanks! > Priority-based

[jira] [Updated] (SPARK-9882) Priority-based scheduling for Spark applications

2016-04-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-9882: --- Description: We implement this patch because in our daily usage of Spark we found that

[jira] [Comment Edited] (SPARK-9882) Priority-based scheduling for Spark applications

2016-04-10 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234024#comment-15234024 ] Mark Hamstra edited comment on SPARK-9882 at 4/10/16 9:49 AM: -- This isn't a

[jira] [Commented] (SPARK-9882) Priority-based scheduling for Spark applications

2016-04-10 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15234024#comment-15234024 ] Mark Hamstra commented on SPARK-9882: - This isn't a very well written JIRA. You are just duplicating

[jira] [Commented] (SPARK-14298) LDA should support disable checkpoint

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233996#comment-15233996 ] Apache Spark commented on SPARK-14298: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Updated] (SPARK-14497) Use top instead of sortBy() to get top N frequent words as dict in ConutVectorizer

2016-04-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14497: -- Assignee: Feng Wang > Use top instead of sortBy() to get top N frequent words as dict in >

[jira] [Resolved] (SPARK-14497) Use top instead of sortBy() to get top N frequent words as dict in ConutVectorizer

2016-04-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14497. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12265

[jira] [Commented] (SPARK-14363) Executor OOM due to a memory leak in Sorter

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233978#comment-15233978 ] Apache Spark commented on SPARK-14363: -- User 'sitalkedia' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14363) Executor OOM due to a memory leak in Sorter

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14363: Assignee: (was: Apache Spark) > Executor OOM due to a memory leak in Sorter >

[jira] [Assigned] (SPARK-14363) Executor OOM due to a memory leak in Sorter

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14363: Assignee: Apache Spark > Executor OOM due to a memory leak in Sorter >

[jira] [Updated] (SPARK-14363) Executor OOM due to a memory leak in Sorter

2016-04-10 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sital Kedia updated SPARK-14363: Description: While running a Spark job, we see that the job fails because of executor OOM with

[jira] [Updated] (SPARK-14363) Executor OOM due to a memory leak in Sorter

2016-04-10 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sital Kedia updated SPARK-14363: Description: While running a Spark job, we see that the job fails because of executor OOM with

[jira] [Updated] (SPARK-14363) Executor OOM due to a memory leak in Sorter

2016-04-10 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sital Kedia updated SPARK-14363: Summary: Executor OOM due to a memory leak in Sorter (was: Executor OOM while trying to acquire

[jira] [Comment Edited] (SPARK-12922) Implement gapply() on DataFrame in SparkR

2016-04-10 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233886#comment-15233886 ] Narine Kokhlikyan edited comment on SPARK-12922 at 4/10/16 7:23 AM:

[jira] [Commented] (SPARK-14516) What about adding general clustering metrics?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233946#comment-15233946 ] zhengruifeng commented on SPARK-14516: -- cc [~mengxr] [~josephkb] [~yanboliang] > What about adding

[jira] [Created] (SPARK-14516) What about adding general clustering metrics?

2016-04-10 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14516: Summary: What about adding general clustering metrics? Key: SPARK-14516 URL: https://issues.apache.org/jira/browse/SPARK-14516 Project: Spark Issue Type:

[jira] [Commented] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233941#comment-15233941 ] zhengruifeng commented on SPARK-14022: -- cc [~yanboliang] [~mengxr] [~josephkb] > What about adding

[jira] [Commented] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233937#comment-15233937 ] zhengruifeng commented on SPARK-14022: -- Ok, I change the Type from Question to Brainstroming. I

[jira] [Reopened] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reopened SPARK-14022: -- There may need some discuss on whether to add RandomProjection or Not. > What about adding

[jira] [Updated] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-14022: - Issue Type: Brainstorming (was: Question) > What about adding RandomProjection to ML/MLLIB as a

[jira] [Resolved] (SPARK-14357) Tasks that fail due to CommitDeniedException (a side-effect of speculation) can cause job failure

2016-04-10 Thread Jason Moore (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Moore resolved SPARK-14357. - Resolution: Fixed Issue resolved by pull request 12228

[jira] [Updated] (SPARK-14357) Tasks that fail due to CommitDeniedException (a side-effect of speculation) can cause job failure

2016-04-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14357: -- Assignee: Jason Moore > Tasks that fail due to CommitDeniedException (a side-effect of speculation) >

[jira] [Updated] (SPARK-14357) Tasks that fail due to CommitDeniedException (a side-effect of speculation) can cause job failure

2016-04-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14357: -- Target Version/s: 1.5.2, 1.6.2, 2.0.0 Fix Version/s: 1.5.2 2.0.0

[jira] [Resolved] (SPARK-14455) ReceiverTracker#allocatedExecutors throw NPE for receiver-less streaming application

2016-04-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-14455. --- Resolution: Fixed Assignee: Saisai Shao Fix Version/s: 2.0.0 Target

[jira] [Resolved] (SPARK-14506) HiveClientImpl's toHiveTable misses a table property for external tables

2016-04-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-14506. --- Resolution: Fixed Assignee: Yin Huai Fix Version/s: 2.0.0 > HiveClientImpl's

[jira] [Commented] (SPARK-14406) Drop Table

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233908#comment-15233908 ] Apache Spark commented on SPARK-14406: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-14362) DDL Native Support: Drop View

2016-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233907#comment-15233907 ] Apache Spark commented on SPARK-14362: -- User 'gatorsmile' has created a pull request for this issue: