[jira] [Commented] (SPARK-10276) Add @since annotation to pyspark.mllib.recommendation

2015-09-09 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736346#comment-14736346 ] Yu Ishikawa commented on SPARK-10276: - [~mengxr] should we add `@since` = to the clas

[jira] [Created] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Yu Ishikawa (JIRA)
Yu Ishikawa created SPARK-10512: --- Summary: Fix @since when a function doesn't have doc Key: SPARK-10512 URL: https://issues.apache.org/jira/browse/SPARK-10512 Project: Spark Issue Type: Improve

[jira] [Commented] (SPARK-10507) timestamp - timestamp

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736385#comment-14736385 ] Sean Owen commented on SPARK-10507: --- (Can you improve the title and description please?

[jira] [Updated] (SPARK-10507) timestamp - timestamp

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10507: -- Priority: Minor (was: Major) > timestamp - timestamp > -- > > Key

[jira] [Updated] (SPARK-10502) tidy up the exception message text to be less verbose/"User friendly"

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10502: -- Issue Type: Improvement (was: Bug) > tidy up the exception message text to be less verbose/"User frien

[jira] [Resolved] (SPARK-10111) StringIndexerModel lacks of method "labels"

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10111. --- Resolution: Duplicate > StringIndexerModel lacks of method "labels" > ---

[jira] [Updated] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Ishikawa updated SPARK-10512: Description: When I tried to add @since to a function which doesn't have doc, @since didn't go wel

[jira] [Updated] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Ishikawa updated SPARK-10512: Description: When I tried to add @since to a function which doesn't have doc, @since didn't go wel

[jira] [Assigned] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10512: Assignee: (was: Apache Spark) > Fix @since when a function doesn't have doc >

[jira] [Assigned] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10512: Assignee: Apache Spark > Fix @since when a function doesn't have doc > ---

[jira] [Commented] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736389#comment-14736389 ] Apache Spark commented on SPARK-10512: -- User 'yu-iskw' has created a pull request fo

[jira] [Commented] (SPARK-10444) Remove duplication in Mesos schedulers

2015-09-09 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736434#comment-14736434 ] Iulian Dragos commented on SPARK-10444: --- Another example of duplicated logic: https

[jira] [Updated] (SPARK-7825) Poor performance in Cross Product due to no combine operations for small files.

2015-09-09 Thread Tang Yan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tang Yan updated SPARK-7825: Affects Version/s: (was: 1.3.1) (was: 1.2.2) (was:

[jira] [Resolved] (SPARK-10227) sbt build on Scala 2.11 fails

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10227. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8433 [https://github.com

[jira] [Updated] (SPARK-10227) sbt build on Scala 2.11 fails

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10227: -- Assignee: Luc Bourlier > sbt build on Scala 2.11 fails > - > >

[jira] [Updated] (SPARK-10316) respect non-deterministic expressions in PhysicalOperation

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10316: -- Assignee: Wenchen Fan > respect non-deterministic expressions in PhysicalOperation > --

[jira] [Updated] (SPARK-4752) Classifier based on artificial neural network

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4752: - Assignee: Alexander Ulanov > Classifier based on artificial neural network > -

[jira] [Updated] (SPARK-10327) Cache Table is not working while subquery has alias in its project list

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10327: -- Assignee: Cheng Hao > Cache Table is not working while subquery has alias in its project list > ---

[jira] [Updated] (SPARK-10441) Cannot write timestamp to JSON

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10441: -- Assignee: Yin Huai > Cannot write timestamp to JSON > -- > >

[jira] [Updated] (SPARK-10501) support UUID as an atomic type

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10501: -- Priority: Minor (was: Major) Component/s: SQL Issue Type: Improvement (was: Bug) > suppor

[jira] [Commented] (SPARK-9564) Spark 1.5.0 Testing Plan

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736594#comment-14736594 ] Sean Owen commented on SPARK-9564: -- Now that 1.5.0 is released, can this be closed? Or e

[jira] [Created] (SPARK-10513) Springleaf Marketing Response

2015-09-09 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10513: --- Summary: Springleaf Marketing Response Key: SPARK-10513 URL: https://issues.apache.org/jira/browse/SPARK-10513 Project: Spark Issue Type: Sub-task Co

[jira] [Commented] (SPARK-10513) Springleaf Marketing Response

2015-09-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736648#comment-14736648 ] Yanbo Liang commented on SPARK-10513: - I will work on this dataset. > Springleaf Mar

[jira] [Commented] (SPARK-9578) Stemmer feature transformer

2015-09-09 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736695#comment-14736695 ] yuhao yang commented on SPARK-9578: --- A better choice for LDA seems to be lemmatization.

[jira] [Created] (SPARK-10514) Minimum ratio of registered resources [ spark.scheduler.minRegisteredResourcesRatio] is not enabled for Mesos Coarse Grained mode

2015-09-09 Thread Akash Mishra (JIRA)
Akash Mishra created SPARK-10514: Summary: Minimum ratio of registered resources [ spark.scheduler.minRegisteredResourcesRatio] is not enabled for Mesos Coarse Grained mode Key: SPARK-10514 URL: https://issues.ap

[jira] [Updated] (SPARK-10507) reject temporal expressions such as timestamp - timestamp at parse time

2015-09-09 Thread N Campbell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated SPARK-10507: --- Summary: reject temporal expressions such as timestamp - timestamp at parse time (was: timestamp -

[jira] [Updated] (SPARK-10507) reject temporal expressions such as timestamp - timestamp at parse time

2015-09-09 Thread N Campbell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated SPARK-10507: --- Description: TIMESTAMP - TIMESTAMP in ISO-SQL should return an interval type which SPARK does not su

[jira] [Created] (SPARK-10515) When kill executor, there is no need to seed RequestExecutors to AM

2015-09-09 Thread KaiXinXIaoLei (JIRA)
KaiXinXIaoLei created SPARK-10515: - Summary: When kill executor, there is no need to seed RequestExecutors to AM Key: SPARK-10515 URL: https://issues.apache.org/jira/browse/SPARK-10515 Project: Spark

[jira] [Assigned] (SPARK-10515) When kill executor, there is no need to seed RequestExecutors to AM

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10515: Assignee: (was: Apache Spark) > When kill executor, there is no need to seed RequestEx

[jira] [Commented] (SPARK-10515) When kill executor, there is no need to seed RequestExecutors to AM

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736853#comment-14736853 ] Apache Spark commented on SPARK-10515: -- User 'KaiXinXiaoLei' has created a pull requ

[jira] [Assigned] (SPARK-10515) When kill executor, there is no need to seed RequestExecutors to AM

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10515: Assignee: Apache Spark > When kill executor, there is no need to seed RequestExecutors to

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736869#comment-14736869 ] Glenn Strycker commented on SPARK-10493: The RDD I am using has the form ((String

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736879#comment-14736879 ] Sean Owen commented on SPARK-10493: --- That much should be OK. zipPartitions only makes

[jira] [Resolved] (SPARK-8793) error/warning with pyspark WholeTextFiles.first

2015-09-09 Thread Diana Carroll (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diana Carroll resolved SPARK-8793. -- Resolution: Not A Problem this is no longer occurring. > error/warning with pyspark WholeTextFi

[jira] [Commented] (SPARK-2960) Spark executables fail to start via symlinks

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736923#comment-14736923 ] Apache Spark commented on SPARK-2960: - User 'jerryshao' has created a pull request for

[jira] [Commented] (SPARK-10428) Struct fields read from parquet are mis-aligned

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736949#comment-14736949 ] Apache Spark commented on SPARK-10428: -- User 'liancheng' has created a pull request

[jira] [Commented] (SPARK-10301) For struct type, if parquet's global schema has less fields than a file's schema, data reading will fail

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736947#comment-14736947 ] Apache Spark commented on SPARK-10301: -- User 'liancheng' has created a pull request

[jira] [Commented] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736961#comment-14736961 ] Davies Liu commented on SPARK-10512: As we discussed here https://github.com/apache/

[jira] [Closed] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-10512. -- Resolution: Won't Fix > Fix @since when a function doesn't have doc > -

[jira] [Commented] (SPARK-10512) Fix @since when a function doesn't have doc

2015-09-09 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736973#comment-14736973 ] Yu Ishikawa commented on SPARK-10512: - [~davies] oh, I see. Thank you for letting me

[jira] [Assigned] (SPARK-7874) Add a global setting for the fine-grained mesos scheduler that limits the number of concurrent tasks of a job

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7874: --- Assignee: Apache Spark > Add a global setting for the fine-grained mesos scheduler that limit

[jira] [Assigned] (SPARK-7874) Add a global setting for the fine-grained mesos scheduler that limits the number of concurrent tasks of a job

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7874: --- Assignee: (was: Apache Spark) > Add a global setting for the fine-grained mesos scheduler

[jira] [Commented] (SPARK-10441) Cannot write timestamp to JSON

2015-09-09 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736986#comment-14736986 ] Don Drake commented on SPARK-10441: --- Got it, thanks for the clarification. > Cannot wr

[jira] [Commented] (SPARK-7874) Add a global setting for the fine-grained mesos scheduler that limits the number of concurrent tasks of a job

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736984#comment-14736984 ] Apache Spark commented on SPARK-7874: - User 'dragos' has created a pull request for th

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737001#comment-14737001 ] Glenn Strycker commented on SPARK-10493: In this example, our RDDs are partitione

[jira] [Created] (SPARK-10516) Add values as a property to DenseVector in PySpark

2015-09-09 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10516: - Summary: Add values as a property to DenseVector in PySpark Key: SPARK-10516 URL: https://issues.apache.org/jira/browse/SPARK-10516 Project: Spark Issue Ty

[jira] [Created] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-09-09 Thread JIRA
Maciej Bryński created SPARK-10517: -- Summary: Console "Output" field is empty when using DataFrameWriter.json Key: SPARK-10517 URL: https://issues.apache.org/jira/browse/SPARK-10517 Project: Spark

[jira] [Updated] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-09-09 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-10517: --- Attachment: screenshot-1.png > Console "Output" field is empty when using DataFrameWriter.jso

[jira] [Updated] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-09-09 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-10517: --- Description: On HTTP application UI "Output" field is empty when using DataFrameWriter.json.

[jira] [Updated] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-09-09 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-10517: --- Attachment: (was: screenshot-1.png) > Console "Output" field is empty when using DataFram

[jira] [Updated] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-09-09 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-10517: --- Attachment: screenshot-1.png > Console "Output" field is empty when using DataFrameWriter.jso

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737051#comment-14737051 ] Sean Owen commented on SPARK-10493: --- I think you still have the same issue with zipPart

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737050#comment-14737050 ] Sean Owen commented on SPARK-10493: --- I think you still have the same issue with zipPart

[jira] [Comment Edited] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737055#comment-14737055 ] Glenn Strycker edited comment on SPARK-10493 at 9/9/15 3:40 PM: ---

[jira] [Updated] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glenn Strycker updated SPARK-10493: --- Attachment: reduceByKey_example_001.scala I'm still working on checking unit tests and exampl

[jira] [Commented] (SPARK-10514) Minimum ratio of registered resources [ spark.scheduler.minRegisteredResourcesRatio] is not enabled for Mesos Coarse Grained mode

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737095#comment-14737095 ] Apache Spark commented on SPARK-10514: -- User 'SleepyThread' has created a pull reque

[jira] [Assigned] (SPARK-10514) Minimum ratio of registered resources [ spark.scheduler.minRegisteredResourcesRatio] is not enabled for Mesos Coarse Grained mode

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10514: Assignee: (was: Apache Spark) > Minimum ratio of registered resources [ > spark.sched

[jira] [Assigned] (SPARK-10514) Minimum ratio of registered resources [ spark.scheduler.minRegisteredResourcesRatio] is not enabled for Mesos Coarse Grained mode

2015-09-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10514: Assignee: Apache Spark > Minimum ratio of registered resources [ > spark.scheduler.minReg

[jira] [Commented] (SPARK-10514) Minimum ratio of registered resources [ spark.scheduler.minRegisteredResourcesRatio] is not enabled for Mesos Coarse Grained mode

2015-09-09 Thread Akash Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737103#comment-14737103 ] Akash Mishra commented on SPARK-10514: -- Created a pull request https://github.com/ap

[jira] [Resolved] (SPARK-10117) Implement SQL data source API for reading LIBSVM data

2015-09-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10117. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8537 [https://gi

[jira] [Comment Edited] (SPARK-10495) For json data source, date values are saved as int strings

2015-09-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735964#comment-14735964 ] Yin Huai edited comment on SPARK-10495 at 9/9/15 4:40 PM: -- The b

[jira] [Commented] (SPARK-10495) For json data source, date values are saved as int strings

2015-09-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737161#comment-14737161 ] Yin Huai commented on SPARK-10495: -- Since we shipped Spark 1.5.0 with this issue, it wil

[jira] [Comment Edited] (SPARK-10309) Some tasks failed with Unable to acquire memory

2015-09-09 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737185#comment-14737185 ] Davies Liu edited comment on SPARK-10309 at 9/9/15 4:53 PM: [

[jira] [Commented] (SPARK-10309) Some tasks failed with Unable to acquire memory

2015-09-09 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737185#comment-14737185 ] Davies Liu commented on SPARK-10309: [~nadenf] Thanks for letting us know, just reali

[jira] [Created] (SPARK-10518) Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-09 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10518: - Summary: Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils Key: SPARK-10518 URL: https://issues.apache.org/jira/browse/SPARK-10518

[jira] [Updated] (SPARK-10495) For json data source, date values are saved as int strings

2015-09-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10495: - Target Version/s: 1.6.0, 1.5.1 (was: 1.5.1) > For json data source, date values are saved as int strings

[jira] [Updated] (SPARK-10495) For json data source, date values are saved as int strings

2015-09-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10495: - Target Version/s: 1.5.1 Priority: Blocker (was: Critical) > For json data source, date value

[jira] [Resolved] (SPARK-10481) SPARK_PREPEND_CLASSES make spark-yarn related jar could not be found

2015-09-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-10481. Resolution: Fixed Assignee: Jeff Zhang Fix Version/s: 1.6.0 > SPARK_PREPEND

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737252#comment-14737252 ] Sean Owen commented on SPARK-10493: --- What do you mean that it's not collapsing key pair

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737296#comment-14737296 ] Glenn Strycker commented on SPARK-10493: [~srowen], the code I attached did run c

[jira] [Resolved] (SPARK-10461) make sure `input.primitive` is always variable name not code at GenerateUnsafeProjection

2015-09-09 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10461. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8613 [https://github.c

[jira] [Created] (SPARK-10519) Investigate if we should encode timezone information to a timestamp value stored in JSON

2015-09-09 Thread Yin Huai (JIRA)
Yin Huai created SPARK-10519: Summary: Investigate if we should encode timezone information to a timestamp value stored in JSON Key: SPARK-10519 URL: https://issues.apache.org/jira/browse/SPARK-10519 Proj

[jira] [Commented] (SPARK-10519) Investigate if we should encode timezone information to a timestamp value stored in JSON

2015-09-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737311#comment-14737311 ] Yin Huai commented on SPARK-10519: -- cc [~davies] I feel that option 3 is better. > Inv

[jira] [Updated] (SPARK-10519) Investigate if we should encode timezone information to a timestamp value stored in JSON

2015-09-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10519: - Target Version/s: 1.6.0 > Investigate if we should encode timezone information to a timestamp value > st

[jira] [Commented] (SPARK-10519) Investigate if we should encode timezone information to a timestamp value stored in JSON

2015-09-09 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737375#comment-14737375 ] Davies Liu commented on SPARK-10519: +1 for 3, user have the ability to control timez

[jira] [Updated] (SPARK-10474) Aggregation failed with unable to acquire memory

2015-09-09 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-10474: --- Target Version/s: 1.6.0, 1.5.1 Priority: Blocker (was: Critical) > Aggregation failed wi

[jira] [Commented] (SPARK-9924) checkForLogs and cleanLogs are scheduled at fixed rate and can get piled up

2015-09-09 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737416#comment-14737416 ] Thomas Graves commented on SPARK-9924: -- [~vanzin] Any reason this wasn't picked back

[jira] [Commented] (SPARK-9924) checkForLogs and cleanLogs are scheduled at fixed rate and can get piled up

2015-09-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737452#comment-14737452 ] Marcelo Vanzin commented on SPARK-9924: --- Timing, I guess (it went in around code fre

[jira] [Commented] (SPARK-9503) Mesos dispatcher NullPointerException (MesosClusterScheduler)

2015-09-09 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737453#comment-14737453 ] Timothy Chen commented on SPARK-9503: - Sorry this is indeed a bug and a fix is already

[jira] [Commented] (SPARK-9924) checkForLogs and cleanLogs are scheduled at fixed rate and can get piled up

2015-09-09 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737478#comment-14737478 ] Thomas Graves commented on SPARK-9924: -- Ok, thanks. wanted to make sure no known issu

[jira] [Created] (SPARK-10520) dates cannot be summarised in SparkR

2015-09-09 Thread Vincent Warmerdam (JIRA)
Vincent Warmerdam created SPARK-10520: - Summary: dates cannot be summarised in SparkR Key: SPARK-10520 URL: https://issues.apache.org/jira/browse/SPARK-10520 Project: Spark Issue Type: Bu

[jira] [Updated] (SPARK-10520) dates cannot be summarised in SparkR

2015-09-09 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-10520: -- Component/s: SQL > dates cannot be summarised in SparkR > -

[jira] [Commented] (SPARK-10520) dates cannot be summarised in SparkR

2015-09-09 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737486#comment-14737486 ] Shivaram Venkataraman commented on SPARK-10520: --- Thanks for the report -- I

[jira] [Updated] (SPARK-10520) dates cannot be summarised in SparkR

2015-09-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10520: Description: I create a simple dataframe in R and call the summary function on it (standard R, not

[jira] [Updated] (SPARK-10520) dates cannot be summarised in SparkR

2015-09-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10520: Description: I create a simple dataframe in R and call the summary function on it (standard R, not

[jira] [Commented] (SPARK-10520) dates cannot be summarised in SparkR

2015-09-09 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737520#comment-14737520 ] Vincent Warmerdam commented on SPARK-10520: --- Thought something similar, it seem

[jira] [Comment Edited] (SPARK-10520) dates cannot be summarised in SparkR

2015-09-09 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737520#comment-14737520 ] Vincent Warmerdam edited comment on SPARK-10520 at 9/9/15 8:24 PM:

[jira] [Commented] (SPARK-10436) spark-submit overwrites spark.files defaults with the job script filename

2015-09-09 Thread Sanket Reddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737544#comment-14737544 ] Sanket Reddy commented on SPARK-10436: -- I am a newbie and interested in it, I will t

[jira] [Commented] (SPARK-7442) Spark 1.3.1 / Hadoop 2.6 prebuilt pacakge has broken S3 filesystem access

2015-09-09 Thread William Cox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737580#comment-14737580 ] William Cox commented on SPARK-7442: Between this issue with the Hadoop 2.6 deploy and

[jira] [Created] (SPARK-10521) Utilize Docker to test DB2 JDBC Dialect support

2015-09-09 Thread Luciano Resende (JIRA)
Luciano Resende created SPARK-10521: --- Summary: Utilize Docker to test DB2 JDBC Dialect support Key: SPARK-10521 URL: https://issues.apache.org/jira/browse/SPARK-10521 Project: Spark Issue T

[jira] [Commented] (SPARK-1169) Add countApproxDistinctByKey to PySpark

2015-09-09 Thread William Cox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737608#comment-14737608 ] William Cox commented on SPARK-1169: I would like this feature. > Add countApproxDis

[jira] [Commented] (SPARK-10519) Investigate if we should encode timezone information to a timestamp value stored in JSON

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737612#comment-14737612 ] Sean Owen commented on SPARK-10519: --- I always feel nervous when storing human readable

[jira] [Commented] (SPARK-10521) Utilize Docker to test DB2 JDBC Dialect support

2015-09-09 Thread Luciano Resende (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737635#comment-14737635 ] Luciano Resende commented on SPARK-10521: - I'll be submitting a PR for this short

[jira] [Commented] (SPARK-10439) Catalyst should check for overflow / underflow of date and timestamp values

2015-09-09 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737644#comment-14737644 ] Davies Liu commented on SPARK-10439: There are many places there could be overflow, e

[jira] [Commented] (SPARK-4036) Add Conditional Random Fields (CRF) algorithm to Spark MLlib

2015-09-09 Thread Xin Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737650#comment-14737650 ] Xin Jin commented on SPARK-4036: Are we still actively working on this task? I have some w

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737681#comment-14737681 ] Sean Owen commented on SPARK-10493: --- If the RDD is a result of reduceByKey, I agree tha

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737727#comment-14737727 ] Glenn Strycker commented on SPARK-10493: I already have that added in my code tha

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14737730#comment-14737730 ] Sean Owen commented on SPARK-10493: --- checkpoint doesn't materialize the RDD, which is w

[jira] [Updated] (SPARK-9996) Create local nested loop join operator

2015-09-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-9996: - Assignee: Shixiong Zhu > Create local nested loop join operator > -- >

[jira] [Updated] (SPARK-9997) Create local Expand operator

2015-09-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-9997: - Assignee: Shixiong Zhu > Create local Expand operator > > > K

  1   2   >