[jira] [Commented] (SPARK-10264) Add @Since annotation to ml.recoomendation

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095861#comment-15095861 ] Sean Owen commented on SPARK-10264: --- Yes, go ahead. The PR appears abandoned. > Add @Since annotation

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095860#comment-15095860 ] Sun Rui commented on SPARK-6817: I agree R's efficiency comes from vectorization. Here UDF is a function

[jira] [Commented] (SPARK-12801) The DataFrame.rdd not return same result

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095889#comment-15095889 ] Sean Owen commented on SPARK-12801: --- You might want to start as a question on user@. I can't reproduce

[jira] [Updated] (SPARK-12798) Broadcast hash join

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12798: -- Description: [~davies] it's minor but you rarely set component on your JIRAs. I think it helps. Just

[jira] [Comment Edited] (SPARK-11857) Remove Mesos fine-grained mode subject to discussions

2016-01-13 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095975#comment-15095975 ] Iulian Dragos edited comment on SPARK-11857 at 1/13/16 10:35 AM: - There's

[jira] [Resolved] (SPARK-6664) Split Ordered RDD into multiple RDDs by keys (boundaries or intervals)

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6664. -- Resolution: Won't Fix > Split Ordered RDD into multiple RDDs by keys (boundaries or intervals) >

[jira] [Updated] (SPARK-12692) Scala style: check no white space before comma

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12692: Summary: Scala style: check no white space before comma (was: Scala style: check no white space

[jira] [Updated] (SPARK-9843) Catalyst: Allow adding custom optimizers

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9843: - Assignee: Robert Kruszewski > Catalyst: Allow adding custom optimizers >

[jira] [Resolved] (SPARK-4165) Using Companion Objects throws ambiguous reference error in REPL when an instance of Class is initialized

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4165. -- Resolution: Won't Fix > Using Companion Objects throws ambiguous reference error in REPL when an >

[jira] [Updated] (SPARK-12797) Aggregation without grouping keys

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12797: -- Component/s: SQL > Aggregation without grouping keys > - > >

[jira] [Updated] (SPARK-12800) Subtle bug on Spark Yarn Client under Kerberos Security Mode

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12800: -- Component/s: YARN > Subtle bug on Spark Yarn Client under Kerberos Security Mode >

[jira] [Updated] (SPARK-12758) Add note to Spark SQL Migration section about SPARK-11724

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12758: -- Assignee: Brandon Bradley > Add note to Spark SQL Migration section about SPARK-11724 >

[jira] [Updated] (SPARK-12795) Whole stage codegen

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12795: -- Component/s: SQL > Whole stage codegen > --- > > Key: SPARK-12795 >

[jira] [Updated] (SPARK-12787) Dataset to support custom encoder

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12787: -- Component/s: SQL > Dataset to support custom encoder > - > >

[jira] [Updated] (SPARK-4438) Add HistoryServer RESTful API

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4438: - Assignee: Josh Rosen > Add HistoryServer RESTful API > - > >

[jira] [Resolved] (SPARK-12802) The DataFrame.rdd not return same result

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12802. --- Resolution: Duplicate You opened this twice > The DataFrame.rdd not return same result >

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2016-01-13 Thread Olivier Girardot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095921#comment-15095921 ] Olivier Girardot commented on SPARK-2984: - Same error - Spark 1.5.1 no speculation enabled seen

[jira] [Updated] (SPARK-12655) GraphX does not unpersist RDDs

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12655: -- Priority: Minor (was: Major) > GraphX does not unpersist RDDs > -- > >

[jira] [Resolved] (SPARK-1603) Flaky test: o.a.s.streaming.StreamingContextSuite

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1603. -- Resolution: Cannot Reproduce > Flaky test: o.a.s.streaming.StreamingContextSuite >

[jira] [Resolved] (SPARK-12692) Scala style: check no white space before comma

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12692. - Resolution: Fixed Fix Version/s: 2.0.0 > Scala style: check no white space before comma >

[jira] [Commented] (SPARK-12692) Scala style: check no white space before comma

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095836#comment-15095836 ] Reynold Xin commented on SPARK-12692: - Some discussion on one of the pull requests: @sarutak I

[jira] [Created] (SPARK-12802) The DataFrame.rdd not return same result

2016-01-13 Thread Joseph Sun (JIRA)
Joseph Sun created SPARK-12802: -- Summary: The DataFrame.rdd not return same result Key: SPARK-12802 URL: https://issues.apache.org/jira/browse/SPARK-12802 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-12801) The DataFrame.rdd not return same result

2016-01-13 Thread Joseph Sun (JIRA)
Joseph Sun created SPARK-12801: -- Summary: The DataFrame.rdd not return same result Key: SPARK-12801 URL: https://issues.apache.org/jira/browse/SPARK-12801 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-12800) Subtle bug on Spark Yarn Client under Kerberos Security Mode

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12800: -- Description: Version used: Spark 1.5.1 (1.5.2-SNAPSHOT) Deployment Mode: Yarn-Cluster Problem

[jira] [Commented] (SPARK-12800) Subtle bug on Spark Yarn Client under Kerberos Security Mode

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095910#comment-15095910 ] Sean Owen commented on SPARK-12800: --- You can propose a PR with your change, to test it and make clear

[jira] [Updated] (SPARK-12796) initial prototype: projection/filter/range

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12796: -- Component/s: SQL > initial prototype: projection/filter/range >

[jira] [Updated] (SPARK-12782) reindex() columns in DataFrame

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12782: -- Component/s: SQL > reindex() columns in DataFrame > -- > >

[jira] [Updated] (SPARK-12777) Dataset fields can't be Scala tuples

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12777: -- Component/s: SQL > Dataset fields can't be Scala tuples > > >

[jira] [Commented] (SPARK-12635) More efficient (column batch) serialization for Python/R

2016-01-13 Thread Dmitriy Selivanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095955#comment-15095955 ] Dmitriy Selivanov commented on SPARK-12635: --- Thanks for clarification! I want to make a try,

[jira] [Resolved] (SPARK-595) Document "local-cluster" mode

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-595. - Resolution: Won't Fix > Document "local-cluster" mode > - > >

[jira] [Resolved] (SPARK-6665) Randomly Shuffle an RDD

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6665. -- Resolution: Won't Fix > Randomly Shuffle an RDD > > > Key:

[jira] [Commented] (SPARK-11857) Remove Mesos fine-grained mode subject to discussions

2016-01-13 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095975#comment-15095975 ] Iulian Dragos commented on SPARK-11857: --- There's one pending issue in the email thread, regarding a

[jira] [Resolved] (SPARK-4325) Improve spark-ec2 cluster launch times

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4325. -- Resolution: Done > Improve spark-ec2 cluster launch times > -- > >

[jira] [Closed] (SPARK-10697) Lift Calculation in Association Rule mining

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-10697. - > Lift Calculation in Association Rule mining > --- > >

[jira] [Resolved] (SPARK-2709) Add a tool for certifying Spark API compatiblity

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2709. -- Resolution: Won't Fix It sounds like this won't happen / didn't happen, but reopen if there's an

[jira] [Commented] (SPARK-10264) Add @Since annotation to ml.recoomendation

2016-01-13 Thread Tommy Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095844#comment-15095844 ] Tommy Yu commented on SPARK-10264: -- It's long time no update for origin PR for this defect, can let me

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095868#comment-15095868 ] Sun Rui commented on SPARK-6817: If we think that column-oriented UDF is more important, I can do it with

[jira] [Resolved] (SPARK-10066) Can't create HiveContext with spark-shell or spark-sql on snapshot

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10066. --- Resolution: Duplicate > Can't create HiveContext with spark-shell or spark-sql on snapshot >

[jira] [Commented] (SPARK-5493) Support proxy users under kerberos

2016-01-13 Thread Hemshankar Sahu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15095938#comment-15095938 ] Hemshankar Sahu commented on SPARK-5493: I have few doubts about running in client mode and

[jira] [Resolved] (SPARK-10697) Lift Calculation in Association Rule mining

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10697. --- Resolution: Won't Fix Target Version/s: (was: 2.0.0) This still doesn't contain any

[jira] [Resolved] (SPARK-4317) Error querying Avro files imported by Sqoop: org.apache.spark.sql.catalyst.errors.package$TreeNodeException: Unresolved attributes

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4317. -- Resolution: Not A Problem No follow up in a long time so assuming this is not a problem > Error

[jira] [Resolved] (SPARK-4640) FixedRangePartitioner for partitioning items with a known range

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4640. -- Resolution: Won't Fix It's not a bad idea but given lack of response I think this should be closed. It

[jira] [Updated] (SPARK-12780) Inconsistency returning value of ML python models' properties

2016-01-13 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-12780: -- Description: In spark/python/pyspark/ml/feature.py, StringIndexerModel has a property method named

[jira] [Updated] (SPARK-12780) Inconsistency returning value of ML python models' properties

2016-01-13 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-12780: -- Description: In spark/python/pyspark/ml/feature.py, StringIndexerModel has a property method named

[jira] [Created] (SPARK-12804) ml.classification.LogisticRegression fails when FitIntercept with same-label dataset

2016-01-13 Thread Feynman Liang (JIRA)
Feynman Liang created SPARK-12804: - Summary: ml.classification.LogisticRegression fails when FitIntercept with same-label dataset Key: SPARK-12804 URL: https://issues.apache.org/jira/browse/SPARK-12804

[jira] [Commented] (SPARK-12804) ml.classification.LogisticRegression fails when FitIntercept with same-label dataset

2016-01-13 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096064#comment-15096064 ] Feynman Liang commented on SPARK-12804: --- Please assign to me >

[jira] [Resolved] (SPARK-7064) Adding binary sparse vector support

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7064. -- Resolution: Won't Fix > Adding binary sparse vector support > --- > >

[jira] [Resolved] (SPARK-2199) Distributed probabilistic latent semantic analysis in MLlib

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2199. -- Resolution: Won't Fix > Distributed probabilistic latent semantic analysis in MLlib >

[jira] [Resolved] (SPARK-2788) Add location filtering to Twitter streams

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2788. -- Resolution: Won't Fix > Add location filtering to Twitter streams >

[jira] [Commented] (SPARK-12786) Actor demo does not demonstrate usable code

2016-01-13 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096187#comment-15096187 ] Nan Zhu commented on SPARK-12786: - the only place it relies on AkkaUtil is to create an ActorSystem,

[jira] [Commented] (SPARK-10264) Add @Since annotation to ml.recoomendation

2016-01-13 Thread Tommy Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096218#comment-15096218 ] Tommy Yu commented on SPARK-10264: -- thanks, I will work on this. > Add @Since annotation to

[jira] [Commented] (SPARK-12803) Consider adding ability to profile specific instances of executors in spark

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096058#comment-15096058 ] Sean Owen commented on SPARK-12803: --- What do you mean here by profiling -- generating heap dumps or

[jira] [Resolved] (SPARK-1882) Support dynamic memory sharing in Mesos

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1882. -- Resolution: Won't Fix I think this is mostly subsumed by the idea of dynamic allocation now > Support

[jira] [Commented] (SPARK-12746) ArrayType(_, true) should also accept ArrayType(_, false)

2016-01-13 Thread Earthson Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096120#comment-15096120 ] Earthson Lu commented on SPARK-12746: - I was just wandering if you could do a review:) On Tue, Jan

[jira] [Created] (SPARK-12803) Consider adding ability to profile specific instances of executors in spark

2016-01-13 Thread Rajesh Balamohan (JIRA)
Rajesh Balamohan created SPARK-12803: Summary: Consider adding ability to profile specific instances of executors in spark Key: SPARK-12803 URL: https://issues.apache.org/jira/browse/SPARK-12803

[jira] [Updated] (SPARK-12803) Consider adding ability to profile specific instances of executors in spark

2016-01-13 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated SPARK-12803: - Issue Type: Improvement (was: Bug) > Consider adding ability to profile specific

[jira] [Updated] (SPARK-12804) ml.classification.LogisticRegression fails when FitIntercept with same-label dataset

2016-01-13 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-12804: -- Description: When training LogisticRegression on a dataset where the label is all 0 or all 1,

[jira] [Updated] (SPARK-12804) ml.classification.LogisticRegression fails when FitIntercept with same-label dataset

2016-01-13 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-12804: -- Description: When training LogisticRegression on a dataset where the label is all 0 or all 1,

[jira] [Commented] (SPARK-12804) ml.classification.LogisticRegression fails when FitIntercept with same-label dataset

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096081#comment-15096081 ] Sean Owen commented on SPARK-12804: --- (I'll do so but you don't really need it assigned to work on it)

[jira] [Updated] (SPARK-12804) ml.classification.LogisticRegression fails when FitIntercept with same-label dataset

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12804: -- Assignee: Feynman Liang > ml.classification.LogisticRegression fails when FitIntercept with same-label

[jira] [Resolved] (SPARK-3561) Allow for pluggable execution contexts in Spark

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3561. -- Resolution: Won't Fix > Allow for pluggable execution contexts in Spark >

[jira] [Resolved] (SPARK-7065) Clear the cached locations mapping after every stage to avoid inconsistent status

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7065. -- Resolution: Won't Fix > Clear the cached locations mapping after every stage to avoid inconsistent >

[jira] [Issue Comment Deleted] (SPARK-12746) ArrayType(_, true) should also accept ArrayType(_, false)

2016-01-13 Thread Earthson Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Earthson Lu updated SPARK-12746: Comment: was deleted (was: I was just wandering if you could do a review:) On Tue, Jan 12, 2016

[jira] [Resolved] (SPARK-1622) Expose input split(s) accessed by a task in UI or logs

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1622. -- Resolution: Won't Fix I assume this is WontFix given the severity, and lack of activity in over a year

[jira] [Comment Edited] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096711#comment-15096711 ] Maciej Bryński edited comment on SPARK-12807 at 1/13/16 6:13 PM: - I think

[jira] [Commented] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096711#comment-15096711 ] Maciej Bryński commented on SPARK-12807: I think there will be a big problem with Spark 2.0.0. As

[jira] [Comment Edited] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096711#comment-15096711 ] Maciej Bryński edited comment on SPARK-12807 at 1/13/16 6:14 PM: - I think

[jira] [Commented] (SPARK-12269) Update aws-java-sdk version

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096729#comment-15096729 ] Maciej Bryński commented on SPARK-12269: [~srowen] Please look at:

[jira] [Comment Edited] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096711#comment-15096711 ] Maciej Bryński edited comment on SPARK-12807 at 1/13/16 6:24 PM: - I think

[jira] [Commented] (SPARK-12269) Update aws-java-sdk version

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096731#comment-15096731 ] Sean Owen commented on SPARK-12269: --- I see it, but what's the issue? Spark has never been on Jackson

[jira] [Commented] (SPARK-12269) Update aws-java-sdk version

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096733#comment-15096733 ] Maciej Bryński commented on SPARK-12269: Yes. But it's possible to compile it with 2.2. Will

[jira] [Commented] (SPARK-11857) Remove Mesos fine-grained mode subject to discussions

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096734#comment-15096734 ] Reynold Xin commented on SPARK-11857: - That sounds good to me. At the very least we should switch the

[jira] [Resolved] (SPARK-9297) covar_pop and covar_samp aggregate functions

2016-01-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-9297. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10029

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096738#comment-15096738 ] Antonio Piccolboni commented on SPARK-6817: --- So I am not sure row-orientation means anything

[jira] [Commented] (SPARK-10528) spark-shell throws java.lang.RuntimeException: The root scratch dir: /tmp/hive on HDFS should be writable.

2016-01-13 Thread Amir Gur (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096742#comment-15096742 ] Amir Gur commented on SPARK-10528: -- Not solving it for me. (To reproduce etting the same with either

[jira] [Commented] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096779#comment-15096779 ] Sean Owen commented on SPARK-12807: --- Why? Spark is already not on Jackson 2.2. > Spark External

[jira] [Commented] (SPARK-12566) GLM model family, link function support

2016-01-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096819#comment-15096819 ] Xiangrui Meng commented on SPARK-12566: --- After we have IRLS (SPARK-9835), it should be able to

[jira] [Updated] (SPARK-12566) GLM model family, link function support in SparkR:::glm

2016-01-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12566: -- Summary: GLM model family, link function support in SparkR:::glm (was: GLM model family, link

[jira] [Resolved] (SPARK-12805) Outdated details in doc related to Mesos run modes

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12805. - Resolution: Fixed Assignee: Luc Bourlier Fix Version/s: 2.0.0

[jira] [Updated] (SPARK-12685) word2vec trainWordsCount gets overflow

2016-01-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12685: -- Fix Version/s: 2.0.0 > word2vec trainWordsCount gets overflow >

[jira] [Updated] (SPARK-12690) NullPointerException in UnsafeInMemorySorter.free()

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-12690: --- Assignee: Carson Wang > NullPointerException in UnsafeInMemorySorter.free() >

[jira] [Commented] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-01-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096826#comment-15096826 ] Steve Loughran commented on SPARK-12807: Jackson versioning is really a symptom of a greater

[jira] [Created] (SPARK-12811) Estimator interface for generalized linear models (GLMs)

2016-01-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-12811: - Summary: Estimator interface for generalized linear models (GLMs) Key: SPARK-12811 URL: https://issues.apache.org/jira/browse/SPARK-12811 Project: Spark

[jira] [Resolved] (SPARK-9383) Merge script should reset back to previous ref instead of detached commit

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-9383. --- Resolution: Fixed Fix Version/s: 2.0.0 > Merge script should reset back to previous ref

[jira] [Commented] (SPARK-5159) Thrift server does not respect hive.server2.enable.doAs=true

2016-01-13 Thread Luciano Resende (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096893#comment-15096893 ] Luciano Resende commented on SPARK-5159: [~saurfang] Did you find out more about the item you

[jira] [Updated] (SPARK-12268) pyspark shell uses execfile which breaks python3 compatibility

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-12268: --- Affects Version/s: (was: 1.6.1) 1.6.0 > pyspark shell uses execfile which

[jira] [Updated] (SPARK-12268) pyspark shell uses execfile which breaks python3 compatibility

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-12268: --- Assignee: Erik Selin (was: Apache Spark) > pyspark shell uses execfile which breaks python3

[jira] [Resolved] (SPARK-12268) pyspark shell uses execfile which breaks python3 compatibility

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-12268. Resolution: Fixed Fix Version/s: 1.6.1 2.0.0 Issue resolved by pull

[jira] [Resolved] (SPARK-12791) Simplify CaseWhen by breaking "branches" into "conditions" and "values"

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12791. - Resolution: Fixed Fix Version/s: 2.0.0 > Simplify CaseWhen by breaking "branches" into

[jira] [Comment Edited] (SPARK-9850) Adaptive execution in Spark

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096985#comment-15096985 ] Maciej Bryński edited comment on SPARK-9850 at 1/13/16 9:13 PM: [~matei]

[jira] [Commented] (SPARK-12269) Update aws-java-sdk version

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096825#comment-15096825 ] Maciej Bryński commented on SPARK-12269: OK. We'll see. Thanks for the answer. > Update

[jira] [Comment Edited] (SPARK-12646) Support _HOST in kerberos principal for connecting to secure cluster

2016-01-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15092460#comment-15092460 ] Marcelo Vanzin edited comment on SPARK-12646 at 1/13/16 7:36 PM: - Can you

[jira] [Updated] (SPARK-12430) Temporary folders do not get deleted after Task completes causing problems with disk space.

2016-01-13 Thread Fede Bar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fede Bar updated SPARK-12430: - Affects Version/s: 1.6.0 > Temporary folders do not get deleted after Task completes causing problems >

[jira] [Commented] (SPARK-12804) ml.classification.LogisticRegression fails when FitIntercept with same-label dataset

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096840#comment-15096840 ] Apache Spark commented on SPARK-12804: -- User 'feynmanliang' has created a pull request for this

[jira] [Comment Edited] (SPARK-12566) GLM model family, link function support

2016-01-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096819#comment-15096819 ] Xiangrui Meng edited comment on SPARK-12566 at 1/13/16 7:24 PM: After we

[jira] [Commented] (SPARK-12430) Temporary folders do not get deleted after Task completes causing problems with disk space.

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096868#comment-15096868 ] Jean-Baptiste Onofré commented on SPARK-12430: -- Thanks for the update, it's what I see too.

[jira] [Updated] (SPARK-12725) SQL generation suffers from name conficts introduced by some analysis rules

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12725: Affects Version/s: (was: 2.0.0) > SQL generation suffers from name conficts introduced by some

[jira] [Resolved] (SPARK-12642) improve the hash expression to be decoupled from unsafe row

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12642. - Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.0.0 > improve the hash

[jira] [Commented] (SPARK-9850) Adaptive execution in Spark

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096985#comment-15096985 ] Maciej Bryński commented on SPARK-9850: --- [~matei] Hi, I'm not sure if my issue is related to this

[jira] [Commented] (SPARK-12269) Update aws-java-sdk version

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096777#comment-15096777 ] Sean Owen commented on SPARK-12269: --- I don't think Spark has changed to use newer Jackson APIs itself,

  1   2   3   >