[
https://issues.apache.org/jira/browse/SPARK-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shawn Guo updated SPARK-3581:
-
Affects Version/s: 1.0.0
1.0.2
RDD API(distinct/subtract) does not work for RDD
[
https://issues.apache.org/jira/browse/SPARK-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138569#comment-14138569
]
Shawn Guo commented on SPARK-3321:
--
No idea yet, I use --py-files Null.py instead. it
ShiShu created SPARK-3583:
-
Summary: Spark run slow after unexpected repartition
Key: SPARK-3583
URL: https://issues.apache.org/jira/browse/SPARK-3583
Project: Spark
Issue Type: Bug
Affects
[
https://issues.apache.org/jira/browse/SPARK-3583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ShiShu updated SPARK-3583:
--
Attachment: spark_q_006.jpg
spark_q_005.jpg
spark_q_004.jpg
[
https://issues.apache.org/jira/browse/SPARK-3578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138638#comment-14138638
]
Ankur Dave commented on SPARK-3578:
---
[~pwendell] Sorry, I forgot to do that this time.
[
https://issues.apache.org/jira/browse/SPARK-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin resolved SPARK-1353.
Resolution: Duplicate
IllegalArgumentException when writing to disk
[
https://issues.apache.org/jira/browse/SPARK-3525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138674#comment-14138674
]
Egor Pakhomov commented on SPARK-3525:
--
https://github.com/apache/spark/pull/2394
Kousuke Saruta created SPARK-3584:
-
Summary: sbin/slaves doesn't work when we use password
authentication for SSH
Key: SPARK-3584
URL: https://issues.apache.org/jira/browse/SPARK-3584
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-3584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138716#comment-14138716
]
Apache Spark commented on SPARK-3584:
-
User 'sarutak' has created a pull request for
Tamilselvan Palani created SPARK-3585:
-
Summary: Probability Values
Key: SPARK-3585
URL: https://issues.apache.org/jira/browse/SPARK-3585
Project: Spark
Issue Type: Question
[
https://issues.apache.org/jira/browse/SPARK-3585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tamilselvan Palani updated SPARK-3585:
--
Summary: Probability Values in Logistic Regression/Decision Tree output
(was:
wangxj created SPARK-3586:
-
Summary: spark streaming
Key: SPARK-3586
URL: https://issues.apache.org/jira/browse/SPARK-3586
Project: Spark
Issue Type: Bug
Components: Streaming
Affects
caoli created SPARK-3587:
Summary: Spark SQL can't support lead() over() window function
Key: SPARK-3587
URL: https://issues.apache.org/jira/browse/SPARK-3587
Project: Spark
Issue Type: Bug
Meethu Mathew created SPARK-3588:
Summary: Gaussian Mixture Model clustering
Key: SPARK-3588
URL: https://issues.apache.org/jira/browse/SPARK-3588
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-3588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Meethu Mathew updated SPARK-3588:
-
Description:
Gaussian Mixture Models (GMM) is a popular technique for soft clustering. GMM
[
https://issues.apache.org/jira/browse/SPARK-3588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Meethu Mathew updated SPARK-3588:
-
Attachment: GMMSpark.py
Gaussian Mixture Model clustering
-
[
https://issues.apache.org/jira/browse/SPARK-3588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138782#comment-14138782
]
Meethu Mathew commented on SPARK-3588:
--
We are interested in contributing this
[
https://issues.apache.org/jira/browse/SPARK-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138803#comment-14138803
]
Philip Wills commented on SPARK-2175:
-
Whilst the workaround for this is trivial,
WangTaoTheTonic created SPARK-3589:
--
Summary: [Minor]Remove redundant code in deploy module
Key: SPARK-3589
URL: https://issues.apache.org/jira/browse/SPARK-3589
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-3589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138821#comment-14138821
]
Apache Spark commented on SPARK-3589:
-
User 'WangTaoTheTonic' has created a pull
[
https://issues.apache.org/jira/browse/SPARK-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138829#comment-14138829
]
Alexander Ulanov commented on SPARK-3403:
-
Thank you, your answers are really
[
https://issues.apache.org/jira/browse/SPARK-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138867#comment-14138867
]
Matthew Farrellee commented on SPARK-3321:
--
[~guoxu1231] i think so too. ok if i
[
https://issues.apache.org/jira/browse/SPARK-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138873#comment-14138873
]
mohan gaddam commented on SPARK-3447:
-
I am also facing the same issue with spark
[
https://issues.apache.org/jira/browse/SPARK-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138884#comment-14138884
]
Yin Huai commented on SPARK-3447:
-
[~mohan.gadm] From the trace, seems the NPE was caused
[
https://issues.apache.org/jira/browse/SPARK-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138899#comment-14138899
]
mohan gaddam commented on SPARK-3447:
-
sorry for the mistake, those are the project
[
https://issues.apache.org/jira/browse/SPARK-2593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138906#comment-14138906
]
Helena Edelson commented on SPARK-2593:
---
[~matei] +1 for spark streaming, that is a
[
https://issues.apache.org/jira/browse/SPARK-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138911#comment-14138911
]
mohan gaddam commented on SPARK-3447:
-
record KeyValueObject {
[
https://issues.apache.org/jira/browse/SPARK-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138873#comment-14138873
]
mohan gaddam edited comment on SPARK-3447 at 9/18/14 1:21 PM:
--
[
https://issues.apache.org/jira/browse/SPARK-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138970#comment-14138970
]
Apache Spark commented on SPARK-1987:
-
User 'larryxiao' has created a pull request for
[
https://issues.apache.org/jira/browse/SPARK-3557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Thomas Graves resolved SPARK-3557.
--
Resolution: Duplicate
Yarn client config prioritization is backwards
[
https://issues.apache.org/jira/browse/SPARK-3557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138993#comment-14138993
]
Thomas Graves commented on SPARK-3557:
--
This is a dup of SPARK-2872, although this
[
https://issues.apache.org/jira/browse/SPARK-2872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138996#comment-14138996
]
Thomas Graves commented on SPARK-2872:
--
adding description from spark-3557 as it
[
https://issues.apache.org/jira/browse/SPARK-3389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139010#comment-14139010
]
Apache Spark commented on SPARK-3389:
-
User 'patmcdonough' has created a pull request
[
https://issues.apache.org/jira/browse/SPARK-3580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139043#comment-14139043
]
Matthew Farrellee commented on SPARK-3580:
--
what do you think about going the
[
https://issues.apache.org/jira/browse/SPARK-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139061#comment-14139061
]
Gino Bustelo commented on SPARK-2892:
-
Any update on this? Will it get fixed for 1.0.3
[
https://issues.apache.org/jira/browse/SPARK-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139061#comment-14139061
]
Gino Bustelo edited comment on SPARK-2892 at 9/18/14 3:30 PM:
--
[
https://issues.apache.org/jira/browse/SPARK-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3270:
-
Issue Type: New Feature (was: Improvement)
Spark API for Application Extensions
[
https://issues.apache.org/jira/browse/SPARK-1576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marcelo Vanzin resolved SPARK-1576.
---
Resolution: Not a Problem
spark-submit already supports this with existing options.
Passing
[
https://issues.apache.org/jira/browse/SPARK-2593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139163#comment-14139163
]
Matei Zaharia commented on SPARK-2593:
--
Sure, it would be great to do this for
[
https://issues.apache.org/jira/browse/SPARK-3547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell resolved SPARK-3547.
Resolution: Fixed
Fix Version/s: 1.2.0
Assignee: WangTaoTheTonic
Resolved
[
https://issues.apache.org/jira/browse/SPARK-3579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell resolved SPARK-3579.
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2443
[
https://issues.apache.org/jira/browse/SPARK-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell resolved SPARK-1477.
Resolution: Won't Fix
Unless we are planning to interact with these components in a generic
Paul Magid created SPARK-3593:
-
Summary: Support Sorting of Binary Type Data
Key: SPARK-3593
URL: https://issues.apache.org/jira/browse/SPARK-3593
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139218#comment-14139218
]
Patrick Wendell edited comment on SPARK-1477 at 9/18/14 5:34 PM:
[
https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139233#comment-14139233
]
Li Pu commented on SPARK-3530:
--
Nice design doc! I had some experiences on the parameter
[
https://issues.apache.org/jira/browse/SPARK-3592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139299#comment-14139299
]
Apache Spark commented on SPARK-3592:
-
User 'davies' has created a pull request for
[
https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139339#comment-14139339
]
Apache Spark commented on SPARK-3560:
-
User 'Victsm' has created a pull request for
[
https://issues.apache.org/jira/browse/SPARK-3566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell resolved SPARK-3566.
Resolution: Fixed
Fix Version/s: 1.2.0
Assignee: Kousuke Saruta
[
https://issues.apache.org/jira/browse/SPARK-3589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell resolved SPARK-3589.
Resolution: Fixed
Fix Version/s: 1.2.0
1.1.1
Assignee:
[
https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139396#comment-14139396
]
Zhan Zhang commented on SPARK-1537:
---
Do you have any update on this, or any schedule in
[
https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-3560:
--
Summary: In yarn-cluster mode, the same jars are distributed through
multiple mechanisms. (was: In
Ian Hummel created SPARK-3595:
-
Summary: Spark should respect configured OutputCommitter when
using saveAsHadoopFile
Key: SPARK-3595
URL: https://issues.apache.org/jira/browse/SPARK-3595
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139461#comment-14139461
]
Xiangrui Meng commented on SPARK-3403:
--
Sorry, it should be netlib-java, but the real
Thomas Graves created SPARK-3596:
Summary: Support changing the yarn client monitor interval
Key: SPARK-3596
URL: https://issues.apache.org/jira/browse/SPARK-3596
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-3595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139509#comment-14139509
]
Apache Spark commented on SPARK-3595:
-
User 'themodernlife' has created a pull request
[
https://issues.apache.org/jira/browse/SPARK-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139549#comment-14139549
]
Apache Spark commented on SPARK-1486:
-
User 'brkyvz' has created a pull request for
[
https://issues.apache.org/jira/browse/SPARK-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Or updated SPARK-3340:
-
Assignee: (was: Andrew Or)
Deprecate ADD_JARS and ADD_FILES
[
https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139600#comment-14139600
]
Xiangrui Meng commented on SPARK-3530:
--
[~eustache] The default implementation of
[
https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139600#comment-14139600
]
Xiangrui Meng edited comment on SPARK-3530 at 9/18/14 10:06 PM:
[
https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3573:
-
Description:
This JIRA is for discussion of ML dataset, essentially a SchemaRDD with extra
[
https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Or closed SPARK-3560.
Resolution: Fixed
Fix Version/s: 1.2.0
1.1.1
Fixed by
[
https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Or reopened SPARK-3560:
--
Assignee: Min Shen
Reopening just to reassign. Closing right afterwards, please disregard.
In
[
https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Or closed SPARK-3560.
Resolution: Fixed
In yarn-cluster mode, the same jars are distributed through multiple
mechanisms.
[
https://issues.apache.org/jira/browse/SPARK-3587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-3587:
---
Labels: (was: features)
Spark SQL can't support lead() over() window function
[
https://issues.apache.org/jira/browse/SPARK-3574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-3574:
---
Component/s: Spark Core
Shuffle finish time always reported as -1
[
https://issues.apache.org/jira/browse/SPARK-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2672:
---
Summary: Support compression in wholeFile() (was: support compressed file
in wholeFile())
[
https://issues.apache.org/jira/browse/SPARK-2761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2761:
---
Component/s: Spark Core
Merge similar code paths in ExternalSorter and EAOM
[
https://issues.apache.org/jira/browse/SPARK-3535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Or updated SPARK-3535:
-
Target Version/s: 1.1.1, 1.2.0 (was: 1.1.1)
Spark on Mesos not correctly setting heap overhead
[
https://issues.apache.org/jira/browse/SPARK-3535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139763#comment-14139763
]
Brenden Matthews edited comment on SPARK-3535 at 9/18/14 11:58 PM:
[
https://issues.apache.org/jira/browse/SPARK-3535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139763#comment-14139763
]
Brenden Matthews commented on SPARK-3535:
-
After some even futher digging, I
[
https://issues.apache.org/jira/browse/SPARK-3562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139786#comment-14139786
]
Matthew Farrellee commented on SPARK-3562:
--
is logrotate an option for you?
[
https://issues.apache.org/jira/browse/SPARK-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Rosen resolved SPARK-3554.
---
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2417
[
https://issues.apache.org/jira/browse/SPARK-3535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139845#comment-14139845
]
Vinod Kone commented on SPARK-3535:
---
This can happen if the spark executor doesn't use
Brenden Matthews created SPARK-3597:
---
Summary: MesosSchedulerBackend does not implement `killTask`
Key: SPARK-3597
URL: https://issues.apache.org/jira/browse/SPARK-3597
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139902#comment-14139902
]
Shawn Guo commented on SPARK-3581:
--
Yes, please. Thanks for clarification.
RDD
Adrian Wang created SPARK-3598:
--
Summary: cast to timestamp should be the same as hive
Key: SPARK-3598
URL: https://issues.apache.org/jira/browse/SPARK-3598
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139903#comment-14139903
]
Shawn Guo commented on SPARK-3321:
--
Yes please, thanks for clarification.
Defining a
WangTaoTheTonic created SPARK-3599:
--
Summary: Avoid loading and printing properties file content
frequently
Key: SPARK-3599
URL: https://issues.apache.org/jira/browse/SPARK-3599
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matthew Farrellee closed SPARK-3581.
Resolution: Not a Problem
RDD API(distinct/subtract) does not work for RDD of Dictionaries
[
https://issues.apache.org/jira/browse/SPARK-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matthew Farrellee closed SPARK-3321.
Resolution: Not a Problem
Defining a class within python main script
[
https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139958#comment-14139958
]
Sandy Ryza commented on SPARK-3573:
---
Currently SchemaRDD lives inside SQL. Would we
[
https://issues.apache.org/jira/browse/SPARK-3250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139965#comment-14139965
]
Erik Erlandson commented on SPARK-3250:
---
PR:
[
https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140017#comment-14140017
]
Patrick Wendell commented on SPARK-3573:
[~sandyr] This is a good question I'm not
[
https://issues.apache.org/jira/browse/SPARK-2058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140023#comment-14140023
]
David Rosenstrauch commented on SPARK-2058:
---
I'm wondering the same: has this
[
https://issues.apache.org/jira/browse/SPARK-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140043#comment-14140043
]
Patrick Wendell commented on SPARK-3270:
Hey There,
For the particular use case
85 matches
Mail list logo