[
https://issues.apache.org/jira/browse/SPARK-13200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-13200:
Issue Type: Sub-task (was: Improvement)
Parent: SPARK-13175
> Investigate math.round on integer nu
[
https://issues.apache.org/jira/browse/SPARK-13154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13154:
Assignee: Apache Spark
> Add pydoc lint for docs
> ---
>
>
[
https://issues.apache.org/jira/browse/SPARK-13154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136170#comment-15136170
]
Apache Spark commented on SPARK-13154:
--
User 'holdenk' has created a pull request fo
[
https://issues.apache.org/jira/browse/SPARK-13154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13154:
Assignee: (was: Apache Spark)
> Add pydoc lint for docs
> ---
>
>
[
https://issues.apache.org/jira/browse/SPARK-5865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-5865:
-
Assignee: Tommy Yu (was: Nicholas Chammas)
> Add doc warnings for methods that return local data structur
[
https://issues.apache.org/jira/browse/SPARK-13103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-13103.
---
Resolution: Cannot Reproduce
I can't reproduce this:
{code}
>>> from pyspark.mllib.feature import Ha
[
https://issues.apache.org/jira/browse/SPARK-13013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136167#comment-15136167
]
Xin Ren commented on SPARK-13013:
-
I'm working on this now, thanks :)
> Replace example
[
https://issues.apache.org/jira/browse/SPARK-13019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136166#comment-15136166
]
Apache Spark commented on SPARK-13019:
--
User 'keypointt' has created a pull request
[
https://issues.apache.org/jira/browse/SPARK-13019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13019:
Assignee: (was: Apache Spark)
> Replace example code in mllib-statistics.md using incl
[
https://issues.apache.org/jira/browse/SPARK-13019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13019:
Assignee: Apache Spark
> Replace example code in mllib-statistics.md using include_example
[
https://issues.apache.org/jira/browse/SPARK-13227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13227:
Assignee: Apache Spark
> Risky apply() in OpenHashMap
>
>
>
[
https://issues.apache.org/jira/browse/SPARK-13227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136160#comment-15136160
]
Apache Spark commented on SPARK-13227:
--
User 'CodingCat' has created a pull request
[
https://issues.apache.org/jira/browse/SPARK-13227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13227:
Assignee: (was: Apache Spark)
> Risky apply() in OpenHashMap
> ---
[
https://issues.apache.org/jira/browse/SPARK-12154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136159#comment-15136159
]
Milad Khajavi commented on SPARK-12154:
---
Hmm, I can work on this issue, if some exp
Nan Zhu created SPARK-13227:
---
Summary: Risky apply() in OpenHashMap
Key: SPARK-13227
URL: https://issues.apache.org/jira/browse/SPARK-13227
Project: Spark
Issue Type: Bug
Components: Spar
[
https://issues.apache.org/jira/browse/SPARK-11472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136127#comment-15136127
]
Atkins edited comment on SPARK-11472 at 2/7/16 4:15 AM:
Reproduce
[
https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136132#comment-15136132
]
Xusen Yin commented on SPARK-13178:
---
Cheers for the good news! :)
> RRDD faces with co
[
https://issues.apache.org/jira/browse/SPARK-11472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136127#comment-15136127
]
Atkins commented on SPARK-11472:
Reproduced in kerberized hadoop cluster with Spark 1.6.0
[
https://issues.apache.org/jira/browse/SPARK-13226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-13226:
Comment: was deleted
(was: Further poking shows that they did both depending on the kmeans algorithm
used,
[
https://issues.apache.org/jira/browse/SPARK-13226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-13226:
Description:
The current MLLib PowerIteration clustering implementation sets the number of
parallel runs i
[
https://issues.apache.org/jira/browse/SPARK-13201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-13201:
Comment: was deleted
(was: Looking at the only use inside of the MLLib API, I'm pretty convinced its
being
[
https://issues.apache.org/jira/browse/SPARK-13226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136118#comment-15136118
]
holdenk commented on SPARK-13226:
-
Further poking shows that they did both depending on t
[
https://issues.apache.org/jira/browse/SPARK-13226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136113#comment-15136113
]
holdenk commented on SPARK-13226:
-
Also update the links since the PDF linked to in the m
[
https://issues.apache.org/jira/browse/SPARK-13201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136112#comment-15136112
]
holdenk edited comment on SPARK-13201 at 2/7/16 2:56 AM:
-
Looking
[
https://issues.apache.org/jira/browse/SPARK-13201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136112#comment-15136112
]
holdenk commented on SPARK-13201:
-
Looking at the only use inside of the MLLib API, I'm p
[
https://issues.apache.org/jira/browse/SPARK-13201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-13201:
Component/s: (was: ML)
> Make a private non-deprecated version of setRuns
> ---
[
https://issues.apache.org/jira/browse/SPARK-13201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-13201:
Description: Make a private non-deprecated version of setRuns API so that
we can call it from the PythonAPI
holdenk created SPARK-13226:
---
Summary: MLLib PowerIteration Clustering depends on deprecated
KMeans setRuns API
Key: SPARK-13226
URL: https://issues.apache.org/jira/browse/SPARK-13226
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-13225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13225:
Assignee: (was: Apache Spark)
> [SQL] Support Intersect All/Distinct
> ---
[
https://issues.apache.org/jira/browse/SPARK-13225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136093#comment-15136093
]
Apache Spark commented on SPARK-13225:
--
User 'gatorsmile' has created a pull request
[
https://issues.apache.org/jira/browse/SPARK-13225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13225:
Assignee: Apache Spark
> [SQL] Support Intersect All/Distinct
> --
Xiao Li created SPARK-13225:
---
Summary: [SQL] Support Intersect All/Distinct
Key: SPARK-13225
URL: https://issues.apache.org/jira/browse/SPARK-13225
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-12469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136080#comment-15136080
]
Apache Spark commented on SPARK-12469:
--
User 'holdenk' has created a pull request fo
[
https://issues.apache.org/jira/browse/SPARK-13209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Placek updated SPARK-13209:
---
Description:
When I run the following loop the join gets slower and slower regardless of
caching. If I chang
[
https://issues.apache.org/jira/browse/SPARK-10935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136053#comment-15136053
]
Ruslan Dautkhanov edited comment on SPARK-10935 at 2/6/16 11:16 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-10935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136053#comment-15136053
]
Ruslan Dautkhanov commented on SPARK-10935:
---
I noticed outer joins. Spark befor
[
https://issues.apache.org/jira/browse/SPARK-13186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13186:
Assignee: Apache Spark
> Migrate away from SynchronizedMap which is derpecated
> -
[
https://issues.apache.org/jira/browse/SPARK-13186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136052#comment-15136052
]
Apache Spark commented on SPARK-13186:
--
User 'huaxingao' has created a pull request
[
https://issues.apache.org/jira/browse/SPARK-13186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13186:
Assignee: (was: Apache Spark)
> Migrate away from SynchronizedMap which is derpecated
[
https://issues.apache.org/jira/browse/SPARK-13171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136006#comment-15136006
]
Ted Yu commented on SPARK-13171:
https://amplab.cs.berkeley.edu/jenkins/job/spark-master-
[
https://issues.apache.org/jira/browse/SPARK-13171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136004#comment-15136004
]
Ted Yu commented on SPARK-13171:
No, I have not.
bq. Could it be that Hive does some fun
[
https://issues.apache.org/jira/browse/SPARK-13171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135979#comment-15135979
]
Jakob Odersky commented on SPARK-13171:
---
[~tedyu]
On GitHub, [~smilegator] wrote:
[
https://issues.apache.org/jira/browse/SPARK-13171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135976#comment-15135976
]
Jakob Odersky commented on SPARK-13171:
---
I can't see any reason why the above chang
[
https://issues.apache.org/jira/browse/SPARK-13171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135972#comment-15135972
]
Ted Yu commented on SPARK-13171:
The above Jenkins jobs were running Scala 2.11
> Update
[
https://issues.apache.org/jira/browse/SPARK-13171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135966#comment-15135966
]
Jakob Odersky commented on SPARK-13171:
---
Thats strange, considering that the underl
[
https://issues.apache.org/jira/browse/SPARK-13216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-13216:
--
Priority: Minor (was: Major)
I think checkpointing is for restarting a failed application in as exactl
[
https://issues.apache.org/jira/browse/SPARK-13062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135932#comment-15135932
]
Sean Owen commented on SPARK-13062:
---
I'm sure there could be a better error, sure. But
[
https://issues.apache.org/jira/browse/SPARK-12861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135927#comment-15135927
]
Sean Owen commented on SPARK-12861:
---
I don't think that's the eventual topic of SPARK-4
[
https://issues.apache.org/jira/browse/SPARK-9307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135922#comment-15135922
]
Apache Spark commented on SPARK-9307:
-
User 'srowen' has created a pull request for th
[
https://issues.apache.org/jira/browse/SPARK-9307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-9307:
---
Assignee: (was: Apache Spark)
> Logging: Make it either stable or private[spark]
> --
[
https://issues.apache.org/jira/browse/SPARK-9307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-9307:
---
Assignee: Apache Spark
> Logging: Make it either stable or private[spark]
> -
[
https://issues.apache.org/jira/browse/SPARK-13199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-13199.
---
Resolution: Duplicate
Ted this is more of what I mean -- this was a duplicate. Please search first
[
https://issues.apache.org/jira/browse/SPARK-4878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-4878.
--
Resolution: Not A Problem
I'm guessing 'not a problem' in that there's no Akka involved anymore
> drive
[
https://issues.apache.org/jira/browse/SPARK-12892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-12892.
---
Resolution: Duplicate
> Support plugging in Spark scheduler
>
>
[
https://issues.apache.org/jira/browse/SPARK-13218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135898#comment-15135898
]
leo wu commented on SPARK-13218:
If I understand it correctly, sparkConf is only used to
[
https://issues.apache.org/jira/browse/SPARK-13218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
leo wu updated SPARK-13218:
---
Summary: Executor failed after SparkContext stop and start again (was:
Executor failed after SparkContext
[
https://issues.apache.org/jira/browse/SPARK-5865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-5865:
-
Assignee: Nicholas Chammas
> Add doc warnings for methods that return local data structures
>
[
https://issues.apache.org/jira/browse/SPARK-5865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-5865.
--
Resolution: Fixed
Fix Version/s: 2.0.0
Issue resolved by pull request 10874
[https://github.com/a
[
https://issues.apache.org/jira/browse/SPARK-13218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen closed SPARK-13218.
-
> Executor failed after SparkContext start and start again
> --
[
https://issues.apache.org/jira/browse/SPARK-13218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-13218.
---
Resolution: Not A Problem
I don't think it's ever been possible to change masters in one app.
> Exec
[
https://issues.apache.org/jira/browse/SPARK-13198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135889#comment-15135889
]
Sean Owen commented on SPARK-13198:
---
You've described a different problem, which you op
[
https://issues.apache.org/jira/browse/SPARK-13224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-13224.
---
Resolution: Invalid
> I use SparkR(Spark version 1.6), but its not working in parallel all
> cluster
Karen created SPARK-13224:
-
Summary: I use SparkR(Spark version 1.6), but its not working in
parallel all clusters. The flow is as follows. model <-
SparkR:::lapply(workList,worker) Can you help why it is so.
Key: SPARK-13224
[
https://issues.apache.org/jira/browse/SPARK-13171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135784#comment-15135784
]
Ted Yu commented on SPARK-13171:
Turns out not that trivial.
For build against hadoop 2.
[
https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135690#comment-15135690
]
Hitoshi Ozawa edited comment on SPARK-11416 at 2/6/16 8:53 AM:
[
https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135690#comment-15135690
]
Hitoshi Ozawa commented on SPARK-11416:
---
I've used Kryo3Upgrade branch in https://g
[
https://issues.apache.org/jira/browse/SPARK-13223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13223:
Assignee: Apache Spark
> Add stratified sampling to ML feature engineering
> -
[
https://issues.apache.org/jira/browse/SPARK-13223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135689#comment-15135689
]
Apache Spark commented on SPARK-13223:
--
User 'hhbyyh' has created a pull request for
[
https://issues.apache.org/jira/browse/SPARK-13223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13223:
Assignee: (was: Apache Spark)
> Add stratified sampling to ML feature engineering
> --
yuhao yang created SPARK-13223:
--
Summary: Add stratified sampling to ML feature engineering
Key: SPARK-13223
URL: https://issues.apache.org/jira/browse/SPARK-13223
Project: Spark
Issue Type: New
[
https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15135671#comment-15135671
]
Sun Rui commented on SPARK-13178:
-
The root cause is that RRDD.compute() uses some instan
71 matches
Mail list logo