Andrew Ray created SPARK-39897:
--
Summary: StackOverflowError in TaskMemoryManager
Key: SPARK-39897
URL: https://issues.apache.org/jira/browse/SPARK-39897
Project: Spark
Issue Type: Bug
Andrew Ray created SPARK-39883:
--
Summary: Add DataFrame function parity check
Key: SPARK-39883
URL: https://issues.apache.org/jira/browse/SPARK-39883
Project: Spark
Issue Type: Improvement
Andrew Ray created SPARK-39734:
--
Summary: Add call_udf to pyspark.sql.functions
Key: SPARK-39734
URL: https://issues.apache.org/jira/browse/SPARK-39734
Project: Spark
Issue Type: Improvement
Andrew Ray created SPARK-39733:
--
Summary: Add map_contains_key to pyspark.sql.functions
Key: SPARK-39733
URL: https://issues.apache.org/jira/browse/SPARK-39733
Project: Spark
Issue Type: Improve
[
https://issues.apache.org/jira/browse/SPARK-39728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Ray updated SPARK-39728:
---
Priority: Minor (was: Major)
> Test for parity of SQL functions between Python and JVM DataFrame AP
Andrew Ray created SPARK-39728:
--
Summary: Test for parity of SQL functions between Python and JVM
DataFrame API's
Key: SPARK-39728
URL: https://issues.apache.org/jira/browse/SPARK-39728
Project: Spark
Andrew Ray created SPARK-21628:
--
Summary: Explicitly specify Java version in maven compiler plugin
so IntelliJ imports project correctly
Key: SPARK-21628
URL: https://issues.apache.org/jira/browse/SPARK-21628
[
https://issues.apache.org/jira/browse/SPARK-21034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111454#comment-16111454
]
Andrew Ray commented on SPARK-21034:
Yes a=1 is the filter to be pushed down. It is n
[
https://issues.apache.org/jira/browse/SPARK-21034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1631#comment-1631
]
Andrew Ray commented on SPARK-21034:
{{first}} is not a deterministic function and th
[
https://issues.apache.org/jira/browse/SPARK-21110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111093#comment-16111093
]
Andrew Ray commented on SPARK-21110:
https://github.com/apache/spark/pull/18818
> St
[
https://issues.apache.org/jira/browse/SPARK-21110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16109734#comment-16109734
]
Andrew Ray commented on SPARK-21110:
I'm working on this
> Structs should be usable
[
https://issues.apache.org/jira/browse/SPARK-21330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16109568#comment-16109568
]
Andrew Ray commented on SPARK-21330:
https://github.com/apache/spark/pull/18800
> Ba
[
https://issues.apache.org/jira/browse/SPARK-21565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16108006#comment-16108006
]
Andrew Ray commented on SPARK-21565:
No nothing like the limitations of microbatches.
[
https://issues.apache.org/jira/browse/SPARK-21565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16107933#comment-16107933
]
Andrew Ray commented on SPARK-21565:
I believe you need to use a window to group by y
[
https://issues.apache.org/jira/browse/SPARK-21584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Ray updated SPARK-21584:
---
Component/s: SQL
> Update R method for summary to call new implementation
> -
Andrew Ray created SPARK-21584:
--
Summary: Update R method for summary to call new implementation
Key: SPARK-21584
URL: https://issues.apache.org/jira/browse/SPARK-21584
Project: Spark
Issue Type
Andrew Ray created SPARK-21566:
--
Summary: Python method for summary
Key: SPARK-21566
URL: https://issues.apache.org/jira/browse/SPARK-21566
Project: Spark
Issue Type: Improvement
Compo
[
https://issues.apache.org/jira/browse/SPARK-21100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Ray updated SPARK-21100:
---
Summary: Add summary method as alternative to describe that gives quartiles
similar to Pandas (was:
[
https://issues.apache.org/jira/browse/SPARK-21184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16067167#comment-16067167
]
Andrew Ray commented on SPARK-21184:
Also the lookup queries are just wrong
{code}
s
Andrew Ray created SPARK-21184:
--
Summary: QuantileSummaries implementation is wrong and
QuantileSummariesSuite fails with larger n
Key: SPARK-21184
URL: https://issues.apache.org/jira/browse/SPARK-21184
Andrew Ray created SPARK-21100:
--
Summary: describe should give quartiles similar to Pandas
Key: SPARK-21100
URL: https://issues.apache.org/jira/browse/SPARK-21100
Project: Spark
Issue Type: Impr
[
https://issues.apache.org/jira/browse/SPARK-20839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Ray resolved SPARK-20839.
Resolution: Not A Problem
> Incorrect Dynamic PageRank calculation
> --
[
https://issues.apache.org/jira/browse/SPARK-20839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16049214#comment-16049214
]
Andrew Ray commented on SPARK-20839:
1 & 2 work together to do the algorithm properly
Andrew Ray created SPARK-20769:
--
Summary: Incorrect documentation for using Jupyter notebook
Key: SPARK-20769
URL: https://issues.apache.org/jira/browse/SPARK-20769
Project: Spark
Issue Type: Bu
[
https://issues.apache.org/jira/browse/SPARK-20429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15991099#comment-15991099
]
Andrew Ray commented on SPARK-20429:
Can you retest your example with Spark 2.2/maste
[
https://issues.apache.org/jira/browse/SPARK-19136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Ray resolved SPARK-19136.
Resolution: Not A Bug
> Aggregator with case class as output type fails with ClassCastException
> -
[
https://issues.apache.org/jira/browse/SPARK-16683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15831878#comment-15831878
]
Andrew Ray commented on SPARK-16683:
I'm working on a solution for this
> Group by d
[
https://issues.apache.org/jira/browse/SPARK-18568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15822397#comment-15822397
]
Andrew Ray commented on SPARK-18568:
RDD's have the same problem for cached collectio
[
https://issues.apache.org/jira/browse/SPARK-19116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15821914#comment-15821914
]
Andrew Ray commented on SPARK-19116:
The 2318 number is the size of the parquet files
[
https://issues.apache.org/jira/browse/SPARK-19136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15821868#comment-15821868
]
Andrew Ray commented on SPARK-19136:
I forgot you can also just do:
{code}
ds.select(
[
https://issues.apache.org/jira/browse/SPARK-8853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15816049#comment-15816049
]
Andrew Ray commented on SPARK-8853:
---
But there is no reason to directly create a {{FPGro
[
https://issues.apache.org/jira/browse/SPARK-19136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15815460#comment-15815460
]
Andrew Ray commented on SPARK-19136:
You did not to a _typed_ aggregation so your res
[
https://issues.apache.org/jira/browse/SPARK-18393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15781473#comment-15781473
]
Andrew Ray commented on SPARK-18393:
It wouldn't hurt to backport to 2.0, its a prett
[
https://issues.apache.org/jira/browse/SPARK-18847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746395#comment-15746395
]
Andrew Ray commented on SPARK-18847:
I have and have not found any relevant. I'm curr
[
https://issues.apache.org/jira/browse/SPARK-18845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746385#comment-15746385
]
Andrew Ray commented on SPARK-18845:
[~srowen] No that's a different thing just wheth
Andrew Ray created SPARK-18848:
--
Summary: PageRank gives incorrect results for graphs with sinks
Key: SPARK-18848
URL: https://issues.apache.org/jira/browse/SPARK-18848
Project: Spark
Issue Type
Andrew Ray created SPARK-18847:
--
Summary: PageRank gives incorrect results for graphs with sinks
Key: SPARK-18847
URL: https://issues.apache.org/jira/browse/SPARK-18847
Project: Spark
Issue Type
Andrew Ray created SPARK-18845:
--
Summary: PageRank has incorrect initialization value that leads to
slow convergence
Key: SPARK-18845
URL: https://issues.apache.org/jira/browse/SPARK-18845
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-17859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15733247#comment-15733247
]
Andrew Ray commented on SPARK-17859:
this appears to be fixed in 2.0.2
{code}
scala>
[
https://issues.apache.org/jira/browse/SPARK-18717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15723527#comment-15723527
]
Andrew Ray commented on SPARK-18717:
I have a fix for this, will make a PR in a bit
[
https://issues.apache.org/jira/browse/SPARK-18717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15723499#comment-15723499
]
Andrew Ray commented on SPARK-18717:
Use `scala.collection.Map` as the type in your c
[
https://issues.apache.org/jira/browse/SPARK-11705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15715675#comment-15715675
]
Andrew Ray commented on SPARK-11705:
Above example does not have a cartesian product
[
https://issues.apache.org/jira/browse/SPARK-17896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15703320#comment-15703320
]
Andrew Ray commented on SPARK-17896:
The given code seems to work in 2.0.2
> Dataset
Andrew Ray created SPARK-18457:
--
Summary: ORC and other columnar formats using HiveShim read all
columns when doing a simple count
Key: SPARK-18457
URL: https://issues.apache.org/jira/browse/SPARK-18457
[
https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15494591#comment-15494591
]
Andrew Ray commented on SPARK-17458:
[~hvanhovell]: My JIRA username is a1ray.
> Ali
[
https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Ray updated SPARK-17458:
---
Comment: was deleted
(was: [~hvanhovell] It's a1ray)
> Alias specified for aggregates in a pivot are
[
https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15494361#comment-15494361
]
Andrew Ray edited comment on SPARK-17458 at 9/15/16 8:09 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15494361#comment-15494361
]
Andrew Ray commented on SPARK-17458:
It's a1ray
> Alias specified for aggregates in
Andrew Ray created SPARK-13749:
--
Summary: Faster pivot implementation for many distinct values with
two phase aggregation
Key: SPARK-13749
URL: https://issues.apache.org/jira/browse/SPARK-13749
Project:
[
https://issues.apache.org/jira/browse/SPARK-12911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15109269#comment-15109269
]
Andrew Ray commented on SPARK-12911:
In the current master this happens even without
[
https://issues.apache.org/jira/browse/SPARK-9042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15060208#comment-15060208
]
Andrew Ray commented on SPARK-9042:
---
Sean, I think there are a couple issues going on he
Andrew Ray created SPARK-12211:
--
Summary: Incorrect version number in graphx doc for migration from
1.1
Key: SPARK-12211
URL: https://issues.apache.org/jira/browse/SPARK-12211
Project: Spark
Is
Andrew Ray created SPARK-12205:
--
Summary: Pivot fails Analysis when aggregate is UnresolvedFunction
Key: SPARK-12205
URL: https://issues.apache.org/jira/browse/SPARK-12205
Project: Spark
Issue T
Andrew Ray created SPARK-12184:
--
Summary: Make python api doc for pivot consistant with scala doc
Key: SPARK-12184
URL: https://issues.apache.org/jira/browse/SPARK-12184
Project: Spark
Issue Typ
Andrew Ray created SPARK-11690:
--
Summary: Add pivot to python api
Key: SPARK-11690
URL: https://issues.apache.org/jira/browse/SPARK-11690
Project: Spark
Issue Type: Improvement
Compone
[
https://issues.apache.org/jira/browse/SPARK-11275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14981338#comment-14981338
]
Andrew Ray commented on SPARK-11275:
I think that I understand what is happening here
Andrew Ray created SPARK-8718:
-
Summary: Improve EdgePartition2D for non perfect square number of
partitions
Key: SPARK-8718
URL: https://issues.apache.org/jira/browse/SPARK-8718
Project: Spark
Andrew Ray created SPARK-5159:
-
Summary: Thrift server does not respect
hive.server2.enable.doAs=true
Key: SPARK-5159
URL: https://issues.apache.org/jira/browse/SPARK-5159
Project: Spark
Issue T
58 matches
Mail list logo