[
https://issues.apache.org/jira/browse/SPARK-47063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17820877#comment-17820877
]
Robert Joseph Evans commented on SPARK-47063:
-
[~planga82] I was not planning on putting up
Robert Joseph Evans created SPARK-47063:
---
Summary: CAST long to timestamp has different behavior for codegen
vs interpreted
Key: SPARK-47063
URL: https://issues.apache.org/jira/browse/SPARK-47063
Robert Joseph Evans created SPARK-46778:
---
Summary: get_json_object flattens wildcard queries that match a
single value
Key: SPARK-46778
URL: https://issues.apache.org/jira/browse/SPARK-46778
Robert Joseph Evans created SPARK-46761:
---
Summary: quoted strings in a JSON path should support ? characters
Key: SPARK-46761
URL: https://issues.apache.org/jira/browse/SPARK-46761
Project:
[
https://issues.apache.org/jira/browse/SPARK-45879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-45879:
Affects Version/s: 3.4.1
3.2.3
> Number check for
[
https://issues.apache.org/jira/browse/SPARK-45599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-45599:
Priority: Blocker (was: Major)
> Percentile can produce a wrong answer if -0.0
[
https://issues.apache.org/jira/browse/SPARK-45599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-45599:
Labels: data-corruption (was: )
> Percentile can produce a wrong answer if -0.0
Robert Joseph Evans created SPARK-45599:
---
Summary: Percentile can produce a wrong answer if -0.0 and 0.0 are
mixed in the dataset
Key: SPARK-45599
URL: https://issues.apache.org/jira/browse/SPARK-45599
Robert Joseph Evans created SPARK-45243:
---
Summary: RADIX sort is not stable and can produce different
results for first/collect_list aggs
Key: SPARK-45243
URL:
Robert Joseph Evans created SPARK-44500:
---
Summary: parse_url treats key as regular expression
Key: SPARK-44500
URL: https://issues.apache.org/jira/browse/SPARK-44500
Project: Spark
Robert Joseph Evans created SPARK-42898:
---
Summary: Cast from string to date and date to string say timezone
is needed, but it is not used
Key: SPARK-42898
URL:
Robert Joseph Evans created SPARK-41218:
---
Summary: ParquetTable reports is supports negative scale decimal
values
Key: SPARK-41218
URL: https://issues.apache.org/jira/browse/SPARK-41218
Robert Joseph Evans created SPARK-40280:
---
Summary: Failure to create parquet predicate push down for ints
and longs on some valid files
Key: SPARK-40280
URL:
Robert Joseph Evans created SPARK-40129:
---
Summary: Decimal multiply can produce the wrong answer because it
rounds twice
Key: SPARK-40129
URL: https://issues.apache.org/jira/browse/SPARK-40129
[
https://issues.apache.org/jira/browse/SPARK-40089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580434#comment-17580434
]
Robert Joseph Evans commented on SPARK-40089:
-
I put up a PR
[
https://issues.apache.org/jira/browse/SPARK-40089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-40089:
Summary: Sorting of at least Decimal(20, 2) fails for some values near the
max.
[
https://issues.apache.org/jira/browse/SPARK-40089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580377#comment-17580377
]
Robert Joseph Evans commented on SPARK-40089:
-
Never mind I figured out that there is a
[
https://issues.apache.org/jira/browse/SPARK-40089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580360#comment-17580360
]
Robert Joseph Evans commented on SPARK-40089:
-
I have been trying to come up with a patch,
[
https://issues.apache.org/jira/browse/SPARK-40089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17579897#comment-17579897
]
Robert Joseph Evans commented on SPARK-40089:
-
Looking at the code it appears that the
[
https://issues.apache.org/jira/browse/SPARK-40089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17579892#comment-17579892
]
Robert Joseph Evans commented on SPARK-40089:
-
It sure looks like it is related to the
[
https://issues.apache.org/jira/browse/SPARK-40089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17579887#comment-17579887
]
Robert Joseph Evans commented on SPARK-40089:
-
I have been trying to debug this and it does
[
https://issues.apache.org/jira/browse/SPARK-40089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-40089:
Attachment: input.parquet
> Doring of at least Decimal(20, 2) fails for some
Robert Joseph Evans created SPARK-40089:
---
Summary: Doring of at least Decimal(20, 2) fails for some values
near the max.
Key: SPARK-40089
URL: https://issues.apache.org/jira/browse/SPARK-40089
Robert Joseph Evans created SPARK-39031:
---
Summary: NaN != NaN in pivot
Key: SPARK-39031
URL: https://issues.apache.org/jira/browse/SPARK-39031
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-38955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525310#comment-17525310
]
Robert Joseph Evans commented on SPARK-38955:
-
Conceptually I am fine if we want to remove
Robert Joseph Evans created SPARK-38955:
---
Summary: from_csv can corrupt surrounding lines if a lineSep is in
the data
Key: SPARK-38955
URL: https://issues.apache.org/jira/browse/SPARK-38955
[
https://issues.apache.org/jira/browse/SPARK-38604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17509265#comment-17509265
]
Robert Joseph Evans commented on SPARK-38604:
-
I marked this as a critical because it
[
https://issues.apache.org/jira/browse/SPARK-38604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-38604:
Priority: Blocker (was: Major)
> ceil and floor return different types when
[
https://issues.apache.org/jira/browse/SPARK-38604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-38604:
Priority: Critical (was: Blocker)
> ceil and floor return different types when
Robert Joseph Evans created SPARK-38604:
---
Summary: ceil and floor return different types when called from
scala than sql
Key: SPARK-38604
URL: https://issues.apache.org/jira/browse/SPARK-38604
[
https://issues.apache.org/jira/browse/SPARK-38577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17508269#comment-17508269
]
Robert Joseph Evans commented on SPARK-38577:
-
This is especially problematic because it is
Robert Joseph Evans created SPARK-37024:
---
Summary: Even more decimal overflow issues in average
Key: SPARK-37024
URL: https://issues.apache.org/jira/browse/SPARK-37024
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-35563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17368098#comment-17368098
]
Robert Joseph Evans commented on SPARK-35563:
-
Or just do the overflow check on the int. I
[
https://issues.apache.org/jira/browse/SPARK-35563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17368096#comment-17368096
]
Robert Joseph Evans commented on SPARK-35563:
-
Yes, technically if we switch it from an int
[
https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17364334#comment-17364334
]
Robert Joseph Evans edited comment on SPARK-35089 at 6/16/21, 2:54 PM:
[
https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17364334#comment-17364334
]
Robert Joseph Evans commented on SPARK-35089:
-
{quote}I understand ordering data, but I
[
https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17364277#comment-17364277
]
Robert Joseph Evans commented on SPARK-35089:
-
[~Tonzetic], I don't know what you mean by an
[
https://issues.apache.org/jira/browse/SPARK-35563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17362910#comment-17362910
]
Robert Joseph Evans commented on SPARK-35563:
-
[~dc-heros] Thanks for looking into this. I
[
https://issues.apache.org/jira/browse/SPARK-35563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-35563:
Labels: data-loss (was: )
> [SQL] Window operations with over Int.MaxValue + 1
[
https://issues.apache.org/jira/browse/SPARK-35563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-35563:
Priority: Blocker (was: Major)
> [SQL] Window operations with over Int.MaxValue
[
https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17355088#comment-17355088
]
Robert Joseph Evans commented on SPARK-35089:
-
I should add that the above "solution" is
[
https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17355083#comment-17355083
]
Robert Joseph Evans commented on SPARK-35089:
-
[~Tonzetic] to be clear my point was just to
[
https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17353778#comment-17353778
]
Robert Joseph Evans commented on SPARK-35089:
-
On window functions if the {{order by}}
[
https://issues.apache.org/jira/browse/SPARK-35563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-35563:
Priority: Blocker (was: Major)
> [SQL] Window operations with over Int.MaxValue
Robert Joseph Evans created SPARK-35563:
---
Summary: [SQL] Window operations with over Int.MaxValue + 1 rows
can silently drop rows
Key: SPARK-35563
URL: https://issues.apache.org/jira/browse/SPARK-35563
[
https://issues.apache.org/jira/browse/SPARK-35108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17338948#comment-17338948
]
Robert Joseph Evans edited comment on SPARK-35108 at 5/4/21, 12:34 PM:
[
https://issues.apache.org/jira/browse/SPARK-35108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17338948#comment-17338948
]
Robert Joseph Evans commented on SPARK-35108:
-
Looks good. Thanks for the fix.
> Pickle
[
https://issues.apache.org/jira/browse/SPARK-35108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17324088#comment-17324088
]
Robert Joseph Evans commented on SPARK-35108:
-
If you have SPARK_HOME set when you run
[
https://issues.apache.org/jira/browse/SPARK-35108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-35108:
Attachment: test.sh
test.py
> Pickle produces incorrect key
Robert Joseph Evans created SPARK-35108:
---
Summary: Pickle produces incorrect key labels for
GenericRowWithSchema (data corruption)
Key: SPARK-35108
URL: https://issues.apache.org/jira/browse/SPARK-35108
[
https://issues.apache.org/jira/browse/SPARK-32110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17247362#comment-17247362
]
Robert Joseph Evans commented on SPARK-32110:
-
Thanks [~tanelk] then resolving this is fine.
[
https://issues.apache.org/jira/browse/SPARK-32110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17247305#comment-17247305
]
Robert Joseph Evans commented on SPARK-32110:
-
I have not tried this again on the latest
[
https://issues.apache.org/jira/browse/SPARK-32110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17246585#comment-17246585
]
Robert Joseph Evans commented on SPARK-32110:
-
Are we sure that this issue should be closed?
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182345#comment-17182345
]
Robert Joseph Evans commented on SPARK-32672:
-
Honestly, it is not a big deal what happened.
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17181873#comment-17181873
]
Robert Joseph Evans commented on SPARK-32672:
-
OK reading through the code I understand what
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-32672:
Attachment: small_bad.snappy.parquet
> Data corruption in some cached compressed
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17181868#comment-17181868
]
Robert Joseph Evans commented on SPARK-32672:
-
So I am able to reduce the corruption down to
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-32672:
Affects Version/s: 3.1.0
> Data corruption in some cached compressed boolean
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17181486#comment-17181486
]
Robert Joseph Evans commented on SPARK-32672:
-
I added some debugging to the compression
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17181478#comment-17181478
]
Robert Joseph Evans commented on SPARK-32672:
-
I did a little debugging and found that
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17181466#comment-17181466
]
Robert Joseph Evans commented on SPARK-32672:
-
I verified that this is still happening on
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17181459#comment-17181459
]
Robert Joseph Evans commented on SPARK-32672:
-
I verified that this is still happening on
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-32672:
Affects Version/s: 2.4.6
> Data corruption in some cached compressed boolean
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-32672:
Summary: Data corruption in some cached compressed boolean columns (was:
Daat
[
https://issues.apache.org/jira/browse/SPARK-32672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-32672:
Attachment: bad_order.snappy.parquet
> Data corruption in some cached compressed
Robert Joseph Evans created SPARK-32672:
---
Summary: Daat corruption in some cached compressed boolean columns
Key: SPARK-32672
URL: https://issues.apache.org/jira/browse/SPARK-32672
Project:
[
https://issues.apache.org/jira/browse/SPARK-32612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178927#comment-17178927
]
Robert Joseph Evans commented on SPARK-32612:
-
This is just one example that shows what can
Robert Joseph Evans created SPARK-32612:
---
Summary: int columns produce inconsistent results on pandas UDFs
Key: SPARK-32612
URL: https://issues.apache.org/jira/browse/SPARK-32612
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-32334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17163576#comment-17163576
]
Robert Joseph Evans commented on SPARK-32334:
-
Row to columnar and columnar to row is mostly
[
https://issues.apache.org/jira/browse/SPARK-32334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17162147#comment-17162147
]
Robert Joseph Evans commented on SPARK-32334:
-
I think I can get the conversation started
[
https://issues.apache.org/jira/browse/SPARK-32274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17155660#comment-17155660
]
Robert Joseph Evans commented on SPARK-32274:
-
I filed
[
https://issues.apache.org/jira/browse/SPARK-32274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-32274:
Description:
Caching a dataset or dataframe can be a very expensive operation,
[
https://issues.apache.org/jira/browse/SPARK-32274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17155654#comment-17155654
]
Robert Joseph Evans commented on SPARK-32274:
-
If someone could assign this to me that would
Robert Joseph Evans created SPARK-32274:
---
Summary: Add in the ability for a user to replace the
serialization format of the cache
Key: SPARK-32274
URL: https://issues.apache.org/jira/browse/SPARK-32274
Robert Joseph Evans created SPARK-32110:
---
Summary: -0.0 vs 0.0 is inconsistent
Key: SPARK-32110
URL: https://issues.apache.org/jira/browse/SPARK-32110
Project: Spark
Issue Type: Bug
Robert Joseph Evans created SPARK-28774:
---
Summary: ReusedExchangeExec cannot be columnar
Key: SPARK-28774
URL: https://issues.apache.org/jira/browse/SPARK-28774
Project: Spark
Issue
Robert Joseph Evans created SPARK-28213:
---
Summary: Remove duplication between columnar and ColumnarBatchScan
Key: SPARK-28213
URL: https://issues.apache.org/jira/browse/SPARK-28213
Project:
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-27396:
Epic Name: Public APIs for extended Columnar Processing Support
> SPIP: Public
Robert Joseph Evans created SPARK-27945:
---
Summary: Make minimal changes to support columnar processing
Key: SPARK-27945
URL: https://issues.apache.org/jira/browse/SPARK-27945
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-27396:
Issue Type: Epic (was: Improvement)
> SPIP: Public APIs for extended Columnar
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16832776#comment-16832776
]
Robert Joseph Evans commented on SPARK-27396:
-
[~bryanc]
The nice to have arrow formatting
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16829631#comment-16829631
]
Robert Joseph Evans commented on SPARK-27396:
-
I have updated this SPIP to clarify some
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-27396:
Description:
*SPIP: Columnar Processing Without Arrow Formatting Guarantees.*
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822411#comment-16822411
]
Robert Joseph Evans commented on SPARK-27396:
-
[~mengxr],
My goal is to provide a framework
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16819592#comment-16819592
]
Robert Joseph Evans commented on SPARK-27396:
-
This SPIP is to put a framework in place to
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16819125#comment-16819125
]
Robert Joseph Evans commented on SPARK-27396:
-
[~bryanc],
I see your point that if this is
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16817990#comment-16817990
]
Robert Joseph Evans commented on SPARK-27396:
-
There are actually a few public facing APIs I
[
https://issues.apache.org/jira/browse/SPARK-26413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16817081#comment-16817081
]
Robert Joseph Evans commented on SPARK-26413:
-
SPARK-27396 covers this, but with a slightly
[
https://issues.apache.org/jira/browse/SPARK-24579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16817080#comment-16817080
]
Robert Joseph Evans commented on SPARK-24579:
-
This SPIP SPARK-27396 covers a superset of
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815846#comment-16815846
]
Robert Joseph Evans commented on SPARK-27396:
-
[~kiszk],
The exact detail of some of the
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815426#comment-16815426
]
Robert Joseph Evans commented on SPARK-27396:
-
Thanks [~tgraves] I updated the JIRA with you
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Joseph Evans updated SPARK-27396:
Shepherd: Thomas Graves
> SPIP: Public APIs for extended Columnar Processing
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16814883#comment-16814883
]
Robert Joseph Evans commented on SPARK-27396:
-
This SPIP has been up for 5 days and I see 10
[
https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16811183#comment-16811183
]
Robert Joseph Evans commented on SPARK-27396:
-
I have kept this at a high level just
Robert Joseph Evans created SPARK-27396:
---
Summary: SPIP: Public APIs for extended Columnar Processing Support
Key: SPARK-27396
URL: https://issues.apache.org/jira/browse/SPARK-27396
Project:
95 matches
Mail list logo