[
https://issues.apache.org/jira/browse/SPARK-22017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tathagata Das resolved SPARK-22017.
---
Resolution: Fixed
Fix Version/s: 3.0.0
Issue resolved by pull request 19239
AnChe Kuo created SPARK-22034:
-
Summary: CrossValidator's training and testing set with different
set of labels, resulting in encoder transform error
Key: SPARK-22034
URL:
[
https://issues.apache.org/jira/browse/SPARK-22033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168650#comment-16168650
]
Sean Owen commented on SPARK-22033:
---
Hm, good point. There may be other similar issues throughout the
[
https://issues.apache.org/jira/browse/SPARK-22033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168612#comment-16168612
]
Vadim Semenov commented on SPARK-22033:
---
Leaving traces for others if they happen to hit the same
Vadim Semenov created SPARK-22033:
-
Summary: BufferHolder size checks should account for the specific
VM array size limitations
Key: SPARK-22033
URL: https://issues.apache.org/jira/browse/SPARK-22033
[
https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-12297:
Assignee: (was: Apache Spark)
> Add work-around for Parquet/Hive int96 timestamp bug.
[
https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168410#comment-16168410
]
Apache Spark commented on SPARK-12297:
--
User 'squito' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-12297:
Assignee: Apache Spark
> Add work-around for Parquet/Hive int96 timestamp bug.
>
[
https://issues.apache.org/jira/browse/SPARK-21842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168278#comment-16168278
]
Arthur Rand commented on SPARK-21842:
-
Hey [~kalvinnchau]
I'm currently of the mind that using the
[
https://issues.apache.org/jira/browse/SPARK-22032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22032:
Assignee: Apache Spark
> Speed up StructType.fromInternal
>
[
https://issues.apache.org/jira/browse/SPARK-22032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22032:
Assignee: (was: Apache Spark)
> Speed up StructType.fromInternal
>
[
https://issues.apache.org/jira/browse/SPARK-22032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168273#comment-16168273
]
Apache Spark commented on SPARK-22032:
--
User 'maver1ck' has created a pull request for this issue:
Maciej Bryński created SPARK-22032:
--
Summary: Speed up StructType.fromInternal
Key: SPARK-22032
URL: https://issues.apache.org/jira/browse/SPARK-22032
Project: Spark
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/SPARK-22031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-22031:
--
Target Version/s: (was: 2.3.0)
Labels: (was: newbie)
Priority: Minor
[
https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168038#comment-16168038
]
Nicholas Chammas commented on SPARK-17025:
--
I take that back. I won't be able to test this for
[
https://issues.apache.org/jira/browse/SPARK-22030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22030:
Assignee: (was: Apache Spark)
> GraphiteSink fails to re-connect to Graphite
[
https://issues.apache.org/jira/browse/SPARK-22030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168030#comment-16168030
]
Apache Spark commented on SPARK-22030:
--
User 'alexmnyc' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-22030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22030:
Assignee: Apache Spark
> GraphiteSink fails to re-connect to Graphite instances behind an
[
https://issues.apache.org/jira/browse/SPARK-22031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Laurent Valdes updated SPARK-22031:
---
Summary: KMeans - Compute cost for a single vector (was: Compute cost for
a single vector)
Laurent Valdes created SPARK-22031:
--
Summary: Compute cost for a single vector
Key: SPARK-22031
URL: https://issues.apache.org/jira/browse/SPARK-22031
Project: Spark
Issue Type: Improvement
Alex Mikhailau created SPARK-22030:
--
Summary: GraphiteSink fails to re-connect to Graphite instances
behind an ELB or any other auto-scaled LB
Key: SPARK-22030
URL:
[
https://issues.apache.org/jira/browse/SPARK-22029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168017#comment-16168017
]
Maciej Bryński commented on SPARK-22029:
I did a proof of concept with functools.lru_cache.
But
Maciej Bryński created SPARK-22029:
--
Summary: Cache of _parse_datatype_json_string function
Key: SPARK-22029
URL: https://issues.apache.org/jira/browse/SPARK-22029
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-22024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Bryński updated SPARK-22024:
---
Summary: [pySpark] Speeding up internal conversion for Spark SQL (was:
[pySpark] Speeding
[
https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168009#comment-16168009
]
Sean Owen commented on SPARK-22028:
---
But that's a Java error or limit, even. What would Spark do? You
[
https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Franz Wimmer reopened SPARK-22028:
--
Sorry - the Error regarding the Hadoop binaries is normal for this system - I'm
asking because of
[
https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Franz Wimmer updated SPARK-22028:
-
Comment: was deleted
(was: The Error with the Hadoop binaries is normal for this system - I'm
[
https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Franz Wimmer updated SPARK-22028:
-
Description:
I have a strange environment variable in my Windows operating system:
{code:none}
[
https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167970#comment-16167970
]
Franz Wimmer commented on SPARK-22028:
--
The Error with the Hadoop binaries is normal for this system
[
https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-22028.
---
Resolution: Not A Problem
No, it indicates you don't have the win Hadoop binaries available. See the
Franz Wimmer created SPARK-22028:
Summary: spark-submit trips over environment variables
Key: SPARK-22028
URL: https://issues.apache.org/jira/browse/SPARK-22028
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-22027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kazunori Sakamoto updated SPARK-22027:
--
Labels: documentation (was: )
> Explanation of default value of GBTRegressor's
[
https://issues.apache.org/jira/browse/SPARK-22027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22027:
Assignee: (was: Apache Spark)
> Explanation of default value of GBTRegressor's
[
https://issues.apache.org/jira/browse/SPARK-22027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22027:
Assignee: Apache Spark
> Explanation of default value of GBTRegressor's maxIter is
[
https://issues.apache.org/jira/browse/SPARK-22027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167944#comment-16167944
]
Apache Spark commented on SPARK-22027:
--
User 'exKAZUu' has created a pull request for this issue:
Kazunori Sakamoto created SPARK-22027:
-
Summary: Explanation of default value of GBTRegressor's maxIter is
missing in API doc
Key: SPARK-22027
URL: https://issues.apache.org/jira/browse/SPARK-22027
[
https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167941#comment-16167941
]
Xiayun Sun edited comment on SPARK-21996 at 9/15/17 2:30 PM:
-
I can reproduce
[
https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167941#comment-16167941
]
Xiayun Sun commented on SPARK-21996:
I can reproduce this issue for master branch, and found out it
[
https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167927#comment-16167927
]
Apache Spark commented on SPARK-21996:
--
User 'xysun' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21996:
Assignee: Apache Spark
> Streaming ignores files with spaces in the file names
>
[
https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21996:
Assignee: (was: Apache Spark)
> Streaming ignores files with spaces in the file names
Wenchen Fan created SPARK-22026:
---
Summary: data source v2 write path
Key: SPARK-22026
URL: https://issues.apache.org/jira/browse/SPARK-22026
Project: Spark
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/SPARK-22024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Bryński updated SPARK-22024:
---
Description:
fromInternal methods of pySpark datatypes are bottleneck when using pySpark.
[
https://issues.apache.org/jira/browse/SPARK-21958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath reassigned SPARK-21958:
--
Assignee: Travis Hegner
> Attempting to save large Word2Vec model hangs driver in
[
https://issues.apache.org/jira/browse/SPARK-21958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath resolved SPARK-21958.
Resolution: Fixed
Fix Version/s: 2.3.0
Issue resolved by pull request 19191
[
https://issues.apache.org/jira/browse/SPARK-22025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22025:
Assignee: (was: Apache Spark)
> Speeding up fromInternal for StructField
>
[
https://issues.apache.org/jira/browse/SPARK-22025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22025:
Assignee: Apache Spark
> Speeding up fromInternal for StructField
>
[
https://issues.apache.org/jira/browse/SPARK-22025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167846#comment-16167846
]
Apache Spark commented on SPARK-22025:
--
User 'maver1ck' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-22012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karan Singh updated SPARK-22012:
Description:
My Spark Streaming duration is 5 seconds (5000) and kafka is all at its default
[
https://issues.apache.org/jira/browse/SPARK-19275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16166112#comment-16166112
]
Karan Singh edited comment on SPARK-19275 at 9/15/17 1:02 PM:
--
Hi Team ,
My
Maciej Bryński created SPARK-22025:
--
Summary: Speeding up fromInternal for StructField
Key: SPARK-22025
URL: https://issues.apache.org/jira/browse/SPARK-22025
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-22024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Bryński updated SPARK-22024:
---
Summary: [pySpark] Speeding up fromInternal methods (was: Speeding up
fromInternal methods)
[
https://issues.apache.org/jira/browse/SPARK-22010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Bryński updated SPARK-22010:
---
Issue Type: Sub-task (was: Improvement)
Parent: SPARK-22024
> Slow fromInternal
Maciej Bryński created SPARK-22024:
--
Summary: Speeding up fromInternal methods
Key: SPARK-22024
URL: https://issues.apache.org/jira/browse/SPARK-22024
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-7276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167814#comment-16167814
]
Barry Becker commented on SPARK-7276:
-
Isn't there still a problem with withColumn performance in
[
https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167806#comment-16167806
]
Nick Pentreath commented on SPARK-22021:
Why a JavaScript function? I think this is not a good
[
https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167788#comment-16167788
]
Jurgis Pods edited comment on SPARK-21994 at 9/15/17 12:13 PM:
---
I have
[
https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167788#comment-16167788
]
Jurgis Pods commented on SPARK-21994:
-
I have updated to CDH 5.12.1 and the problem persists. There
[
https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jurgis Pods updated SPARK-21994:
Description:
This seems to be a new bug introduced in Spark 2.2, since it did not occur
under
[
https://issues.apache.org/jira/browse/SPARK-22023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Oli Hall updated SPARK-22023:
-
Description:
I've been testing some existing PySpark code after migrating to Python3, and
there seems
Oli Hall created SPARK-22023:
Summary: Multi-column Spark SQL UDFs broken in Python 3
Key: SPARK-22023
URL: https://issues.apache.org/jira/browse/SPARK-22023
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22021:
Assignee: (was: Apache Spark)
> Add a feature transformation to accept a function and
[
https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167753#comment-16167753
]
Apache Spark commented on SPARK-22021:
--
User 'narahari92' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-22021:
Assignee: Apache Spark
> Add a feature transformation to accept a function and apply it
[
https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167734#comment-16167734
]
Jen-Ming Chung edited comment on SPARK-22019 at 9/15/17 11:29 AM:
--
The
[
https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167736#comment-16167736
]
Hosur Narahari commented on SPARK-22021:
If I just apply this function, I can't use it in spark's
[
https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167734#comment-16167734
]
Jen-Ming Chung edited comment on SPARK-22019 at 9/15/17 11:28 AM:
--
The
[
https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167734#comment-16167734
]
Jen-Ming Chung commented on SPARK-22019:
The alternative is giving the explicit schema instead
[
https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167725#comment-16167725
]
Jen-Ming Chung edited comment on SPARK-22019 at 9/15/17 11:18 AM:
--
Hi
[
https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167725#comment-16167725
]
Jen-Ming Chung commented on SPARK-22019:
Hi [~client.test],
The schema inferred after
Maciej Bryński created SPARK-22022:
--
Summary: Unable to use Python Profiler with SparkSession
Key: SPARK-22022
URL: https://issues.apache.org/jira/browse/SPARK-22022
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167722#comment-16167722
]
Sean Owen commented on SPARK-22021:
---
Why can't you just apply this function? Or implement Transformer.
Hosur Narahari created SPARK-22021:
--
Summary: Add a feature transformation to accept a function and
apply it on all rows of dataframe
Key: SPARK-22021
URL: https://issues.apache.org/jira/browse/SPARK-22021
[
https://issues.apache.org/jira/browse/SPARK-21780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21780:
Assignee: Apache Spark
> Simpler Dataset.sample API in R
>
[
https://issues.apache.org/jira/browse/SPARK-21780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167571#comment-16167571
]
Apache Spark commented on SPARK-21780:
--
User 'HyukjinKwon' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-21780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21780:
Assignee: (was: Apache Spark)
> Simpler Dataset.sample API in R
>
[
https://issues.apache.org/jira/browse/SPARK-22020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-22020.
---
Resolution: Duplicate
> Support session local timezone
> --
>
>
[
https://issues.apache.org/jira/browse/SPARK-20921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-20921.
---
Resolution: Duplicate
> While reading from oracle database, it converts to wrong type.
>
[
https://issues.apache.org/jira/browse/SPARK-21713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167506#comment-16167506
]
Apache Spark commented on SPARK-21713:
--
User 'joseph-torres' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-21987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiao Li resolved SPARK-21987.
-
Resolution: Fixed
Assignee: Wenchen Fan
Fix Version/s: 2.3.0
> Spark 2.3 cannot read 2.2
[
https://issues.apache.org/jira/browse/SPARK-22002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiao Li resolved SPARK-22002.
-
Resolution: Fixed
Assignee: Yuming Wang
Fix Version/s: 2.3.0
> Read JDBC table use
[
https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167407#comment-16167407
]
Jurgis Pods commented on SPARK-21994:
-
Thank you for testing. Which version of Hive are you using? It
Navya Krishnappa created SPARK-22020:
Summary: Support session local timezone
Key: SPARK-22020
URL: https://issues.apache.org/jira/browse/SPARK-22020
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-21902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Saisai Shao updated SPARK-21902:
Priority: Trivial (was: Major)
> BlockManager.doPut will hide actually exception when exception
[
https://issues.apache.org/jira/browse/SPARK-21902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Saisai Shao updated SPARK-21902:
Issue Type: Improvement (was: Wish)
> BlockManager.doPut will hide actually exception when
[
https://issues.apache.org/jira/browse/SPARK-21902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Saisai Shao reassigned SPARK-21902:
---
Assignee: zhoukang
> BlockManager.doPut will hide actually exception when exception thrown
[
https://issues.apache.org/jira/browse/SPARK-21902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Saisai Shao resolved SPARK-21902.
-
Resolution: Fixed
Fix Version/s: 2.3.0
Issue resolved by pull request 19171
[
https://issues.apache.org/jira/browse/SPARK-20921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167373#comment-16167373
]
Yuming Wang commented on SPARK-20921:
-
Fixed by https://github.com/apache/spark/pull/18266.
> While
88 matches
Mail list logo