[jira] [Commented] (SPARK-48123) Provide a constant table schema for querying structured logs

2024-05-07 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844245#comment-17844245 ] Steve Loughran commented on SPARK-48123: this doesn't handle nested stack traces. I seem to have

[jira] [Commented] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2024-03-21 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829521#comment-17829521 ] Steve Loughran commented on SPARK-38330: [~jpanda] a bit late but your problem is the WONTFIX

[jira] [Commented] (SPARK-41392) spark builds against hadoop trunk/3.4.0-SNAPSHOT fail in scala-maven plugin

2024-03-01 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17822572#comment-17822572 ] Steve Loughran commented on SPARK-41392: expect an official release this week; this pr will

[jira] [Updated] (SPARK-41392) spark builds against hadoop trunk/3.4.0-SNAPSHOT fail in scala-maven plugin

2024-02-28 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-41392: --- Priority: Major (was: Minor) > spark builds against hadoop trunk/3.4.0-SNAPSHOT fail in

[jira] [Commented] (SPARK-41392) spark builds against hadoop trunk/3.4.0-SNAPSHOT fail in scala-maven plugin

2024-02-28 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17821694#comment-17821694 ] Steve Loughran commented on SPARK-41392: Hadoop 3.4.0 RC2 exhibits this; spark needs its patches

[jira] [Created] (SPARK-47008) Spark to support S3 Express One Zone Storage

2024-02-08 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-47008: -- Summary: Spark to support S3 Express One Zone Storage Key: SPARK-47008 URL: https://issues.apache.org/jira/browse/SPARK-47008 Project: Spark Issue Type:

[jira] [Commented] (SPARK-45404) Support AWS_ENDPOINT_URL env variable

2024-01-22 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17809536#comment-17809536 ] Steve Loughran commented on SPARK-45404: Just saw this while working on SPARK-35878. If you

[jira] [Updated] (SPARK-46793) Revert S3A endpoint fixup logic of SPARK-35878

2024-01-22 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-46793: --- Summary: Revert S3A endpoint fixup logic of SPARK-35878 (was: Revert region fixup logic of

[jira] [Created] (SPARK-46793) Revert region fixup logic of SPARK-35878

2024-01-22 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-46793: -- Summary: Revert region fixup logic of SPARK-35878 Key: SPARK-46793 URL: https://issues.apache.org/jira/browse/SPARK-46793 Project: Spark Issue Type:

[jira] [Commented] (SPARK-46247) Invalid bucket file error when reading from bucketed table created with PathOutputCommitProtocol

2024-01-18 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17808227#comment-17808227 ] Steve Loughran commented on SPARK-46247: why is the file invalid? any more stack trace? # try

[jira] [Commented] (SPARK-44124) Upgrade AWS SDK to v2

2023-10-25 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17779460#comment-17779460 ] Steve Loughran commented on SPARK-44124: good document # I think you could consider cutting the

[jira] [Commented] (SPARK-38958) Override S3 Client in Spark Write/Read calls

2023-09-11 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17763725#comment-17763725 ] Steve Loughran commented on SPARK-38958: [~hershalb] hadoop trunk is now on v2 sdk, but we are

[jira] [Commented] (SPARK-44884) Spark doesn't create SUCCESS file in Spark 3.3.0+ when partitionOverwriteMode is dynamic

2023-08-25 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17759063#comment-17759063 ] Steve Loughran commented on SPARK-44884: so using insert overwrite. yes, what happens there is

[jira] [Commented] (SPARK-44884) Spark doesn't create SUCCESS file when external path is passed

2023-08-23 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17758167#comment-17758167 ] Steve Loughran commented on SPARK-44884: i'm not trying to replicate it; i have too many other

[jira] [Commented] (SPARK-44884) Spark doesn't create SUCCESS file when external path is passed

2023-08-22 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17757547#comment-17757547 ] Steve Loughran commented on SPARK-44884: [~dipayandev] i don't think think anyone has disabled

[jira] [Commented] (SPARK-38958) Override S3 Client in Spark Write/Read calls

2023-08-22 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17757364#comment-17757364 ] Steve Loughran commented on SPARK-38958: [~hershalb] we are about to merge the v2 sdk feature

[jira] [Commented] (SPARK-44884) Spark doesn't create SUCCESS file when external path is passed

2023-08-21 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17757036#comment-17757036 ] Steve Loughran commented on SPARK-44884: this is created in the committer; for hadoop-mapreduce

[jira] [Resolved] (SPARK-44883) Spark insertInto with location GCS bucket root causes NPE

2023-08-21 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran resolved SPARK-44883. Resolution: Duplicate > Spark insertInto with location GCS bucket root causes NPE >

[jira] [Commented] (SPARK-44124) Upgrade AWS SDK to v2

2023-08-15 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754698#comment-17754698 ] Steve Loughran commented on SPARK-44124: +will need to make sure any classloaders set up to pass

[jira] [Commented] (SPARK-44116) Utilize Hadoop vectorized APIs

2023-07-31 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17749266#comment-17749266 ] Steve Loughran commented on SPARK-44116: If this gets into the libraries, you don't need

[jira] [Commented] (SPARK-44124) Upgrade AWS SDK to v2

2023-07-31 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17749264#comment-17749264 ] Steve Loughran commented on SPARK-44124: we are soon to move hadoop trunk up to SDK v2,

[jira] [Commented] (SPARK-44042) SPIP: PySpark Test Framework

2023-06-21 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17735646#comment-17735646 ] Steve Loughran commented on SPARK-44042: * you can create an independent git repo for this (ASF

[jira] [Commented] (SPARK-41599) Memory leak in FileSystem.CACHE when submitting apps to secure cluster using InProcessLauncher

2023-06-16 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17733454#comment-17733454 ] Steve Loughran commented on SPARK-41599: correct. remember, all the source of hadoop is there

[jira] [Commented] (SPARK-43170) The spark sql like statement is pushed down to parquet for execution, but the data cannot be queried

2023-04-28 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17717611#comment-17717611 ] Steve Loughran commented on SPARK-43170: FWIW, using S3 URLs

[jira] [Commented] (SPARK-40034) PathOutputCommitters to work with dynamic partition overwrite

2023-03-01 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17694969#comment-17694969 ] Steve Loughran commented on SPARK-40034: thanks for the update. I will get that new pr done

[jira] [Commented] (SPARK-42537) Remove obsolete/superfluous imports in spark-hadoop-cloud module

2023-02-23 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17692617#comment-17692617 ] Steve Loughran commented on SPARK-42537: FYI +[~dannycjones]. I'm getting build issues related

[jira] [Created] (SPARK-42537) Remove obsolete/superfluous imports in spark-hadoop-cloud module

2023-02-23 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-42537: -- Summary: Remove obsolete/superfluous imports in spark-hadoop-cloud module Key: SPARK-42537 URL: https://issues.apache.org/jira/browse/SPARK-42537 Project: Spark

[jira] [Commented] (SPARK-40034) PathOutputCommitters to work with dynamic partition overwrite

2023-01-19 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17678814#comment-17678814 ] Steve Loughran commented on SPARK-40034: Note that these changes aren't sufficient. The hadoop

[jira] [Commented] (SPARK-41599) Memory leak in FileSystem.CACHE when submitting apps to secure cluster using InProcessLauncher

2023-01-18 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17678189#comment-17678189 ] Steve Loughran commented on SPARK-41599: well, the challenge there becomes "not changing that

[jira] [Commented] (SPARK-41599) Memory leak in FileSystem.CACHE when submitting apps to secure cluster using InProcessLauncher

2022-12-23 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17651672#comment-17651672 ] Steve Loughran commented on SPARK-41599: apps can callĀ  FileSystem.closeAllForUGI() to remove

[jira] [Commented] (SPARK-41599) Memory leak in FileSystem.CACHE when submitting apps to secure cluster using InProcessLauncher

2022-12-23 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17651614#comment-17651614 ] Steve Loughran commented on SPARK-41599: 1. try explicitly disabling the cache for that fs

[jira] [Commented] (SPARK-41551) Improve/complete PathOutputCommitProtocol support for dynamic partitioning

2022-12-22 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17651384#comment-17651384 ] Steve Loughran commented on SPARK-41551: PR up. PathOutputCommitProtocol stops anyone trying to

[jira] [Commented] (SPARK-41599) Memory leak in FileSystem.CACHE when submitting apps to secure cluster using InProcessLauncher

2022-12-22 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17651201#comment-17651201 ] Steve Loughran commented on SPARK-41599: either the fs is being created by

[jira] [Commented] (SPARK-41551) Improve/complete PathOutputCommitProtocol support for dynamic partitioning

2022-12-20 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17649775#comment-17649775 ] Steve Loughran commented on SPARK-41551: So there's an interesting little "feature" of

[jira] [Created] (SPARK-41551) Improve/complete PathOutputCommitProtocol support for dynamic partitioning

2022-12-16 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-41551: -- Summary: Improve/complete PathOutputCommitProtocol support for dynamic partitioning Key: SPARK-41551 URL: https://issues.apache.org/jira/browse/SPARK-41551

[jira] [Commented] (SPARK-41392) spark builds against hadoop trunk/3.4.0-SNAPSHOT fail in scala-maven plugin

2022-12-06 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17643774#comment-17643774 ] Steve Loughran commented on SPARK-41392: may relate to the bouncy castle 1.68 update of

[jira] [Commented] (SPARK-41392) spark builds against hadoop trunk/3.4.0-SNAPSHOT fail in scala-maven plugin

2022-12-05 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17643492#comment-17643492 ] Steve Loughran commented on SPARK-41392: MBP m1 with {code} uname -a Darwin stevel-MBP16

[jira] [Created] (SPARK-41392) spark builds against hadoop trunk/3.4.0-SNAPSHOT fail in scala-maven plugin

2022-12-05 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-41392: -- Summary: spark builds against hadoop trunk/3.4.0-SNAPSHOT fail in scala-maven plugin Key: SPARK-41392 URL: https://issues.apache.org/jira/browse/SPARK-41392

[jira] [Commented] (SPARK-38934) Provider TemporaryAWSCredentialsProvider has no credentials

2022-10-21 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17622378#comment-17622378 ] Steve Loughran commented on SPARK-38934: sounds like there is a race condition, which surfaces

[jira] [Updated] (SPARK-29729) Upgrade ASM to 7.2

2022-10-17 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-29729: --- Description: this patch is required for spark to build with any version of bouncy castle

[jira] [Created] (SPARK-40640) SparkHadoopUtil to set origin of hadoop/hive config options

2022-10-03 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-40640: -- Summary: SparkHadoopUtil to set origin of hadoop/hive config options Key: SPARK-40640 URL: https://issues.apache.org/jira/browse/SPARK-40640 Project: Spark

[jira] [Created] (SPARK-40567) SharedState to redact secrets when propagating them to HadoopConf

2022-09-26 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-40567: -- Summary: SharedState to redact secrets when propagating them to HadoopConf Key: SPARK-40567 URL: https://issues.apache.org/jira/browse/SPARK-40567 Project: Spark

[jira] [Commented] (SPARK-40286) Load Data from S3 deletes data source file

2022-09-01 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17598837#comment-17598837 ] Steve Loughran commented on SPARK-40286: this is EMR. can you repliacate in an ASF spark release

[jira] [Commented] (SPARK-40287) Load Data using Spark by a single partition moves entire dataset under same location in S3

2022-09-01 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17598835#comment-17598835 ] Steve Loughran commented on SPARK-40287: does this happen when # you switch to an ASF spark

[jira] [Commented] (SPARK-38934) Provider TemporaryAWSCredentialsProvider has no credentials

2022-08-26 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17585275#comment-17585275 ] Steve Loughran commented on SPARK-38934: [~graceee318] try explicitly setting the aws secrets as

[jira] [Reopened] (SPARK-38934) Provider TemporaryAWSCredentialsProvider has no credentials

2022-08-26 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran reopened SPARK-38934: > Provider TemporaryAWSCredentialsProvider has no credentials >

[jira] [Commented] (SPARK-38934) Provider TemporaryAWSCredentialsProvider has no credentials

2022-08-26 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17585268#comment-17585268 ] Steve Loughran commented on SPARK-38934: staring at this some more, as there's enough

[jira] [Commented] (SPARK-38954) Implement sharing of cloud credentials among driver and executors

2022-08-17 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580711#comment-17580711 ] Steve Loughran commented on SPARK-38954: any plans to put the PR up? i'm curious about what

[jira] [Commented] (SPARK-38445) Are hadoop committers used in Structured Streaming?

2022-08-17 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580710#comment-17580710 ] Steve Loughran commented on SPARK-38445: SPARK-40039 might address this > Are hadoop committers

[jira] [Comment Edited] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-08-17 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580707#comment-17580707 ] Steve Loughran edited comment on SPARK-38330 at 8/17/22 9:46 AM: - bq. Is

[jira] [Comment Edited] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-08-17 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580707#comment-17580707 ] Steve Loughran edited comment on SPARK-38330 at 8/17/22 9:45 AM: - bq. Is

[jira] [Commented] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-08-17 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580707#comment-17580707 ] Steve Loughran commented on SPARK-38330: remove all jars with cos in the title from your

[jira] [Commented] (SPARK-40039) Introducing a streaming checkpoint file manager based on Hadoop's Abortable interface

2022-08-11 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578334#comment-17578334 ] Steve Loughran commented on SPARK-40039: doesn't actualy use MPU; if you haven't uploaded any

[jira] [Updated] (SPARK-40034) PathOutputCommitters to work with dynamic partition overwrite

2022-08-10 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-40034: --- Summary: PathOutputCommitters to work with dynamic partition overwrite (was:

[jira] [Created] (SPARK-40034) PathOutputCommitters to work with dynamic partition overwrite -if they support it

2022-08-10 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-40034: -- Summary: PathOutputCommitters to work with dynamic partition overwrite -if they support it Key: SPARK-40034 URL: https://issues.apache.org/jira/browse/SPARK-40034

[jira] [Commented] (SPARK-39969) Spark AWS SDK and kinesis dependencies lagging hadoop-aws and s3a

2022-08-09 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17577389#comment-17577389 ] Steve Loughran commented on SPARK-39969: there's an AWS SDK CVE which is fixed with

[jira] [Commented] (SPARK-39863) Upgrade Hadoop to 3.3.4

2022-08-03 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17574722#comment-17574722 ] Steve Loughran commented on SPARK-39863: probably should follow this with an upgrade of the aws

[jira] [Commented] (SPARK-39969) Spark AWS SDK and kinesis dependencies lagging hadoop-aws and s3a

2022-08-03 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17574720#comment-17574720 ] Steve Loughran commented on SPARK-39969: note: although the latest release fixes the latest set

[jira] [Created] (SPARK-39969) Spark AWS SDK and kinesis dependencies lagging hadoop-aws and s3a

2022-08-03 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-39969: -- Summary: Spark AWS SDK and kinesis dependencies lagging hadoop-aws and s3a Key: SPARK-39969 URL: https://issues.apache.org/jira/browse/SPARK-39969 Project: Spark

[jira] [Updated] (SPARK-39969) Spark AWS SDK and kinesis dependencies lagging hadoop-aws and s3a

2022-08-03 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-39969: --- Priority: Minor (was: Major) > Spark AWS SDK and kinesis dependencies lagging hadoop-aws

[jira] [Commented] (SPARK-38934) Provider TemporaryAWSCredentialsProvider has no credentials

2022-08-01 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17573653#comment-17573653 ] Steve Loughran commented on SPARK-38934: bq. our system set the provider as

[jira] [Commented] (SPARK-38934) Provider TemporaryAWSCredentialsProvider has no credentials

2022-08-01 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17573652#comment-17573652 ] Steve Loughran commented on SPARK-38934: because its your deployment setup, not anybody's code

[jira] [Commented] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-07-29 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17573141#comment-17573141 ] Steve Loughran commented on SPARK-38330: the hadoop 3.3.4 rC0 will fix this with that cut of the

[jira] [Commented] (SPARK-38934) Provider TemporaryAWSCredentialsProvider has no credentials

2022-07-29 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17573138#comment-17573138 ] Steve Loughran commented on SPARK-38934: so that's a config problem? not a bug? closing >

[jira] [Resolved] (SPARK-38934) Provider TemporaryAWSCredentialsProvider has no credentials

2022-07-29 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran resolved SPARK-38934. Resolution: Invalid > Provider TemporaryAWSCredentialsProvider has no credentials >

[jira] [Commented] (SPARK-38958) Override S3 Client in Spark Write/Read calls

2022-07-29 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17573137#comment-17573137 ] Steve Loughran commented on SPARK-38958: #. api is public, but we have changed the api

[jira] [Commented] (SPARK-33088) Enhance ExecutorPlugin API to include methods for task start and end events

2022-07-25 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17570936#comment-17570936 ] Steve Loughran commented on SPARK-33088: i;m playing with this and IOStatistics collection in

[jira] [Commented] (SPARK-29250) Upgrade to Hadoop 3.3.1

2022-06-13 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17553744#comment-17553744 ] Steve Loughran commented on SPARK-29250: use whatever version the spark release was built with

[jira] [Commented] (SPARK-38954) Implement sharing of cloud credentials among driver and executors

2022-05-23 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17541002#comment-17541002 ] Steve Loughran commented on SPARK-38954: what is the strategy for having the workers get the

[jira] [Commented] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-04-21 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525719#comment-17525719 ] Steve Loughran commented on SPARK-38330: aws sdk does its own thing sometimes, from what we see.

[jira] [Commented] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-04-18 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17523891#comment-17523891 ] Steve Loughran commented on SPARK-38330: FWIW I'm not 100% sure this is fixed, as we've had

[jira] [Commented] (SPARK-38445) Are hadoop committers used in Structured Streaming?

2022-04-05 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17517556#comment-17517556 ] Steve Loughran commented on SPARK-38445: not suppoorted unless you provide the PR for a new

[jira] [Commented] (SPARK-38652) K8S IT Test DepsTestsSuite blocks with PathIOException in hadoop-aws-3.3.2

2022-03-25 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17512458#comment-17512458 ] Steve Loughran commented on SPARK-38652: have you tried running the same suite against an aws s3

[jira] [Commented] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-03-22 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17510830#comment-17510830 ] Steve Loughran commented on SPARK-38330: the hadoop fix is in, but it will take a while. note

[jira] [Commented] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-03-16 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17507557#comment-17507557 ] Steve Loughran commented on SPARK-38330: sorry about that. try enabling path style access and

[jira] [Commented] (SPARK-38330) Certificate doesn't match any of the subject alternative names: [*.s3.amazonaws.com, s3.amazonaws.com]

2022-03-10 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17504286#comment-17504286 ] Steve Loughran commented on SPARK-38330: this is a hadoop issue -create a Jira there and file as

[jira] [Resolved] (SPARK-31911) Using S3A staging committer, pending uploads are committed more than once and listed incorrectly in _SUCCESS data

2022-03-10 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran resolved SPARK-31911. Fix Version/s: 3.0.1 2.4.7 Resolution: Fixed > Using S3A

[jira] [Commented] (SPARK-31911) Using S3A staging committer, pending uploads are committed more than once and listed incorrectly in _SUCCESS data

2022-03-10 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17504273#comment-17504273 ] Steve Loughran commented on SPARK-31911: I'm going to close as fixed now; the spark changes will

[jira] [Created] (SPARK-38394) build of spark sql against hadoop-3.4.0-snapshot failing with bouncycastle classpath error

2022-03-02 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-38394: -- Summary: build of spark sql against hadoop-3.4.0-snapshot failing with bouncycastle classpath error Key: SPARK-38394 URL: https://issues.apache.org/jira/browse/SPARK-38394

[jira] [Commented] (SPARK-38115) No spark conf to control the path of _temporary when writing to target filesystem

2022-02-22 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17496022#comment-17496022 ] Steve Loughran commented on SPARK-38115: bq. Is there any config as such to stop using

[jira] [Commented] (SPARK-38115) No spark conf to control the path of _temporary when writing to target filesystem

2022-02-15 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17492810#comment-17492810 ] Steve Loughran commented on SPARK-38115: * stop using the classic FileOutputCommitter for your

[jira] [Comment Edited] (SPARK-37814) Migrating from log4j 1 to log4j 2

2022-02-08 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17488804#comment-17488804 ] Steve Loughran edited comment on SPARK-37814 at 2/8/22, 12:04 PM: --

[jira] [Commented] (SPARK-37814) Migrating from log4j 1 to log4j 2

2022-02-08 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17488804#comment-17488804 ] Steve Loughran commented on SPARK-37814: everyone is aware of the log4j issues, but they are

[jira] [Commented] (SPARK-37771) Race condition in withHiveState and limited logic in IsolatedClientLoader result in ClassNotFoundException

2022-02-02 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17485973#comment-17485973 ] Steve Loughran commented on SPARK-37771: [~ivan.sadikov] -any update here? > Race condition in

[jira] [Commented] (SPARK-37771) Race condition in withHiveState and limited logic in IsolatedClientLoader result in ClassNotFoundException

2022-01-07 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17470842#comment-17470842 ] Steve Loughran commented on SPARK-37771: probably related to HADOOP-17372, which makes sure the

[jira] [Commented] (SPARK-37814) Migrating from log4j 1 to log4j 2

2022-01-05 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17469178#comment-17469178 ] Steve Loughran commented on SPARK-37814: be good to link to all issues related to this, e.g test

[jira] [Comment Edited] (SPARK-6305) Add support for log4j 2.x to Spark

2021-12-30 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-6305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466876#comment-17466876 ] Steve Loughran edited comment on SPARK-6305 at 12/30/21, 6:44 PM: -- If

[jira] [Commented] (SPARK-37630) Security issue from Log4j 1.X exploit

2021-12-30 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466918#comment-17466918 ] Steve Loughran commented on SPARK-37630: nobody does. you can find a patched jar at

[jira] [Commented] (SPARK-6305) Add support for log4j 2.x to Spark

2021-12-30 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-6305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466876#comment-17466876 ] Steve Loughran commented on SPARK-6305: --- If anyone wants a version of a log4j 1.17 without the

[jira] [Commented] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2021-11-02 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437259#comment-17437259 ] Steve Loughran commented on SPARK-23977: [~gumartinm] can I draw your attention to Apache

[jira] [Commented] (SPARK-36024) Switch the datasource example due to the depreciation of the dataset

2021-10-08 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17426166#comment-17426166 ] Steve Loughran commented on SPARK-36024: Amazon are being very nice here and keeping the landsat

[jira] [Commented] (SPARK-36761) spark-examples_2.12-3.0.2.jar DFSReadWriteTest S3A Implementation

2021-10-08 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17426163#comment-17426163 ] Steve Loughran commented on SPARK-36761: something in the code has got the default cluster FS

[jira] [Commented] (SPARK-35428) Spark history Server to S3 doesn't show incomplete applications

2021-10-08 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17426159#comment-17426159 ] Steve Loughran commented on SPARK-35428: # please stop using s3n; that connector is unsupported

[jira] [Commented] (SPARK-36529) Decouple CPU with IO work in vectorized Parquet reader

2021-10-08 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17426158#comment-17426158 ] Steve Loughran commented on SPARK-36529: If you look at HADOOP-11867 /

[jira] [Commented] (SPARK-36766) Spark SQL DDL does not recognize fs.s3.impl implied filesystem in LOCATION tag

2021-10-08 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17426152#comment-17426152 ] Steve Loughran commented on SPARK-36766: I can see why you'd want to do this (consistent URLs on

[jira] [Commented] (SPARK-36599) ExecutorClassLoader no longer works with Http based Class Servers

2021-08-27 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17405917#comment-17405917 ] Steve Loughran commented on SPARK-36599: I thought things had been fixed up so Hadoop's HTTP

[jira] [Commented] (SPARK-36024) Switch the datasource example due to the depreciation of the dataset

2021-07-28 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17388969#comment-17388969 ] Steve Loughran commented on SPARK-36024: yes, you can change the example. For hadoop we're

[jira] [Commented] (SPARK-36024) Switch the datasource example due to the depreciation of the dataset

2021-07-06 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17375400#comment-17375400 ] Steve Loughran commented on SPARK-36024: similar to HADOOP-17784 I'm "in discussions" with

[jira] [Comment Edited] (SPARK-36024) Switch the datasource example due to the depreciation of the dataset

2021-07-06 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17375400#comment-17375400 ] Steve Loughran edited comment on SPARK-36024 at 7/6/21, 9:32 AM: -

[jira] [Created] (SPARK-35878) add fs.s3a.endpoint if unset and fs.s3a.endpoint.region is null

2021-06-24 Thread Steve Loughran (Jira)
Steve Loughran created SPARK-35878: -- Summary: add fs.s3a.endpoint if unset and fs.s3a.endpoint.region is null Key: SPARK-35878 URL: https://issues.apache.org/jira/browse/SPARK-35878 Project: Spark

  1   2   3   4   5   6   7   8   9   >