Re: hadoop-2 profile to be removed in 3.5.0

2023-04-15 Thread yangjie01
Thanks Chao ~ Yang Jie 发件人: Dongjoon Hyun 日期: 2023年4月16日 星期日 00:08 收件人: Chao Sun 抄送: dev 主题: Re: hadoop-2 profile to be removed in 3.5.0 Thank you so much for head-ups, Chao! Dongjoon. On Fri, Apr 14, 2023 at 6:33 PM Chao Sun mailto:sunc...@apache.org>> wrote: Hi all, Just a heads up

Re: Parametrisable output metadata path

2023-04-15 Thread Jungtaek Lim
Hi, We have been indicated with lots of issues with the current FileStream sink. The effort to fix these issues are quite significant, and it ended up with derivation of "Data Lake" products. I'd recommend not to fix the issue but leave it as its limitation, and integrate your workload with Data

Parametrisable output metadata path

2023-04-15 Thread Wojciech Indyk
Hi! I raised a ticket on parametrisable output metadata path https://issues.apache.org/jira/browse/SPARK-43152. I am going to raise a PR against it and I realised, that this relatively simple change impacts on method hasMetadata(path), that would have a new meaning if we can define custom path for

Re: hadoop-2 profile to be removed in 3.5.0

2023-04-15 Thread Dongjoon Hyun
Thank you so much for head-ups, Chao! Dongjoon. On Fri, Apr 14, 2023 at 6:33 PM Chao Sun wrote: > Hi all, > > Just a heads up that `hadoop-2` profile is going to be removed in > Apache Spark 3.5.0. This has been discussed previously through this > email thread: >

Re: [ANNOUNCE] Apache Spark 3.4.0 released

2023-04-15 Thread Dongjoon Hyun
Nice catch, Xiao! All `latest` tags are updated to v3.4.0 now. https://hub.docker.com/r/apache/spark/tags https://hub.docker.com/r/apache/spark-py/tags https://hub.docker.com/r/apache/spark-r/tags Dongjoon. On Fri, Apr 14, 2023 at 8:38 PM Xiao Li wrote: > @Dongjoon Hyun Thank you! > >