Could you file an INFRA JIRA issue with the error message and context first, Wenchen?
As you know, if we see something, we had better file a JIRA issue because it could be not only an Apache Spark project issue but also all ASF project issues. Dongjoon. On Thu, May 9, 2024 at 12:28 AM Wenchen Fan <[email protected]> wrote: > UPDATE: > > After resolving a few issues in the release scripts, I can finally build > the release packages. However, I can't upload them to the staging SVN repo > due to a transmitting error, and it seems like a limitation from the server > side. I tried it on both my local laptop and remote AWS instance, but > neither works. These package binaries are like 300-400 MBs, and we just did > a release last month. Not sure if this is a new limitation due to cost > saving. > > While I'm looking for help to get unblocked, I'm wondering if we can > upload release packages to a public git repo instead, under the Apache > account? > > On Thu, May 9, 2024 at 12:39 AM Holden Karau <[email protected]> > wrote: > >> That looks cool, maybe let’s split off a thread on how to improve our >> release processes? >> >> Twitter: https://twitter.com/holdenkarau >> Books (Learning Spark, High Performance Spark, etc.): >> https://amzn.to/2MaRAG9 <https://amzn.to/2MaRAG9> >> YouTube Live Streams: https://www.youtube.com/user/holdenkarau >> >> >> On Wed, May 8, 2024 at 9:31 AM Erik Krogen <[email protected]> wrote: >> >>> On that note, GitHub recently released (public preview) a new feature >>> called Artifact Attestions which may be relevant/useful here: Introducing >>> Artifact Attestations–now in public beta - The GitHub Blog >>> <https://github.blog/2024-05-02-introducing-artifact-attestations-now-in-public-beta/> >>> >>> On Wed, May 8, 2024 at 9:06 AM Nimrod Ofek <[email protected]> >>> wrote: >>> >>>> I have no permissions so I can't do it but I'm happy to help (although >>>> I am more familiar with Gitlab CICD than Github Actions). >>>> Is there some point of contact that can provide me needed context and >>>> permissions? >>>> I'd also love to see why the costs are high and see how we can reduce >>>> them... >>>> >>>> Thanks, >>>> Nimrod >>>> >>>> On Wed, May 8, 2024 at 8:26 AM Holden Karau <[email protected]> >>>> wrote: >>>> >>>>> I think signing the artifacts produced from a secure CI sounds like a >>>>> good idea. I know we’ve been asked to reduce our GitHub action usage but >>>>> perhaps someone interested could volunteer to set that up. >>>>> >>>>> Twitter: https://twitter.com/holdenkarau >>>>> Books (Learning Spark, High Performance Spark, etc.): >>>>> https://amzn.to/2MaRAG9 <https://amzn.to/2MaRAG9> >>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau >>>>> >>>>> >>>>> On Tue, May 7, 2024 at 9:43 PM Nimrod Ofek <[email protected]> >>>>> wrote: >>>>> >>>>>> Hi, >>>>>> Thanks for the reply. >>>>>> >>>>>> From my experience, a build on a build server would be much more >>>>>> predictable and less error prone than building on some laptop- and of >>>>>> course much faster to have builds, snapshots, release candidates, early >>>>>> previews releases, release candidates or final releases. >>>>>> It will enable us to have a preview version with current changes- >>>>>> snapshot version, either automatically every day or if we need to save >>>>>> costs (although build is really not expensive) - with a click of a >>>>>> button. >>>>>> >>>>>> Regarding keys for signing. - that's what vaults are for, all across >>>>>> the industry we are using vaults (such as hashicorp vault)- but if the >>>>>> build will be automated and the only thing which will be manual is to >>>>>> sign >>>>>> the release for security reasons that would be reasonable. >>>>>> >>>>>> Thanks, >>>>>> Nimrod >>>>>> >>>>>> >>>>>> בתאריך יום ד׳, 8 במאי 2024, 00:54, מאת Holden Karau < >>>>>> [email protected]>: >>>>>> >>>>>>> Indeed. We could conceivably build the release in CI/CD but the >>>>>>> final verification / signing should be done locally to keep the keys >>>>>>> safe >>>>>>> (there was some concern from earlier release processes). >>>>>>> >>>>>>> Twitter: https://twitter.com/holdenkarau >>>>>>> Books (Learning Spark, High Performance Spark, etc.): >>>>>>> https://amzn.to/2MaRAG9 <https://amzn.to/2MaRAG9> >>>>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau >>>>>>> >>>>>>> >>>>>>> On Tue, May 7, 2024 at 10:55 AM Nimrod Ofek <[email protected]> >>>>>>> wrote: >>>>>>> >>>>>>>> Hi, >>>>>>>> >>>>>>>> Sorry for the novice question, Wenchen - the release is done >>>>>>>> manually from a laptop? Not using a CI CD process on a build server? >>>>>>>> >>>>>>>> Thanks, >>>>>>>> Nimrod >>>>>>>> >>>>>>>> On Tue, May 7, 2024 at 8:50 PM Wenchen Fan <[email protected]> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> UPDATE: >>>>>>>>> >>>>>>>>> Unfortunately, it took me quite some time to set up my laptop and >>>>>>>>> get it ready for the release process (docker desktop doesn't work >>>>>>>>> anymore, >>>>>>>>> my pgp key is lost, etc.). I'll start the RC process at my tomorrow. >>>>>>>>> Thanks >>>>>>>>> for your patience! >>>>>>>>> >>>>>>>>> Wenchen >>>>>>>>> >>>>>>>>> On Fri, May 3, 2024 at 7:47 AM yangjie01 <[email protected]> >>>>>>>>> wrote: >>>>>>>>> >>>>>>>>>> +1 >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> *发件人**: *Jungtaek Lim <[email protected]> >>>>>>>>>> *日期**: *2024年5月2日 星期四 10:21 >>>>>>>>>> *收件人**: *Holden Karau <[email protected]> >>>>>>>>>> *抄送**: *Chao Sun <[email protected]>, Xiao Li < >>>>>>>>>> [email protected]>, Tathagata Das <[email protected]>, >>>>>>>>>> Wenchen Fan <[email protected]>, Cheng Pan <[email protected]>, >>>>>>>>>> Nicholas Chammas <[email protected]>, Dongjoon Hyun < >>>>>>>>>> [email protected]>, Cheng Pan <[email protected]>, Spark >>>>>>>>>> dev list <[email protected]>, Anish Shrigondekar < >>>>>>>>>> [email protected]> >>>>>>>>>> *主题**: *Re: [DISCUSS] Spark 4.0.0 release >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> +1 love to see it! >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Thu, May 2, 2024 at 10:08 AM Holden Karau < >>>>>>>>>> [email protected]> wrote: >>>>>>>>>> >>>>>>>>>> +1 :) yay previews >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Wed, May 1, 2024 at 5:36 PM Chao Sun <[email protected]> >>>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>> +1 >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Wed, May 1, 2024 at 5:23 PM Xiao Li <[email protected]> >>>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>> +1 for next Monday. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> We can do more previews when the other features are ready for >>>>>>>>>> preview. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Tathagata Das <[email protected]> 于2024年5月1日周三 08:46写道: >>>>>>>>>> >>>>>>>>>> Next week sounds great! Thank you Wenchen! >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Wed, May 1, 2024 at 11:16 AM Wenchen Fan <[email protected]> >>>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>> Yea I think a preview release won't hurt (without a branch cut). >>>>>>>>>> We don't need to wait for all the ongoing projects to be ready. How >>>>>>>>>> about >>>>>>>>>> we do a 4.0 preview release based on the current master branch next >>>>>>>>>> Monday? >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Wed, May 1, 2024 at 11:06 PM Tathagata Das < >>>>>>>>>> [email protected]> wrote: >>>>>>>>>> >>>>>>>>>> Hey all, >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Reviving this thread, but Spark master has already accumulated a >>>>>>>>>> huge amount of changes. As a downstream project maintainer, I want >>>>>>>>>> to >>>>>>>>>> really start testing the new features and other breaking changes, >>>>>>>>>> and it's >>>>>>>>>> hard to do that without a Preview release. So the sooner we make a >>>>>>>>>> Preview >>>>>>>>>> release, the faster we can start getting feedback for fixing things >>>>>>>>>> for a >>>>>>>>>> great Spark 4.0 final release. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> So I urge the community to produce a Spark 4.0 Preview soon even >>>>>>>>>> if certain features targeting the Delta 4.0 release are still >>>>>>>>>> incomplete. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Thanks! >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Wed, Apr 17, 2024 at 8:35 AM Wenchen Fan <[email protected]> >>>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>> Thank you all for the replies! >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> To @Nicholas Chammas <[email protected]> : Thanks for >>>>>>>>>> cleaning up the error terminology and documentation! I've merged the >>>>>>>>>> first >>>>>>>>>> PR and let's finish others before the 4.0 release. >>>>>>>>>> >>>>>>>>>> To @Dongjoon Hyun <[email protected]> : Thanks for driving >>>>>>>>>> the ANSI on by default effort! Now the vote has passed, let's flip >>>>>>>>>> the >>>>>>>>>> config and finish the DataFrame error context feature before 4.0. >>>>>>>>>> >>>>>>>>>> To @Jungtaek Lim <[email protected]> : Ack. We can >>>>>>>>>> treat the Streaming state store data source as completed for 4.0 >>>>>>>>>> then. >>>>>>>>>> >>>>>>>>>> To @Cheng Pan <[email protected]> : Yea we definitely should >>>>>>>>>> have a preview release. Let's collect more feedback on the ongoing >>>>>>>>>> projects >>>>>>>>>> and then we can propose a date for the preview release. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Wed, Apr 17, 2024 at 1:22 PM Cheng Pan <[email protected]> >>>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>> will we have preview release for 4.0.0 like we did for 2.0.0 and >>>>>>>>>> 3.0.0? >>>>>>>>>> >>>>>>>>>> Thanks, >>>>>>>>>> Cheng Pan >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> > On Apr 15, 2024, at 09:58, Jungtaek Lim < >>>>>>>>>> [email protected]> wrote: >>>>>>>>>> > >>>>>>>>>> > W.r.t. state data source - reader (SPARK-45511), there are >>>>>>>>>> several follow-up tickets, but we don't plan to address them soon. >>>>>>>>>> The >>>>>>>>>> current implementation is the final shape for Spark 4.0.0, unless >>>>>>>>>> there are >>>>>>>>>> demands on the follow-up tickets. >>>>>>>>>> > >>>>>>>>>> > We may want to check the plan for transformWithState - my >>>>>>>>>> understanding is that we want to release the feature to 4.0.0, but >>>>>>>>>> there >>>>>>>>>> are several remaining works to be done. While the tentative timeline >>>>>>>>>> for >>>>>>>>>> releasing is June 2024, what would be the tentative timeline for the >>>>>>>>>> RC cut? >>>>>>>>>> > (cc. Anish to add more context on the plan for >>>>>>>>>> transformWithState) >>>>>>>>>> > >>>>>>>>>> > On Sat, Apr 13, 2024 at 3:15 AM Wenchen Fan < >>>>>>>>>> [email protected]> wrote: >>>>>>>>>> > Hi all, >>>>>>>>>> > >>>>>>>>>> > It's close to the previously proposed 4.0.0 release date (June >>>>>>>>>> 2024), and I think it's time to prepare for it and discuss the >>>>>>>>>> ongoing >>>>>>>>>> projects: >>>>>>>>>> > • >>>>>>>>>> > ANSI by default >>>>>>>>>> > • Spark Connect GA >>>>>>>>>> > • Structured Logging >>>>>>>>>> > • Streaming state store data source >>>>>>>>>> > • new data type VARIANT >>>>>>>>>> > • STRING collation support >>>>>>>>>> > • Spark k8s operator versioning >>>>>>>>>> > Please help to add more items to this list that are missed >>>>>>>>>> here. I would like to volunteer as the release manager for Apache >>>>>>>>>> Spark >>>>>>>>>> 4.0.0 if there is no objection. Thank you all for the great work >>>>>>>>>> that fills >>>>>>>>>> Spark 4.0! >>>>>>>>>> > >>>>>>>>>> > Wenchen Fan >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> -- >>>>>>>>>> >>>>>>>>>> Twitter: https://twitter.com/holdenkarau >>>>>>>>>> <https://mailshield.baidu.com/check?q=9DewFnOIsK%2bK64Uu60Jx4QkcL9rDgnApD6spzOBjk%2fa2KQxn> >>>>>>>>>> >>>>>>>>>> Books (Learning Spark, High Performance Spark, etc.): >>>>>>>>>> https://amzn.to/2MaRAG9 >>>>>>>>>> <https://mailshield.baidu.com/check?q=D34Ozfkj%2bFrnkuu9ci%2b4FcMkreOvMZ3jO85bIw%3d%3d> >>>>>>>>>> >>>>>>>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau >>>>>>>>>> <https://mailshield.baidu.com/check?q=nadOZCZjNeU0qOVGCJesf8dvH4OrsWdKamKIxnJncPneWoN8%2bsIqc2DWow8%3d> >>>>>>>>>> >>>>>>>>>>
