Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-08-13 Thread Marcelo Vanzin
On this topic... when I worked on 2.3.1 and caused this breakage by deleting and old release, I tried to write some code to make this more automatic: https://github.com/vanzin/spark/tree/SPARK-24532 I just found that the code was a little too large and hacky for what it does (find out the latest

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-19 Thread Mark Hamstra
> >>>> Also, this list needs to be updated every time we make a new release? >>>> >>>> Plus can we cache them on Jenkins, maybe we can avoid downloading the >>>> same thing from Apache archive every test run. >>>> >>>> >&

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-19 Thread Sean Owen
run. >>> >>> >>> ------ >>> *From:* Marco Gaido >>> *Sent:* Monday, July 16, 2018 11:12 PM >>> *To:* Hyukjin Kwon >>> *Cc:* Sean Owen; dev >>> *Subject:* Re: Cleaning Spark releases from mirrors, and the flak

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-19 Thread Mark Hamstra
ownloading the >> same thing from Apache archive every test run. >> >> >> -- >> *From:* Marco Gaido >> *Sent:* Monday, July 16, 2018 11:12 PM >> *To:* Hyukjin Kwon >> *Cc:* Sean Owen; dev >> *Subject:* Re: Cleaning Spark releases

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-19 Thread Sean Owen
thing from Apache archive every test run. > > > -- > *From:* Marco Gaido > *Sent:* Monday, July 16, 2018 11:12 PM > *To:* Hyukjin Kwon > *Cc:* Sean Owen; dev > *Subject:* Re: Cleaning Spark releases from mirrors, and the flakiness of > HiveExter

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-19 Thread Felix Cheung
, 2018 11:12 PM To: Hyukjin Kwon Cc: Sean Owen; dev Subject: Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite +1 too On Tue, 17 Jul 2018, 05:38 Hyukjin Kwon, mailto:gurwls...@gmail.com>> wrote: +1 2018년 7월 17일 (화) 오전 7:34, Sean Owen mail

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-17 Thread Marco Gaido
+1 too On Tue, 17 Jul 2018, 05:38 Hyukjin Kwon, wrote: > +1 > > 2018년 7월 17일 (화) 오전 7:34, Sean Owen 님이 작성: > >> Fix is committed to branches back through 2.2.x, where this test was >> added. >> >> There is still some issue; I'm seeing that archive.apache.org is >> rate-limiting downloads and

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-16 Thread Hyukjin Kwon
+1 2018년 7월 17일 (화) 오전 7:34, Sean Owen 님이 작성: > Fix is committed to branches back through 2.2.x, where this test was added. > > There is still some issue; I'm seeing that archive.apache.org is > rate-limiting downloads and frequently returning 503 errors. > > We can help, I guess, by avoiding

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-16 Thread Sean Owen
Fix is committed to branches back through 2.2.x, where this test was added. There is still some issue; I'm seeing that archive.apache.org is rate-limiting downloads and frequently returning 503 errors. We can help, I guess, by avoiding testing against non-current releases. Right now we should be

Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-15 Thread Reynold Xin
Makes sense. Thanks for looking into this. On Sun, Jul 15, 2018 at 1:51 PM Sean Owen wrote: > Yesterday I cleaned out old Spark releases from the mirror system -- we're > supposed to only keep the latest release from active branches out on > mirrors. (All releases are available from the Apache

Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite

2018-07-15 Thread Sean Owen
Yesterday I cleaned out old Spark releases from the mirror system -- we're supposed to only keep the latest release from active branches out on mirrors. (All releases are available from the Apache archive site.) Having done so I realized quickly that the HiveExternalCatalogVersionsSuite relies on