Re: SIGMOD System Award for Apache Spark

2022-05-12 Thread shane knapp
l real-world and research systems. This puts Spark in good company >>> with some very impressive previous recipients >>> <https://sigmod.org/sigmod-awards/sigmod-systems-award/>. This award is >>> really an achievement by the whole community, so I wanted to say congrat

Re: [Apache Spark Jenkins] build system shutting down Dec 23th, 2021

2021-12-27 Thread shane knapp
# sysctl stop jenkins #  goodbye jenkins!  On Mon, Dec 6, 2021 at 12:02 PM shane knapp ☠ wrote: > hey everyone! > > after a marathon run of nearly a decade, we're finally going to be > shutting down {amp|rise}lab jenkins at the end of this month... > > the earliest sna

Re: [Apache Spark Jenkins] build system shutting down Dec 23th, 2021

2021-12-07 Thread shane knapp
created an issue to track stuff: https://issues.apache.org/jira/browse/SPARK-37571 On Tue, Dec 7, 2021 at 8:25 AM shane knapp ☠ wrote: > Will you be nuking all the Jenkins-related code in the repo after the 23rd? >> >> probably not right away... but soon after jenkins is s

Re: [Apache Spark Jenkins] build system shutting down Dec 23th, 2021

2021-12-07 Thread shane knapp
> > Will you be nuking all the Jenkins-related code in the repo after the 23rd? > > probably not right away... but soon after jenkins is shut down. bits of the docs and spark website will need to be updated as well. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley E

[Apache Spark Jenkins] build system shutting down Dec 23th, 2021

2021-12-06 Thread shane knapp
y'alls can watch me type the final command: systemctl stop jenkins feeling bittersweet, shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [FYI] Build and run tests on Java 17 for Apache Spark 3.3

2021-11-12 Thread shane knapp
le Silicon natively, but some 3rd party libraries like > RocksDB/LevelDB are not ready yet. Since Mac is one of the popular dev > environments, we are going to keep monitoring and improving gradually for > Apache Spark 3.3. > > Please test Java 17 and let us know your feedback. >

Re: [build system] quick jenkins reboot

2021-10-22 Thread shane knapp
we've been back for about an hour. :) On Fri, Oct 22, 2021 at 1:52 PM shane knapp ☠ wrote: > system load on the primary is getting suspiciously high, and free ram has > mysteriously disappeared and we are rapidly approaching swap. whatever > could it be? > > java. > &

[build system] quick jenkins reboot

2021-10-22 Thread shane knapp
-- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] DNS outage @ uc berkeley, jenkins not available

2021-09-01 Thread shane knapp
this was resolved by campus IT around 930pm last night. On Tue, Aug 31, 2021 at 12:54 PM shane knapp ☠ wrote: > > we're having some DNS issues here in the EECS department, and our > crack team is working on getting it resolved asap. until then, > jenkins isn't visible to the o

[build system] DNS outage @ uc berkeley, jenkins not available

2021-08-31 Thread shane knapp
we're having some DNS issues here in the EECS department, and our crack team is working on getting it resolved asap. until then, jenkins isn't visible to the outside world. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

Re: [build system] quick jenkins restart

2021-08-25 Thread shane knapp
aaand we're back! On Wed, Aug 25, 2021 at 9:24 AM shane knapp ☠ wrote: > i'll be: > - upgrading jenkins to the latest LTS > - moving jenkins to java 11 (from java 8) > - rebooting everything > > sorry for the disruption... there aren't many builds running right now

[build system] quick jenkins restart

2021-08-25 Thread shane knapp
i'll be: - upgrading jenkins to the latest LTS - moving jenkins to java 11 (from java 8) - rebooting everything sorry for the disruption... there aren't many builds running right now so i'll just get this sorted. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research

Re: [build system] half of the jenkins workers are down

2021-08-09 Thread shane knapp
turns out that minikube/k8s and friends were being oom-killed and this was causing all sorts of weirdnesses. i've upped the ram limits on all of the k8s jobs to 8G (from 6G), and we'll keep an eye on things and see how they go. On Mon, Aug 9, 2021 at 12:02 PM shane knapp ☠ wrote: > as work

Re: [build system] half of the jenkins workers are down

2021-08-09 Thread shane knapp
as workers are continuing to fail, i've stopped jenkins from accepting new builds for the time being. more updates as they come. On Mon, Aug 9, 2021 at 9:17 AM shane knapp ☠ wrote: > happy monday! > > the server gods did not smile upon us this weekend, and 4 of the workers > are

[build system] half of the jenkins workers are down

2021-08-09 Thread shane knapp
happy monday! the server gods did not smile upon us this weekend, and 4 of the workers are down. we'll most likely need to head to our colo some time today and give them an in-person kick and see what's going on. i'll send an update when they're back up. shane -- Shane Knapp Computer Guy

[build system] jenkins "freeze" for remainder of 2021

2021-07-28 Thread shane knapp
jenkins. exceptions to this rule include new branches (spark 3.3, i'm looking at you!), and any major security or critical fixes required for builds. please let us know if you have any questions! thanks in advance, brian & shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Rese

Re: please read: current state and the future of the apache spark build system

2021-07-28 Thread shane knapp
has been pretty stable, the jenkins administrative GUI is still broken (but at least i can hack the xml on the bare metal), and we've got 8 workers up and running. i'll be sending out another email to this list soon regarding the impending jenkins 'freeze'. shane -- Shane Knapp Comput

Re: [build system] jenkins downtime today

2021-07-22 Thread shane knapp
that actually went much faster than anticipated, and we're already back up and building! On Thu, Jul 22, 2021 at 10:24 AM shane knapp ☠ wrote: > i'll be taking jenkins down for a couple of hours today to reboot/clean up > the workers and finish up the python package installs covered in &

[build system] jenkins downtime today

2021-07-22 Thread shane knapp
i'll be taking jenkins down for a couple of hours today to reboot/clean up the workers and finish up the python package installs covered in https://github.com/apache/spark/pull/33469/files shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical

Re: quick jenkins restart

2021-07-09 Thread shane knapp
we're back up! On Fri, Jul 9, 2021 at 10:23 AM shane knapp ☠ wrote: > the primary is running out of memory pretty quickly, and i'm going to > reboot the server quickly so that it doesn't crash over the weekend. > > we'll investigate a bit more next week. > > shane > -- >

quick jenkins restart

2021-07-09 Thread shane knapp
the primary is running out of memory pretty quickly, and i'm going to reboot the server quickly so that it doesn't crash over the weekend. we'll investigate a bit more next week. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

Re: How to think about SparkPullRequestBuilder-K8s?

2021-06-11 Thread shane knapp
we're back. On Fri, Jun 11, 2021 at 2:30 PM shane knapp ☠ wrote: > btw i just noticed jenkins was down, and i restarted the primary node. > > On Fri, Jun 11, 2021 at 12:09 PM Sean Owen wrote: > >> I find that somewhat often, the K8S PR builders will fail

Re: How to think about SparkPullRequestBuilder-K8s?

2021-06-11 Thread shane knapp
the PR seems totally unrelated to K8S. I've kind of learned to > ignore them in that case but that seems wrong. Are they just kind of flaky? > am I imagining things? Just trying to figure out how much they're > 'accurate' in catching real vs false failures. > -- Shane Knapp Computer

Re: [build system] jenkins down, working on it

2021-05-04 Thread shane knapp
we're back and building! On Tue, May 4, 2021 at 4:03 PM shane knapp ☠ wrote: > jenkins went down some time in the past few days, and i'm currently > investigating. > > if it's been down a while, i apologize as i've been dealing w/some health > issues. > > shane > -- >

[build system] jenkins down, working on it

2021-05-04 Thread shane knapp
jenkins went down some time in the past few days, and i'm currently investigating. if it's been down a while, i apologize as i've been dealing w/some health issues. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

Re: [SPARK-34738] issues w/k8s+minikube and PV tests

2021-04-16 Thread shane knapp
next week. On Thu, Apr 15, 2021 at 3:05 PM shane knapp ☠ wrote: > i'm all for that... and once they're turned off, we can finish the > minikube/k8s/move-to-docker project in a couple of hours max. > > On Thu, Apr 15, 2021 at 3:00 PM Holden Karau wrote: > >> What about if we

Re: [SPARK-34738] issues w/k8s+minikube and PV tests

2021-04-15 Thread shane knapp
that one > test fails because it relies on some minikube specific functionality. That > test could be refactored because I think it’s just adding a minimal Ceph > cluster to the K8S cluster which can be done to any K8S cluster in principal > > > > > > > > Rob > >

Re: please read: current state and the future of the apache spark build system

2021-04-14 Thread shane knapp
ng this out today: https://github.com/apache/spark/pull/32178 shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [SPARK-34738] issues w/k8s+minikube and PV tests

2021-04-14 Thread shane knapp
On Wed, Apr 14, 2021 at 10:32 AM Frank Luo wrote: > Is there any hard dependency on minkube? (i.e, GPU setting), kind ( > https://kind.sigs.k8s.io/) is a stabler and simpler k8s cluster env on a > single machine (only requires docker) , it been widely used by k8s projects > testing. > > there

[SPARK-34738] issues w/k8s+minikube and PV tests

2021-04-14 Thread shane knapp
virtualization layer. thanks in advance, shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: Increase the number of parallel jobs in GitHub Actions at ASF organization level

2021-04-08 Thread shane knapp
ferent now... - since there are no funds coming from research labs, i am unable to staff the build system past 2021 (tbh, even this year is a stretch) - the hardware is far past EOL and literally falling over - jenkins is, and always will be a PITA to run shane -- Shane Knapp Computer Guy / Voice

please read: current state and the future of the apache spark build system

2021-04-07 Thread shane knapp
, but some things might be flaky. but the biggest question is what you all need w/regards to build infrastructure... and who's going to be responsible for it. thanks for reading! :) shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

Re: [build system] short downtime today, new workers coming soon

2021-03-23 Thread shane knapp
we're back! On Tue, Mar 23, 2021 at 12:31 PM shane knapp ☠ wrote: > jenkins is acting up, and i'm going to take the opportunity to reboot the > primary and all the workers. > > sorry for the short notice, but on the bright side we have a bunch of > shiny new workers coming

[build system] short downtime today, new workers coming soon

2021-03-23 Thread shane knapp
jenkins is acting up, and i'm going to take the opportunity to reboot the primary and all the workers. sorry for the short notice, but on the bright side we have a bunch of shiny new workers coming soon! shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab

Re: [build system] github fetches timing out

2021-03-17 Thread shane knapp
it's been happening a lot again recently... i'm investigating. On Wed, Mar 10, 2021 at 10:23 AM Liang-Chi Hsieh wrote: > Thanks Shane for looking at it! > > > shane knapp ☠ wrote > > ...and just like that, overnight the builds started successfully git > > fetching! &g

Re: [build system] github fetches timing out

2021-03-10 Thread shane knapp
...and just like that, overnight the builds started successfully git fetching! On Tue, Mar 9, 2021 at 12:31 PM shane knapp ☠ wrote: > it looks like over the past few days the master/branch builds have been > timing out... this hasn't happened in a few years, and honestly the last &

[build system] github fetches timing out

2021-03-09 Thread shane knapp
i had a more concrete answer or solution for what's going on... i'll continue to investigate as best i can today, and if this continues, i'll re-open my issue w/github and see if they can shed any light on the situation. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research

Re: minikube and kubernetes cluster versions for integration testing

2021-03-04 Thread shane knapp
park-developers-list.1001551.n3.nabble.com/ > > - > To unsubscribe e-mail: dev-unsubscr...@spark.apache.org > > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: minikube and kubernetes cluster versions for integration testing

2021-03-03 Thread shane knapp
n Mac but with a simple sed expression it can be tailored to >> linux too. >> >> >> >> *After all of this my questions:* >> *A) What about to change the required versions and suggest to use >> kubernetes v1.17.3 and Minikube v1.7.3 and greater for integration testing?* >> >> I would chose v1.17.3 for k8s cluster as that is the newest supported k8s >> version for that Minikube v1.7.3 (hoping it will be good for us for a long >> time). >> If you agree with this suggestion I go ahead and update the relevant >> documentation. >> >> >> >> *B) How about extending the integration test to check whether the >> Minikube version is sufficient? *By this we can provide a meaningful >> error when it is violated. >> >> Bests, >> Attila >> > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] jenkins wedged, going to restart after current builds finish

2021-02-23 Thread shane knapp
this was done about an hour ago... rebooted several of the workers to clear out lingering builds, and one worker had an SSD fail on boot and is currently offline. shane On Tue, Feb 23, 2021 at 10:13 AM shane knapp ☠ wrote: > EOM > > -- > Shane Knapp > Computer Guy / Voice

[build system] jenkins wedged, going to restart after current builds finish

2021-02-23 Thread shane knapp
EOM -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: K8s integration test failure ("credentials Jenkins is using is probably wrong...")

2021-02-23 Thread shane knapp
stupid bash variable assignment. i'm surprised this has lingered for as long as it had (3 years). it's fixed and shouldn't be an issue any more. On Tue, Feb 23, 2021 at 9:28 AM shane knapp ☠ wrote: > the AmplabJenks bot's github creds are out of date, which is causing that > non-fatal

Re: K8s integration test failure ("credentials Jenkins is using is probably wrong...")

2021-02-23 Thread shane knapp
>> probably wrong. Or the user account does not have write access to the repo. >> >> >> See >> https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39934/consoleFull >> >> Can anybody please advise? >> >> Thanks in advance. >> >> Phillip >> >> >> -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [FYI] CI Infra issues (in both GitHub Action and Jenkins)

2021-01-08 Thread shane knapp
kins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-sbt-hadoop-2.7/1887/console >> >> On Fri, Jan 8, 2021 at 2:13 PM shane knapp ☠ wrote: >> >>> 1. Jenkins machines start to fail with the following recently. >>>> (master branch) >>

Re: [FYI] CI Infra issues (in both GitHub Action and Jenkins)

2021-01-08 Thread shane knapp
QA%20Test%20(Dashboard)/job/spark-master-test-sbt-hadoop-3.2/1836/console > > https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-sbt-hadoop-2.7/1887/console > > On Fri, Jan 8, 2021 at 2:13 PM shane knapp ☠ wrote: > >> 1. Jenkins machines

Re: [FYI] CI Infra issues (in both GitHub Action and Jenkins)

2021-01-08 Thread shane knapp
> > 1. Jenkins machines start to fail with the following recently. > (master branch) > > Python versions prior to 3.6 are not supported. > Build step 'Execute shell' marked build as failure > > examples please? -- Shane Knapp Computer Guy / Voice of Reason U

[build system] jenkins downtime 01/02/2021 - 01/03/2020

2020-12-21 Thread shane knapp
spark jira. :) -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] WE'RE LIVE!

2020-12-04 Thread shane knapp
ok, it's broken on the new nodes, so i tied the project to ubuntu16. i'll create a jira and investigate further at a later date. On Fri, Dec 4, 2020 at 8:58 AM shane knapp ☠ wrote: > no, it isn't but i'll try and take a look at this later today. > > On Fri, Dec 4, 2020 at 7:12 AM T

Re: [build system] WE'RE LIVE!

2020-12-04 Thread shane knapp
c 2nd failed: > > https://amplab.cs.berkeley.edu/jenkins/view/Spark%20Packaging/job/spark-master-maven-snapshots/3186/ > > Not sure if this is result of upgrade? > > Thanks, > Tom > On Tuesday, December 1, 2020, 06:55:27 PM CST, shane knapp ☠ < > skn...@berkeley.edu> wrote:

[build system] WE'RE LIVE!

2020-12-01 Thread shane knapp
for his work on the project! we couldn't have done it w/o him. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] jenkins downtime today/tomorrow

2020-12-01 Thread shane knapp
and move on to fixing any lingering environment/system issues that pop up. shane On Mon, Nov 30, 2020 at 4:01 PM shane knapp ☠ wrote: > amplab jenkins is down. > > On Mon, Nov 30, 2020 at 3:25 PM shane knapp ☠ wrote: > >> old jenkins is getting shut down Real Soon Now[tm]! cros

Re: [build system] jenkins downtime today/tomorrow

2020-11-30 Thread shane knapp
amplab jenkins is down. On Mon, Nov 30, 2020 at 3:25 PM shane knapp ☠ wrote: > old jenkins is getting shut down Real Soon Now[tm]! crossing my fingers! > :) > > On Mon, Nov 30, 2020 at 10:05 AM shane knapp ☠ > wrote: > >> hey all! >> >> the Great Jen

Re: [build system] jenkins downtime today/tomorrow

2020-11-30 Thread shane knapp
old jenkins is getting shut down Real Soon Now[tm]! crossing my fingers! :) On Mon, Nov 30, 2020 at 10:05 AM shane knapp ☠ wrote: > hey all! > > the Great Jenkins Migration[tm] is well under way, and we will be > sunsetting the old amp-jenkins-master server and moving to a new o

[build system] jenkins downtime today/tomorrow

2020-11-30 Thread shane knapp
. shane/brian/jon -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] IMPORTANT UPDATE

2020-11-25 Thread shane knapp
On Wed, Nov 25, 2020 at 1:35 PM shane knapp ☠ wrote: > hey all, work is going quite well and smoothly for this project. > > today's update: > > we will experience significant downtime monday/tuesday as we spin up the > new primary jenkins node. until then, we'll be building

Re: [build system] IMPORTANT UPDATE

2020-11-25 Thread shane knapp
at 6:08 PM shane knapp ☠ wrote: > all spark builds have been ported and triggered: > > https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/ > > not shown are the regular and k8s PRB, which are also running. > > i think i've nailed down most of the stup

Re: [build system] IMPORTANT UPDATE

2020-11-24 Thread shane knapp
: rack rearrangement, cleaning up networking, fixing hardware, reimaging and generally kicking ass! have a great holiday! shane On Tue, Nov 24, 2020 at 2:24 PM shane knapp ☠ wrote: > our very first ubuntu-based PRB is running: > https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestB

Re: [build system] IMPORTANT UPDATE

2020-11-24 Thread shane knapp
our very first ubuntu-based PRB is running: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/131701/ crossing my fingers! :) On Tue, Nov 24, 2020 at 1:30 PM shane knapp ☠ wrote: > due to scheduling, upcoming holiday and in-the-colo work requirements, all > of the

Re: [build system] IMPORTANT UPDATE

2020-11-24 Thread shane knapp
! shane On Tue, Nov 24, 2020 at 11:24 AM shane knapp ☠ wrote: > this is a lengthy, but important read for everyone here. > > in the next few days, the remaining centos machines (PRB/SBT workers AND > primary) will have be reimaged from centos6.9 to ubuntu 20.04LTS. > > this mea

Re: jenkins downtime tomorrow evening/weekend

2020-11-24 Thread shane knapp
> > Please see https://issues.apache.org/jira/browse/SPARK-27177 for more > details. > > On Tue, Nov 24, 2020 at 8:23 AM shane knapp ☠ wrote: > >> it seems that the plugin upgrade went as smoothly as it could have... i >> still have a bunch of stack traces to filter th

[build system] IMPORTANT UPDATE

2020-11-24 Thread shane knapp
like to have helped find the build system a new home, and sunset jenkins. over the past 11 years (i think), this system has built spark. it's getting a little tired and needs a well deserved break. :) shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff

Re: jenkins downtime tomorrow evening/weekend

2020-11-23 Thread shane knapp
me here. also, my backlog of things i need to install will be addressed this week. the ansible is coming along nicely! On Mon, Nov 23, 2020 at 2:11 PM shane knapp ☠ wrote: > the third most terrifying event in the world, a massive jenkins plugin > update is happening in a couple of hours

Re: jenkins downtime tomorrow evening/weekend

2020-11-23 Thread shane knapp
. shane On Sat, Nov 21, 2020 at 4:23 PM shane knapp ☠ wrote: > somehow that went pretty smoothly, tho i've got a bunch of plugins to deal > with... we're back up and building w/a shiny new UI. :) > > On Sat, Nov 21, 2020 at 3:52 PM shane knapp ☠ wrote: > >> this is starting

Re: jenkins downtime tomorrow evening/weekend

2020-11-21 Thread shane knapp
somehow that went pretty smoothly, tho i've got a bunch of plugins to deal with... we're back up and building w/a shiny new UI. :) On Sat, Nov 21, 2020 at 3:52 PM shane knapp ☠ wrote: > this is starting now > > On Thu, Nov 19, 2020 at 4:34 PM shane knapp ☠ wrote: >

Re: jenkins downtime tomorrow evening/weekend

2020-11-21 Thread shane knapp
this is starting now On Thu, Nov 19, 2020 at 4:34 PM shane knapp ☠ wrote: > i'm going to be upgrading jenkins to something more reasonable, and there > will definitely be some downtime as i get things sorted. > > we should be back up and building by monday. > > shane

jenkins downtime tomorrow evening/weekend

2020-11-19 Thread shane knapp
i'm going to be upgrading jenkins to something more reasonable, and there will definitely be some downtime as i get things sorted. we should be back up and building by monday. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

[build system] IMPORTANT: builds will be impacted this month

2020-11-02 Thread shane knapp
things up to date while trying to remotely train up one of my sysadmins to take over some of my build system duties. -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] jenkins wedged again

2020-10-14 Thread shane knapp
everything's up and jenkins is slowly chewing through the queue! :) On Wed, Oct 14, 2020 at 12:00 PM Xiao Li wrote: > Thank you, Shane! > > Xiao > > On Wed, Oct 14, 2020 at 12:00 PM shane knapp ☠ > wrote: > >> we're mostly back up, and just waiting for a couple o

Re: [build system] jenkins wedged again

2020-10-14 Thread shane knapp
we're mostly back up, and just waiting for a couple of ubuntu boxes to finish booting... prb seem to be building now! On Wed, Oct 14, 2020 at 11:48 AM shane knapp ☠ wrote: > i'm going to reboot the primary and worker nodes, so it'll be a few > minutes before everything is back up. >

[build system] jenkins wedged again

2020-10-14 Thread shane knapp
i'm going to reboot the primary and worker nodes, so it'll be a few minutes before everything is back up. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: Running K8s integration tests for changes in core?

2020-09-24 Thread shane knapp
ads up. I hope you get some time to relax :) > > On Thu, Aug 20, 2020 at 2:26 PM shane knapp ☠ wrote: > >> fyi, i won't be making this change until the 1st week of september. i'll >> be out, off the grid all next week! :) >> >> i will send an announcement out tom

Re: [build system] downtime due to SSL cert errors

2020-09-24 Thread shane knapp
certs delivered and installed... we're back! On Wed, Sep 23, 2020 at 6:07 PM shane knapp ☠ wrote: > jenkins is up and building, but not reachable via https at the moment. > i'm working on getting this sorted ASAP. > > shane > -- > Shane Knapp > Computer Guy / Voice of Reas

[build system] downtime due to SSL cert errors

2020-09-23 Thread shane knapp
jenkins is up and building, but not reachable via https at the moment. i'm working on getting this sorted ASAP. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

[build system] shane out all next week (aug 22-29), support instructions

2020-08-20 Thread shane knapp
the number of tickets opened. :) if there are any other problems, file a JIRA and assign to me. i will look at it in early september. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: Running K8s integration tests for changes in core?

2020-08-20 Thread shane knapp
> > A presubmit(which includes K8s integration tests) build will be run, once > the PR receives LGTM from "Approved reviewers". This is one criteria that > comes to my mind, others may have better suggestions. > > On Thu, Aug 20, 2020 at 12:25 AM shane knapp ☠ > wrote

Re: Running K8s integration tests for changes in core?

2020-08-19 Thread shane knapp
> >> > > -- > Twitter: https://twitter.com/holdenkarau > Books (Learning Spark, High Performance Spark, etc.): > https://amzn.to/2MaRAG9 <https://amzn.to/2MaRAG9> > YouTube Live Streams: https://www.youtube.com/user/holdenkarau > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: Running K8s integration tests for changes in core?

2020-08-18 Thread shane knapp
er.com/holdenkarau > Books (Learning Spark, High Performance Spark, etc.): > https://amzn.to/2MaRAG9 <https://amzn.to/2MaRAG9> > YouTube Live Streams: https://www.youtube.com/user/holdenkarau > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

[build system] restarting jenkins now

2020-08-14 Thread shane knapp
there isn't much activity right now, and i'd like to restart jenkins quickly as it's consuming a lot of memory on the head node. shouldn't be more than a couple of minutes downtime... if something goes awry i'll send an email here. if you don't hear from me again, please carry on. :) -- Shane

Re: R installation broken on ubuntu workers, impacts K8s PRB builds

2020-07-17 Thread shane knapp
this is done, except for amp-jenkins-staging-worker-02 which is refusing to allow me to reinstall R... i marked that worker offline and will beat on it later today. On Fri, Jul 17, 2020 at 11:36 AM shane knapp ☠ wrote: > starting now... pausing jenkins so no new builds are launc

Re: R installation broken on ubuntu workers, impacts K8s PRB builds

2020-07-17 Thread shane knapp
starting now... pausing jenkins so no new builds are launched. On Thu, Jul 16, 2020 at 3:09 PM Holden Karau wrote: > Sounds good, thanks. No rush :) > > On Thu, Jul 16, 2020 at 3:03 PM shane knapp ☠ wrote: > >> i'll get to this tomorrow afternoon, and there will be a short

Re: R installation broken on ubuntu workers, impacts K8s PRB builds

2020-07-16 Thread shane knapp
-32326 > > On Wed, Jul 15, 2020 at 12:09 PM shane knapp ☠ > wrote: > >> i'm not entirely sure when the dep for R got bumped to 3.5+, but it's >> breaking the k8s builds. >> >> i'll need to purge these workers of all previous versions of R + >> packages, the

R installation broken on ubuntu workers, impacts K8s PRB builds

2020-07-15 Thread shane knapp
of downtime. i'll file a JIRA, and figure out when i will be able to get to this... possibly this afternoon. -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [DISCUSS] Drop Python 2, 3.4 and 3.5

2020-07-14 Thread shane knapp
tages by dropping them: >>>>>>>>> 1. It removes a bunch of hacks we added around 700 lines in >>>>>>>>> PySpark. >>>>>>>>> 2. PyPy2 has a critical bug that causes a flaky test, >>>>>>>>> https://issues.apache.org

Re: Welcoming some new Apache Spark committers

2020-07-14 Thread shane knapp
al > > All three of them contributed to Spark 3.0 and we’re excited to have them > join the project. > > Matei and the Spark PMC > - > To unsubscribe e-mail: dev-unsubscr...@spark.apache.org > > -- Sha

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-13 Thread shane knapp
feel we're out of the woods right now. :) shane On Fri, Jul 10, 2020 at 3:43 PM Frank Yin wrote: > Great. Thanks. > > On Fri, Jul 10, 2020 at 3:39 PM shane knapp ☠ wrote: > >> no, 8 hours is plenty. things will speed up soon once the backlog of >> builds work

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-10 Thread shane knapp
hanks. >> >> On Fri, Jul 10, 2020 at 12:43 PM shane knapp ☠ >> wrote: >> >>> only 125561, 125562 and 125564 were impacted by -9. >>> >>> 125565 exited w/a code of 15 (143 - 128), which means the process was >>> terminated for unknown rea

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-10 Thread shane knapp
arkPullRequestBuilder/125563/console > > https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125562/console > > https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125561/console > > On Fri, Jul 10, 2020 at 9:35 AM shane knapp ☠ wrote: > >>

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-10 Thread shane knapp
; infrastructure? > > On Fri, Jul 10, 2020 at 8:19 AM shane knapp ☠ wrote: > >> yeah, i can't do much for flaky tests... just flaky infrastructure. >> >> >> On Fri, Jul 10, 2020 at 12:41 AM Hyukjin Kwon >> wrote: >> >>> Couple of flaky

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-10 Thread shane knapp
;> >> >> >> -- >> Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ >> >> ----- >> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org >> >> --

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-09 Thread shane knapp
i'm seeing green PRB builds now, so i feel that we've gotten things building again! :) On Thu, Jul 9, 2020 at 5:33 PM Hyukjin Kwon wrote: > Thank you Shane. > > 2020년 7월 10일 (금) 오전 2:35, shane knapp ☠ 님이 작성: > >> and -06 is back! i'll keep an eye on things today, but

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-09 Thread shane knapp
and -06 is back! i'll keep an eye on things today, but suffice to say on each worker i: 1) rebooted 2) cleaned ~/.ivy2, ~/.m2, and other associated caches we should be g2g! please reply here if you continue to see weirdness. On Thu, Jul 9, 2020 at 10:08 AM shane knapp ☠ wrote: >

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-09 Thread shane knapp
ok, we're back up and building (just waiting for one worker, -06 to finish cleaning itself up). On Thu, Jul 9, 2020 at 9:30 AM shane knapp ☠ wrote: > this is happening now. > > On Wed, Jul 8, 2020 at 9:07 AM shane knapp ☠ wrote: > >> this will be happening tomorrow... tod

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-09 Thread shane knapp
this is happening now. On Wed, Jul 8, 2020 at 9:07 AM shane knapp ☠ wrote: > this will be happening tomorrow... today is Meeting Hell Day[tm]. > > On Tue, Jul 7, 2020 at 1:59 PM shane knapp ☠ wrote: > >> i wasn't able to get to it today, so i'm hoping to squeeze in a quick &

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-08 Thread shane knapp
this will be happening tomorrow... today is Meeting Hell Day[tm]. On Tue, Jul 7, 2020 at 1:59 PM shane knapp ☠ wrote: > i wasn't able to get to it today, so i'm hoping to squeeze in a quick trip > to the colo tomorrow morning. if not, then first thing thursday. > > -- > Shane K

restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-07 Thread shane knapp
i wasn't able to get to it today, so i'm hoping to squeeze in a quick trip to the colo tomorrow morning. if not, then first thing thursday. -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: m2 cache issues in Jenkins?

2020-07-06 Thread shane knapp
te: > >> Could this be a flaky or persistent issue? It failed with Scala gendoc >> but it didn't fail with the part the PR modified. It ran from worker-05. >> >> >> https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125121/consoleFull >> &g

Re: m2 cache issues in Jenkins?

2020-07-06 Thread shane knapp
i killed and retriggered the PRB jobs on 04, and wiped that workers' m2 cache. On Mon, Jul 6, 2020 at 9:24 AM shane knapp ☠ wrote: > once the jobs running on that worker are finished, yes. > > On Sun, Jul 5, 2020 at 7:41 PM Hyukjin Kwon wrote: > >> Shane, can we remove .m2 i

Re: m2 cache issues in Jenkins?

2020-07-06 Thread shane knapp
>>>>> Huh interesting that it’s the same worker. Have you filed a ticket to >>>>>> Shane? >>>>>> >>>>>> On Wed, Jul 1, 2020 at 8:50 PM Hyukjin Kwon >>>>>> wrote: >>>>>> >>>>&g

Re: Jenkins is down

2020-07-05 Thread shane knapp
t; Hi all and Shane, >> >> Is there something wrong with the Jenkins machines? Seems they are down. >> > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: m2 cache issues in Jenkins?

2020-06-24 Thread shane knapp
done: -bash-4.1$ cd .m2 -bash-4.1$ ls repository -bash-4.1$ time rm -rf * real17m4.607s user0m0.950s sys 0m18.816s -bash-4.1$ On Wed, Jun 24, 2020 at 10:50 AM shane knapp ☠ wrote: > ok, i've taken that worker offline and once the job running on it > finishes, i'll wipe the

  1   2   3   4   5   6   7   8   >