1. Do the same people where you work operate the cluster and write the code to develop the application?
Mostly. Ops vs dev, although there's some overlap 2. Do you have a metrics stack that allows you to see graphs of various metrics with all the nodes displayed together? Yes, Prometheus+Grafana (currently custom script reporting to Prometheus, but that needs revisiting) 3. Do you have a log stack that allows you to see the logs for all the nodes together? Yep, graylog. 4. Do you regularly repair your clusters - such as by using Reaper? Yes, with reaper. Every day or two, more or less. It would be almost-constant if Reaper could work off queues with blacklisted time windows instead of a schedule 5. Do you use artificial intelligence to help manage your clusters? No. On Thu, Mar 28, 2019 at 8:46 AM Tom van der Woerdt <tom.vanderwoe...@booking.com.invalid> wrote: > 1. Do the same people where you work operate the cluster and write > the code to develop the application? > > No, we have a small infrastructure team, and many people developing > applications using Cassandra > > 2. Do you have a metrics stack that allows you to see graphs of > various metrics with all the nodes displayed together? > > Yes, we use a re-implementation of Graphite, which we open-sourced and now > lives at https://github.com/go-graphite > > 3. Do you have a log stack that allows you to see the logs for all > the nodes together? > > Yes, although in practice we don't use it much for Cassandra > > 4. Do you regularly repair your clusters - such as by using Reaper? > > Yes, we have built our own tools for this > > 5. Do you use artificial intelligence to help manage your clusters? > > It's not "artificial intelligence" the way most people would describe it, > but we certainly don't run our clusters manually > > > > Tom van der Woerdt > Site Reliability Engineer > > Booking.com B.V. > Vijzelstraat 66-80 Amsterdam 1017HL Netherlands > [image: Booking.com] <https://www.booking.com/> > Empowering people to experience the world since 1996 > 43 languages, 214+ offices worldwide, 141,000+ global destinations, 29 > million reported listings > Subsidiary of Booking Holdings Inc. (NASDAQ: BKNG) > > > On Thu, Mar 28, 2019 at 10:03 AM Kenneth Brotman > <kenbrot...@yahoo.com.invalid> wrote: > >> I’m looking to get a better feel for how people use Cassandra in >> practice. I thought others would benefit as well so may I ask you the >> following five questions: >> >> >> >> 1. Do the same people where you work operate the cluster and write >> the code to develop the application? >> >> >> >> 2. Do you have a metrics stack that allows you to see graphs of >> various metrics with all the nodes displayed together? >> >> >> >> 3. Do you have a log stack that allows you to see the logs for all >> the nodes together? >> >> >> >> 4. Do you regularly repair your clusters - such as by using Reaper? >> >> >> >> 5. Do you use artificial intelligence to help manage your clusters? >> >> >> >> >> >> Thank you for taking your time to share this information! >> >> >> >> Kenneth Brotman >> >