Re: [openstack-dev] [magnum] supported OS images and magnum spawn failures for Swarm and Kubernetes

2018-08-04 Thread Joe Topjian
We recently deployed Magnum and I've been making my way through getting both Swarm and Kubernetes running. I also ran into some initial issues. These notes may or may not help, but thought I'd share them in case: * We're using Barbican for SSL. I have not tried with the internal x509keypair. * I

Re: [openstack-dev] [sahara] Anti-Affinity Broke

2018-08-01 Thread Joe Topjian
ERVER_GROUP_NAMES].insert(i, > server_group_resource) > > Thanks, > > > On Fri, Jun 22, 2018 at 5:03 AM Luigi Toscano wrote: > >> On Friday, 22 June 2018 05:00:16 CEST Joe Topjian wrote: >> > Hello, >> > >> > I originally posted this to the general

[openstack-dev] [sahara] Anti-Affinity Broke

2018-06-21 Thread Joe Topjian
Hello, I originally posted this to the general openstack list to get a sanity check on what I was seeing. Jeremy F reached out and confirmed that, so I'm going to re-post the details here to begin a discussion. >From what I can see, anti-affinity is not working at all in Sahara. I was able to

Re: [Openstack-operators] large high-performance ephemeral storage

2018-06-13 Thread Joe Topjian
of r/w mix (where write <=50% > to acknowledge this is ephemeral storage so hopefully something is using it > soon after storing). Thoughts? > > Cheers, > Blair > > On Thu., 14 Jun. 2018, 00:24 Joe Topjian, wrote: > >> Yes, you can! The kernel documentation for rea

Re: [Openstack-operators] large high-performance ephemeral storage

2018-06-13 Thread Joe Topjian
Yes, you can! The kernel documentation for read/write limits actually uses /dev/null in the examples :) But more seriously: while we have not architected specifically for high performance, for the past few years, we have used a zpool of cheap spindle disks and 1-2 SSD disks for caching. We have

[Openstack] [Sahara] Anti-Affinity Broke?

2018-06-12 Thread Joe Topjian
Hello, I was unable to get anti-affinity to work in Sahara. The logs were reporting a Heat error that u'' was not a valid UUID. Upon further investigation, I think I found several errors in the `sahara/service/heat/templates.py` file. I'm working with Sahara 7.0.0 but the master branch still

Re: [openstack-dev] [k8s][octavia][lbaas] Experiences on using the LB APIs with K8s

2018-03-15 Thread Joe Topjian
Hi Chris, I wear a number of hats related to this discussion, so I'll add a few points of view :) It turns out that with > Terraform, it's possible to tear down resources in a way that causes > Neutron to > leak administrator-privileged resources that can not be deleted by a > non-privileged

Re: [Openstack] [Openstack-operators] Certifying SDKs

2017-12-15 Thread Joe Topjian
Hi all, I've been meaning to reply to this thread. Volodymyr, your reply reminded me :) I agree with what you said that the SDK should support everything that the API supports. In that way, one could simply review the API reference docs and create a checklist for each possible action. I've often

Re: [Openstack-operators] [Openstack] Certifying SDKs

2017-12-15 Thread Joe Topjian
Hi all, I've been meaning to reply to this thread. Volodymyr, your reply reminded me :) I agree with what you said that the SDK should support everything that the API supports. In that way, one could simply review the API reference docs and create a checklist for each possible action. I've often

Re: [Openstack-operators] mitaka/xenial libvirt issues

2017-11-27 Thread Joe Topjian
We think we've pinned the qemu errors down to a mismatched group ID on a handful of compute nodes. The slow systemd/libvirt is still unsolved, but at the moment that does not actually be the cause of the qemu errors. On Mon, Nov 27, 2017 at 8:04 AM, Joe Topjian <j...@topjian.net> wrote:

Re: [Openstack-operators] mitaka/xenial libvirt issues

2017-11-27 Thread Joe Topjian
his: > > https://bugs.launchpad.net/ubuntu/+source/qemu/+bug/1647389 > > Thanks > > On Thu, Nov 23, 2017 at 6:20 PM, Joe Topjian <j...@topjian.net> wrote: > >> OK, thanks. We'll definitely look at downgrading in a test environment. >> >> To add som

Re: [Openstack-operators] mitaka/xenial libvirt issues

2017-11-23 Thread Joe Topjian
gt; Chris > > On Thu, 23 Nov 2017 at 15:33 Joe Topjian <j...@topjian.net> wrote: > >> Hi all, >> >> We're seeing some strange libvirt issues in an Ubuntu 16.04 environment. >> It's running Mitaka, but I don't think this is a problem with OpenStack >>

[Openstack-operators] mitaka/xenial libvirt issues

2017-11-23 Thread Joe Topjian
Hi all, We're seeing some strange libvirt issues in an Ubuntu 16.04 environment. It's running Mitaka, but I don't think this is a problem with OpenStack itself. We're in the process of upgrading this environment from Ubuntu 14.04 with the Mitaka cloud archive to 16.04. Instances are being live

Re: [openstack-dev] [all][api] POST /api-sig/news

2017-09-22 Thread Joe Topjian
On Fri, Sep 22, 2017 at 7:03 AM, Jay Pipes <jaypi...@gmail.com> wrote: > On 09/21/2017 11:37 PM, Joe Topjian wrote: > >> >> Microversions also reared their head in the form of a long >> discussion about how SDK developers and users are consuming >>

Re: [openstack-dev] [all][api] POST /api-sig/news

2017-09-21 Thread Joe Topjian
> Microversions also reared their head in the form of a long discussion > about how SDK developers and users are consuming microversions at a very > granular level. This discussion opened many surprised eyes as we learned > how different SDK platforms deal with microversions, and what exactly are

Re: [Openstack-operators] Successful nova-network to Neutron Migration

2017-05-20 Thread Joe Topjian
migration, > like us :) > > thanks, > Belmiro > > On Sat, May 20, 2017 at 6:51 PM, Joe Topjian <j...@topjian.net> wrote: > >> Hi all, >> >> There probably aren't a lot of people in this situation nowadays, but for >> those that are, I wanted to

Re: [openstack-dev] [horizon][api][docs] Feedback requested on proposed formatting change to API docs

2017-05-19 Thread Joe Topjian
On Fri, May 19, 2017 at 8:00 AM, Sean Dague <s...@dague.net> wrote: > On 05/19/2017 08:36 AM, Monty Taylor wrote: > > On 05/17/2017 10:14 AM, Joe Topjian wrote: > >> > >> > >> On Tue, May 16, 2017 at 4:13 PM, Monty Taylor <mord...@inaugust.

Re: [openstack-dev] [horizon][api][docs] Feedback requested on proposed formatting change to API docs

2017-05-17 Thread Joe Topjian
On Tue, May 16, 2017 at 4:13 PM, Monty Taylor wrote: > Hey all! > > I read the API docs A LOT. (thank you to all of you who have worked on > writing them) > > As I do, a gotcha I hit up against a non-zero amount is mapping the > descriptions of the response parameters to

Re: [Openstack-operators] [scientific] Resource reservation requirements (Blazar) - Forum session

2017-04-03 Thread Joe Topjian
On Mon, Apr 3, 2017 at 8:20 AM, Jay Pipes <jaypi...@gmail.com> wrote: > On 04/01/2017 08:32 PM, Joe Topjian wrote: > >> On Sat, Apr 1, 2017 at 5:21 PM, Matt Riedemann <mriede...@gmail.com >> <mailto:mriede...@gmail.com>> wrote: >> >>

Re: [Openstack-operators] [scientific] Resource reservation requirements (Blazar) - Forum session

2017-04-01 Thread Joe Topjian
On Sat, Apr 1, 2017 at 5:21 PM, Matt Riedemann wrote: > On 4/1/2017 8:36 AM, Blair Bethwaite wrote: > >> Hi all, >> >> The below was suggested for a Forum session but we don't yet have a >> submission or name to chair/moderate. I, for one, would certainly be >> interested in

Re: [Openstack] virtio-blk vs virtio-scsi performance

2017-03-31 Thread Joe Topjian
That sounds about right. There was some prior discussion about this on the openstack-operators group with similar results. We use virtio-scsi in one of our clouds because testing (and production) has shown that volumes attached via virtio-scsi are better able to participate in mdadm and zfs. For

Re: [Openstack-operators] backup to object store - tool recommendations

2017-03-27 Thread Joe Topjian
We use rclone quite a bit. It works great and has a wealth of features: http://rclone.org/ On Mon, Mar 27, 2017 at 7:50 AM, Nick Jones wrote: > On 27 Mar 2017, at 12:59, Marcus Furlong wrote: > > > On 27 March 2017 at 22:39, Blair Bethwaite

Re: [Openstack] Help with openstack single node deployment

2017-03-24 Thread Joe Topjian
I'm an absolute amateur at this -- no doubt OpenStack Infra has better tools -- but I have a suite of deployment configurations here: https://github.com/jtopjian/terraform-devstack The most up-to-date ones are the "packstack-standard" (which I'm now using Packer to create a standard image that

Re: [Openstack-operators] [neutron] Modify Default Quotas

2017-03-23 Thread Joe Topjian
We run a similar kind of script. I think in most cases, a Floating IP means a publicly routable IP, and those are now scarce resources. Because of that, I agree with what's been mentioned about a conservative floating IP quota. Since the other resource types aren't restricted by external

Re: [Openstack] nova-network -> neutron migration docs and stories?

2017-03-17 Thread Joe Topjian
Hi Andrew, NeCTAR published a suite of scripts for doing a nova-network to neutron migration: https://github.com/NeCTAR-RC/novanet2neutron IIRC, another organization reported success with these scripts a few months ago on the openstack-operators list. I'm currently doing some trial runs and all

Re: [Openstack-operators] Flavors

2017-03-15 Thread Joe Topjian
Mar 15, 2017 at 10:31 PM, Joe Topjian <j...@topjian.net> wrote: > Another benefit of flavors is that they provide ease of use. While there > are users who are confident enough to spec out each instance they launch, I > work with a lot of users who would feel overwhelmed if they had t

Re: [Openstack-operators] Flavors

2017-03-15 Thread Joe Topjian
Another benefit of flavors is that they provide ease of use. While there are users who are confident enough to spec out each instance they launch, I work with a lot of users who would feel overwhelmed if they had to do this. Providing a set of recommended instance specs can go a long way to

Re: [Openstack-operators] libvirt freezing when loading Nova instance nwfilters

2017-02-22 Thread Joe Topjian
We ran into the "virsh nwfilter-list hanging indefinitely" thing back in early January. I spent hours and I almost went insane trying to figure it out. We weren't upgrading nodes, though, it just sort of happened. I have no idea if the following was the correct way of handling this, but this

Re: [Openstack-operators] Encrypted Cinder Volume Deployment

2017-02-05 Thread Joe Topjian
an update, do post :) Thanks, Joe On Mon, Jan 23, 2017 at 8:58 PM, Joe Topjian <j...@topjian.net> wrote: > Hi Kris, > > I came across that as well and I believe it has been fixed and ensures > existing volumes are accessible: > > https://github.com/openstack/nova/blob/8c3f77

Re: [Openstack-operators] Encrypted Cinder Volume Deployment

2017-01-23 Thread Joe Topjian
hashed wrong and was using the badly hashed key/password vs’s > what was configured. > > > > > > ___ > > Kris Lindgren > > Senior Linux Systems Engineer > > GoDaddy > > > > *From:

[Openstack-operators] Encrypted Cinder Volume Deployment

2017-01-23 Thread Joe Topjian
Hi all, I'm investigating the options for configuring Cinder with encrypted volumes and have a few questions. The Cinder environment is currently running Kilo which will be upgraded to something between M-O later this year. The Kilo release supports the fixed_key setting. I see fixed_key is

Re: [Openstack-operators] VM monitoring suggestions

2016-11-17 Thread Joe Topjian
We have some custom scripts that run on the hypervisors which poll: virsh dominfo virsh domiflist etc The memory stats with "virsh dommemstat" are, AFAIK, not accurate since there's nothing triggering kvm / the vm to release unused memory. But all other virsh stuff works well for us. We don't

Re: [Openstack-operators] [nova] Does anyone use the os-diagnostics API?

2016-10-12 Thread Joe Topjian
Hi Matt, Tim, Thanks for asking. We’ve used the API in the past as a way of getting the > usage data out of Nova. We had problems running ceilometer at scale and > this was a way of retrieving the data for our accounting reports. We > created a special policy configuration to allow authorised

Re: [Openstack-operators] Murano in Production

2016-09-26 Thread Joe Topjian
Hi Serg, We were indeed hitting that bug, but the cert wasn't self-signed. It was easier for us to manually patch the Ubuntu Cloud package of Murano with the stable/mitaka fix linked in that bug report than trying to debug where OpenSSL/python/requests/etc was going awry. We might redeploy

Re: [Openstack-operators] Murano in Production

2016-09-23 Thread Joe Topjian
Hi Serg, Thank you for sharing this information :) If I'm understanding correctly, the main reason you're using a non-clustered / corosync setup is because that's how most other components in Mirantis OpenStack are configured? Is there anything to be aware of in how Murano communicates over the

Re: [Openstack-operators] Murano in Production

2016-09-18 Thread Joe Topjian
<sorri...@gmail.com> wrote: > You could also use https://www.rabbitmq.com/maxlength.html to mitigate > overflowing on the trove vhost side. > > > Sam > > > On 19 Sep 2016, at 1:07 PM, Joe Topjian <j...@topjian.net> wrote: > > Thanks for everyone's in

Re: [Openstack-operators] Murano in Production

2016-09-18 Thread Joe Topjian
than happy to report failure. :) On Sun, Sep 18, 2016 at 7:38 PM, Silence Dogood <m...@nycresistor.com> wrote: > I'd love to see your results on this . Very interesting stuff. > > On Sep 17, 2016 1:37 AM, "Joe Topjian" <j...@topjian.net> wrote: > >> Hi al

[Openstack-operators] Murano in Production

2016-09-16 Thread Joe Topjian
Hi all, We're planning to deploy Murano to one of our OpenStack clouds and I'm debating the RabbitMQ setup. For background: the Murano agent that runs on instances requires access to RabbitMQ. Murano is able to be configured with two RabbitMQ services: one for traditional OpenStack communication

Re: [Openstack-operators] cURL call at the end of provisioning and deprovisioning

2016-08-15 Thread Joe Topjian
Hi Will, What about notification events? I'm not sure what the best source of documentation is for events, but googling "OpenStack Notification Events" yields a bunch of information that should lead you in the right direction. With events, you would write a custom scripts / daemon that polls

Re: [Openstack-operators] PCI Passthrough issues

2016-07-19 Thread Joe Topjian
Hi Blair, We only updated qemu. We're running the version of libvirt from the Kilo cloudarchive. We've been in production with our K80s for around two weeks now and have had several users report success. Thanks, Joe On Tue, Jul 19, 2016 at 5:06 PM, Blair Bethwaite

Re: [Openstack-operators] How to create floating ip pool use nova network? thanks

2016-07-07 Thread Joe Topjian
In Kilo (I haven't verified Liberty or Mitaka), you can manage nova-network floating IP pools with: nova-manage floating --help nova-manage floating create --help Hope that helps! Joe On Wed, Jul 6, 2016 at 8:23 PM, 云淡风轻 <821696...@qq.com> wrote: > hi everyone, > > How to create floating ip

Re: [Openstack-operators] PCI Passthrough issues

2016-07-06 Thread Joe Topjian
Hi Jon, We were also running into issues with the K80s. For our GPU nodes, we've gone with a 4.2 or 4.4 kernel. PCI Passthrough works much better in those releases. (I ran into odd issues with 4.4 and NFS, downgraded to 4.2 after a few hours of banging my head, problems went away, not a

Re: [Openstack-operators] How are folks providing GPU instance types?

2016-05-11 Thread Joe Topjian
Just wanted to add a few notes (I apologize for the brevity): * The wiki page is indeed the best source of information to get started. * I found that I didn't have to use EFI-based images. I wonder why that is? * PCI devices and IDs can be found by running the following on a compute node: $

Re: [Openstack-operators] RAID / stripe block storage volumes

2016-03-07 Thread Joe Topjian
> 1: > https://support.rackspace.com/how-to/configuring-a-software-raid-on-a-linux-general-purpose-cloud-server/ > 2: https://support.rackspace.com/how-to/cloud-block-storage-faq/ > 3: > https://azure.microsoft.com/en-us/documentation/articles/virtual-machines-linux-configure-raid/ > 4: https://wiki.ope

Re: [Openstack-operators] RAID / stripe block storage volumes

2016-03-06 Thread Joe Topjian
-configure-raid/ 4: https://wiki.openstack.org/wiki/LibvirtVirtioScsi On Mon, Feb 8, 2016 at 7:18 PM, Joe Topjian <j...@topjian.net> wrote: > Yep. Don't get me wrong -- I agree 100% with everything you've said > throughout this thread. Applications that have native replication are > awesome.

Re: [Openstack-operators] [kolla] Question about how Operators deploy

2016-02-12 Thread Joe Topjian
2 VIPs as well. On Fri, Feb 12, 2016 at 8:27 AM, Matt Fischer wrote: > We also use 2 VIPs. public and internal, with admin being a CNAME for > internal. > > On Fri, Feb 12, 2016 at 7:28 AM, Fox, Kevin M wrote: > >> We usually use two vips. >> >>

Re: [Openstack-operators] User Survey - Deadline Feb 24th

2016-02-09 Thread Joe Topjian
Isn't it similar to asking if you use Fedora, CentOS, or RHEL? My understanding is that Juniper offers a paid/supported version of Contrail while OpenContrail is the open source version. On Tue, Feb 9, 2016 at 2:50 AM, Edgar Magana wrote: > Tom, > > For the "Which

Re: [Openstack] [Openstack-operators] User Survey - Deadline Feb 24th

2016-02-09 Thread Joe Topjian
Isn't it similar to asking if you use Fedora, CentOS, or RHEL? My understanding is that Juniper offers a paid/supported version of Contrail while OpenContrail is the open source version. On Tue, Feb 9, 2016 at 2:50 AM, Edgar Magana wrote: > Tom, > > For the "Which

Re: [Openstack-operators] User Survey - Deadline Feb 24th

2016-02-09 Thread Joe Topjian
the case. There is not Enterprise version of OpenContrail. > Therefore, this question could split the responses because it is confusing. > > Edgar > > From: Joe Topjian <j...@topjian.net> > Date: Tuesday, February 9, 2016 at 8:17 AM > To: Edgar Magana <edgar.m

Re: [Openstack] [Openstack-operators] User Survey - Deadline Feb 24th

2016-02-09 Thread Joe Topjian
the case. There is not Enterprise version of OpenContrail. > Therefore, this question could split the responses because it is confusing. > > Edgar > > From: Joe Topjian <j...@topjian.net> > Date: Tuesday, February 9, 2016 at 8:17 AM > To: Edgar Magana <edgar.m

Re: [Openstack-operators] RAID / stripe block storage volumes

2016-02-08 Thread Joe Topjian
s> wrote: > >> I have not run into anyone replicating volumes or creating redundancy at >> the VM level (beyond, as you point out, HDFS, etc.). >> >> R >> >> On Mon, Feb 8, 2016 at 6:54 PM, Joe Topjian <j...@topjian.net> wrote: >> >>> This

Re: [Openstack-operators] RAID / stripe block storage volumes

2016-02-08 Thread Joe Topjian
always recommended providing multiple underlying storage services to > provide this rather than adding the overhead to the VM. So, not in any of > my systems or any I've worked with. > > R > > > > On Fri, Feb 5, 2016 at 5:56 PM, Joe Topjian <j...@topjian.net> wrote:

Re: [Openstack-operators] RAID / stripe block storage volumes

2016-02-08 Thread Joe Topjian
anslate that same >>> idea to block storage. Rather than have a large HA storage cluster (whether >>> Ceph, Gluster, NetApp, etc), is it possible to offer simple single LVM >>> volume servers and push the failure handling on to the user? >>> >>>

[Openstack-operators] RAID / stripe block storage volumes

2016-02-05 Thread Joe Topjian
Hello, Does anyone have users RAID'ing or striping multiple block storage volumes from within an instance? If so, what was the experience? Good, bad, possible but with caveats? Thanks, Joe ___ OpenStack-operators mailing list

Re: [Openstack-operators] Storage backend for glance

2016-01-27 Thread Joe Topjian
Yup, it's definitely possible. All Glance nodes will need to share the same database as well as the same file system. Common ways of sharing the file system are to mount /var/lib/glance/images either from NFS (like you mentioned) or Gluster. I've done both in the past with no issues. The usual

Re: [Openstack-operators] Galera setup testing

2015-12-11 Thread Joe Topjian
We do something similar: Instead of McRouter, we use the repcached patches to replicate data between two memcached nodes. We then use HAProxy as a single entry point for memcached requests. We've been doing this for 6+ months and it's been working great. It's effectively solved the issue I

Re: [Openstack-operators] Cinder API with multiple regions not working.

2015-12-11 Thread Joe Topjian
fd74ec3b7fb6647ea661686 | regionOne | glance | image > | > > +--+---+--+--+ > > > > But different names are working perfectly fine with glance and nova. > > Anyways thanks for all your time and

Re: [Openstack-operators] Cinder API with multiple regions not working.

2015-12-11 Thread Joe Topjian
> > —— cinder.conf > [default] > os_region_name = regionOne > —— > > Regards.. > Salman. > > > > > > PhD, Scientific Computing > Researcher, IT Department, > Uppsala University. > Senior Cloud Architect, > SNIC. > Cloud Application Expert, > UPPMA

Re: [Openstack-operators] Cinder API with multiple regions not working.

2015-12-09 Thread Joe Topjian
Hi Salman, Someone mentioned this same issue yesterday in relation to Terraform (maybe a colleague of yours?), so given the two occurrences, I thought I'd look into this. I have a Liberty environment readily available, so I created a second set of volume and volumev2 endpoints for a fictional

Re: [Openstack-operators] Hypervisor Tuning Guide

2015-12-08 Thread Joe Topjian
ore formal schedule and would like to lead that effort, please get in contact with me. Thanks, Joe 1: https://wiki.openstack.org/wiki/Documentation/HypervisorTuningGuide 2: https://wiki.openstack.org/wiki/Documentation/HypervisorTuningGuide#How_to_Contribute On Tue, Oct 27, 2015 at 9:02 PM, Joe

[Openstack-operators] Horizon Kilo bug with nova-network and floating IPs

2015-11-27 Thread Joe Topjian
Hi all, I recently came across this bug and thought I'd share it for anyone else running a similar environment: https://bugs.launchpad.net/horizon/+bug/1520071 Joe ___ OpenStack-operators mailing list OpenStack-operators@lists.openstack.org

Re: [Openstack-operators] instances floating IPs not reachable while restarting nova-network

2015-11-26 Thread Joe Topjian
Yup, this is expected. It happens for both single-host and multi-host. With the former, we have an older environment where it takes around 10 minutes for all network access to resume. That's with a few hundred tenants, a few hundred vlans, and a few hundred floating IPs all on one host, though.

Re: [Openstack-operators] Router associated with multiple l3 agents

2015-11-24 Thread Joe Topjian
Hi Matt, > It's also weird that we've only seen this when the environment has been > built using terraform. This particular customer re-creates the issue every > time they rebuild. > I work on the OpenStack support for Terraform, so I might be able to help with this. Could you provide an example

Re: [Openstack-operators] [Nova] Question about starting nova as service versus directly

2015-11-20 Thread Joe Topjian
> Yes, most likely is related to permissions. Another good source of > information for troubleshooting is /var/log/upstart/nova-compute.log > Ah yes! Much easier. ___ OpenStack-operators mailing list OpenStack-operators@lists.openstack.org

Re: [Openstack-operators] [Nova] Question about starting nova as service versus directly

2015-11-19 Thread Joe Topjian
Hi Adam, I've seen this happen due to permission issues. Regardless of running with sudo, upstart is dropping to the "nova" user. I usually debug this by setting a shell on the nova user, sudoing/su'ing to nova, then running nova-compute from there. It should die with an error message of the

Re: [Openstack-operators] [openstack-operators][osops] tools-contrib is open for business!

2015-11-19 Thread Joe Topjian
Thanks, JJ! It looks like David Wahlstrom submitted a script and there's a question about license. https://review.openstack.org/#/c/247823/ Though contributions to contrib do not have to follow a certain coding style, can be very lax on error handling, etc, should they at least mention a

Re: [Openstack-operators] [openstack-operators][osops] tools-contrib is open for business!

2015-11-19 Thread Joe Topjian
:31 PM, "Matt Fischer" <m...@mattfischer.com> wrote: > >> Is there a reason why we can't license the entire repo with Apache2 and >> if you want to contribute you agree to that? Otherwise it might become a >> bit of a nightmare. Or maybe at least do "A

Re: [Openstack-operators] OPs Midcycle location discussion.

2015-11-16 Thread Joe Topjian
+1 Option 1 On Mon, Nov 16, 2015 at 10:01 AM, Jonathan Proulx wrote: > > Let me restate the question a bit as I think I'm hearing two different > responses that may be getting conflated. > > Option 1: There's a single Ops Midcycle that shifts around and we > look at ways to

Re: [Openstack-operators] OpenStack Tuning Guide

2015-11-04 Thread Joe Topjian
Hi Kevin, Oops, noticed I didn't reply to all the first time. I think it's great to see more people who want to collect and distil knowledge like this. :) Finally, I hate diverging resources, so if something like this already > exists please speak up so we can focus our efforts on making sure

Re: [Openstack-operators] Informal Ops Meetup?

2015-10-29 Thread Joe Topjian
We're currently in the Prince room by the projector. On Fri, Oct 30, 2015 at 10:07 AM, Edgar Magana wrote: > Where are you meeting? > > Edgar > > From: "Kris G. Lindgren" > Date: Thursday, October 29, 2015 at 6:37 AM > To: Sam Morrison

Re: [Openstack-operators] Informal Ops Meetup?

2015-10-29 Thread Joe Topjian
Hi Kris, I'll be around and am interested. On Thu, Oct 29, 2015 at 4:39 PM, Kris G. Lindgren wrote: > Hello all, > > I am not sure if you guys have looked at the schedule for Friday… but its > all working groups. I was talking with a few other operators and the idea >

[Openstack-operators] Hypervisor Tuning Guide

2015-10-27 Thread Joe Topjian
We had a great Hypervisor Tuning Guide session yesterday! We agreed on an initial structure to the guide that will include four core sections (CPU, Memory, Network, and Disk) and common subsections to each. The etherpad[1] has this structure defined and during the session, we went through and

Re: [Openstack-operators] [openstack-operators][osops] Something other than NOOP in our jenkins tests

2015-09-29 Thread Joe Topjian
quot;dumping ground" that people could share what they had. And a curated one, > where everything within the curated repo follows a standard set of > conventions/guidelines. > > _______ > Kris Lindgren > Senior Linux Sys

Re: [Openstack-operators] Tokyo Summit Ops Design Summit Tracks - Draft Agenda

2015-09-22 Thread Joe Topjian
Hi Tom, As luck would have it, the sole session I'm doing in the Main Conference clashes with the current time slot for Hypervisor Tuning. I see that Tim is also listed as a moderator for that session, so I'm more than happy to let him run it. However, if it's not asking much, I'd really like to

Re: [Openstack-operators] [openstack-operators] Announcement! We have everything ready to get to Stackforge!

2015-09-02 Thread Joe Topjian
Hi JJ, Thank you for putting all of this together! All, one decision that was made during the PAO Ops Meetup was to make the OSOps github repos "official" repos / projects. If you've contributed to the existing repos, you may have an interest in this. As well, if you

Re: [Openstack-operators] Draft Agenda for PAO Ops Meetup (August 18, 19)

2015-08-13 Thread Joe Topjian
Hi Tom, On Thu, Aug 13, 2015 at 2:08 AM, Tom Fifield t...@openstack.org wrote: Hi all, We're still lacking moderators for the following sessions - any takers? 1. Hypervisor Tuning - General Session (have a backup, but would like a primary) I'm happy to be the primary on this -- not a

Re: [Openstack-operators] Palo Alto Midcycle - agenda brainstorming

2015-07-18 Thread Joe Topjian
Hi Tom, The list of General Session ideas is definitely shorter than past meetups, but maybe that's a good sign! It could be that past burning topics have been acknowledged and handled. If that's the case, does anyone have thoughts about extending the length of Working Group sessions so there's

Re: [Openstack-operators] FAiled to create instance wiht openstack nova network

2015-07-13 Thread Joe Topjian
Hello, According to nova.conf, you're running nova-network in multi-host mode. Just to be verbose: if your OpenStack installation is an all-in-one or if you intend for all network traffic to go through the cloud controller, this setting should be changed to false. The error message is reporting:

Re: [Openstack-operators] Scaling the Ops Meetup

2015-06-30 Thread Joe Topjian
Hi Tom, I think this is a great problem to have. Difficult to solve, but it shows how popular / important these meetups are. I'm definitely in favor of a no booths type meetup. I feel if a company wants to sponsor, they're doing it out of good will and any recognition would come from that. I'd

Re: [Openstack-operators] Instance memory overhead

2015-06-23 Thread Joe Topjian
In addition to what Kris said, here are two other ways to see memory usage of qemu processes: The first is with nova diagnostics uuid. By default this is an admin-only command. The second is by running virsh dommemstat instance-id directly on the compute node. Note that it's possible for the

Re: [Openstack-operators] 100% CPU and hangs if syslog is restarted

2015-05-28 Thread Joe Topjian
Hello, Yeah, I ran into it last fall: http://www.gossamer-threads.com/lists/openstack/operators/41876 Good to know that this issue still exists in Juno (we're still on Icehouse). Thanks for the note. :) Joe On Thu, May 28, 2015 at 10:56 AM, George Shuklin george.shuk...@gmail.com wrote:

Re: [Openstack-operators] How do your end users use networking?

2015-05-22 Thread Joe Topjian
Hi Kris, Busy week! It was good seeing you in Vancouver - even if it was just in passing on the escalator ;) It is always nice to see that other people are doing the same things as you or see the same issues as you are and that you are not crazy. +100 Would it be accurate to say that

Re: [Openstack-operators] Venom vulnerability

2015-05-13 Thread Joe Topjian
Hello, Looking through the details of the Venom vulnerability, https://securityblog.redhat.com/2015/05/13/venom-dont-get-bitten/, it would appear that the QEMU processes need to be restarted. Our understanding is thus that a soft reboot of the VM is not sufficient but a hard one would be

Re: [Openstack-operators] Venom vulnerability

2015-05-13 Thread Joe Topjian
Looks like the updated Ubuntu packages are available: http://www.ubuntu.com/usn/usn-2608-1/ On Wed, May 13, 2015 at 10:44 AM, Matt Van Winkle mvanw...@rackspace.com wrote: Yeah, something like that would be handy. From: matt m...@nycresistor.com Date: Wednesday, May 13, 2015 10:29 AM

[Openstack-operators] Federation Ops Session at the Vancouver Summit

2015-05-12 Thread Joe Topjian
Hello, Following suit of the other posts, this is an announcement / reminder of the Federation Ops Session happening next Tuesday: http://sched.co/3BBs The etherpad for the session is here: https://etherpad.openstack.org/p/YVR-ops-federation I encourage everyone to add items they'd like to

Re: [Openstack] Routing from instances to floating ips in nova-network -- possible?

2015-05-07 Thread Joe Topjian
Hello, I'm glad to hear that it works for you! I just now tried setting force_snat_range for my floating IP range but I'm still not getting any pings. Strangely if I restart nova-network things work for a minute or two, then return to the status quo. That means that no matter what I

Re: [Openstack] HAProxy -MySQL Galera not connecting to keystone

2015-05-07 Thread Joe Topjian
Hello, Do you have HAProxy configured to round-robin the MySQL traffic? If so, change the balance config to something where the source connection will always hit the same mysql node. Or just make one of the mysql nodes a designated write node by setting the opposite node as backup. This will

Re: [Openstack] [openstack-dev] Question about multi-host mode while using nova-network

2015-05-07 Thread Joe Topjian
Hello, If the nova-network service is down, then only actions that would involve nova-network (creating and terminating instances for example) won't work. Instances that are already running will still be able to communicate with both the outside network and other instances in the cloud. You can

Re: [Openstack] [keystone] Multi-region with horizon

2015-05-04 Thread Joe Topjian
On Mon, May 4, 2015 at 12:56 PM, Adam Young ayo...@redhat.com wrote: On 05/04/2015 10:23 AM, rémi Le trocquer wrote: Hi, In multi-region configuration : multi keystone, multi database but with a common ldap. Is-it possible on Horizon to switch region without re-authenticate ? Horizon

Re: [Openstack] [nova] Cleaning up unused images in the cache

2015-04-28 Thread Joe Topjian
Hello, I've got a similar question about cache-manager and the presence of a shared filesystem for instances images. I'm currently reading the source code in order to find out how this is managed but before I would be curious how you achieve this on production servers. For example images

Re: [Openstack-operators] [Openstack] [nova] Cleaning up unused images in the cache

2015-04-28 Thread Joe Topjian
Hello, I've got a similar question about cache-manager and the presence of a shared filesystem for instances images. I'm currently reading the source code in order to find out how this is managed but before I would be curious how you achieve this on production servers. For example images

[Openstack-operators] Windows Instances and Volumes

2015-04-28 Thread Joe Topjian
Hello, I'm wondering if anyone has best practices for Windows-based instances that make heavy use of volumes? I have a user who was running SQL Server off of an iSCSI-based volume. We did a live-migration of the instance and that seemed to have caused Windows to drop the drive. Disk Manager

Re: [Openstack-operators] nova rescue

2015-03-29 Thread Joe Topjian
That's all very useful advise -- thank you. :) On Sun, Mar 29, 2015 at 8:39 PM, gustavo panizzo (gfa) g...@zumbi.com.ar wrote: On 03/29/2015 11:19 AM, Joe Topjian wrote: Hello, Without specifying a rescue image, Nova will use the image that the instance is based on when performing

[Openstack-operators] nova rescue

2015-03-28 Thread Joe Topjian
Hello, Without specifying a rescue image, Nova will use the image that the instance is based on when performing a rescue. I've noticed that this is problematic for cloud-friendly images such as the official Ubuntu images and the newer CentOS 7 images. I'm finding that /dev/vdb still ends up

Re: [Openstack-operators] FYI: Rabbit Heartbeat Patch Landed

2015-03-20 Thread Joe Topjian
We have other supporting services that require RabbitMQ and since they only accept a single host in their connection config, we need a more reliable way for them to connect. Those services work just fine with HAProxy/RabbitMQ. The OpenStack HA guide

[Openstack-operators] Live migration of instances with iscsi volumes

2015-03-19 Thread Joe Topjian
Hello, I just resolved an issue where migrating instances with iSCSI volumes would occasionally fail. There's a bug report here: https://bugs.launchpad.net/nova/+bug/1423772 The core cause ended up being libvirt transferring the volume paths verbatim. For example, take the situation where:

Re: [Openstack] Neutron vs. FlatDHCP -- what's the latest?

2015-03-19 Thread Joe Topjian
, that should be on par with nova-network's FlatDHCP. 1. https://review.openstack.org/#/c/163728/ On Fri, Jan 16, 2015 at 7:12 AM, Joe Topjian j...@topjian.net wrote: Tenants can launch on the shared network. The issue is with floating IP addresses: when the tenant goes to associate a floating IP

Re: [Openstack] iptables chain and instance id

2015-03-18 Thread Joe Topjian
The number is the ID of the instance in the nova.instances table: mysql select id from instances where uuid = '9927550c-5950-4daf-9f05-0530e51d36c7'; +---+ | id| +---+ | 19437 | +---+ $ iptables-save | grep 19437 :nova-compute-inst-19437 - [0:0] -A nova-compute-inst-19437 -m

Re: [Openstack] nova boot does not create nic into vm

2015-03-12 Thread Joe Topjian
Hello, I don't know the exact reason you're seeing this issue, but there are a few pointers/suggestions: * Try to set the nova.conf settings the same across both the controller and compute node. Although you're not using nova-network on your controller, doing this can't hurt. So for your

[Openstack-operators] Ops Meetup Monitoring/Tools Session

2015-03-04 Thread Joe Topjian
Hi all, I'll be moderating the Monitoring/Tools session at next week's Ops Meetup. The etherpage is here: https://etherpad.openstack.org/p/PHL-ops-tools-wg Please add items you'd like to see covered. So far, the general topics will be: * Discussion of Monasca, StackTach, and related tools.

  1   2   >