Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
On 27-07-15 11:52, Daan Hoogland wrote: gogogo The PR is out there: https://github.com/apache/cloudstack/pull/625 On Mon, Jul 27, 2015 at 10:28 AM, Wido den Hollander w...@widodh.nl wrote: Hi, It has been quite some time and this feature hasn't merged yet. I created a issue for this: https://issues.apache.org/jira/browse/CLOUDSTACK-8677 The code is ready to merge and is in the reporter [0] branch on the Git repo. Even if the VM isn't ready, I would like to merge this feature into 4.6 and get a VERY CLEAR doc into the Release Notes. Without Infra to send stats to, it seems like this feature is worthless, but it isn't. Since we got the code running, so as soon as we get call-home.cloudstack.org online we are good. Merging would just be sending a PR, before I do so, any objections? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter On 01-12-14 14:08, Wido den Hollander wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
Hi, It has been quite some time and this feature hasn't merged yet. I created a issue for this: https://issues.apache.org/jira/browse/CLOUDSTACK-8677 The code is ready to merge and is in the reporter [0] branch on the Git repo. Even if the VM isn't ready, I would like to merge this feature into 4.6 and get a VERY CLEAR doc into the Release Notes. Without Infra to send stats to, it seems like this feature is worthless, but it isn't. Since we got the code running, so as soon as we get call-home.cloudstack.org online we are good. Merging would just be sending a PR, before I do so, any objections? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter On 01-12-14 14:08, Wido den Hollander wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
gogogo On Mon, Jul 27, 2015 at 10:28 AM, Wido den Hollander w...@widodh.nl wrote: Hi, It has been quite some time and this feature hasn't merged yet. I created a issue for this: https://issues.apache.org/jira/browse/CLOUDSTACK-8677 The code is ready to merge and is in the reporter [0] branch on the Git repo. Even if the VM isn't ready, I would like to merge this feature into 4.6 and get a VERY CLEAR doc into the Release Notes. Without Infra to send stats to, it seems like this feature is worthless, but it isn't. Since we got the code running, so as soon as we get call-home.cloudstack.org online we are good. Merging would just be sending a PR, before I do so, any objections? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter On 01-12-14 14:08, Wido den Hollander wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter -- Daan
RE: [DISCUSS] Automatic usage reporting / call-home for CloudStack
Hi Wido I recently had a thought on this. Although I would be loathed that anybody has to register ACS when they use it, it would be good if we could differentiate between production deployments and lab/bench deployments. From the marketing perspective, we've got this crazy situation where we KNOW that there are many 100's of people running ACS but still only 70 people told us about their use in the last survey. The problem is, if we don’t differentiate between prod/lab - then the data you collect will be padded with people running it in their home labs. I'm not sure how we can go about doing this. The simplest thing would be a global setting, but where would the incentive be for people to change that setting when they put something into production ? Another approach, which some may think controversial, would be that we do an about screen/page that allows people to change this setting. This screen could display a summary of version, #pods, clusters,etc (all the standard stuff people need to find out about their system) - so it would be useful for people It could also contain info on how to join/access the community/ML's for support and (most controversially) it could give people the option to put in some basic company information if they wished. I totally understand that the last bit may be a thought too far some some people Kind Regards Giles D: +44 20 3603 0541 | M: +44 796 111 2055 giles.sir...@shapeblue.com -Original Message- From: Wido den Hollander [mailto:w...@widodh.nl] Sent: 01 December 2014 18:08 To: dev@cloudstack.apache.org Subject: Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack On 12/01/2014 06:40 PM, Rohit Yadav wrote: +1 Wido, will the statistics be accessible online for everyone (like real-time) or will the PMC release aggregated data periodically? I was thinking to keep the data PMC-only to start with and indeed release aggregated data periodically. On Monday 01 December 2014 06:38 PM, Wido den Hollander wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h =refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=rep orter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff55 20855;hb=reporter -- Regards, Rohit Yadav Software Architect, ShapeBlue M. +41 779015219 | rohit.ya...@shapeblue.com Blog: bhaisaab.org | Twitter: @_bhaisaab Find out more about ShapeBlue and our range of CloudStack related
[DISCUSS] Automatic usage reporting / call-home for CloudStack
Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
On Dec 1, 2014, at 8:08 AM, Wido den Hollander w...@widodh.nl wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? We should ask users@ as well. I am +1 on this, Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
On Mon, Dec 1, 2014 at 2:15 PM, Sebastien Goasguen run...@gmail.com wrote: On Dec 1, 2014, at 8:08 AM, Wido den Hollander w...@widodh.nl wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? We should ask users@ as well. I am +1 on this, As a cloudstack user I don't have any problem with this, +1. -- Erik
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
+1 On 1 December 2014 at 14:31, Erik Weber terbol...@gmail.com wrote: On Mon, Dec 1, 2014 at 2:15 PM, Sebastien Goasguen run...@gmail.com wrote: On Dec 1, 2014, at 8:08 AM, Wido den Hollander w...@widodh.nl wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? We should ask users@ as well. I am +1 on this, As a cloudstack user I don't have any problem with this, +1. -- Erik -- Andrija Panić
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
On 12/01/2014 02:15 PM, Sebastien Goasguen wrote: On Dec 1, 2014, at 8:08 AM, Wido den Hollander w...@widodh.nl wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? We should ask users@ as well. Yes, that is probably a good idea. First I'd like to have the politics set before we take it to users. In the end I'd like to see a VOTE coming up for this feature to go into CloudStack. Wido I am +1 on this, Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
+1, good job! -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Wido den Hollander w...@widodh.nl To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 13:08:02 Subject: [DISCUSS] Automatic usage reporting / call-home for CloudStack Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
On 12/01/2014 04:00 PM, Nux! wrote: In addition: this should be opt-out, but clearly specified in the release notes, as well as visible in the UI. But where do we display this in the UI? I don't think we have a mechanism to show a one-time message after a upgrade. Do we? -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Nux! n...@li.nux.ro To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 13:52:35 Subject: Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack +1, good job! -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Wido den Hollander w...@widodh.nl To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 13:08:02 Subject: [DISCUSS] Automatic usage reporting / call-home for CloudStack Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
Hm, after an upgrade it might be tricky. Perhaps display it here somehow http://img.nux.ro/3hH-callhome.png Perhaps some UI/javascript specialist can help us here ... Lucian -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Wido den Hollander w...@widodh.nl To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 15:05:55 Subject: Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack On 12/01/2014 04:00 PM, Nux! wrote: In addition: this should be opt-out, but clearly specified in the release notes, as well as visible in the UI. But where do we display this in the UI? I don't think we have a mechanism to show a one-time message after a upgrade. Do we? -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Nux! n...@li.nux.ro To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 13:52:35 Subject: Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack +1, good job! -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Wido den Hollander w...@widodh.nl To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 13:08:02 Subject: [DISCUSS] Automatic usage reporting / call-home for CloudStack Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter
RE: [DISCUSS] Automatic usage reporting / call-home for CloudStack
1+ from a user -Original Message- From: Wido den Hollander [mailto:w...@widodh.nl] Sent: Monday, December 01, 2014 10:21 AM To: dev@cloudstack.apache.org Subject: Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack On 12/01/2014 04:12 PM, Nux! wrote: Hm, after an upgrade it might be tricky. Perhaps display it here somehow http://img.nux.ro/3hH-callhome.png Perhaps some UI/javascript specialist can help us here ... Good idea. In that Menu we probably need a Call-Home link pointing to http://cloudstack.apache.org/callhome.html (doesn't exist yet!) Wido Lucian -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Wido den Hollander w...@widodh.nl To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 15:05:55 Subject: Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack On 12/01/2014 04:00 PM, Nux! wrote: In addition: this should be opt-out, but clearly specified in the release notes, as well as visible in the UI. But where do we display this in the UI? I don't think we have a mechanism to show a one-time message after a upgrade. Do we? -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Nux! n...@li.nux.ro To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 13:52:35 Subject: Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack +1, good job! -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro - Original Message - From: Wido den Hollander w...@widodh.nl To: dev@cloudstack.apache.org Sent: Monday, 1 December, 2014 13:08:02 Subject: [DISCUSS] Automatic usage reporting / call-home for CloudStack Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlo g;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f= reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460 cfff5520855;hb=reporter
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
+1 Wido, will the statistics be accessible online for everyone (like real-time) or will the PMC release aggregated data periodically? On Monday 01 December 2014 06:38 PM, Wido den Hollander wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter -- Regards, Rohit Yadav Software Architect, ShapeBlue M. +41 779015219 | rohit.ya...@shapeblue.com Blog: bhaisaab.org | Twitter: @_bhaisaab Find out more about ShapeBlue and our range of CloudStack related services IaaS Cloud Design Buildhttp://shapeblue.com/iaas-cloud-design-and-build// CSForge – rapid IaaS deployment frameworkhttp://shapeblue.com/csforge/ CloudStack Consultinghttp://shapeblue.com/cloudstack-consultancy/ CloudStack Software Engineeringhttp://shapeblue.com/cloudstack-software-engineering/ CloudStack Infrastructure Supporthttp://shapeblue.com/cloudstack-infrastructure-support/ CloudStack Bootcamp Training Courseshttp://shapeblue.com/cloudstack-training/ This email and any attachments to it may be confidential and are intended solely for the use of the individual to whom it is addressed. Any views or opinions expressed are solely those of the author and do not necessarily represent those of Shape Blue Ltd or related companies. If you are not the intended recipient of this email, you must neither take any action based upon its contents, nor copy or show it to anyone. Please contact the sender if you believe you have received this email in error. Shape Blue Ltd is a company incorporated in England Wales. ShapeBlue Services India LLP is a company incorporated in India and is operated under license from Shape Blue Ltd. Shape Blue Brasil Consultoria Ltda is a company incorporated in Brasil and is operated under license from Shape Blue Ltd. ShapeBlue SA Pty Ltd is a company registered by The Republic of South Africa and is traded under license from Shape Blue Ltd. ShapeBlue is a registered trademark.
Re: [DISCUSS] Automatic usage reporting / call-home for CloudStack
On 12/01/2014 06:40 PM, Rohit Yadav wrote: +1 Wido, will the statistics be accessible online for everyone (like real-time) or will the PMC release aggregated data periodically? I was thinking to keep the data PMC-only to start with and indeed release aggregated data periodically. On Monday 01 December 2014 06:38 PM, Wido den Hollander wrote: Hello, As a project we currently don't have a lot of insight information on about how CloudStack is being used. Surveys tell us a lot, but not everybody fills in the survey, so we still miss a lot of information. That's why I've written the Usage Reporting functionality for the management server which automatically sends back anonymous information about a CloudStack deployment. It's currently in the 'reporter' branch. [0] By default, every 7 days it generates a JSON document with: - Hosts (Number, version, type, hypervisor) - Clusters (Hypervisor en Management type) - Primary storage (Type and provider) - Zones (Network type and providers) - Instances (Number and types) This report is not complete yet, I'd like to add more information, but that will be Management Server information. The code on how this report is generated is obviously 100% Open Source, so end-users can always exactly see how the information was compiled. I want to discuss this new feature for CloudStack and the possible implications it might have. I'm opting for a opt-out. So every new or upgraded install to 4.6.0 (master) will have this enabled. Yes, we have to be very explicit in the Release Notes that this has been added. Why? It's the small price we as a project ask for using CloudStack. We want a little bit of information on how CloudStack is being used so that we can use this to make CloudStack even better. Turning it off is also just one global setting and it will never turn on again. On the server-side there is a Python flask application [1] (found in the reporter directory) which stores all the incoming information in a ElasticSearch database. From there analytics can be gathered on CloudStack deployments. It currently points to http://cs-report.widodh.nl/report which will NOT be the endpoint when this is merged into master. For 'production' I want to have https://report.cloudstack.apache.org/report where all reports are submitted. For every setup a unique ID is determined by hashing the first row in the 'version' table. This is the version + timestamp and that is hashed using SHA256. Using this unique ID we can track changes in deployments and see how they grow or shrink. Technically this wasn't that hard to implement, but the politics surrounding it might be the hardest part. What do other have to say about this? Should there be a VOTE for this feature to come into CloudStack? Opt-in, opt-out? Wido [0]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=shortlog;h=refs/heads/reporter [1]: https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;a=blob;f=reporter/usage-report-collector.py;h=500a4d284b5172fd93acea08f5460cfff5520855;hb=reporter -- Regards, Rohit Yadav Software Architect, ShapeBlue M. +41 779015219 | rohit.ya...@shapeblue.com Blog: bhaisaab.org | Twitter: @_bhaisaab Find out more about ShapeBlue and our range of CloudStack related services IaaS Cloud Design Buildhttp://shapeblue.com/iaas-cloud-design-and-build// CSForge – rapid IaaS deployment frameworkhttp://shapeblue.com/csforge/ CloudStack Consultinghttp://shapeblue.com/cloudstack-consultancy/ CloudStack Software Engineeringhttp://shapeblue.com/cloudstack-software-engineering/ CloudStack Infrastructure Supporthttp://shapeblue.com/cloudstack-infrastructure-support/ CloudStack Bootcamp Training Courseshttp://shapeblue.com/cloudstack-training/ This email and any attachments to it may be confidential and are intended solely for the use of the individual to whom it is addressed. Any views or opinions expressed are solely those of the author and do not necessarily represent those of Shape Blue Ltd or related companies. If you are not the intended recipient of this email, you must neither take any action based upon its contents, nor copy or show it to anyone. Please contact the sender if you believe you have received this email in error. Shape Blue Ltd is a company incorporated in England Wales. ShapeBlue Services India LLP is a company incorporated in India and is operated under license from Shape Blue Ltd. Shape Blue Brasil Consultoria Ltda is a company incorporated in Brasil and is operated under license from Shape Blue Ltd. ShapeBlue SA Pty Ltd is a company registered by The Republic of South Africa and is traded under license from Shape Blue Ltd. ShapeBlue is a registered trademark.