In general we've put statistics-gathering into server rather than client because - it gives uniform data over the entire host population - it puts the data all in one place
Currently these statistics are just the bare essentials: mean and standard deviation of elapsed time, turnaround time, and credit-related quantities. We maintain these per (host, app version) and per app version. We use them to estimate job duration and to compute credit. As you point out, there are many other types of info we could track, and many visualizations that could offered. This is an area were having a few CS grad students working on BOINC would be a big help. -- David On 10-Feb-2014 4:01 PM, Max Power wrote:
Many types of distributed computing applications don't due uniform processing (and reporting on percent done) like SETI, Astropulse or Einstein ... and the biological science applications (and image rendering ones) have taken some time to discipline the reporting of percent done. What the BOINC Client does not do is use the hashsums of computing applications (as sometimes they run in pairs as in Climate Prediction) to form a local knowledge base of -- work unit size (average, median, standard deviation) -- work unit computation length (average, median, standard deviation) -- completed work unit average size (average, median, standard deviation) -- disk use (average, median, standard deviation) -- these could be uplinked to the BOINC design groups and the projects themselves ... as you probably have to do an SQL query to find this stuff out -- THE "STATS" tab is almost totally devoid of usable statistics ... and the ones above relating to runtime are graphable and usable ... I am not saying this will fix the wonky estimated run time problem ... only regular application reporting to the BOINC client will ever do that. However, the averaged knowledge from these parameters could improve it when the daft application is not reporting. MP, DSN @ H -----Original Message----- From: McLeod, John Sent: 10 February 2014 05:48 To: Jon Sonntag ; BOINC Developers Mailing [email protected] Subject: Re: [boinc_dev] Estimated Time Remaining Not all applications report smooth % complete. So the calculation of time remaining involve the initial estimate as well. Given the bad information given for both % complete and initial estimate, there is no method of predicting how much longer the task will take that is completely right. The most reliable appears to be to combine the initial estimate the DCF (if in use for the project) the % complete, and the time spent already (the only really well known item in the list) to come up with an estimate. _______________________________________________ boinc_dev mailing list [email protected] http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_dev To unsubscribe, visit the above URL and (near bottom of page) enter your email address.
_______________________________________________ boinc_dev mailing list [email protected] http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_dev To unsubscribe, visit the above URL and (near bottom of page) enter your email address.
