Hi there,
Bug #24064 (submitted by [EMAIL PROTECTED]) requests a standard deviation
function for PHP. I realise that any of you could implement this in 10
minutes, but according to the bug database it is still Open so I figured I
would give it a try myself!
There are probably a dozen errors in the code and/or places where it could
be better optimised, but I'm hoping one of you might be able to help with
that. So, the attached diff file implements the function array_std_dev(),
to calculate standard deviation using the deviation method.
With the function in place, standard deviation is calculated like this:
<?php
$scores = array(18,5,7,18,3,2,10);
print array_std_dev($score);
// prints 6.6833125519211
?>
My first attempt at implementing this was using an extra array to buffer the
deviations - this was more out of curiosity to see how the array stuff
works. Sadly, it caused PHP to segfault and I couldn't figure out why - can
any of you help me spot the brain fart? (I've attached the offending code
in bad_stddev_code.txt)
Yours,
Paul
PS: I'm not on the internals list, so I would appreciate it if you would CC
me on your reply.
/* {{{ proto mixed array_std_dev(array input)
Returns the standard deviation of the array entries */
PHP_FUNCTION(array_std_dev)
{
zval **input,
**entry,
*entry_n,
*squareddev;
int argc = ZEND_NUM_ARGS();
HashPosition pos;
double dval;
double total;
double mean;
int numelements = 0; // note this is calcuated by hand, not using
zend_hash_num_elements()
double deviation;
if (argc != 1 || zend_get_parameters_ex(argc, &input) == FAILURE) {
WRONG_PARAM_COUNT;
}
if (Z_TYPE_PP(input) != IS_ARRAY) {
php_error_docref(NULL TSRMLS_CC, E_WARNING, "The argument should be an
array");
return;
}
ZVAL_LONG(return_value, 0);
total = 0;
// step one: sum the values of the array
for (zend_hash_internal_pointer_reset_ex(Z_ARRVAL_PP(input), &pos);
zend_hash_get_current_data_ex(Z_ARRVAL_PP(input), (void **)&entry,
&pos) == SUCCESS;
zend_hash_move_forward_ex(Z_ARRVAL_PP(input), &pos)) {
if (Z_TYPE_PP(entry) == IS_ARRAY || Z_TYPE_PP(entry) == IS_OBJECT)
continue;
entry_n = *entry;
zval_copy_ctor(entry_n);
convert_scalar_to_number(entry_n TSRMLS_CC);
convert_to_double(entry_n);
total += Z_DVAL_P(entry_n);
// this is incremented by hand so that it doesn't count object and
array elements as an element
numelements++;
}
// step two: calculate the mean of the input array
mean = total / numelements;
// step three: get the deviation from the mean of each number in the input
array, and add it to the squareddev array
MAKE_STD_ZVAL(squareddev);
array_init(squareddev);
for (zend_hash_internal_pointer_reset_ex(Z_ARRVAL_PP(input), &pos);
zend_hash_get_current_data_ex(Z_ARRVAL_PP(input), (void **)&entry,
&pos) == SUCCESS;
zend_hash_move_forward_ex(Z_ARRVAL_PP(input), &pos)) {
if (Z_TYPE_PP(entry) == IS_ARRAY || Z_TYPE_PP(entry) == IS_OBJECT)
continue;
entry_n = *entry;
zval_copy_ctor(entry_n);
convert_scalar_to_number(entry_n TSRMLS_CC);
convert_to_double(entry_n);
Z_DVAL_P(entry_n) -= mean;
zend_hash_next_index_insert(Z_ARRVAL_P(squareddev), entry_n,
sizeof(zval *), NULL);
}
// step four: sum the squared deviation array
total = 0;
for (zend_hash_internal_pointer_reset_ex(Z_ARRVAL_P(squareddev), &pos);
zend_hash_get_current_data_ex(Z_ARRVAL_P(squareddev), (void
**)&entry, &pos) == SUCCESS;
zend_hash_move_forward_ex(Z_ARRVAL_P(squareddev), &pos)) {
entry_n = *entry;
zval_copy_ctor(entry_n);
convert_scalar_to_number(entry_n TSRMLS_CC);
convert_to_double(entry_n);
total += Z_DVAL_P(entry_n);
}
// step five: divide the sum of the squared deviation array by the number of
elements - 1
total /= numelements - 1;
Z_DVAL_P(return_value) = sqrt(total);
}
/* }}} */
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php