Erik Shilts created HIVE-3711:
---------------------------------
Summary: Create UDAF to calculate an array of Benford's Law
Key: HIVE-3711
URL: https://issues.apache.org/jira/browse/HIVE-3711
Project: Hive
Issue Type: New Feature
Components: UDF
Reporter: Erik Shilts
Priority: Minor
Benford's Law is a useful analytical tool to determine if a number was
generated with a random process by evaluating the relative proportions of the
leading digit. It can be used to detect accounting, financial, and election
fraud.
[Wikipedia's|http://en.wikipedia.org/wiki/Benford's_law] Benford's Law page has
a good overview.
Hive is well suited to calculate Benford's Law. The result should be a named
struct with names 1-9 and values being the corresponding proportions of each
digit.
An alternative is to calculate the deviations from Benford's Law for each
digit. The structure of the resulting array would be the same, but the result
would be the difference between the actual proportions and the proportions
given the by
[formula|http://en.wikipedia.org/wiki/Benford's_law#Mathematical_statement] on
Wikipedia.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira