[ 
https://issues.apache.org/jira/browse/ATLAS-4878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17855802#comment-17855802
 ] 

ASF subversion and git services commented on ATLAS-4878:
--------------------------------------------------------

Commit 978087a882348f1fc1b6002a0aeb29192d8cc00a in atlas's branch 
refs/heads/master from Madhan Neethiraj
[ https://gitbox.apache.org/repos/asf?p=atlas.git;h=978087a88 ]

ATLAS-4878: utility to analyze hook notifications


> utility to analyze hook notifications
> -------------------------------------
>
>                 Key: ATLAS-4878
>                 URL: https://issues.apache.org/jira/browse/ATLAS-4878
>             Project: Atlas
>          Issue Type: Improvement
>          Components:  atlas-core
>            Reporter: Madhan Neethiraj
>            Assignee: Madhan Neethiraj
>            Priority: Major
>             Fix For: 3.0.0
>
>         Attachments: ATLAS-4878.patch
>
>
> A utility to analyze notifications received from hooks to gather following 
> details will be useful in troubleshooting:
>  # number of notifications per notification type (CREATE, UPDATE, 
> PARTIAL_UPDATE, DELETE, ..)
>  # number of entities referenced in notifications per entity type
>  # number of entity operations performed while processing the notifications 
> (create/update/delete)
>  
> For example, following details by analyzing 114k notifications from Hive hook 
> show that 94% of entities processed are of type hive_column and 
> hive_column_lineage :
> {noformat}
> {
>   "notifications":         114755,
>   "entities":              598435,
>   "notificationEntities": 2575347,
>   "notificationByType": {
>     "ENTITY_CREATE_V2":         49428,
>     "ENTITY_FULL_UPDATE_V2":     1597,
>     "ENTITY_PARTIAL_UPDATE_V2": 36561,
>     "ENTITY_DELETE_V2":         27169
>   },
>   "notificationEntityByType": {
>     "hdfs_path":             16417,
>     "hive_db":               20471,
>     "hive_table":            57143,
>     "hive_storagedesc":      30018,
>     "hive_column":          685384,
>     "hive_process":          41512
>     "hive_column_lineage": 1724402,
>   },
>   "entityOperations": {
>     "CREATE":         598435,
>     "UPDATE":        1913182
>     "PARTIAL_UPDATE":  36561,
>     "DELETE":          27169
>   },
>   "entityOperationsByType": {
>     "CREATE": {
>       "hdfs_path":            10940,
>       "hive_db":                224,
>       "hive_table":           22154,
>       "hive_storagedesc":     15280,
>       "hive_column":         332332,
>       "hive_process":         23462,
>       "hive_column_lineage": 194043
>     },
>     "UPDATE" {
>       "hdfs_path":              5477,
>       "hive_column":          319559,
>       "hive_column_lineage": 1530359,
>       "hive_db":               20203,
>       "hive_process":          18050,
>       "hive_storagedesc":      13204,
>       "hive_table":             6330
>     },
>    "PARTIAL_UPDATE": {
>      "hive_column":      33493,
>      "hive_storagedesc":  1534,
>      "hive_table":        1534
>     },
>     "DELETE": {
>       "hive_db":       44,
>       "hive_table": 27125
>     }
>   }
> } {noformat}
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to