[ 
https://issues.apache.org/jira/browse/SENTRY-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15982745#comment-15982745
 ] 

Hadoop QA commented on SENTRY-1687:
-----------------------------------

Here are the results of testing the latest attachment
https://issues.apache.org/jira/secure/attachment/12864894/SENTRY-1687.001-sentry-ha-redesign.patch
 against sentry-ha-redesign.

{color:red}Overall:{color} -1 due to 2 errors

{color:red}ERROR:{color} mvn test exited 1
{color:red}ERROR:{color} Failed: 
org.apache.sentry.tests.e2e.hdfs.TestHDFSIntegrationEnd2End

Console output: 
https://builds.apache.org/job/PreCommit-SENTRY-Build/2561/console

This message is automatically generated.

> FullUpdateInitializer can be more efficient
> -------------------------------------------
>
>                 Key: SENTRY-1687
>                 URL: https://issues.apache.org/jira/browse/SENTRY-1687
>             Project: Sentry
>          Issue Type: Sub-task
>          Components: Sentry
>    Affects Versions: sentry-ha-redesign
>            Reporter: Alexander Kolbasov
>            Assignee: Alexander Kolbasov
>         Attachments: SENTRY-1687.001-sentry-ha-redesign.patch
>
>
> The FullUpdateInitializer follows the {{MetastoreCacheInitializer}}. It reads 
> a bunch of information from HMS and uses Thrift structures to pass around, 
> but in the end it just constructs {{Map<String, Set<String>>}}. It uses 
> concurrent fetches from HMS, but synchronizes a lot on common data structures 
> to update them.
> I think that we can refactor all this code to make it faster and consume less 
> memory. The idea is the following:
> Use background threads to collect Thrift results from HMS calls (database, 
> table and partition data). Then we can use a single thread to construct the 
> resulting update and return it without using intermediate Thrift methods.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to