[jira] [Commented] (SENTRY-2183) Increase default sentry-hdfs rpc timeout to 20 mins

2018-03-28 Thread Na Li (JIRA)

[ 
https://issues.apache.org/jira/browse/SENTRY-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16417611#comment-16417611
 ] 

Na Li commented on SENTRY-2183:
---

[~arjunmishra13] I agree. Having a relatively large timeout as default would 
not hurt small users, and help avoid issues for large users. Otherwise, the 
large users will encounter issues, and then spend a lot of time to figure out 
that they need to increase this value.

> Increase default sentry-hdfs rpc timeout to 20 mins
> ---
>
> Key: SENTRY-2183
> URL: https://issues.apache.org/jira/browse/SENTRY-2183
> Project: Sentry
>  Issue Type: Improvement
>  Components: Sentry
>Affects Versions: 2.1.0
>Reporter: Arjun Mishra
>Assignee: Arjun Mishra
>Priority: Major
> Attachments: SENTRY-2183.001.patch, SENTRY-2183.002.patch, 
> SENTRY-2183.01.patch
>
>
> Full HMS Snapshots for large amounts of data (millions of databases, or 
> tables or partitions etc) take a lot more time than the current default of 
> 3.3mins. Also ideally since a full HMS snapshot should only occur once it is 
> best to increase the default to 20mins. This highly reduce the chance of 
> failure while sending a full snapshot which could be critical
> Set sentry.hdfs.service.client.server.rpc-connection-timeout = 120



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (SENTRY-2183) Increase default sentry-hdfs rpc timeout to 20 mins

2018-03-28 Thread Arjun Mishra (JIRA)

[ 
https://issues.apache.org/jira/browse/SENTRY-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16417081#comment-16417081
 ] 

Arjun Mishra commented on SENTRY-2183:
--

I think we should go with 20 mins. It doesn't affect performance. If small 
users take 3.3 mins this configuration won't hurt them. Full updates is usually 
large and it makes sense to have a larger default threshold than smaller. Let 
me know what you think.

> Increase default sentry-hdfs rpc timeout to 20 mins
> ---
>
> Key: SENTRY-2183
> URL: https://issues.apache.org/jira/browse/SENTRY-2183
> Project: Sentry
>  Issue Type: Improvement
>  Components: Sentry
>Affects Versions: 2.1.0
>Reporter: Arjun Mishra
>Assignee: Arjun Mishra
>Priority: Major
> Attachments: SENTRY-2183.001.patch, SENTRY-2183.002.patch, 
> SENTRY-2183.01.patch
>
>
> Full HMS Snapshots for large amounts of data (millions of databases, or 
> tables or partitions etc) take a lot more time than the current default of 
> 3.3mins. Also ideally since a full HMS snapshot should only occur once it is 
> best to increase the default to 20mins. This highly reduce the chance of 
> failure while sending a full snapshot which could be critical
> Set sentry.hdfs.service.client.server.rpc-connection-timeout = 120



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (SENTRY-2183) Increase default sentry-hdfs rpc timeout to 20 mins

2018-03-27 Thread JIRA

[ 
https://issues.apache.org/jira/browse/SENTRY-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16415808#comment-16415808
 ] 

Sergio Peña commented on SENTRY-2183:
-

[~arjunmishra13] After thinking this for a while, I wonder if we really need 
this change? The timeout will vary based on the amount of data users use. For 
small users, 3.3min is enough, for large users, 20 or 30min is enough, for 
others 10min is enough, so being this a configuration value that can be 
customized on the sentry-site.xml file, then I don't think we should try to 
change the value. What do you think?

> Increase default sentry-hdfs rpc timeout to 20 mins
> ---
>
> Key: SENTRY-2183
> URL: https://issues.apache.org/jira/browse/SENTRY-2183
> Project: Sentry
>  Issue Type: Improvement
>  Components: Sentry
>Affects Versions: 2.1.0
>Reporter: Arjun Mishra
>Assignee: Arjun Mishra
>Priority: Major
> Attachments: SENTRY-2183.001.patch, SENTRY-2183.002.patch, 
> SENTRY-2183.01.patch
>
>
> Full HMS Snapshots for large amounts of data (millions of databases, or 
> tables or partitions etc) take a lot more time than the current default of 
> 3.3mins. Also ideally since a full HMS snapshot should only occur once it is 
> best to increase the default to 20mins. This highly reduce the chance of 
> failure while sending a full snapshot which could be critical
> Set sentry.hdfs.service.client.server.rpc-connection-timeout = 120



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (SENTRY-2183) Increase default sentry-hdfs rpc timeout to 20 mins

2018-03-15 Thread Arjun Mishra (JIRA)

[ 
https://issues.apache.org/jira/browse/SENTRY-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16401422#comment-16401422
 ] 

Arjun Mishra commented on SENTRY-2183:
--

That is true. It is purely empirical. Cases with million partitions took about 
10 mins. Sometimes they take much longer. We can't factor in all cases. It is 
significantly larger than 3.3mins which is very conservative. I can run some 
tests to get numbers on the threshold time to number of hive objects

> Increase default sentry-hdfs rpc timeout to 20 mins
> ---
>
> Key: SENTRY-2183
> URL: https://issues.apache.org/jira/browse/SENTRY-2183
> Project: Sentry
>  Issue Type: Improvement
>  Components: Sentry
>Affects Versions: 2.1.0
>Reporter: Arjun Mishra
>Assignee: Arjun Mishra
>Priority: Major
> Attachments: SENTRY-2183.001.patch, SENTRY-2183.002.patch, 
> SENTRY-2183.01.patch
>
>
> Full HMS Snapshots for large amounts of data (millions of databases, or 
> tables or partitions etc) take a lot more time than the current default of 
> 3.3mins. Also ideally since a full HMS snapshot should only occur once it is 
> best to increase the default to 20mins. This highly reduce the chance of 
> failure while sending a full snapshot which could be critical
> Set sentry.hdfs.service.client.server.rpc-connection-timeout = 120



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (SENTRY-2183) Increase default sentry-hdfs rpc timeout to 20 mins

2018-03-15 Thread Alexander Kolbasov (JIRA)

[ 
https://issues.apache.org/jira/browse/SENTRY-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16401143#comment-16401143
 ] 

Alexander Kolbasov commented on SENTRY-2183:


Why do you think that 20 mins is a good number? What if it takes 25 mins?

> Increase default sentry-hdfs rpc timeout to 20 mins
> ---
>
> Key: SENTRY-2183
> URL: https://issues.apache.org/jira/browse/SENTRY-2183
> Project: Sentry
>  Issue Type: Improvement
>  Components: Sentry
>Affects Versions: 2.1.0
>Reporter: Arjun Mishra
>Assignee: Arjun Mishra
>Priority: Major
> Attachments: SENTRY-2183.001.patch, SENTRY-2183.002.patch, 
> SENTRY-2183.01.patch
>
>
> Full HMS Snapshots for large amounts of data (millions of databases, or 
> tables or partitions etc) take a lot more time than the current default of 
> 3.3mins. Also ideally since a full HMS snapshot should only occur once it is 
> best to increase the default to 20mins. This highly reduce the chance of 
> failure while sending a full snapshot which could be critical
> Set sentry.hdfs.service.client.server.rpc-connection-timeout = 120



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (SENTRY-2183) Increase default sentry-hdfs rpc timeout to 20 mins

2018-03-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/SENTRY-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16401118#comment-16401118
 ] 

Hadoop QA commented on SENTRY-2183:
---

Here are the results of testing the latest attachment
https://issues.apache.org/jira/secure/attachment/12914760/SENTRY-2183.002.patch 
against master.

{color:green}Overall:{color} +1 all checks pass

{color:green}SUCCESS:{color} all tests passed

Console output: 
https://builds.apache.org/job/PreCommit-SENTRY-Build/3701/console

This message is automatically generated.

> Increase default sentry-hdfs rpc timeout to 20 mins
> ---
>
> Key: SENTRY-2183
> URL: https://issues.apache.org/jira/browse/SENTRY-2183
> Project: Sentry
>  Issue Type: Improvement
>  Components: Sentry
>Affects Versions: 2.1.0
>Reporter: Arjun Mishra
>Assignee: Arjun Mishra
>Priority: Major
> Attachments: SENTRY-2183.001.patch, SENTRY-2183.002.patch, 
> SENTRY-2183.01.patch
>
>
> Full HMS Snapshots for large amounts of data (millions of databases, or 
> tables or partitions etc) take a lot more time than the current default of 
> 3.3mins. Also ideally since a full HMS snapshot should only occur once it is 
> best to increase the default to 20mins. This highly reduce the chance of 
> failure while sending a full snapshot which could be critical
> Set sentry.hdfs.service.client.server.rpc-connection-timeout = 120



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (SENTRY-2183) Increase default sentry-hdfs rpc timeout to 20 mins

2018-03-15 Thread Arjun Mishra (JIRA)

[ 
https://issues.apache.org/jira/browse/SENTRY-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16400886#comment-16400886
 ] 

Arjun Mishra commented on SENTRY-2183:
--

Test passes locally. Will re-submit the same patch
{noformat}
$ mvn -f sentry-tests/sentry-tests-hive/pom.xml test 
-Dtest=TestHmsNotificationProcessingWithOutSyncOnDrop

[INFO] ---
[INFO]  T E S T S
[INFO] ---
[INFO] Running 
org.apache.sentry.tests.e2e.dbprovider.TestHmsNotificationProcessingWithOutSyncOnDrop
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 24.931 s 
- in 
org.apache.sentry.tests.e2e.dbprovider.TestHmsNotificationProcessingWithOutSyncOnDrop
[INFO]
[INFO] Results:
[INFO]
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0
[INFO]
[INFO] 
[INFO] BUILD SUCCESS
[INFO] 
[INFO] Total time: 47.291 s
[INFO] Finished at: 2018-03-15T23:10:22+05:30
[INFO] Final Memory: 85M/836M
[INFO] 
{noformat}

> Increase default sentry-hdfs rpc timeout to 20 mins
> ---
>
> Key: SENTRY-2183
> URL: https://issues.apache.org/jira/browse/SENTRY-2183
> Project: Sentry
>  Issue Type: Improvement
>  Components: Sentry
>Affects Versions: 2.1.0
>Reporter: Arjun Mishra
>Assignee: Arjun Mishra
>Priority: Major
> Attachments: SENTRY-2183.01.patch
>
>
> Full HMS Snapshots for large amounts of data (millions of databases, or 
> tables or partitions etc) take a lot more time than the current default of 
> 3.3mins. Also ideally since a full HMS snapshot should only occur once it is 
> best to increase the default to 20mins. This highly reduce the chance of 
> failure while sending a full snapshot which could be critical
> Set sentry.hdfs.service.client.server.rpc-connection-timeout = 120



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (SENTRY-2183) Increase default sentry-hdfs rpc timeout to 20 mins

2018-03-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/SENTRY-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16400788#comment-16400788
 ] 

Hadoop QA commented on SENTRY-2183:
---

Here are the results of testing the latest attachment
https://issues.apache.org/jira/secure/attachment/12914717/SENTRY-2183.01.patch 
against master.

{color:red}Overall:{color} -1 due to 2 errors

{color:red}ERROR:{color} mvn test exited 1
{color:red}ERROR:{color} Failed: 
org.apache.sentry.tests.e2e.dbprovider.TestHmsNotificationProcessingWithOutSyncOnDrop

Console output: 
https://builds.apache.org/job/PreCommit-SENTRY-Build/3700/console

This message is automatically generated.

> Increase default sentry-hdfs rpc timeout to 20 mins
> ---
>
> Key: SENTRY-2183
> URL: https://issues.apache.org/jira/browse/SENTRY-2183
> Project: Sentry
>  Issue Type: Improvement
>  Components: Sentry
>Affects Versions: 2.1.0
>Reporter: Arjun Mishra
>Assignee: Arjun Mishra
>Priority: Major
> Attachments: SENTRY-2183.01.patch
>
>
> Full HMS Snapshots for large amounts of data (millions of databases, or 
> tables or partitions etc) take a lot more time than the current default of 
> 3.3mins. Also ideally since a full HMS snapshot should only occur once it is 
> best to increase the default to 20mins. This highly reduce the chance of 
> failure while sending a full snapshot which could be critical
> Set sentry.hdfs.service.client.server.rpc-connection-timeout = 120



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)