[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16217624#comment-16217624 ] Billie Rinaldi commented on YARN-7326: -- Patch 006 resolves the issue I was seeing where the first lookup of an external domain would time out. > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch, > YARN-7326.yarn-native-services.002.patch, > YARN-7326.yarn-native-services.003.patch, > YARN-7326.yarn-native-services.004.patch, > YARN-7326.yarn-native-services.005.patch, > YARN-7326.yarn-native-services.006.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16217394#comment-16217394 ] Eric Yang commented on YARN-7326: - [~billie.rinaldi] This happens if /etc/resolv.conf is set to use itself as upstream dns server. I will add some logic to filter out local ip address. > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch, > YARN-7326.yarn-native-services.002.patch, > YARN-7326.yarn-native-services.003.patch, > YARN-7326.yarn-native-services.004.patch, > YARN-7326.yarn-native-services.005.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16217184#comment-16217184 ] Billie Rinaldi commented on YARN-7326: -- I tried out the patch and it also worked for me for looking up the root zone and a well-known host (in addition to still supporting lookups for container hostnames). I saw a strange behavior where the first time I did the lookup it would fail with a timeout, but then the second time it would succeed. This consistently happened for looking up the root zone, google.com, and example.com. After I restart registrydns, it happens for the first lookup for each external hostname (it doesn't happen for container hostnames). Did this happen for anyone else? {noformat} ; (1 server found) ;; global options: +cmd ;; connection timed out; no servers could be reached {noformat} > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch, > YARN-7326.yarn-native-services.002.patch, > YARN-7326.yarn-native-services.003.patch, > YARN-7326.yarn-native-services.004.patch, > YARN-7326.yarn-native-services.005.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16215851#comment-16215851 ] Jian He commented on YARN-7326: --- Thanks Eric, I tested the new patch, it can successfully forward the dns lookup, [~billie.rinaldi], do you want to take a look ? > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch, > YARN-7326.yarn-native-services.002.patch, > YARN-7326.yarn-native-services.003.patch, > YARN-7326.yarn-native-services.004.patch, > YARN-7326.yarn-native-services.005.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16215633#comment-16215633 ] Hadoop QA commented on YARN-7326: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 16s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} yarn-native-services Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 23m 5s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 32s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 25s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 38s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 13m 34s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 0m 50s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 29s{color} | {color:green} yarn-native-services passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 32s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 30s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 30s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 19s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 31s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 14m 42s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 2s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 26s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 1m 19s{color} | {color:green} hadoop-yarn-registry in the patch passed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 35s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 60m 35s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Docker | Image:yetus/hadoop:0de40f0 | | JIRA Issue | YARN-7326 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12893552/YARN-7326.yarn-native-services.005.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux c8050f592166 3.13.0-119-generic #166-Ubuntu SMP Wed May 3 12:18:55 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh | | git revision | yarn-native-services / 16ecb9c | | maven | version: Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; 2015-11-10T16:41:47+00:00) | | Default Java | 1.8.0_151 | | findbugs | v3.1.0-RC1 | | Test Results | https://builds.apache.org/job/PreCommit-YARN-Build/18089/testReport/ | | modules | C: hadoop-yarn-project/hadoop-yarn/hadoop-yarn-registry U: hadoop-yarn-project/hadoop-yarn/hadoop-yarn-registry | | Console output | https://builds.apache.org/job/PreCommit-YARN-Build/18089/console | | Powered by | Apache Yetus 0.6.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Some
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16215450#comment-16215450 ] Hadoop QA commented on YARN-7326: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 22m 38s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} yarn-native-services Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 14m 56s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 18s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 14s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 20s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 9m 36s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 0m 31s{color} | {color:green} yarn-native-services passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 15s{color} | {color:green} yarn-native-services passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 16s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 15s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 15s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 11s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 17s{color} | {color:green} the patch passed {color} | | {color:red}-1{color} | {color:red} whitespace {color} | {color:red} 0m 0s{color} | {color:red} The patch has 4 line(s) that end in whitespace. Use git apply --whitespace=fix <>. Refer https://git-scm.com/docs/git-apply {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 10m 20s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 0m 37s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 17s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 1m 10s{color} | {color:green} hadoop-yarn-registry in the patch passed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 26s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 63m 31s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Docker | Image:yetus/hadoop:0de40f0 | | JIRA Issue | YARN-7326 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12893538/YARN-7326.yarn-native-services.004.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux daf96fa784b5 3.13.0-116-generic #163-Ubuntu SMP Fri Mar 31 14:13:22 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh | | git revision | yarn-native-services / 16ecb9c | | maven | version: Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; 2015-11-10T16:41:47+00:00) | | Default Java | 1.8.0_151 | | findbugs | v3.1.0-RC1 | | whitespace | https://builds.apache.org/job/PreCommit-YARN-Build/18085/artifact/patchprocess/whitespace-eol.txt | | Test Results | https://builds.apache.org/job/PreCommit-YARN-Build/18085/testReport/ | | modules | C: hadoop-yarn-project/hadoop-yarn/hadoop-yarn-registry U: hadoop-yarn-project/hadoop-yarn/hadoop-yarn-registry | | Console
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16213535#comment-16213535 ] Hadoop QA commented on YARN-7326: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s{color} | {color:blue} Docker mode activated. {color} | | {color:red}-1{color} | {color:red} docker {color} | {color:red} 0m 15s{color} | {color:red} Docker failed to build yetus/hadoop:0de40f0. {color} | \\ \\ || Subsystem || Report/Notes || | JIRA Issue | YARN-7326 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12893368/YARN-7326.yarn-native-services.003.patch | | Console output | https://builds.apache.org/job/PreCommit-YARN-Build/18064/console | | Powered by | Apache Yetus 0.6.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch, > YARN-7326.yarn-native-services.002.patch, > YARN-7326.yarn-native-services.003.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212967#comment-16212967 ] Eric Yang commented on YARN-7326: - [~jianhe] I will add comments for updateDNSServer method to describe what it does. For testing, try: {code} dig @localhost -p 5353 . dig @localhost -p 5353 google.com. {code} > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch, > YARN-7326.yarn-native-services.002.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212950#comment-16212950 ] Jian He commented on YARN-7326: --- [~eyang], I'm not familiar with the JAVA DNS libs, could you add some comments in the code to explain what the new method is doing ?, like the updateDNSServer method. It'll be useful for people who aren't familiar with these libs to understand. And how can I test this this change ? > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch, > YARN-7326.yarn-native-services.002.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211337#comment-16211337 ] Hadoop QA commented on YARN-7326: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s{color} | {color:blue} Docker mode activated. {color} | | {color:red}-1{color} | {color:red} docker {color} | {color:red} 0m 14s{color} | {color:red} Docker failed to build yetus/hadoop:0de40f0. {color} | \\ \\ || Subsystem || Report/Notes || | JIRA Issue | YARN-7326 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12893070/YARN-7326.yarn-native-services.002.patch | | Console output | https://builds.apache.org/job/PreCommit-YARN-Build/18029/console | | Powered by | Apache Yetus 0.6.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch, > YARN-7326.yarn-native-services.002.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16210429#comment-16210429 ] Hadoop QA commented on YARN-7326: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s{color} | {color:blue} Docker mode activated. {color} | | {color:red}-1{color} | {color:red} docker {color} | {color:red} 0m 11s{color} | {color:red} Docker failed to build yetus/hadoop:0de40f0. {color} | \\ \\ || Subsystem || Report/Notes || | JIRA Issue | YARN-7326 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12892961/YARN-7326.yarn-native-services.001.patch | | Console output | https://builds.apache.org/job/PreCommit-YARN-Build/18022/console | | Powered by | Apache Yetus 0.6.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Eric Yang > Attachments: YARN-7326.yarn-native-services.001.patch > > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208588#comment-16208588 ] Jian He commented on YARN-7326: --- I have found the issue why DNS eats up one core, opened YARN-7351 to fix it. It's mainly because the implementation creates a new thread for every dns queries.. After the patch, the cpu usage drops from 100% to 0.1% on my Mac > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Jian He > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16204241#comment-16204241 ] Jian He commented on YARN-7326: --- Thanks [~aw], bq. For 3) Integrate the yarn service commands into yarn application as mentioned by Eric Yang. I had a comment in [here|https://issues.apache.org/jira/browse/YARN-7127?focusedCommentId=16200955=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16200955] to explain the rationale. I do see the value of having coherent user-experience. But, I also would like to see an elegant way to handle the issues I mentioned there - mainly, two things: 1) Certain subcommands such as flex or upgrade won't be applicable to the generic yarn app, they are specific to this custom framework AM. Will this confuse users if we merge all these custom subcommands to the generic yarn app command ? 2) For certain overloaded sub-commands such as status, it has different meanings for the the app status from RM vs the status from customized AM. We need a way to differentiate. Adding one more option may not seem friendly. bq. 1) Actually integrate the docs with the rest of yarn-site. I'm not sure what benefit there is of having a separate documentation section, especially given #2 above and that the registrydns server could be used independently of the REST API. Sorry, didn't get this, which doc section are you referring to ? can you clarify more ? > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Jian He > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16204208#comment-16204208 ] Allen Wittenauer commented on YARN-7326: Without looking too hard at the current state (so I apologize if I've missed something) but to me, there are three showstopper issues: 1) Obviously the RegistryDNS 100% cpu issue. [I'm truly surprised that no one else had noticed its awful performance characteristics.] 2) Banish the separate API server, now that YARN-6626 has been committed. It's confusing and greatly increases the operating costs (and worse, potential security exposure) for little-to-no real benefit vs just using the REST API from the RM. So just remove it from the docs and the yarn command. 3) Integrate the yarn service commands into yarn application as mentioned by Eric Yang. I'd really like to see, but also wouldn't block the merge for: 1) Actually integrate the docs with the rest of yarn-site. I'm not sure what benefit there is of having a separate documentation section, especially given #2 above and that the registrydns server could be used independently of the REST API. 2) A more complex example that doesn't use Docker. This is important given that the docker bits in YARN have some significant security problems. A lot of sites probably can't or won't enable the Docker subsystem for quite a while as a result. 3) Slider migration guide. > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Jian He > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16204181#comment-16204181 ] Eric Yang commented on YARN-7326: - Sounds like dns java is misconfigured to use itself as the upstream dns server. Hence, query runs into a infinite loop. When setting up DNS, the code probably can look at /etc/resolv.conf and filter out its own IP address, then use the rest of the servers as upstream dns servers. > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Jian He > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-7326) Some issues in RegistryDNS
[ https://issues.apache.org/jira/browse/YARN-7326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16203984#comment-16203984 ] Jian He commented on YARN-7326: --- Hi [~aw], We start investigating the issues. Before we restart the vote for trunk and GA, are there any issues that you think must be resolved before the merge ? cc [~gsaha], [~billie.rinaldi] > Some issues in RegistryDNS > -- > > Key: YARN-7326 > URL: https://issues.apache.org/jira/browse/YARN-7326 > Project: Hadoop YARN > Issue Type: Sub-task >Reporter: Jian He >Assignee: Jian He > > [~aw] helped to identify these issues: > Now some general bad news, not related to this patch: > Ran a few queries, but this one is a bit concerning: > {code} > root@ubuntu:/hadoop/logs# dig @localhost -p 54 . > ;; Warning: query response not set > ; <<>> DiG 9.10.3-P4-Ubuntu <<>> @localhost -p 54 . > ; (2 servers found) > ;; global options: +cmd > ;; Got answer: > ;; ->>HEADER<<- opcode: QUERY, status: NOTAUTH, id: 47794 > ;; flags: rd ad; QUERY: 0, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 > ;; WARNING: recursion requested but not available > ;; Query time: 0 msec > ;; SERVER: 127.0.0.1#54(127.0.0.1) > ;; WHEN: Thu Oct 12 16:04:54 PDT 2017 > ;; MSG SIZE rcvd: 12 > root@ubuntu:/hadoop/logs# dig @localhost -p 54 axfr . > ;; Connection to ::1#54(::1) for . failed: connection refused. > ;; communications error to 127.0.0.1#54: end of file > root@ubuntu:/hadoop/logs# > {code} > It looks like it effectively fails when asked about a root zone, which is bad. > It's also kind of interesting in what it does and doesn't log. Probably > should be configured to rotate logs based on size not date. > The real showstopper though: RegistryDNS basically eats a core. It is running > with 100% cpu utilization with and without jsvc. On my laptop, this is > triggering my fan. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org