[jira] [Commented] (YARN-9040) LevelDBCacheTimelineStore in ATS 1.5 leaks native memory
[ https://issues.apache.org/jira/browse/YARN-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17862548#comment-17862548 ] Eric Yang commented on YARN-9040: - [~suyeon1222] Please back port the patch to branch-2.10 as needed. It was not intentional to leave this out of branch-2.10. Thanks > LevelDBCacheTimelineStore in ATS 1.5 leaks native memory > > > Key: YARN-9040 > URL: https://issues.apache.org/jira/browse/YARN-9040 > Project: Hadoop YARN > Issue Type: Bug > Components: timelineserver >Affects Versions: 2.8.0 >Reporter: Tarun Parimi >Assignee: Tarun Parimi >Priority: Major > Fix For: 3.0.4, 3.1.2, 3.3.0, 2.8.6, 3.2.1, 2.9.3 > > Attachments: YARN-9040.001.patch, YARN-9040.002.patch > > > When LevelDBCacheTimelineStore from YARN-4219 is used as ATS 1.5 entity > caching storage, we observe memory leak due to leveldb files even after the > fix of YARN-5368 . > Top output shows 0.024TB (25GB) RES, even though heap size is only 8GB. > > > {code:java} > PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND > 25519 yarn 20 0 33.024g 0.024t 41468 S 6.2 26.0 21:07.39 > /usr/java/default/bin/java -Dproc_timelineserver -Xmx8192m > {code} > > Lsof shows a lot of open timeline-cache.ldb files which are referenced by > ATS, even though are deleted (DEL), since they are not present when listing > them . > > {code:java} > java 25519 yarn DEL REG 253,28 9438452 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438438 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438437 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/05.sst > {code} > > Looks like LevelDBCacheTimelineStore is not closing these files as the > LevelDB DBIterator is not closed. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-9040) LevelDBCacheTimelineStore in ATS 1.5 leaks native memory
[jira] [Commented] (YARN-9040) LevelDBCacheTimelineStore in ATS 1.5 leaks native memory
[ https://issues.apache.org/jira/browse/YARN-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16723170#comment-16723170 ] Eric Yang commented on YARN-9040: - Patch 002 looks good. Committing to trunk, branch-3.2, branch-3.1, branch-3.0, branch-2.9, branch-2.8. > LevelDBCacheTimelineStore in ATS 1.5 leaks native memory > > > Key: YARN-9040 > URL: https://issues.apache.org/jira/browse/YARN-9040 > Project: Hadoop YARN > Issue Type: Bug > Components: timelineserver >Affects Versions: 2.8.0 >Reporter: Tarun Parimi >Assignee: Tarun Parimi >Priority: Major > Attachments: YARN-9040.001.patch, YARN-9040.002.patch > > > When LevelDBCacheTimelineStore from YARN-4219 is used as ATS 1.5 entity > caching storage, we observe memory leak due to leveldb files even after the > fix of YARN-5368 . > Top output shows 0.024TB (25GB) RES, even though heap size is only 8GB. > > > {code:java} > PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND > 25519 yarn 20 0 33.024g 0.024t 41468 S 6.2 26.0 21:07.39 > /usr/java/default/bin/java -Dproc_timelineserver -Xmx8192m > {code} > > Lsof shows a lot of open timeline-cache.ldb files which are referenced by > ATS, even though are deleted (DEL), since they are not present when listing > them . > > {code:java} > java 25519 yarn DEL REG 253,28 9438452 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438438 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438437 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/05.sst > {code} > > Looks like LevelDBCacheTimelineStore is not closing these files as the > LevelDB DBIterator is not closed. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-9040) LevelDBCacheTimelineStore in ATS 1.5 leaks native memory
[ https://issues.apache.org/jira/browse/YARN-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16722720#comment-16722720 ] Tarun Parimi commented on YARN-9040: Thanks [~csingh] for your review. Yeah, I was not sure how to write unit tests for this. The implementations of KeyValueBasedTimelineStore - MemoryTimelineStore and LevelDBCacheTimelineStore have unit tests. But this patch doesn't bring any functionality changes to them and just closes the LevelDB Iterator. > LevelDBCacheTimelineStore in ATS 1.5 leaks native memory > > > Key: YARN-9040 > URL: https://issues.apache.org/jira/browse/YARN-9040 > Project: Hadoop YARN > Issue Type: Bug > Components: timelineserver >Affects Versions: 2.8.0 >Reporter: Tarun Parimi >Assignee: Tarun Parimi >Priority: Major > Attachments: YARN-9040.001.patch, YARN-9040.002.patch > > > When LevelDBCacheTimelineStore from YARN-4219 is used as ATS 1.5 entity > caching storage, we observe memory leak due to leveldb files even after the > fix of YARN-5368 . > Top output shows 0.024TB (25GB) RES, even though heap size is only 8GB. > > > {code:java} > PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND > 25519 yarn 20 0 33.024g 0.024t 41468 S 6.2 26.0 21:07.39 > /usr/java/default/bin/java -Dproc_timelineserver -Xmx8192m > {code} > > Lsof shows a lot of open timeline-cache.ldb files which are referenced by > ATS, even though are deleted (DEL), since they are not present when listing > them . > > {code:java} > java 25519 yarn DEL REG 253,28 9438452 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438438 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438437 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/05.sst > {code} > > Looks like LevelDBCacheTimelineStore is not closing these files as the > LevelDB DBIterator is not closed. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-9040) LevelDBCacheTimelineStore in ATS 1.5 leaks native memory
[ https://issues.apache.org/jira/browse/YARN-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16720512#comment-16720512 ] Chandni Singh commented on YARN-9040: - [~tarunparimi] the change looks good to me. There aren't any existing tests for {{KeyValueBasedTimelineStore}} so any changes made to it cannot be verified by unit tests. We should create tests for {{KeyValueBasedTimelineStore}} but that doesn't have to be part of this change. [~rohithsharma] [~eyang] Could you please help review. > LevelDBCacheTimelineStore in ATS 1.5 leaks native memory > > > Key: YARN-9040 > URL: https://issues.apache.org/jira/browse/YARN-9040 > Project: Hadoop YARN > Issue Type: Bug > Components: timelineserver >Affects Versions: 2.8.0 >Reporter: Tarun Parimi >Assignee: Tarun Parimi >Priority: Major > Attachments: YARN-9040.001.patch, YARN-9040.002.patch > > > When LevelDBCacheTimelineStore from YARN-4219 is used as ATS 1.5 entity > caching storage, we observe memory leak due to leveldb files even after the > fix of YARN-5368 . > Top output shows 0.024TB (25GB) RES, even though heap size is only 8GB. > > > {code:java} > PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND > 25519 yarn 20 0 33.024g 0.024t 41468 S 6.2 26.0 21:07.39 > /usr/java/default/bin/java -Dproc_timelineserver -Xmx8192m > {code} > > Lsof shows a lot of open timeline-cache.ldb files which are referenced by > ATS, even though are deleted (DEL), since they are not present when listing > them . > > {code:java} > java 25519 yarn DEL REG 253,28 9438452 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438438 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438437 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/05.sst > {code} > > Looks like LevelDBCacheTimelineStore is not closing these files as the > LevelDB DBIterator is not closed. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-9040) LevelDBCacheTimelineStore in ATS 1.5 leaks native memory
[ https://issues.apache.org/jira/browse/YARN-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16702860#comment-16702860 ] Tarun Parimi commented on YARN-9040: [~rohithsharma] Can you please review this ? > LevelDBCacheTimelineStore in ATS 1.5 leaks native memory > > > Key: YARN-9040 > URL: https://issues.apache.org/jira/browse/YARN-9040 > Project: Hadoop YARN > Issue Type: Bug > Components: timelineserver >Affects Versions: 2.8.0 >Reporter: Tarun Parimi >Assignee: Tarun Parimi >Priority: Major > Attachments: YARN-9040.001.patch, YARN-9040.002.patch > > > When LevelDBCacheTimelineStore from YARN-4219 is used as ATS 1.5 entity > caching storage, we observe memory leak due to leveldb files even after the > fix of YARN-5368 . > Top output shows 0.024TB (25GB) RES, even though heap size is only 8GB. > > > {code:java} > PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND > 25519 yarn 20 0 33.024g 0.024t 41468 S 6.2 26.0 21:07.39 > /usr/java/default/bin/java -Dproc_timelineserver -Xmx8192m > {code} > > Lsof shows a lot of open timeline-cache.ldb files which are referenced by > ATS, even though are deleted (DEL), since they are not present when listing > them . > > {code:java} > java 25519 yarn DEL REG 253,28 9438452 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438438 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/07.sst > java 25519 yarn DEL REG 253,28 9438437 > /var/yarn/timeline/timelineEntityGroupId_1542280269959_55569_dag_1542280269959_55569_2-timeline-cache.ldb/05.sst > {code} > > Looks like LevelDBCacheTimelineStore is not closing these files as the > LevelDB DBIterator is not closed. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org
[jira] [Commented] (YARN-9040) LevelDBCacheTimelineStore in ATS 1.5 leaks native memory
[ https://issues.apache.org/jira/browse/YARN-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16694610#comment-16694610 ] Hadoop QA commented on YARN-9040: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 13s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:red}-1{color} | {color:red} test4tests {color} | {color:red} 0m 0s{color} | {color:red} The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 43s{color} | {color:blue} Maven dependency ordering for branch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 23m 38s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 2m 38s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 1m 4s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 57s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 14m 21s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 12s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 41s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 12s{color} | {color:blue} Maven dependency ordering for patch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 51s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 2m 32s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 2m 32s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 58s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 52s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 13m 19s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 22s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 34s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 3m 18s{color} | {color:green} hadoop-yarn-server-applicationhistoryservice in the patch passed. {color} | | {color:green}+1{color} | {color:green} unit {color} | {color:green} 2m 19s{color} | {color:green} hadoop-yarn-server-timeline-pluginstorage in the patch passed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 26s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 71m 35s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Docker | Client=17.05.0-ce Server=17.05.0-ce Image:yetus/hadoop:8f97d6f | | JIRA Issue | YARN-9040 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12949023/YARN-9040.002.patch | | Optional Tests | dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux aa1c9ecb937f 4.4.0-134-generic #160~14.04.1-Ubuntu SMP Fri Aug 17 11:07:07 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/patchprocess/precommit/personality/provided.sh | | git revision | trunk / f63e4e4 | | maven | version: Apache Maven 3.3.9 | | Default Java | 1.
[jira] [Commented] (YARN-9040) LevelDBCacheTimelineStore in ATS 1.5 leaks native memory
[ https://issues.apache.org/jira/browse/YARN-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16694159#comment-16694159 ] Hadoop QA commented on YARN-9040: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 18s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:red}-1{color} | {color:red} test4tests {color} | {color:red} 0m 0s{color} | {color:red} The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 34s{color} | {color:blue} Maven dependency ordering for branch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 22m 45s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 2m 56s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 1m 12s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 59s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 14m 22s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 13s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 43s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 12s{color} | {color:blue} Maven dependency ordering for patch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 49s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 2m 34s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 2m 34s{color} | {color:green} the patch passed {color} | | {color:orange}-0{color} | {color:orange} checkstyle {color} | {color:orange} 0m 56s{color} | {color:orange} hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server: The patch generated 1 new + 22 unchanged - 0 fixed = 23 total (was 22) {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 49s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 13m 30s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 22s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 36s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 3m 45s{color} | {color:green} hadoop-yarn-server-applicationhistoryservice in the patch passed. {color} | | {color:green}+1{color} | {color:green} unit {color} | {color:green} 2m 21s{color} | {color:green} hadoop-yarn-server-timeline-pluginstorage in the patch passed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 27s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 71m 45s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Docker | Client=17.05.0-ce Server=17.05.0-ce Image:yetus/hadoop:8f97d6f | | JIRA Issue | YARN-9040 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12948962/YARN-9040.001.patch | | Optional Tests | dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux 8b1df4d6ae2e 3.13.0-153-generic #203-Ubuntu SMP Thu Jun 14 08:52:28 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/patchprocess/precommit/personality/pro