[ https://issues.apache.org/jira/browse/YARN-9063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Akira Ajisaka reassigned YARN-9063: ----------------------------------- Assignee: Ashutosh Gupta (was: Tarun Parimi) > ATS 1.5 fails to start if RollingLevelDb files are corrupt or missing > --------------------------------------------------------------------- > > Key: YARN-9063 > URL: https://issues.apache.org/jira/browse/YARN-9063 > Project: Hadoop YARN > Issue Type: Bug > Components: timelineserver, timelineservice > Affects Versions: 2.8.0 > Reporter: Tarun Parimi > Assignee: Ashutosh Gupta > Priority: Major > Labels: pull-request-available > Time Spent: 5h 20m > Remaining Estimate: 0h > > ATS v1.5 fails to start up if there are some missing files in > RollingLevelDBTimelineStore. YARN-6054 fixes this issue only for the > LevelDBTimelineStore. Since RollingLevelDBTimelineStore opens multiple level > db and rolls them, we need a separate fix for this. The error is shown below > {code} > 18/11/13 07:00:56 FATAL applicationhistoryservice.ApplicationHistoryServer: > Error starting ApplicationHistoryServer > org.apache.hadoop.service.ServiceStateException: > org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: 1 > missing files; e.g.: > /tmp/ats_folder/yarn/timeline/leveldb-timeline-store/owner-ldb/000005.sst > at > org.apache.hadoop.service.ServiceStateException.convert(ServiceStateException.java:59) > > at org.apache.hadoop.service.AbstractService.init(AbstractService.java:172) > at > org.apache.hadoop.service.CompositeService.serviceInit(CompositeService.java:107) > > at > org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryServer.serviceInit(ApplicationHistoryServer.java:111) > > at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163) > at > org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryServer.launchAppHistoryServer(ApplicationHistoryServer.java:174) > > at > org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryServer.main(ApplicationHistoryServer.java:184) > > Caused by: org.fusesource.leveldbjni.internal.NativeDB$DBException: > Corruption: 1 missing files; e.g.: > /tmp/ats-folder/yarn/timeline/leveldb-timeline-store/owner-ldb/000005.sst > at org.fusesource.leveldbjni.internal.NativeDB.checkStatus(NativeDB.java:200) > at org.fusesource.leveldbjni.internal.NativeDB.open(NativeDB.java:218) > at org.fusesource.leveldbjni.JniDBFactory.open(JniDBFactory.java:168) > at > org.apache.hadoop.yarn.server.timeline.RollingLevelDBTimelineStore.serviceInit(RollingLevelDBTimelineStore.java:321) > > at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163) > {code} -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org