Hello,
Our ARCO database suddenly stopped updating records about two weeks ago. When
we've started dbwriter in the past, we've seen this in dbwriter.log:
08/09/2016 12:41:10||.ReportingDBWriter.initLogging|I|Starting up dbwriter
(Version 6.2) ---------------------------
08/09/2016 12:41:10||r.ReportingDBWriter.initialize|I|Connection to db
jdbc:mysql://server:3306/oge_arco
08/09/2016 12:41:10||r.ReportingDBWriter.initialize|I|Found database model
version 8
08/09/2016 12:41:10||er.file.FileParser.processFile|I|Renaming reporting to
reporting.processing
08/09/2016 12:41:10||tingDBWriter.getDbWriterConfig|I|calculation file
/home/oge/ge2011.11/dbwriter/database/mysql/dbwriter.xml has changed, reread it
08/09/2016 12:41:10||iter.file.FileParser.parseFile|W|0 lines marked as
erroneous, these will be skipped
08/09/2016 12:41:11||iter.file.FileParser.parseFile|I|Deleting file
reporting.processing
08/09/2016 12:41:11||le.FileParser.createStatistics|I|Processed 14 lines in
0.35s (40.35 lines/s)
08/09/2016 12:41:11||tingDBWriter.getDbWriterConfig|I|calculation file
/home/oge/ge2011.11/dbwriter/database/mysql/dbwriter.xml has changed, reread it
08/09/2016 12:41:11||ngDBWriter$StatisticThread.run|I|Next statistic
calculation will be done at 9/8/16 1:30 PM
Now, for some reason when we restart it (we're unaware of any changes), we get
this:
29/09/2016 13:02:36||.ReportingDBWriter.initLogging|I|Starting up dbwriter
(Version 6.2) ---------------------------
29/09/2016 13:02:36||dbwriter.ReportingDBWriter.run|D|ENTRY
29/09/2016 13:02:36||ReportingDBWriter.writePidFile|D|pid 5152 written into pid
file /home/oge/ge2011.11/default/spool/dbwriter/dbwriter.pid
29/09/2016 13:02:36||r.ReportingDBWriter.initialize|I|Connection to db
jdbc:mysql://server:3306/oge_arco
29/09/2016 13:02:36||ting.dbwriter.db.Database.init|D|register jdbc driver
'com.mysql.jdbc.Driver'
29/09/2016 13:02:36||iter.db.Database.getConnection|D|opening connection to
'jdbc:mysql://server:3306/oge_arco' as user 'oge_arco_write'
29/09/2016 13:02:36||iter.db.Database.getConnection|D|connection Connection 1
(null@jdbc:mysql://server:3306/oge_arco) opened
29/09/2016 13:02:36||iter.db.Database.getConnection|D|Thread dbwriter gots
connection
Connection 1 (null@jdbc:mysql://server:3306/oge_arco)
29/09/2016 13:02:36||riter.db.Database.executeQuery|D|Execute sql: select
count(*) from sge_host
We don't see Fileparser messages or anything like that. Here's the output of
qconf -mconf regarding reporting:
reporting_params accounting=true reporting=true \
flush_time=00:00:15 joblog=true sharelog=00:00:00
And the output of qconf -me global:
hostname global
load_scaling NONE
complex_values imc_lic=150,matlab_lic=40,sas_lic=32,clc_lic=1
user_lists NONE
xuser_lists NONE
projects NONE
xprojects NONE
usage_scaling NONE
report_variables NONE
Everything appears to be in order, any idea of what might be happening? We
haven't had reporting_variables defined before, but it was logging data to the
database. Any idea of what could be happening?
Scott Lucas
HPC Applications Support
208-776-0209
[email protected]
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users