Hi All, we face a weird problem in our 3 node slony setup. * node1 (db1.domain.tld ) is the master provider and node2 (db2.domain.tld ), node3 (db3.domain.tld ) are subscribers. currently nodes have 5 replication sets and the replication is working fine. * the problem is sl_log_1 and sl_log_2 tables in node1 gets cleaned up properly, but the tables in the node2 and node3 doesn't. On node1 the total number of rows in sl_log_1 table is 24845 and in sl_log_2 it is 0. whereas
node2: relation | size ----------------------------------------------------------+--------- _mhb_replication.sl_log_2 | 130 GB _mhb_replication.sl_log_2_idx1 | 47 GB _mhb_replication.PartInd_mhb_replication_sl_log_2-node-1 | 30 GB node3: relation | size ----------------------------------------------------------+-------- _mhb_replication.sl_log_2 | 133 GB _mhb_replication.sl_log_2_idx1 | 47 GB _mhb_replication.PartInd_mhb_replication_sl_log_2-node-1 | 30 GB _mhb_replication.sl_log_1 | 352 MB in node2 and node3 could see the following lines frequently. slon[20695]: [4031-1] FATAL cleanupThread: "delete from "_mhb_replication".sl_log_1 where log_origin = '1' and log_xid < '2130551154'; delete from slon[20695]: [4031-2] "_mhb_replication".sl_log_2 where log_origin = '1' and log_xid < '2130551154'; delete from "_mhb_replication".sl_seqlog where slon[20695]: [4031-3] seql_origin = '1' and seql_ev_seqno < '51449379'; select "_mhb_replication".logswitch_finish(); " - ERROR: canceling statement slon[20695]: [4031-4] due to statement timeout slon[20695]: [4032-1] DEBUG2 slon_retry() from pid=20695 please find the slony_tools.conf here https://gist.github.com/clement1289/d928acb771ca01a89281 and sl_status /sl_listen output here https://gist.github.com/clement1289/88df40f77c03c691eee5 . Hoping for some help. Regards, Clement _______________________________________________ Slony1-general mailing list Slony1-general@lists.slony.info http://lists.slony.info/mailman/listinfo/slony1-general