Time to checkpoint is increasing on Goldengate integrated replicat
Hi,
Fairly new to GoldenGate.
Recently experienced a couple of replication outages on an integrated replicat instance. No errors or abends in the GG logs but when I check the summary status I see a lag on the time since checkpoint value:
Program Status Group Lag at Chkpt Time Since Chkpt
MANAGER RUNNING
JAGENT STOPPED
REPLICAT RUNNING <NAME> 00:00:00 01:19:46
REPLICAT RUNNING <NAME> 00:00:00 01:19:32
Replicat processes stopped responding to simple console commands, such as status <name>.... just gets a timeoout error after a while.
The only way I could recover was by issuing a KILL on the replicat processes and then restarting - wouldn't respond to a normal stop. Problem is intermittent and will probably occur again in a day or two. Had a look online and seen suggestions that it could be related to long running transactions, but should not be any on this target system, it is characterized by lots of small insert/updates that tend commit sub-second.