How to Raise an Alert if an OIC Scheduled Job Fails to Trigger?
Summary:
We aim to set up an alarm for cases where OIC scheduled jobs fail to run due to unexpected issues—such as the instance being down, performance degradation, or jobs getting stuck in the queue.
Currently, we're using Alarm Definitions in OCI to monitor and trigger an absence alarm when a job doesn’t run. However, the challenge lies in monitoring for job absence only during its scheduled execution window. For example, if a job is scheduled to run daily at 3:00 AM, monitoring it at 1-hour intervals would result in the alert being triggered repeatedly every hour after 3:00 AM, since the job would not run again until the following day.