2 Replies Latest reply: Feb 12, 2013 2:12 PM by PradeepKPathak RSS

    Best practice on monitoring Endeca health / defining outage

    Jim Song
      I am looking for best practice on how to define Endeca service outage and monitor the health of the system. I understand this depends on your user requirements and it may vary from customer to customer. Specifically what criteria do you use to notify your engineer there is a problem? We have our load balancers pinging dgraphs on an interval. However the ping operation is not sufficient in our use case. We are also experimenting running a "low cost" query to the dgraphs on an interval and using some query latency thresholds to determine outage. I want to hear from people on the field running large commercial web site about your best practice of monitoring/notifying health of the system.

      Thanks.

      Edited by: Jim Song on Jan 3, 2013 10:51 AM