Skip to Main Content

DevOps, CI/CD and Automation

Announcement

For appeals, questions and feedback about Oracle Forums, please email oracle-forums-moderators_us@oracle.com. Technical questions should be asked in the appropriate category. Thank you!

Interested in getting your voice heard by members of the Developer Marketing team at Oracle? Check out this post for AppDev or this post for AI focus group information.

Ideas for implementing Database Reliability Engineering

User_45CRXFeb 4 2022 — edited Feb 4 2022

Hey Folks,
I am just exploring the Ideas for implementing Database Reliability Engineering at my Org. We have almost 4000+ DB's (mostly with Oracle) and a few database techs. We have infrastructure spanned across on-premise and OCI ExadataCS.
Looking for ideas or anything that has been in place related to DBRE. I am trying to get a few metrics like DB Uptime, Query Response time (for top 5 pinned queries by app), security, etc and feed that as an input to showcase it in a dashboard view.
By such many metrics, we are planning to show the database reliability score on live time. I understand it's a vast topic but still, if someone has some ideas already around this space, it could be better to start with them.

Comments

Jasmin Fluri

If you already have a metrics aggregation tool in place like Prometheus (Overview | Prometheus) one way would be to create custom dashboard views with something like Grafana (Grafana: The open observability platform | Grafana Labs).
But there's a plethora of tools out there - depending on what you already have, they could be a possibility or not.
For DBRE in distributed systems (not only limited to the Oracle database) tools like Honeycomb (Observability for Distributed Services | Honeycomb) can help you analyse system behaviours.

1 - 1

Post Details

Added on Feb 4 2022
1 comment
451 views