out of buffer metric on Exadata-X10M
Hi All,
We see constant increase in the exawatcher rocestat out_of_buffer metric.
This metric consistently increases on both compute and cell nodes.
As a result, this metric effects our DB and causes "Cluster gc events".
We have opened several SRs for DB, Linux OS and also Cisco switch teams and we have applied net.ipv4.tcp_[rw]mem max size from 16MB to 128MB and ring buffer size from 1024 to 2048 on OS side. Finally, we have upgraded image version to 25.1.10.0.0.251012.1 but unfortunately these actions can not resolved our problem-steady increase of roce out_of_buffer metric.
Is there someone faced same issue and resolve it?