You're almost there! Please answer a few more questions for access to the Applications content. Complete registration
Interested in joining? Complete your registration by providing Areas of Interest here. Register

OCI: OKE Oracle Linux instance hangs intermittently and mlx5_core: transmit queue 0 timed out

edited Apr 18, 2025 7:54AM in Linux

Applies To:
Oracle Cloud Infrastructure - Version N/A and later
Linux x86-64 on Oracle Public Cloud

Symptoms:
In OKE cluster environment Oracle Linux VM goes into hung state intermittently. To address the issue, it is necessary to perform a reboot of the VM.

Cause:
This is a known issue with Mellanox firmware version 22.38.1002.
Bug 37745582 - Node hang and (mlx5_core): transmit queue 0 timed out messages are seen


# ethtool -i <interface>
driver: mlx5_core
version: 5.15.0-210.163.7.el8uek.x86_64
firmware-version: 22.38.1002 (ORC0000000007)
expansion-rom-version:
bus-info: 0000:00:03.0
supports-statistics: yes
supports-test: yes
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: yes

Solution:

Update the Mellanox firmware version to 22.39.1002 or later

Howdy, Stranger!

Log In

To view full details, sign in.

Register

Don't have an account? Click here to get started!