Problem Status : open
Diag Engine : fdd 1.0
System
Manufacturer : Oracle Corporation
Name : ORACLE SERVER X9-2L
Part_Number : 7603374-24643
Serial_Number : 2309XC0002
----------------------------------------
Suspect 1 of 1
Problem class : fault.chassis.device.sppost
Certainty : 100%
Affects : /SYS/MB
Status : faulted
FRU
Status : faulty
Location : /SYS/MB
Manufacturer : Oracle Corporation
Name : ASM,MTHRBD,2U
Part_Number : 8207825
Revision : 12
Serial_Number : 465136N+2253Y5004B
Chassis
Manufacturer : Oracle Corporation
Name : ORACLE SERVER X9-2L
Part_Number : 7603374-24643
Serial_Number : 2309XCF01G
Description : The Service Processor power-on self test has detected a
problem.
Response : The service-required LED may be illuminated on the affected
FRU and chassis.
Impact : The Service Processor may not be able to perform necessary
functions to power on, monitor, or manage the system.
Action : Please refer to the associated reference document at http://support.oracle.com/msg/ILOM-8000-4T for the latest
service procedures and policies regarding this diagnosis.
HWdiag - Build Number 145377 (Apr 16 2022, 11:43:58)
Current Date/Time: Dec 16 2022, 10:28:08
Note: Turn off host to access DIMMs over i2c.
I2C DEVICE CHIP NAME BUS/MUX1/CH1/MUX0/CH0/ADDR RESULT
-------------------------------------------------------------------------------------------------------------------
/SYS/MB/CPLD XC6SLX16 CPLD 1/FF/FF/FF/FF/4E OK
/SYS/MB/FM0 ADT7462 FAN_CTRL_0 1/FF/FF/FF/FF/B0 OK
/SYS/MB AT24C64 MB_FRU 3/FF/FF/FF/FF/A0 OK
/SYS/MB/CPU0_DIMM_LED PCA9554 CPU0_DIMM_LED 3/FF/FF/FF/FF/70 OK
/SYS/MB/CPU1_DIMM_LED PCA9554 CPU1_DIMM_LED 3/FF/FF/FF/FF/7E OK
/SYS/MB/CPU0_CPU1_DIMM_LED PCA9555 CPU0_CPU1_DIMM_LED 3/FF/FF/FF/FF/40 OK
/SYS/MB/CPU0_CPU1_LED PCA9554 CPU0_CPU1_LED 3/FF/FF/FF/FF/76 OK
/SYS/MB/PCA9547_ROT PCA9547 PCA9547_ROT 4/FF/FF/FF/FF/E0 OK
/SYS/MB/P0/CPU0_PIROM PIROM CPU0_PIROM 8/FF/FF/FF/FF/A0 OK
/SYS/MB/P1/CPU1_PIROM PIROM CPU1_PIROM 8/FF/FF/FF/FF/A2 OK
/SYS/MB/RTC DS1338 RTC 8/FF/FF/FF/FF/D0 OK
/SYS/MB/VCORE_CPU0 XDPE12284 VCORE_CPU0 8/FF/FF/FF/FF/B0 OK
/SYS/MB/VCORE_CPU1 XDPE12284 VCORE_CPU1 8/FF/FF/FF/FF/B4 OK
/SYS/MB/VDDQ_CPU0 XDPE12284 VDDQ_CPU0 8/FF/FF/FF/FF/D8 OK
/SYS/MB/VDDQ_CPU1 XDPE12284 VDDQ_CPU1 8/FF/FF/FF/FF/DC OK
/SYS/MB/VCCIO_CPU0 XDPE12284 VCCIO_CPU0 8/FF/FF/FF/FF/C0 OK
/SYS/MB/VCCIO_CPU1 XDPE12284 VCCIO_CPU1 8/FF/FF/FF/FF/C4 OK
/SYS/MB/VPCIEG4_CPU0 XDPE12284 VPCIEG4_CPU0 8/FF/FF/FF/FF/B8 OK
/SYS/MB/VPCIEG4_CPU1 XDPE12284 VPCIEG4_CPU1 8/FF/FF/FF/FF/BC OK
/SYS/MB/PCA9546_SMB8 PCA9546 PCA9546_SMB8 9/FF/FF/FF/FF/E0 OK
/SYS/MB/PCA9546_SMB9 PCA9546 PCA9546_SMB9 10/FF/FF/FF/FF/E0 OK
/SYS/MB/FAN_9554 PCA9554 FAN_9554 10/FF/FF/E0/00/70 OK
/SYS/MB/FRONT_9552 PCA9552 FRONT_9552 10/FF/FF/E0/01/C0 OK
/SYS/12DBP AT24C64 DB12_FRU 10/FF/FF/E0/03/A4 OK
/SYS/DBP/CPLD XC6SLX16 DB_FPGA 10/FF/FF/E0/03/56 OK
/SYS/T_AMB ADT7461 TS_DBP 10/FF/FF/E0/03/98 OK
/SYS/MB/PCA9547_SMB9 PCA9547 PCA9547_SMB9 10/FF/FF/FF/FF/EE OK
/SYS/PS0/FRU AT24C64 PS0_FRU 10/FF/FF/EE/01/A0 OK
/SYS/PS0/DATA A269 PS0_DATA 10/FF/FF/EE/01/B0 OK
/SYS/PS1/FRU AT24C64 PS1_FRU 10/FF/FF/EE/02/A0 OK
/SYS/PS1/DATA A269 PS1_DATA 10/FF/FF/EE/02/B0 OK
/SYS/MB/REAR_9552 PCA9552 REAR_9552 10/FF/FF/EE/03/C0 OK
/SYS/MB/PCA9547_SMB0 PCA9547 PCA9547_SMB0 1/FF/FF/FF/FF/E0 OK
/SYS/MB/T_IN_ZONE0 ADT7461 TS_Z0_I 1/FF/FF/E0/00/98 OK
/SYS/MB/T_OUT_ZONE0 ADT7461 TS_Z0_E 1/FF/FF/E0/01/98 OK
/SYS/MB/T_IN_ZONE1 ADT7461 TS_Z1_I 1/FF/FF/E0/02/98 OK
/SYS/MB/T_OUT_ZONE1 ADT7461 TS_Z1_E 1/FF/FF/E0/03/98 OK
/SYS/MB/T_IN_ZONE2 ADT7461 TS_Z2_I 1/FF/FF/E0/04/98 OK
/SYS/MB/T_OUT_ZONE2 ADT7461 TS_Z2_E 1/FF/FF/E0/05/98 OK
/SYS/MB/T_IN_ZONE3 ADT7461 TS_Z3_I 1/FF/FF/E0/06/98 OK
/SYS/MB/T_OUT_ZONE3 ADT7461 TS_Z3_E 1/FF/FF/E0/07/98 OK
/SYS/DBP/PCA9547_DBP PCA9547 PCA9547_DBP 2/FF/FF/FF/FF/E2 OK
/SYS/DBP/PCA9547_2_DBP PCA9547 PCA9547_2_DBP 2/FF/FF/E2/00/E4 OK
/SYS/MB/PCA9547_SMB4 PCA9547 PCA9547_SMB4 5/FF/FF/FF/FF/E0 OK
/SYS/MB/PCA9547_SMB6 PCA9547 PCA9547_SMB6 7/FF/FF/FF/FF/E4 OK
/SYS/MB/PCA9546_RETIMER_SMB PCA9546 PCA9546_RETIMER_SMB 7/FF/FF/E4/05/E6 Access failed <<<<<<<<<
/SYS/MB/RETIMER_FRU AT24C64 RETIMER_FRU 7/E4/05/E6/00/A4 Access failed <<<<<<<<<
/SYS/MB/RETIMER_9554 PCA9554 RETIMER_9554 7/E4/05/E6/00/78 Access failed <<<<<<<<<
/SYS/MB/RETIMER_TEMP ADT7461 TS_RETIMER 7/E4/05/E6/00/98 Access failed <<<<<<<<<
/SYS/MB/RETIMER_ASIC1 DS160PT801 RETIMER_ASIC1 7/E4/05/E6/01/20 Access failed <<<<<<<<<
/SYS/MB/RETIMER_CLK IDT9DBL0851 RETIMER_CLK 7/E4/05/E6/02/D6 Access failed <<<<<<<<<
I2C Test Result: FAILED
改变:Factory setting in the BIOS for the Retimer card that is not used on ODA X9-2HA and ODA X9-2S systems.
原因:
PCIe slot 10 is designated on ODA X9-2L system for a Retimer card, which requires a special setting "x4x4x4x4 NVME Retimer Hotplug" in the BIOS to function properly.
On ODA X9-2HA and ODA X9-2S systems this card is not used and installed, but if the BIOS setting for the card is set to that specific "x4x4x4x4 NVME Retimer Hotplug" setting, the ILOM will try to access it when it is reset and because it does not find it, it will report the access failed errors in the i2c test.
解决办法:
BIOS中去掉这个设置即可。注意,需要停机
To correct the issue, enter the BIOS of the ODA node and change the setting:
1. To access the BIOS, press F2 (Ctrl+E from a serial connection) to launch the BIOS Setup Utility when prompted in the BIOS screen at boot.
Alternatively, the boot device can be set from the ILOM CLI by running the following commands, by first stopping the system, setting the boot device, starting the system and the console.
->stop -f /SYS
Are you sure you want to immediately stop /SYS (y/n)? y
Stopping /SYS immediately
->set /HOST boot_device=bios
Set 'boot_device' to 'bios'
->start /SYS
Are you sure you want to start /SYS (y/n)? y
Starting /SYS
->start /SP/console
Are you sure you want to start /SP/console (y/n)? y
Serial console started. To stop, type ESC (
2. On the BIOS Setup Utility screen, select the IO tab, select PCIE Connector Special Configuration, and press Enter.