cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
1342
Views
0
Helpful
7
Replies

UCS Fabric Interconnect hangs

Jay UCS
Level 1
Level 1

My Fabric Interconnects 6248 hangs with the following single line message "

N5000 BIOS v.3.5.0, Thu 02/03/2011, 05:12 PM 

 I've tried going into the Golden Bios and showing debug message and the following was displayed before it hanged. 

N5000 BIOS v.3.5.0, Thu 02/03/2011, 05:12 PM 

Booting Golden BIOS

Enable BIOS debug messages (y/n) : 

N5000 BIOS v.3.5.0, Thu 02/03/2011, 05:12 PM 

--- Turning ON Amber LED --- 
--- Turning ON Blue LED --- 
MEPlatformPEI.Entry(FFFA74E8)
ME Platform PEI Init start.
Register PPI Notify: 64c96700-6b4c-480c-a3e1-b8bde8f602b2
ME Platform PEI Init end.
HECIPEI.Entry(FFFA9E84)
HECI PEI Init start.
HECI PEI Init end.
CsiandMrcInit.Entry(FFFB1A90)
========== Entering UNCOREINIT ========== 
RtidProfile before: 0
Setup variable is valid - QPI BS (BIOS STUFF).
MemCeil : 0xD0000000
Pci Resource Mem32 Range: 0xD0000000 - 0xFBFFFFFF
PciResourceMem64Base: 0x100000000
RB[0] Mem64 Range : 0x100000000 - 0x7fffffffff
RB[1] Mem64 Range : 0x0 - 0x0
RtidProfile after: 0
Tohm : 0x0 
HighMmioBase : 0x100000000 
HighMmioLimit : 0x8000000000 
========== Enter CSI RC code ========== 
========== Exit CSI RC code ========== 
LowGap: 12
Notify: PPI Guid: 71a8917b-0891-4e27-8a73-a9b334840393, Peim notify entry point: fff9d798


<<<< SUP_NUOVA_OR2 System Information >>>>
  << CpuID_EAX(1) = 0x106e4>>  
  << CPU Revision ID = 0x10, C0-Stepping >>  
  << Cores : 0x0>>  
  << HyperThreading : Enabled>>  
<<IbexPeak Revision ID = 0x6, B3-Stepping>>
<<Loaded Microcode Revision on BSP = ffff0002>>

========== Enter MRC code ========== 


MRC rev: 00900000 
Memory behind processor 0 running at DDR3-1066
RDIMM population
Channel Early Config
DDR training
<<<< DDR Vref Training Disabled >>>>
Command/Clock TrainingRead DQ/DQS
Receive Enable
Write Leveling
Write Leveling Fix-up
Write DQ/DQS
Command phase 0  Re-center RdDqs  Re-center WrDq   Re-run Rd Vref   RTL              
Checking margins for all ranks with loop count = 10...

            RxDqLeft RxDqRight RxVLow RxVCenter RxVHigh TxDqLo TxDqHi TxVLow TxVCenter TxVHigh
---------------------------------------------------------------------------------------------------------------------
N0.C0.D0.R0:   18       18       27      -1       25      25     25      31      0      31
N0.C0.D0.R1:   17       17       28      -1       26      24     25      31      0      31
N0.C1.D0.R0:   18       17       27      -1       23      24     25      31      0      31
N0.C1.D0.R1:   18       17       27      -1       24      24     26      31      0      31
Independant channel mode enabled on socket 0
ECC is enabled
Start Hardware Memory Init

MemInit latency             1815 ms
Initialize Memory Map
Clear Errors
Set RAS Config

Total MRC latency = 7074 ms


MRC latency - MemTest and MemInit = 5259 ms
Software memory test Passed!

 DIMM location  | dimmPresent | Ranks |   Size   |  mapOut  | Mfg. ID |   Mfg. Date   | DRAM Id |    Part #
________________|_____________|_______|__________|__________|_________|_______________|_________|__________________
    N0.C0.D0    |      1      |   2   |  8192 MB |    0     | Samsung |   2017 WW35   | Samsung |M393B1K70CH0-YH9 
    N0.C0.D1    |      0
    N0.C1.D0    |      1      |   2   |  8192 MB |    0     | Samsung |   2017 WW35   | Samsung |M393B1K70CH0-YH9 
    N0.C1.D1    |      0
    N0.C2.D0    |      0
    N0.C2.D1    |      0

MRC is done!
========== Exit  MRC code ========== 
Notify: PPI Guid: 64c96700-6b4c-480c-a3e1-b8bde8f602b2, Peim notify entry point: fffa7938
Install PEI Memory.
Memory Installed: Address=BF800000; Length=10000000
PEI_STACK: Address=BF800000; Length=100000
HOBLIST address before memory init = 0xffae0000
HOBLIST address after memory init = 0xbf900000
InstallMrcPpi

Data in MemoryInit
sizeof(struct hostNvram): 3300
sizeof(MRC_HOST_HOB): 3336
sizeof(MEMORY_INFO): 3344
MrcHostHob->SizeOf_NvRam: 3300
MrcHostHob->SizeOf_MRCHOSTHOB: 3336
MrcHostHob->SizeOf_MEMORYINFO: 3344

MemoryInit Completion!!!! 
After PeiCpuWorkaround (UncoreInitLateInit) 
After PeiIohWorkaround (UncoreInitLateInit) 
Tohm : 0x42C000000 
HighMmioBase : 0x430000000 
HighMmioLimit : 0x8000000000 
Updated PciResourceMem64Base: 0x430000000
Updated RB[0] Mem64 Range : 0x430000000 - 0x7FFFFFFFFF
After OemUncoreInitHook  (UncoreInitLateIohChipsetInit) 
--- Turning OFF Blue Led --- 
After OemUncoreInitHook (UncoreInitLateInit)  
========== Exiting UNCOREINIT ========== 
PEI core reallocated to memory
Total Cache as RAM:    131072 bytes.
  CAR stack ever used: 65532 bytes.
  CAR heap used:       6904 bytes.
Notify: PPI Guid: f894643d-c449-42d1-8ea8-85bdd8c65bde, Peim notify entry point: fff9d908
OEMPEI.Entry(FFFF9D00)
S3Resume.Entry(FFC80A34)
CpuPei.Entry(FFF9493C)
Starting all APs.
Load AP Microcode. BSP and maybe NBSPs Microcode loaded in SEC.
Enable BSP and AP Cache.
APs waiting.
NumCpus = 2.
CPU initialization.
BSP APIC ID = 0.
Register PPI Notify: 605ea650-c65c-42e1-ba80-91a52ab618c6
Exit PEI CPU Driver.
CK505Pei.Entry(FFF9B374)
CK505Pei_Init Entry.
Program CK505 start.
Select the SMBUS mux. Assuming to be on SMBUS Segment 0
Program CK505 end.
PEIM 8401a046-6f70-4505-8471-7015b40355e3 was not started!!
PEIM e008b434-0e73-440c-8612-a143f6a07bcb was not started!!
DXE IPL Entry(CF7F67D0)
CORE_DXE.Entry(CF63FDF0)
Notify: PPI Guid: 605ea650-c65c-42e1-ba80-91a52ab618c6, Peim notify entry point: fff946bc
DXE Status Code Available
[13AC6DD0-73D0-11D4-B06B-00AA00BD6DE7].Entry(15996A0)
Runtime.Entry(CF423AE0)
GenericSIO: Variable Write Available!
GenericSIO: Get ISA_IRQ_MASK Status=EFI_NOT_FOUND Updating with DEFAULT E305
GenericSIO: Get ISA_DMA_MASK Status=EFI_NOT_FOUND Updating with DEFAULT 10
AmiBoardInfo.Entry(15F92C0)
SBRun.Entry(CF40CF90)
ACPIS3Save.Entry(160AA00)
SmbiosGetFlashData64.Entry(15F5720)
ReFlash.Entry(161EA70)
CpuDxe.Entry(164C190)

Intended Cpu Freq = 1729
Actual Cpu Freq = 1745
AcpiResLib: LibGetDsdt(): LocateProtocol(ACPISupport) returned EFI_NOT_FOUND 
ACPI.Entry(1674420)
IN ACPI Start: 0
IN ACPI 1: 0
DSDT21 addres 0x15FAB50; -> EFI_SUCCESS 
AcpiResLib: LibGetDsdt(): Found v1.0b   RSDT->DSDT @ 0x15FAB50; -> EFI_SUCCESS 
AcpiResLib: LibGetDsdt(): Found v2.0&UP XSDT->DSDT @ 0x15FAB50; -> EFI_SUCCESS 
SIO[0]: Aml=> Collected 5 DepFunc Items of UAR1 Object 
SIO: Updating ISA_IRQ_MASK = 0xE305 with 0x10 for IRQ# 4
SIO[1]: Aml=> Failed to Locate ASL Object UAR2 in DSDT at 015FAB50
SIO[1]: Tbl=> Entry is Empty! Check 'YourSioName_DevLst' table.
GenericSIO: FAIL to Enumerate SIO Chip # 1 EFI_NOT_FOUND
AcpiResLib: LibGetDsdt(): Found v1.0b   RSDT->DSDT @ 0x15FAB50; -> EFI_SUCCESS 
AcpiResLib: LibGetDsdt(): Found v2.0&UP XSDT->DSDT @ 0x15FAB50; -> EFI_SUCCESS 
ACPI: SetAcpiTable() Table=0x1006318; Handle=0xBF8FF520; *Handle=0x0
ACPI: SetAcpiTable() Exiting... Status = EFI_SUCCESS
PciRootBridge.Entry(16D2880)
PciHostCSHooks: LocateProtocol(ACPISupport)=EFI_SUCCESS
ACPI: SetAcpiTable() Table=0x161B298; Handle=0x16F7FB0; *Handle=0x0
ACPI: SetAcpiTable() Exiting... Status = EFI_SUCCESS
PciHostCSHooks: ACPISupport->SetAcpiTable(MCFG) = EFI_SUCCESS
CspLibDxe.Entry(161E240)
NBDXE.Entry(16B5560)
TSEG Base: cf800000
GCD: AddMemSpace   B=CF800000, L=800000, i=0, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=CF800000, L=800000, i=0, S=EFI_SUCCESS
GCD: AddMemSpace   B=FEE00000, L=100000, i=1, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FEE00000, L=100000, i=1, S=EFI_SUCCESS
GCD: AddMemSpace   B=FEB00000, L=100000, i=2, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FEB00000, L=100000, i=2, S=EFI_SUCCESS
GCD: AddMemSpace   B=FEA00000, L=20, i=3, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FEA00000, L=20, i=3, S=EFI_SUCCESS
GCD: AddMemSpace   B=FE800000, L=200000, i=4, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FE800000, L=200000, i=4, S=EFI_SUCCESS
GCD: AddMemSpace   B=FE000000, L=800000, i=5, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FE000000, L=800000, i=5, S=EFI_SUCCESS
GCD: AddMemSpace   B=FD000000, L=1000000, i=6, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FD000000, L=1000000, i=6, S=EFI_SUCCESS
GCD: AddMemSpace   B=FC000000, L=1000000, i=7, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FC000000, L=1000000, i=7, S=EFI_SUCCESS
GCD: AddMemSpace   B=FBFFE000, L=2000, i=8, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FBFFE000, L=2000, i=8, S=EFI_SUCCESS
GCD: AddMemSpace   B=D0000000, L=10000000, i=9, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=D0000000, L=10000000, i=9, S=EFI_SUCCESS
GCD: AddI/OSpace   B=CF8, L=8, i=10, S=EFI_ACCESS_DENIED
GCD: AllocI/OSpace B=CF8, L=8, i=10, S=EFI_SUCCESS
GCD: SpaceAttr A=8000000000000000 B=CF800000, L=800000, i=0, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FEE00000, L=100000, i=1, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FEB00000, L=100000, i=2, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FEA00000, L=20, i=3, S=EFI_UNSUPPORTED
GCD: SpaceAttr A=1 B=FE800000, L=200000, i=4, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FE000000, L=800000, i=5, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FD000000, L=1000000, i=6, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FC000000, L=1000000, i=7, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FBFFE000, L=2000, i=8, S=EFI_SUCCESS
GCD: SpaceAttr A=8000000000000001 B=D0000000, L=10000000, i=9, S=EFI_SUCCESS
Initializing NTB for IIO 0 
Port 0:3 Active.  Max Link Width = 4
Port 0:4 Active.  Max Link Width = 4
Port 0:5 Active.  Max Link Width = 4
Port 0:6 Active.  Max Link Width = 4
TSEG Address cf800000.
TSEG size 800000.
Creating Memory Data for SMBIOS.
 --DIMM Speed = 1067 MHz
 --Array[0] Device[0] Size = 8192 MB
 --Array[0] Device[2] Size = 8192 MB
LegacyInterrupt.Entry(16A7380)
LegacyRegion.Entry(16BB550)
VTdDXE.Entry(16C4210)
ERROR: VTdDXE.Entry(16C4210)=EFI_UNSUPPORTED
SMBiosBoard.Entry(16C1460)
OEMDXE.Entry(16C52F0)
SBDXE.Entry(17132A0)
GCD: AddMemSpace   B=FF000000, L=1000000, i=0, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FF000000, L=1000000, i=0, S=EFI_SUCCESS
GCD: AddMemSpace   B=FED1C000, L=4000, i=1, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FED1C000, L=4000, i=1, S=EFI_SUCCESS
GCD: AddMemSpace   B=FED00000, L=4000, i=2, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FED00000, L=4000, i=2, S=EFI_SUCCESS
GCD: AddMemSpace   B=FEC00000, L=100000, i=3, S=EFI_ACCESS_DENIED
GCD: AllocMemSpace B=FEC00000, L=100000, i=3, S=EFI_SUCCESS
GCD: AddI/OSpace   B=400, L=80, i=4, S=EFI_ACCESS_DENIED
GCD: AllocI/OSpace B=400, L=80, i=4, S=EFI_SUCCESS
GCD: AddI/OSpace   B=500, L=80, i=5, S=EFI_ACCESS_DENIED
GCD: AllocI/OSpace B=500, L=80, i=5, S=EFI_SUCCESS
GCD: AddI/OSpace   B=1180, L=20, i=6, S=EFI_ACCESS_DENIED
GCD: AllocI/OSpace B=1180, L=20, i=6, S=EFI_SUCCESS
GCD: SpaceAttr (UC ALL) B=FEC00000; L=1400000; 
GCD:(UC ALL) setting  B=FEC00000, L=104000, A=1; S=EFI_SUCCESS
GCD:(UC ALL) setting  B=FED04000, L=18000, A=1; S=EFI_SUCCESS
GCD:(UC ALL) setting  B=FED1C000, L=4000, A=1; S=EFI_SUCCESS
GCD:(UC ALL) setting  B=FED20000, L=E0000, A=1; S=EFI_SUCCESS
GCD:(UC ALL) skipping B=FEE00000, L=100000, A=1; S=EFI_SUCCESS
GCD:(UC ALL) setting  B=FEF00000, L=100000, A=1; S=EFI_SUCCESS
GCD:(UC ALL) setting  B=FF000000, L=1000000, A=1; S=EFI_SUCCESS
GCD: SpaceAttr A=8000000000000001 B=FF000000, L=1000000, i=0, S=EFI_SUCCESS
GCD: SpaceAttr A=8000000000000001 B=FED1C000, L=4000, i=1, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FED00000, L=4000, i=2, S=EFI_SUCCESS
GCD: SpaceAttr A=1 B=FEC00000, L=100000, i=3, S=EFI_SUCCESS
HPET: FED1F404 = 80
HPET LocateProtocol(ACPISupport)- EFI_SUCCESS Success
ACPI: SetAcpiTable() Table=0x16C3798; Handle=0x171F2B8; *Handle=0x0
ACPI: SetAcpiTable() Exiting... Status = EFI_SUCCESS
ACPISupport.SetAcpiTable() = EFI_SUCCESS 
MEPlatformDXE.Entry(1728780)
HECIDXE.Entry(17202E0)
MeCore.Entry(1724380)
BiosExtension.Entry(1730CF0)
FileSystem.Entry(17376F0)
PciBus.Entry(17B47D0)
SBIDE.Entry(1793CC0)
SBAHCI.Entry(1799D90)
AHCI.Entry(17CF2A0)
AINT13.Entry(179E280)
AMITSE.Entry(180F568)
CSMCORE.Entry(1830C00)
BIOSBLKIO.Entry(17EA760)
Terminal.Entry(1848990)
SMBios64.Entry(18A4990)
SmmBase.Entry(18B1050)
SMM.SmmDispatcher.Entry(CF81A310)
SMM.Runtime.Entry(CF822AE0)
FwBootDriver.Entry(180B510)
SmmChildDispatcher.Entry(1D07980)
SMM.SmmChildDispatcher.Entry(CF837980)
AhciSmm.Entry(18AA6B0)
SMM.AhciSmm.Entry(CF83E6B0)
ERROR: AhciSmm.Entry(CF83E6B0)=EFI_NOT_FOUND
ERROR: AhciSmm.Entry(18AA6B0)=EFI_NOT_FOUND
SmmRuntime.Entry(CF3FC310)
SMM.SmmRuntime.Entry(CF83D310)
MicrocodeUpdate.Entry(1D4A2B0)
SMM.MicrocodeUpdate.Entry(CF8442B0)
SBSMI.Entry(1D522F0)
SMM.SBSMI.Entry(CF84C2F0)
AcpiModeEnable.Entry(1D5B940)
SMM.AcpiModeEnable.Entry(CF850940)
SMIFlash.Entry(1D64C40)
SMM.SMIFlash.Entry(CF855C40)
SmiFlash: Flash Protocol CF831000
PowerButton.Entry(1D46380)
SMM.PowerButton.Entry(CF859380)
SupNuovaTcoHandler.Entry(1D70300)
SMM.SupNuovaTcoHandler.Entry(CF85C300)
InitializeSupNuovaTcoHandler - Not in Smm
InitializeSupNuovaTcoHandler - In SMM.
InitializeSupNuovaTcoHandler - Registering TCO handler
InitializeSupNuovaTcoHandler - Successfully registered
InitializeSupNuovaTcoHandler - Not in Smm
UHCD.Entry(1DA92A0)
SleepSmi.Entry(1D68290)
SMM.SleepSmi.Entry(CF85F290)
SmbiosDMIEdit.Entry(1DA0630)
SMM.SmbiosDMIEdit.Entry(CF861630)
USBRT.Entry(1DFA2A0)
SMM.USBRT.Entry(CF8682A0)
USBINT13.Entry(1DC52A0)
Driver 4a37320b-3fb3-4365-9730-9e89c600395d was discovered but not loaded!!
Driver 899407d7-99fe-43d8-9a21-79ec328cac21 was discovered but not loaded!!
AMI PCI Bus Driver.Start(17C6C30)=<<<< Found PCIe Device at B=0 D=0 F=0 : Dev_Ven=0x370a8086 >>>>
<<<< Found PCIe Device at B=0 D=3 F=0 : Dev_Ven=0x37218086 >>>>
<<<< Found PCIe Device at B=0 D=4 F=0 : Dev_Ven=0x37228086 >>>>
<<<< Found PCIe Device at B=0 D=5 F=0 : Dev_Ven=0x37238086 >>>>
<<<< Found PCIe Device at B=0 D=6 F=0 : Dev_Ven=0x37248086 >>>>
ASSERT in C:\Projects\AMI\O296\argon_o2_src_work\Core\EM\SMM\SmmDispatcher.c on 462: gUseSbsp

 

 

 

 

7 Replies 7

Keny Perez
Level 8
Level 8

We don't  have 5548 Fabric Interconnects... what is the hardware you have? Is it Nexus 5548 instead?

 

-kenny

Sorry, I meant a UCS6248. I've edit the message above. Finger wasn't listening to the brain. Anyway, any idea ? The eqpt was offline for quite some time, and I'm not sure if this is somehow related to the CMOS battery.  

In all my years working with UCS I have not seen that on a case for a Fabric Interconnect yet... is that what you see from the console? (sorry I had to ask)

 

-Kenny

Yes, it is. It hanged with a single message "N5000 BIOS v.3.5.0, Thu 02/03/2011, 05:12 PM " . 

The rest of it appears only if I turn on debug. 

Based on the errors, it would appear that there was a incompatibility with certain BIOS settings on the Fabric Interconnect. And I suspect that these are because of the CMOS battery.

It would appear that it hanged during the BOOT when it try to poll the status of the EFI , AHCI. I suspected the CMOS of the BIOS in the UCS-6248 and had replaced the CMOS battery, so the only question would be , how do I get into the BIOS of the UCS-6248 and reset back to default or is there a button on the motherboard that I could hold and clear everything, or a magic key combination in the console that I can use to either go to the Bios settings or just clear this up. I don't mind a ping offline if you have. 

I would actually reboot the FI and press Ctrl+L at the very beginning so you can get to the load> prompt, load the kickstart and system image and then do a wipe out of the config for that FI.

Process is pretty much as when you are doing a Password recovery and Jeff does a very good kob guiding ppl thru the process:

http://jeffsaidso.com/2011/10/password-recovery-in-cisco-ucs/

 

-Kenny

I may be wrong here and I will have no access to my UCS-6248 for the next 2 weeks as I'm on the road. But the link you forward mentioned, 

---------------

  1. Press ctrl+shift+r a few times as the interconnect boots to interrupt the boot process

    <side note> In a normal boot, the FI is programmed to three specific images:

  • kickstart (kernel)
  • system (system)
  • management (UCSM)

 -----------------

I believe that is not totally correct. The UCS6248 effectively do the following 

BIOS -> Kickstart -> System -> UCSM ,

and right now, this

ASSERT in C:\Projects\AMI\O296\argon_o2_src_work\Core\EM\SMM\SmmDispatcher.c on 462: gUseSbsp

 suggest to me the problem is during the loading of the BIOS and is before the loading of Kickstart, so I will never get a chance get to the KICKStart. 

 

Then you will need to open a TAC case and have them provide you with the file (s)

 

-Kenny

Review Cisco Networking products for a $25 gift card