cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
12457
Views
5
Helpful
45
Replies

Need Help with Cisco C240 M4SX RAID Controller and Fan Issues!

ServerNewbie
Level 1
Level 1

I recently got my hands on a Cisco C240 M4SX with 24 drive bays. It’s equipped with a Cisco 12G SAS Modular RAID Controller, but I’ve run into an issue I can’t seem to solve.

From what I’ve read, using a non-Cisco PCIe adapter causes the system fans to stay in high-power mode. However, in my case, I am using a Cisco controller, so I’m unsure why the fans are still running at full speed.

I’ve searched everywhere, but it seems like no one has a clear solution. If you’ve dealt with this or have any advice, I’d love to hear:

  • Debugging steps I can take?
  • Any potential fixes or workarounds?
  • Is reflashing the controller firmware worth trying?

I’m open to all suggestions—please share your expertise!

TL;DR: Cisco C240 M4SX fans stuck in high-power mode even with a Cisco 12G SAS Modular RAID Controller. Any advice on fixing this?

45 Replies 45

Steven Tardy
Cisco Employee
Cisco Employee

You also posted in another thread where I recommended changing "fan control setting" to balanced.

What is your CIMC fan control set to (Balanced/Performance/Low Power/High Power/Maximum Power)?

Ideally collect and review a CIMC tech support to understand WHY the fans are spinning fast.

 

Hello! 

The CIMC Fan Control is set to High Power And won't go any lower. I have tried to put it on balanced and low power.

I have attached an image as well. The applied policy says Fan Policy Override - Card(s) "Cisco 12G SAS Modular Pass Through Controller" Present

 

 

Don't see any bugs matching this after a quick search.

CSCwd09815 :: Advisable to use Balance Power Fan Policy

Can you set the fan policy to Balanced and see what happens?

Do any of the temperature sensors have a high value (85+)?

Are you on the latest firmware? 4.1(2m)
If not, then upgrade to 4.1(2m).
If so, then collect a tech support and private message me with the tech support file.

 

Hello,

1. Switched to Balance but I am receiving the same Fan Policy Override issue that I shared in the image to the last message.
2. I do see some sensors that have a higher value: MLOM_TMP 36.0/96.80F. I have attached an image of the temps below
3. I am on the latest firmware
4. I went to Admin > Utilities > Generate Technical Support Data for Local Download and got a tar.gz. Do you need the full tar or did you need a specific file from it?

Thanks!

Steven Tardy
Cisco Employee
Cisco Employee

These older M4 logs make my head hurt as the M5+ newer logs have a much better layout for fan speed logging.

I don't see any issue with the HBA itself.

But did notice one strange thing in the logs somewhat related to the storage controller:

  • var/storphydrive_info_decoded.txt
Status Desc = Drive temperature 34362097685 C: Exceeded threshold of 70 C
  • tmp/storage-data.HBA
%controller "SLOT-HBA" %type "HBA" %physical-drive "1" %group inquiry-data
+host-power: on
+has-error: No
+info-valid: Yes
+error:
+vendor: ATA
+product-id: KINGSTON SA400S37240G
+product-revision-level: SBFKT1A3
+vendor-specific-info: **YOUR DISK SERIAL NUMBER REMOVED**
%controller "SLOT-HBA" %type "HBA" %physical-drive "1" %group smart-info
+host-power: on
+has-error: No
+info-valid: Yes
+error:
+power-cycle-count: 65
+power-on-hours: 2435
+percentage-life-left: 100
+wear-status-in-days: 1825
+operating-temperature: 2359317
+threshold-operating-temperature: 70

This reminded me of another post on the community forums.

34362097685 decimal is 0x800240015 hexadecimal.

2359317 decimal is 0x240015 hexadecimal.

If you remove that disk, then do the fans slow down to a "normal" speed?

 

Hey! I'll def try it once I'm back home and get back to you!

I'm buying some new SAS drives this weekend, I'm hopeful that it's an issue
with the drives!

Hey!

Sorry for the delay, been waiting to get the drives.
I bought Seagate Cisco Branded SAS drives and still no luck.
Can I send you a new Support file?

Thanks!

Yes. Send a new tech support file.

 

I sent you the newest tar via email  
Thanks!

The M4 and M5 systems override the fan speed when the SAS HBA MRAID card is present. I have replaced the Cisco HBA with a LSI HBA in M3, M4, and M5 systems. You lose visibility in the CIMC to the drives and SAS backplane, but it works fine otherwise. I have also experienced where drives that aren't on the Cisco compatibility list don't present the SMART temperature information in the way the CIMC expects it, so the CIMC thinks the drives are molten lava. This the fan goes to max speed. The generic LSI PCI controllers don't have this problem. It is a bit of a kludge, but it does work. You might need to purchase longer SAS cables.

[Wrong msg was sent and I couldn't figure out how to delete it, sorry for the extra messages!]

Oh I see!
Do you have a link to the replacement card you bought?
Also, I'm not understanding how the longer SAS cables play into this.
Don't the cables just need to be long enough to reach the card?

Thanks!

The PCIe cards are further back than the MRAID card, so that was why I needed the longer cables. I used 2 cards to support all the drives. 1 LSI 9300-16i and 1 LSI 9300-8i. Each card was under $100 on eBay.

Hi!

I've started looking into purchasing the cards but wanted to double-check everything before placing the order.

Currently, the Cisco PCIe RAID card that came with the server (which is causing the fan issue) is plugged into the SAS RAID slot and connected to the backplane via Mini-SAS cables (I think).

I should replace the existing RAID card with the LSI 9300-16i and as you suggested, I'll need longer Mini-SAS cables to properly reach the new card.

Can you please confirm my understanding?
I've also shared images I took of the server as well 

Thanks in advance!

Review Cisco Networking for a $25 gift card

Review Cisco Networking for a $25 gift card