Article ID: 000059777 Content Type: Error Messages Last Reviewed: 08/23/2022

Getting Critical Errors on SEL for Intel® Data Center Blocks for HPC (Intel® DCB for HPC)

BUILT IN - ARTICLE INTRO SECOND COMPONENT
Summary

Updating the BIOS firmware can potentially fix the errors reflecting on SEL

Description

Critical error in SEL:
SmaRT-CLST Stat reports it has been asserted, Assertion event - transition to critical from less severe from PSU sensor 0x67Y

Resolution
  • The errors reflecting on the debug logs are not errors but rather indicates that there is a slight power drops, since Intel PSU's are designed to be sensitive about a power signal and can detect little power drops.
  • Recommended to use an Online UPS instead of Line-Interactive UPS.
  • Check the model/type of UPS.
  • Update the BIOS to the latest version.
Additional information

Debug log errors;

EventID:0367 Time Stamp:06/04/2020 16:16:43 SensorName:SmaRT & CLST     Sensor Type:OEM Reserved               Description:transition to critical from less severe from PSU sensor 0x66. reports it has been asserted. -Asserted

EventID:0368 Time Stamp:06/04/2020 16:16:43 SensorName:PS1 Status      Sensor Type:Power Supply               Description:reports the power supply's input (AC/DC) has been lost -Asserted

EventID:0369 Time Stamp:06/04/2020 16:16:43 SensorName:SmaRT & CLST     Sensor Type:OEM Reserved               Description:transition to non-critical from more severe from PSU sensor 0x66. reports it has been deasserted. -Asserted

EventID:0370 Time Stamp:06/04/2020 16:16:44 SensorName:SmaRT & CLST     Sensor Type:OEM Reserved               Description:transition to critical from less severe from PSU sensor 0x67. reports it has been asserted. -Asserted

EventID:0371 Time Stamp:06/04/2020 16:16:44 SensorName:Pwr Unit Status    Sensor Type:Power Unit                Description:reports the power unit has suffered a failure -Asserted

EventID:0372 Time Stamp:01/01/1970 00:00:30 SensorName:Pwr Unit Status    Sensor Type:Power Unit                Description:reports the power unit is powered off or being powered down -Asserted

EventID:0373 Time Stamp:01/01/1970 00:00:30 SensorName:Pwr Unit Status    Sensor Type:Power Unit                Description:reports the power unit's AC is lost -Asserted

EventID:0374 Time Stamp:01/01/1970 00:00:30 SensorName:P1 Status       Sensor Type:Processor                 Description:reports the processor's presence has been detected -Asserted

EventID:0375 Time Stamp:01/01/1970 00:00:30 SensorName:P2 Status       Sensor Type:Processor                 Description:reports the processor's presence has been detected -Asserted

EventID:0376 Time Stamp:01/01/1970 00:00:31 SensorName:Pwr Unit Status    Sensor Type:Power Unit                Description:reports the power unit's AC is lost -Deasserted

EventID:0377 Time Stamp:08/24/2030 16:19:43 SensorName:P1 Status       Sensor Type:Processor                 Description:reports the processor's presence has been detected -Asserted

EventID:0378 Time Stamp:08/24/2030 16:19:43 SensorName:P2 Status       Sensor Type:Processor                 Description:reports the processor's presence has been detected -Asserted

 

SDR:

0628 - RID:0274 TS:06/12/2020 19:25:46 SN:unnamed sensor 0x00 (no SDR) ST:OS Boot ED:C: boot completed ET:Asserted EC:OK

0629 - RID:0275 TS:06/12/2020 19:25:46 SN:unnamed sensor 0xB4 (no SDR) ST:Unknown ED:OEM ET:Deasserted EC:OK

0630 - RID:0276 TS:06/13/2020 11:35:24 SN:unnamed sensor 0x00 (no SDR) ST:OS Stop ED:OS Graceful Shutdown ET:Asserted EC:OK

0631 - RID:0277 TS:06/13/2020 11:35:24 SN:unnamed sensor 0x00 (no SDR) ST:Unknown ED:Unknown ET:Asserted EC:OK

0632 - RID:0278 TS:06/13/2020 11:35:24 SN:unnamed sensor 0x00 (no SDR) ST:OS Stop ED:OS Graceful Shutdown ET:Asserted EC:OK
 

Related Products

This article applies to 2 products

Intel® Server System LWF1304LNETCENT