Raptor Computing Systems Community Forums (BETA)
Raptor Computing Systems Hardware => Talos II => Topic started by: jas on August 24, 2022, 07:26:08 am
-
Hi. I'm using freeipmi (version 1.6.6-4+deb11u1 from Debian 11) ipmi-sensors but it complains about SDR problems:
root@vello:~# ipmi-sensors --flush-cache
Flushing cache: /root/.freeipmi/sdr-cache/sdr-cache-vello.localhost
root@vello:~# ipmi-sensors
Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-vello.localhost
Caching SDR record 105 of 105 (current record ID 275)
ipmi_sdr_cache_create: SDR record count invalid
root@vello:~#
According to
https://www.mail-archive.com/freeipmi-users@gnu.org/msg01542.html
this suggests something is wrong in the hardware. Any ideas?
For reference, the workaround works fine, and I get nice outputs like below.
root@vello:~# ipmi-sensors -W assumemaxsdrrecordcount
Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-vello.localhost
Caching SDR record 105 of 105 (current record ID 275)
ID | Name | Type | Reading | Units | Event
3 | occ | Processor | N/A | N/A | N/A
4 | occ | Processor | N/A | N/A | N/A
8 | occ0 | Power Unit | N/A | N/A | 'Device Enabled'
9 | occ1 | Power Unit | N/A | N/A | 'Device Disabled'
17 | p0_core0_temp | Temperature | N/A | C | N/A
20 | p0_core1_temp | Temperature | N/A | C | N/A
23 | p0_core2_temp | Temperature | 37.00 | C | 'OK'
26 | p0_core3_temp | Temperature | 37.00 | C | 'OK'
29 | p0_core4_temp | Temperature | 37.00 | C | 'OK'
32 | p0_core5_temp | Temperature | 37.00 | C | 'OK'
35 | p0_core6_temp | Temperature | 37.00 | C | 'OK'
38 | p0_core7_temp | Temperature | 37.00 | C | 'OK'
41 | p0_core8_temp | Temperature | 37.00 | C | 'OK'
44 | p0_core9_temp | Temperature | 37.00 | C | 'OK'
47 | p0_core10_temp | Temperature | 37.00 | C | 'OK'
50 | p0_core11_temp | Temperature | 37.00 | C | 'OK'
53 | p0_core12_temp | Temperature | N/A | C | N/A
56 | p0_core13_temp | Temperature | N/A | C | N/A
59 | p0_core14_temp | Temperature | N/A | C | N/A
62 | p0_core15_temp | Temperature | N/A | C | N/A
65 | p0_core16_temp | Temperature | 37.00 | C | 'OK'
68 | p0_core17_temp | Temperature | 37.00 | C | 'OK'
71 | p0_core18_temp | Temperature | 37.00 | C | 'OK'
74 | p0_core19_temp | Temperature | 37.00 | C | 'OK'
77 | p0_core20_temp | Temperature | 37.00 | C | 'OK'
80 | p0_core21_temp | Temperature | 37.00 | C | 'OK'
83 | p0_core22_temp | Temperature | 37.00 | C | 'OK'
86 | p0_core23_temp | Temperature | 37.00 | C | 'OK'
91 | p1_core0_temp | Temperature | N/A | C | N/A
94 | p1_core1_temp | Temperature | N/A | C | N/A
97 | p1_core2_temp | Temperature | N/A | C | N/A
100 | p1_core3_temp | Temperature | N/A | C | N/A
103 | p1_core4_temp | Temperature | N/A | C | N/A
106 | p1_core5_temp | Temperature | N/A | C | N/A
109 | p1_core6_temp | Temperature | N/A | C | N/A
112 | p1_core7_temp | Temperature | N/A | C | N/A
115 | p1_core8_temp | Temperature | N/A | C | N/A
118 | p1_core9_temp | Temperature | N/A | C | N/A
121 | p1_core10_temp | Temperature | N/A | C | N/A
124 | p1_core11_temp | Temperature | N/A | C | N/A
127 | p1_core12_temp | Temperature | N/A | C | N/A
130 | p1_core13_temp | Temperature | N/A | C | N/A
133 | p1_core14_temp | Temperature | N/A | C | N/A
136 | p1_core15_temp | Temperature | N/A | C | N/A
139 | p1_core16_temp | Temperature | N/A | C | N/A
142 | p1_core17_temp | Temperature | N/A | C | N/A
145 | p1_core18_temp | Temperature | N/A | C | N/A
148 | p1_core19_temp | Temperature | N/A | C | N/A
151 | p1_core20_temp | Temperature | N/A | C | N/A
154 | p1_core21_temp | Temperature | N/A | C | N/A
157 | p1_core22_temp | Temperature | N/A | C | N/A
160 | p1_core23_temp | Temperature | N/A | C | N/A
161 | p0_vdd_temp | Temperature | 41.00 | C | 'OK'
162 | p1_vdd_temp | Temperature | N/A | C | N/A
165 | dimm0_temp | Temperature | N/A | C | N/A
167 | dimm1_temp | Temperature | N/A | C | N/A
169 | dimm2_temp | Temperature | 42.00 | C | 'OK'
171 | dimm3_temp | Temperature | 41.00 | C | 'OK'
173 | dimm4_temp | Temperature | N/A | C | N/A
175 | dimm5_temp | Temperature | N/A | C | N/A
177 | dimm6_temp | Temperature | 39.00 | C | 'OK'
179 | dimm7_temp | Temperature | 39.00 | C | 'OK'
181 | dimm8_temp | Temperature | N/A | C | N/A
183 | dimm9_temp | Temperature | N/A | C | N/A
185 | dimm10_temp | Temperature | N/A | C | N/A
187 | dimm11_temp | Temperature | N/A | C | N/A
189 | dimm12_temp | Temperature | N/A | C | N/A
191 | dimm13_temp | Temperature | N/A | C | N/A
193 | dimm14_temp | Temperature | N/A | C | N/A
195 | dimm15_temp | Temperature | N/A | C | N/A
221 | fan0 | Fan | 19900.00 | RPM | 'OK'
222 | fan1 | Fan | 19900.00 | RPM | 'OK'
223 | fan2 | Fan | 0.00 | RPM | 'OK'
226 | fan3 | Fan | 0.00 | RPM | 'OK'
227 | fan4 | Fan | 1700.00 | RPM | 'OK'
228 | fan5 | Fan | 0.00 | RPM | 'OK'
229 | fan6 | Fan | N/A | RPM | N/A
231 | p0_power | Power Supply | 30.00 | W | 'OK'
232 | p0_vdd_power | Power Supply | 2.00 | W | 'OK'
233 | p0_vdn_power | Power Supply | 9.00 | W | 'OK'
234 | p1_power | Power Supply | N/A | W | N/A
235 | p1_vdd_power | Power Supply | N/A | W | N/A
236 | p1_vdn_power | Power Supply | N/A | W | N/A
252 | cpu_1_ambient | Temperature | 26.10 | C | 'OK'
253 | pcie | Temperature | 39.00 | C | 'OK'
254 | ambient | Temperature | 27.20 | C | 'OK'
root@vello:~#
/Simon
-
No, I get this on all my systems, so I don't think there's anything wrong with your specific hardware (one can argue about whether the SDR records are misconfigured, but that's separate). I don't know why this isn't the default.
That said, I just use ipmitool, not ipmi-sensors.