Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1018551.1
Update Date:2009-12-03
Keywords:

Solution Type  Problem Resolution Sure

Solution  1018551.1 :   picld reports "ERROR running psvc_check_temperature_policy_0 on CPUx_DIE_TEMPERATURE_SENSOR"  


Related Items
  • Sun Fire V480 Server
  •  
  • Sun Fire V880z Visualization Server
  •  
  • Sun Fire V890 Server
  •  
  • Sun Fire V880 Server
  •  
  • Sun Fire V490 Server
  •  
Related Categories
  • GCS>Sun Microsystems>Servers>Entry-Level Servers
  •  

PreviouslyPublishedAs
230232


Symptoms
After adding or removing system boards from Sun Fire[TM] V480/V490 or V880/V890, picld daemon reports "ERROR running psvc_check_temperature_policy_0 on CPUX_DIE_TEMPERATURE_SENSOR".


Resolution
After adding or removal of any system boards on the Sun Fire V480/V490 or V880/V890 systems, ensure that the i2c device nodes are regenerated properly in the /devices/pci@9,700000/ebus@1/i2c@1,30" directory.

The picld error messages usually appears after adding or removing the systemboard(s) to/from the system without properly regenerating the i2c sensor devices nodes.

Picld relies on devices nodes located in /devices/pci@9,700000/ebus@1/i2c@1,30 directory to read sensory information from the the system boards.

Examples

An existing Sun Fire V480 system had a new additional system board installed, let's say Board B (CPU1 and CPU3). The system was normally booted without the -r option and the following picld error messages started to appear.

PICLD Error Messages

May 13 16:58:08 ISU001 picld[103]: [ID 478985 daemon.error] ERROR running psvc_check_temperature_policy_0 on CPU1_DIE_TEMPERATURE_SENSOR (719584)
May 13 16:58:08 ISU001 picld[103]: [ID 875627 daemon.error] No such file or directory
May 13 16:58:08 ISU001 picld[103]: [ID 478985 daemon.error] ERROR running psvc_check_temperature_policy_0 on CPU3_DIE_TEMPERATURE_SENSOR (719648)
May 13 16:58:08 ISU001 picld[103]: [ID 875627 daemon.error] No such file or directory

A check on the i2c directory /devices/pci@9,700000/ebus@1/i2c@1,30 only says that the following nodes below are present.

crw-r--r--   1 root     sys      180,  0 Apr 19 01:32 temperature-sensor@0,9c:lm75
crw-------   1 root     sys       10,  0 Apr 19 01:32 temperature@0,30:amb_temp
crw-------   1 root     sys       10,  1 Apr 19 01:32 temperature@0,30:die_temp
crw-------   1 root     sys       10, 16 Apr 19 01:32 temperature@0,34:amb_temp
crw-------   1 root     sys       10, 17 Apr 19 01:32 temperature@0,34:die_temp

Note: It is missing the node entries for CPU1 and CPU3 Die and Ambient temperature sensor nodes. Below are full listing of the i2c sensor nodes for fully populated machine.

A 2xSystemboard (4 CPUs) Sun Fire V480/V490 i2c device nodes

$ cd /devices/pci@9,700000/ebus@1/i2c@1,30
$ ls | grep temperature
...
temperature@0,30:amb_temp <--- CPU0
temperature@0,30:die_temp   <--- CPU0
temperature@0,32:amb_temp <--- CPU1
temperature@0,32:die_temp   <--- CPU1
temperature@0,34:amb_temp <--- CPU2
temperature@0,34:die_temp   <--- CPU2
temperature@0,52:amb_temp <--- CPU3
temperature@0,52:die_temp   <--- CPU3

A 4xSystemboards (8 CPUs) Sun Fire V880/V890 i2c device nodes

$ cd /devices/pci@9,700000/ebus@1/i2c@1,30
$ ls | grep temperature
...
temperature@0,30:amb_temp <--- CPU0
temperature@0,30:die_temp   <--- CPU0
temperature@0,32:amb_temp <--- CPU1
temperature@0,32:die_temp   <--- CPU1
temperature@0,34:amb_temp <--- CPU2
temperature@0,34:die_temp   <--- CPU2
temperature@0,52:amb_temp <--- CPU3
temperature@0,52:die_temp   <--- CPU3
temperature@0,54:amb_temp <--- CPU4
temperature@0,54:die_temp   <--- CPU4
temperature@0,56:amb_temp <--- CPU5
temperature@0,56:die_temp   <--- CPU5
temperature@0,98:amb_temp <--- CPU6
temperature@0,98:die_temp   <--- CPU6
temperature@0,9a:amb_temp <--- CPU7
temperature@0,9a:die_temp   <--- CPU7

If any of the i2c device nodes are missing as defined in the above example, proceed to regenerate the device nodes and restart the picld deamon, you need to be root. This should stop the error messages.

For Example:

root@v4u-480rb # devfsadm -C
root@v4u-480rb # devfsadm
root@v4u-480rb # /etc/init.d/picld stop
root@v4u-480rb # /etc/init.d/picld start

Alternatively the system can be rebooted with the -r option or just plain boot -r from the OK prompt.

For Example:

# reboot -- -r
ok  boot -r


Product
Sun Fire V480 Server
Sun Fire V890 Server
Sun Fire V880z Visualization Server
Sun Fire V880 Server
Sun Fire V490 Server

Internal Comments
Audited/updated 12/03/09 - [email protected], Entry Level SPARC Content Team Member




picld, psvc_check_temperature_policy_0, 480, 490, 880, 890, sensors, environmental, psvc
Previously Published As
81557

Change History
Date: 2005-05-26
User Name: 97961
Action: Approved
Comment: - Converted to STM formatting for better readability
- Audience changed to "Contract" per FvF http://kmo.central/howto/FvF.html
- Applied trademarking
Version: 3

Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback