Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1007691.1
Update Date:2011-04-28
Keywords:

Solution Type  Problem Resolution Sure

Solution  1007691.1 :   [3310] ssmon can generate ASC: 0x29 (power on, reset, or bus reset occurred)  


Related Items
  • Sun Storage 3310 Array
  •  
  • Sun Storage 3320 SCSI Array
  •  
Related Categories
  • GCS>Sun Microsystems>Storage - Disk>Modular Disk - 3xxx Arrays
  •  

PreviouslyPublishedAs
210658


Applies to:

Sun Storage 3310 Array
Sun Storage 3320 SCSI Array - Version: Not Applicable and later    [Release: N/A and later]
All Platforms

Symptoms

If you have the following messages in the host messages, 3 times a day and more when the I/O activity increases, they can be generated by ssmon daemon or ssconsole GUI (graphical user interface) when "in-band" monitoring is configured.


Jan 3 08:02:59 bretagne scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/pci@2/scsi@5/sd@1,4 (sd154):
Jan 3 08:02:59 bretagne Error for Command: read Error Level: Retryable
Jan 3 08:02:59 bretagne scsi: [ID 107833 kern.notice] Requested Block: 533456 Error Block: 533456
Jan 3 08:02:59 bretagne scsi: [ID 107833 kern.notice] Vendor: SUN Serial Number: 25AFBEBB-01

Jan 3 08:02:59 bretagne scsi: [ID 107833 kern.notice] Sense Key: Unit Attention
Jan 3 08:02:59 bretagne scsi: [ID 107833 kern.notice] ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0
x0
Jan 3 11:16:33 bretagne scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/pci@2/scsi@5/sd@1,4 (sd154):
Jan 3 11:16:33 bretagne Error for Command: read(10) Error Level: Retryable
Jan 3 11:16:33 bretagne scsi: [ID 107833 kern.notice] Requested Block: 136343840 Error Block: 136343840
Jan 3 11:16:33 bretagne scsi: [ID 107833 kern.notice] Vendor: SUN Serial Number: 25AFBEBB-01

Jan 3 11:16:33 bretagne scsi: [ID 107833 kern.notice] Sense Key: Unit Attention
Jan 3 11:16:33 bretagne scsi: [ID 107833 kern.notice] ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0
x0
Jan 3 14:28:23 bretagne write: [ID 201458 auth.crit] /dev/dtremote in utmpx is not a tty
Jan 3 14:48:56 bretagne su: [ID 810491 auth.crit] 'su root' failed for 98000 on /dev/pts/9
Jan 3 22:36:59 bretagne scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/pci@2/scsi@5

Jan 3 22:36:59 bretagne Error for Command: read(10) Error Level: Retryable
Jan 3 22:36:59 bretagne scsi: [ID 107833 kern.notice] Requested Block: 139878414 Error Block: 139878414
Jan 3 22:36:59 bretagne scsi: [ID 107833 kern.notice] Vendor: SUN Serial Number: 25AFBEBB-01

Jan 3 22:36:59 bretagne scsi: [ID 107833 kern.notice] Sense Key: Unit Attention
Jan 3 22:36:59 bretagne scsi: [ID 107833 kern.notice] ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0

Notice : "in-band" means  ssmon or ssconsole communicate through the SCSI connection, vs out-of-band through the network connection

Changes

{CHANGE}

Cause

{CAUSE}

Solution

Before planning any hardware replacement, it's compulsory  to apply the minimum
level of patch for the I/O drivers on the host, and  upgrade the firmware of  the array controllers  and
internal disks before verifing the ssmon configuration as described below:


1) Verify software/firmware:

o All array firmwares should be at the latest revisions: 4.23C for 3310, and 4.21E for 3320 Controller firmware (minimum requirement),
SAF-TE firmware rev 1180 for SE3310, and 1185 for the SE3320 (minimum requirement).
o All HBA, and SCSI drivers should be up-to-date for the particular Solaris release.
Use an available diagnostic tool such as explominer, or patchdiag,  to obtain the required patches to install.
o Ensure SSCS (ssconsole) is at the latest version: 2.5 (minimum requirement).

2) Check if the ssagent is configured "out-of-band":

Edit the ssagent configuration file: /var/opt/SUNWsscs/ssagent/ssagent.cfg

o Verify the value of variable PRIAGENT_OVER_INBAND

A value of 0 for PRIAGENT_OVER_INBAND indicates that the direct "in-band" scsi
access method is selected. This is the default value. If this value is set to 0, proceed to the next step if  you wish to configure ssagent "out-of-band".

3) How to configure ssagent "out-of-band":

When the network interface of the SE3310 or SE3320 is configured to a LAN.

There are two methods:

o By command line:

a) Stop the ssagent:

/etc/init.d/ssagent stop

b)Edit the file: /var/opt/SUNWsscs/ssagent/ssagent.cfg
and set to 1 the value of variable PRIAGENT_OVER_INBAND.

PRIAGENT_OVER_INBAND=1

c) Check that the 2 following lines exist for each array in this file:

PRIAGENT_IP=xxx.xxx.xxx.xxx

PRIAGENT_PORT=58632

d) Verify the network interface of [3310] is correctly configured in the ssagent.cfg file:

PRIAGENT_IP=xxx.xxx.xxx.xxx ( IP address of [3310] )

PRIAGENT_PORT=58632 ( address port by default )

e)Start the ssagent:

/etc/init.d/ssagent start

o By the GUI (sscs):

Follow the steps described in the Sun Storedge 3000 Family Configuration
Services 2.5 User's Guide (817-3337-18) in the section: "To Use In-Band Management".

Relief/Workaround

If "in-band" monitoring is the only alternative for a particular array, you can stop the ssagent
software:

/etc/init.d/ssagent stop

and monitor for one week or more depending upon the frequency of the messages,
to see if they stop.

If the messages still occur, it's advised to place the array on a private network
and apply the above step 3.

Otherwise, if errors still occur, the cause is more complex and will need additional troubleshooting.


For out-of-band it's recommended to configure the network interface of the array
on a private network and apply the above step 3.
This recommendation is to prevent network port scanning software interfering with
the TCP/IP stack on the array.





Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback