Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition | |||
|
|
Solution Type FAB (standard) Sure Solution 1000882.1 : Sun StorEdge 3310 Arrays with firmware prior to 4.13 and CLI software prior to 2.1, and SAF-TE firmware prior to 1170, may experience downtime, drives offline, and inaccurate status reporting for components.
PreviouslyPublishedAs 201168 Product Sun StorageTek 3310 SCSI Array Impact Sun StorEdge 3310 Arrays with firmware (FW) versions earlier than 4.13, CLI software earlier than 2.1, and SAF-TE FW earlier than 1170 may experience system downtime and data integrity issues. FW 4.13, available in patch 113722-11, addresses these issues and provides product improvements. Below are details for some of the major issues addressed by the 4.13 FW:
This issue can occur when the controller firmware fails to distinguish between single-bit ECC errors and multi-bit ECC errors. The controller seems to continue to work normally even for multi-bit errors, which leads to loss in file system integrity. A single-bit ECC error is recoverable, while a multi-bit ECC error is not. With 4.13 FW if this issue happens the controller will shutdown itself.
During the recovery from a failure, Sun StorEdge 3310/3510/3511 FC array controllers may incorrectly offline good drives causing multiple drive failures. As a result, logical devices may become degraded thereby causing applications to stop running. The 4.13 FW has proper procedure for fault handling and will not cause this issue.
In the event of a disk failure, disk rebuilding would commence on the spare drive (if configured) and the rebuilding may stop after 99% and not complete. The rebuild will remain incomplete and the logical device state would remain as degraded. Should another drive failure occur, this condition could result in loss of data integrity. This issue has not been seen throughout the extensive tests with 4.13 FW and it is believed the issue is fixed due to the big changes in the fault handling area of the 4.13 FW.
The 1.6.2 CLI release and subsequent releases prevent the user from changing the cache optimization mode while there is an existing LD. Also, 4.13 controller firmware release has the section of code rewritten so that different mode LDs can exist in a controller, thus making the issue nonexistent.
The "show frus" command is performed by doing a series of Read Buffer commands that read the FRU data from I2C EEPROMs in the chassis. A FRU would not be displayed when the SAF-TE controller firmware returned a FRU Read Failed sense code. The I2C driver firmware in the SAF-TE controller had several issues that caused I2C messages to be missed. The driver was improved to detect and recover from I2C errors. In addition, message retries were implemented so that failed messages would be recovered. This is resolved in SAF-TE revision shipped with 4.13/2.1 release.
The issue occurs when multiple Send Diag commands are received by the controller. The Send Diag command is single threaded. The firmware does not properly handle returning a BUSY status when a Send Diag is received with a Send Diag already in process. This is caused by the pass through structure being overwritten by the receipt of the second command resulting in inconsistent results including bug hang, bus phase error and/or data returned incorrectly. In most cases a bus reset is issued by the initiator when this issue occurs. This is fixed in FW 4.13.
More Info about the 4.13 FW / CLI 2.1 Release The 4.13/2.1 release is a FW and software upgrade and does not require a hardware change. While the FW updates for this product have been non-disruptive for previous releases, due to the big difference between the current code(s) and this release, the upgrade is a disruptive process and requires a controller reset. Firmware version 4.13 and CLI software version 2.1 add the following new features to StorEdge 3310 RAID arrays: 1. Common source code for RAID controller firmware with separate bindings specific to FC, SATA, and SCSI. 2. Improves the interoperability with StorADE in regards to:
3. New features: 3.1. Cache specific
3.2. Fault management specific
3.3. Logical Device and Logical Volume specific
Resolution Upgrade the software for the SE3310 Array by installing patch 113722-11. Follow the detailed procedure given in the README file. Please use this Customer List to identify sites which may be affected.
Use this Customer Letter as needed to communicate the issue to customers.
Modification History Date: 19-SEP-2005
Previously Published As 101901 Internal Comments Please reference the following product manuals as needed.
Related Information
Internal Eng Business Unit Group KE Authors Internal Eng Responsible Engineer [email protected] Internal Resolution Patches 113722-11 Internal Kasp FAB Legacy ID 101901 Internal Sun Alert & FAB Admin Info Critical Category: Significant Change Date: Avoidance: Firmware Responsible Manager: null Original Admin Info: null Product_uuid 3db30178-43d7-4d85-8bbe-551c33040f0d|Sun StorageTek 3310 SCSI Array Attachments This solution has no attachment |
||||||||||||
|