Document Audience: | INTERNAL |
Document ID: | I0779-1 |
Title: | Ultra 5/10 workstations may encounter uncorrectable data errors on IDE disks during boot after using shutdown -y -i5 -g0 command. |
Copyright Notice: | Copyright © 2005 Sun Microsystems, Inc. All Rights Reserved |
Update Date: | 2004-06-08 |
------------------------------------------------------------
- Sun Proprietary/Confidential: Internal Use Only -
------------------------------------------------------------------------
FIELD INFORMATION NOTICE
(For Authorized Distribution by Sun Services)
FIN #: I0779-1
Synopsis: Ultra 5/10 workstations may encounter uncorrectable data errors on IDE disks during boot after using shutdown -y -i5 -g0 command.Create Date: May/26/04
SunAlert: Yes
Top FIN/FCO Report: Yes
Products Reference: Ultra 5/10 with IDE drives
Product Category: Storage / Disks
Product Affected:
Systems Affected
----------------
Mkt_ID Platform Model Description Serial Number
------ -------- ----- ----------- -------------
- A21 ALL Ultra 5 -
- A22 ALL Ultra 10 -
X-Options Affected
------------------
Mkt_ID Platform Model Description Serial Number
------ -------- ----- ----------- -------------
X6174A - - X-Opt Internal 20GB Drive for U5/U10 SB1 -
X5236A - - X-Opt Internal 9.1GB Drive for U5/U10 -
X5227A - - X-Opt Internal 4.3GB Drive for U10 -
Parts Affected:
Part Number Description Model
----------- ----------- -----
370-4327-03 20GB Disk 3 1/2" 7200 RPM Ultra ATA/100 -
370-3693-01 9.1GB Disk 3 1/2" 7200 RPM Ultra ATA/66 -
370-3176-01 4.3GB Disk 3 1/2" 4500 RPM Ultra ATA-3 -
370-3692-01 4.3GB Disk 3 1/2" 5400 RPM Ultra ATA-3 -
370-3863-02 8.4GB Disk 3 1/2" 5400 RPM Ultra ATA-4 -
References:
BugId: 4380416 - init 5 corrupts filesystems on ultra-10 440MHz on
2.5.1 systems.
4435428 - darwin:EIDE disk with write-cache enabled should be
flushed before power-off.
4337637 - grover: IDE doesn't complete all writes.
PatchId: 106197-06 - SunOS 5.5.1: Jumbo patch for ide and atapi fixes.
103640-38 - SunOS 5.5.1: kernel, nisopaccess, & libthread patch.
106407-07 - SunOS 5.6: Jumbo patch for ide and atapi fixes.
105181-31 - SunOS 5.6: Kernel update patch.
106541-21 - SunOS 5.7: Kernel update patch
108974-05 - SunOS 5.8: dada, uata, dad, sd and scsi drivers patch.
108528-03 - SunOS 5.8: kernel update patch.
ESC: 529449 - init 5 corrupts filesystems on ultra5.
530062 - darwin:EIDE disk with write-cache enabled should be
flushed before power-off.
Issue Description:
On Sun Ultra 5/10 workstations, use of the following shutdown(1M)
command 'shutdown -y -i5 -g0' might result in the IDE disk drive not
getting "sync'ed" properly. If the disk is not "sync'ed" (see sync(1M))
properly, "Uncorrectable data Errors" are reported which may lead to
unnecessary disk replacement.
Failing Configuration: Ultra 5/10 systems with internal IDE drives.
Here are two different cases of error messages which may be seen in the
/var/sadm/messages files upon boot up after using the shutdown -y -i5
-g0 command:
Case 1
------
Jan 9 15:04:16 Ultra-10 unix: WARNING:
/pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):
Jan 9 15:04:16 Ultra-10 Uncorrectable data Error: Block 289ed0
Jan 9 15:04:18 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0
(dad1):
Jan 9 15:04:18 Ultra-10 disk not responding to selection
Jan 9 15:04:18 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@0,0
(dad0):
Jan 9 15:04:18 Ultra-10 ATA transport failed: reason `reset`: retrying
command
Jan 9 15:04:22 Ultra-10 unix: dad1: disk okay
Jan 9 15:42:06 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@2,0
(dad1):
Jan 9 15:42:06 Ultra-10 Uncorrectable data Error: Block 8ca48
Jan 9 15:42:07 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@2,0
(dad1):
Jan 9 15:42:07 Ultra-10 disk not responding to selection
Jan 9 15:42:07 Ultra-10 unix: dad1: disk okay
Jan 9 15:45:26 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@2,0
(dad1):
Jan 9 15:45:26 Ultra-10 Uncorrectable data Error: Block 5201c4
Jan 9 15:45:27 Ultra-10 disk not responding to selection
Case 2
------
Mar 29 11:04:16 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0
(dad1):
Mar 29 11:04:16 Ultra-10 Uncorrectable data Error: Block 289ed0
Mar 29 11:04:18 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0
(dad1):
Mar 29 11:04:18 Ultra-10 disk not responding to selection
Mar 29 11:04:18 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@0,0
(dad0):
Mar 29 11:04:18 Ultra-10 ATA transport failed: reason 'reset':
retrying command
Mar 29 11:04:22 Ultra-10 unix: dad1: disk okay
NOTE: In some cases, the fsck(1M) command may need to be run manually
to repair file systems when the system is booted. But in most
cases fsck activation during reboot.
Unlike SCSI disks, IDE drives have their internal write cache enabled.
Whenever the system is powered down, the data in the disk cache is not
flushed causing possible data corruption.
The fix requires writing a new entry point in the IDE driver that will
send a disk flush command before powering down the disk.
Implementation:
---
| | MANDATORY (Fully Pro-Active)
---
---
| X | CONTROLLED PRO-ACTIVE (per Sun Geo Plan)
---
---
| | REACTIVE (As Required)
---
Corrective Action:
The following recommendation is provided as a guideline for authorized
Enterprise Services Field Representatives who may encounter the above
mentioned problem.
The recommended workaround is to use the "-i0" option instead of the
"-i5" option for shutdown:
# shutdown -y -i0 -g0
Please install the following patches as a final solution:
---------------------------------------------------------
| Solaris | Solaris patchId | IDE PatchId |
|=========================================================|
| 2.5.1 | 103640-38 (or later) | 106197-06 (or later) |
|---------+------------------------+----------------------|
| 2.6 | 105181-31 (or later) | 106407-07 (or later) |
|---------+------------------------+----------------------|
| 7 | 106541-21 (or later) | See NOTE |
|---------+------------------------+----------------------|
| 8 | 108528-03 (or later) | 108974-05 (or later) |
---------------------------------------------------------
NOTE: For Solaris 7, there is no IDE patch available, so the fix
is not complete for that release.
Comments:
There have been cases in the field that the disk drives were
inadvertently replaced.
The boot disk is less prone to data loss as this is not used normally
to store user data. But any other internal IDE drives could cause
substantial damage to the customer as data perceived by applications to
have been written to the disks are lost.
--------------------------------------------------------------------------
Implementation Footnote:
________________________
i) In case of MANDATORY FINs, Enterprise Services will attempt to contact
all affected customers to recommend implementation of the FIN.
ii) For CONTROLLED PROACTIVE FINs, Enterprise Services mission critical sup-
port teams will recommend implementation of the FIN (to their respective
accounts), at the convenience of the customer.
iii) For REACTIVE FINs, Enterprise Services will implement the FIN as the need
arises.
--------------------------------------------------------------------------
All released FINs and FCOs can be accessed using your favorite network
browser as follows:
SunWeb Access:
______________
* Access the top level URL of http://cte.ebay/FIN_FCO/
* From there, select the appropriate link to query or browse the FIN and
FCO Homepage collections.
SunSolve Online Access:
_______________________
* Access the SunSolve Online URL at http://sunsolve.Corp/
* From there, select the appropriate link to browse the FIN or FCO index.
Internet Access:
_______________
* Access the top level URL of https://infoserver.Sun.COM
--------------------------------------------------------------------------
General:
________
Send questions or comments to [email protected]
--------------------------------------------------------------------------