Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition | |||
|
|
Solution Type Problem Resolution Sure Solution 1002435.1 : Disconnecting any SCSI cable in a dual-hosted StorEdge[TM] D1000 causes SCSI bus errors in remaining host due to loss of termination.
PreviouslyPublishedAs 203408
Applies to:Sun Storage D1000 ArraySun Netra st D1000 Array All Platforms SymptomsFatal SCSI bus errors occurred in one of two SunCluster[TM] hosts while the other host was down.The following messages were observed: (timestamp) test unix: WARNING: /sbus@49,0/QLGC,isp@0,10000/sd@9,0 (sd53): If data redundancy was not properly configured, the above event could potentially result in data lost and/or service down time. ChangesIn the customer's situation, during the maintenance of one of 2 clustered hosts, customer/service personnel unwittingly disconnected a host SCSI cable from the D1000's host port (either the far right or far left port). The remaining host that is still connected to the D1000 started experiencing recurring and varied SCSI bus errors, that eventually turned fatal. As this host's application were live, i/o were active, and disks were managed by VxVM, the fatal bus errors caused active plexes to be detached.CauseThe StorEdge[TM] D1000 is a JBOD where the enclosure can be configured as a single SCSI bus for connection to a single host scsi bus adaptor with access to all of the disks; or configured as 2 separate buses for connection to 2 separate hosts with each allowed to access half of the enclosed disks. A third configuration allows for 2 separate but "clustered" hosts (with Sun Cluster software) to share the entire enclosure and all its disks in a single SCSI bus.This dual-hosted/clustered configuration implies that the single bus is connected to 2 SCSI HBA on separate hosts, where the SCSI bus termination is provided by the host connections since D1000 requires external bus termination. For diagrams of configuration, refer to "Sun StorEdge A1000 & D1000 Installation, Operation & Service Manual (805-2624)". For a description of the cabling, refer to Document 1018089.1 "Sun StorEdge[TM] D1000 array cabling and address" (formerly SunSolve Doc 18451). SolutionTo resolve the problem, the SCSI cable has to be reconnected, or if the cable need to be disconnected for an extended period of time, an external SCSI terminator must be installed in place of the cable.The connected host may need to be rebooted for the SCSI devices to be re-scanned and properly recognized since device sync speed may have been reduced, or marked offline. To avoid the above situation, any SCSI cable should only be disconnected when BOTH hosts are shut down or at least ensure that no application is accessing the D1000 during the time. Internal Comments For internal Sun use only. Service Request ID: 10775412 Escalation ID: 1-13232796 Solution ID: 1-13411978 D1000, SunCluster, dual-host, scsi termination Previously Published As 83317 Change History Date: 2005-12-05 User Name: 95826 Action: Approved Comment: - fixed typo - verified metadata - changed review date to 2006-12-05 - checked for TM - none added - checked audience : contract Publishing Version: 3 Date: 2005-12-05 User Name: 95826 Action: Accept Comment: Version: 0 Attachments This solution has no attachment |
||||||||||||
|