Summary of Contents for IBM Storwize V7000 Unified
Page 1
IBM Storwize V7000 Unified Problem Determination Guide GA32-1057-14...
Page 2
The information in the “Safety and environmental notices” on page xi v The information in the IBM Environmental Notices and User Guide (provided on a DVD) This edition applies to IBM Storwize V7000 Unified and to all subsequent releases and modifications until otherwise indicated in new editions.
Back up your data . Safety notices and labels . Manage your spare and failed drives . Caution notices for the Storwize V7000 Unified Resolve alerts in a timely manner . Danger notices for Storwize V7000 Unified . Keep your software up to date .
Page 4
Procedure: Powering off your system . Battery operation for Storwize V7000 Gen2 Procedure: Powering on the Storwize V7000 control enclosures . Gen2 system. Battery operation for Storwize V7000 Unified Gen1 control enclosures . Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 5
Running system recovery using the service Appendix. Accessibility features for assistant . IBM Storwize V7000 Unified ..Recovering from offline volumes using the CLI What to check after running the system Notices ....
Page 6
Industries Association Statement . European Union Electromagnetic Compatibility Korean Communications Commission Class A Directive . Statement Germany Electromagnetic Compatibility Russia Electromagnetic Interference Class A Directive . Statement People's Republic of China Class A Statement Storwize V7000 Unified: Problem Determination Guide 2073-720...
16. Rear view of a model 2076-212 or a model 49. Example of a SMART error . 2076-224 expansion enclosure . 50. Removing the cover . 17. Rear view of a Storwize V7000 Unified control 51. Installing the cover enclosure . 52. Removing the bezel 18.
Page 8
94. Unlocking and removing a 3.5-inch drive 118. Installing the host interface adapter . from its slot . 119. Replacing a CMOS Gen2 battery . 95. Installing and locking a 3.5-inch drive into its slot . viii Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 9
65. Description of data fields for the event log information 66. Notification levels . 33. Installation error code actions 67. Storwize V7000 Unified Gen1 model numbers 230 34. Error messages and actions . 68. Storwize V7000 Unified Gen2 model numbers 230 35. CLI command problems .
Page 10
81. Storwize V7000 Unified Gen2 model numbers 250 103. Storwize V7000 Unified Gen1 model numbers 287 82. Storwize V7000 Unified Gen1 model numbers 252 104. Storwize V7000 Unified Gen2 model numbers 288 83. Storwize V7000 Unified Gen2 model numbers 252 105.
In the preceding examples, the numbers (C001) and (D002) are the identification numbers. 2. Locate the IBM Systems Safety Notices with the user publications that were provided with the Storwize V7000 Unified hardware. 3. Find the matching identification number in the IBM Systems Safety Notices. Then review the topics concerning the safety notices to ensure that you are in compliance.
“Labels” section. Note: You can find and download the current IBM System Safety Notices by searching for Publication number G229-9054 in the IBM Publications Center.
Page 13
CAUTION: The battery contains lithium. To avoid possible explosion, do not burn or charge the battery. Do not: Throw or immerse into water, heat to more than 100°C (212°F), repair or disassemble. (C003) CAUTION: Electrical current from power, telephone, and communication cables can be hazardous.
Page 14
It is intended that equipment installed within this rack will have its own enclosure. (R005). CAUTION: Tighten the stabilizer brackets until they are flush against the rack. (R006) CAUTION: Use safe practices when lifting. (R007) Storwize V7000 Unified: Problem Determination Guide 2073-720...
(R009) Danger notices for Storwize V7000 Unified Ensure that you are familiar with the danger notices for Storwize V7000 Unified. Use the reference numbers in parentheses at the end of each notice, such as (C003) for example, to find the matching translated notice in IBM Systems Safety Notices.
Page 16
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 17
Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet.
General safety When you service the Storwize V7000 Unified, follow general safety guidelines. Use the following general rules to ensure safety to yourself and others. v Observe good housekeeping in the area where the devices are kept during and after maintenance.
Attention: Depending on local conditions, the sound pressure can exceed 85 dB(A) during service operations. In such cases, wear appropriate hearing protection. Environmental notices This information contains all of the required environmental notices for IBM Systems products in English and other languages. Safety and environmental notices...
Page 20
The IBM Systems Environmental Notices (http://ibm.co/1fBgWFI) information includes statements on limitations, product information, product recycling and disposal, battery information, flat panel display, refrigeration and water-cooling systems, external power supplies, and safety data sheets. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Storwize V7000 Unified. IBM Knowledge Center for Storwize V7000 Unified The information collection in the IBM Knowledge Center contains all of the information that is required to install, configure, and manage the system. The information collection in the IBM Knowledge Center is updated between product releases to provide the most current documentation.
Page 22
Each of the PDF publications in the Table 2 library is also available in the IBM Knowledge Center by clicking the number in the “Order number” column: Table 2. Storwize V7000 Unified library Title Description Order number IBM Storwize V7000 Model...
Page 23
Table 2. Storwize V7000 Unified library (continued) Title Description Order number Safety Information The guide contains translated caution and danger statements for the file module documentation. Each caution and danger statement in the Storwize V7000 Unified documentation has a number. Use the number to...
Some publications are available for you to view or download at no charge. You can also order publications. The publications center displays prices in your local currency. You can access the IBM Publications Center through the following website: www.ibm.com/e-business/linkweb/publications/servlet/pbi.wss...
Before calling for support, be sure to have your IBM Customer Number available. If you are in the US or Canada, you can call 1 (800) IBM SERV for help and service. From other parts of the world, see http://www.ibm.com/planetwide for the number that you can call.
Page 26
Software option Identify the Storwize V7000 Unified product as your product and supply your customer number as proof of purchase. The customer number is a 7-digit number (0000000 to 9999999) assigned by IBM when the product is purchased. Your customer number should be located on the customer information worksheet or on the invoice from your storage purchase.
At times, you might need expert advice about using a function provided by the system or about how to configure the system. Purchasing the IBM Support Line offering gives you access to this professional advice while deploying your system, and in the future.
Page 28
Storwize V7000 Unified: Problem Determination Guide 2073-720...
Chapter 1. Storwize V7000 Unified hardware components A Storwize V7000 Unified system consists of 1 or more machine type 2076 rack-mounted enclosures and 2 machine type 2073 rack-mounted file modules. Control enclosures contain the node canisters that manage the system operation and provide the host interfaces.
Figure 5. Rear view of 2073-720 file module 1 8 Gbps Fibre Channel port 1 (connected to the control enclosure) 2 8 Gbps Fibre Channel port 2 (connected to the control enclosure) Chapter 1. Storwize V7000 Unified hardware components...
Important: Drive slots cannot be empty. Install a drive assembly or blank carrier in each slot. Note: Drives that are sold as Storwize V7000 Unified options are the only drives that are supported. For more information, see the Support website for more information.
Figure 9. Storwize V7000 Gen2 Small form factor vertical drive Drive indicators for control enclosures Storwize V7000 Unified enclosures use different drive indicators, depending on the generation of your control enclosure model. Drives have two light-emitting diode (LED) indicators each; they have no controls or connectors.
If the LED is on, a fault exists on the drive. v If the LED is off, no known fault exists on the drive. v If the LED is flashing, the drive is being identified. A fault might or might not exist. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Enclosure end cap indicators Storwize V7000 Unified enclosure end cap indicators vary, depending on the generation of your control enclosure model. Storwize V7000 Gen1 Figure 12 shows where the end caps are located on the front of an enclosure with 12 drives.
Figure 15 on page 9 shows the rear view of a model 2076-312 or a model 2076-324 control enclosure with the 10 Gbps Ethernet port ( 5 ). Figure 16 on page 9 shows the rear of an expansion enclosure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Power supply units for control enclosures Storwize V7000 Unified enclosures use different power supply units, depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Chapter 1. Storwize V7000 Unified hardware components...
Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 8. Storwize V7000 Unified Gen2 model numbers...
Figure 18. LEDs on the power supply units of the control enclosure Table 9 identifies the LEDs in the rear of the control enclosure. Table 9. Power supply unit LEDs in the rear of the control enclosure Name Color Symbol ac power failure Amber Chapter 1. Storwize V7000 Unified hardware components...
Storwize V7000 Unified enclosures use different power supply units, depending on the generation of your expansion enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 10. Storwize V7000 Unified Gen1 model numbers Machine...
There is a power switch on each of the power supply units. The switch must be on for the power supply unit to be operational. If the power switches are turned off, the power supply units stop providing power to the system. Chapter 1. Storwize V7000 Unified hardware components...
A node canister contains a battery that provides power to the canister as it stores cache and system data to an internal drive in the event of a power failure. This process is known as a fire hose dump. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 22. Storwize V7000 2076-524 node canister indicators Storwize V7000 2076-524 node canister SAS port LEDs Table 14 on page 16 depict the status LEDs for SAS ports 1 and 2, and their location in Figure 22. Chapter 1. Storwize V7000 Unified hardware components...
4 phys connected. v Not all 4 phys are at the same speed. v One or more of the connected phys are attached to an address different from the others Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 45
Storwize V7000 2076-524 node canister battery status LEDs Table 15 on page 18 show battery status LEDs and their location in Figure 22 on page 15. Chapter 1. Storwize V7000 Unified hardware components...
There is a fault in the battery. Storwize V7000 2076-524 node canister system status LEDs Table 16 on page 19 show system status LEDs and their location in Figure 22 on page 15. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 47
FAST BLINK The canister is active, able to complete I/O operations, or starting. The canister is active, able to complete I/O operations, or starting. The node is part of a cluster. Chapter 1. Storwize V7000 Unified hardware components...
Page 48
Two USB ports are located on each Storwize V7000 Gen2 node canister. The USB ports are numbered 1 on top and 2 on the bottom as shown in Figure 23 on page 21. One port is used during installation. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 24 on page 22. Each port can have up to an 8 Gbps SW SFP transceiver installed. Each transceiver connects to a host or Fibre Channel switch with an LC-to-LC Fibre Channel cable. Chapter 1. Storwize V7000 Unified hardware components...
Fibre Channel over Ethernet connections to host system or storage systems. Each port can support simultaneous FCoE and iSCSI connections. The Small Form-factor Pluggable (SFP) transceivers that are installed on the adapter support data transfer speeds of 10 Gbps. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 26. 10 Gbps Fibre Channel over Ethernet/iSCSI host interface adapter ports Storwize V7000 2076-524 10 Gbps Fibre Channel over Ethernet/iSCSI host interface adapter indicators Each port has two LED indicators, one green and one amber (see Figure 27 on page 24). Chapter 1. Storwize V7000 Unified hardware components...
The ports are numbered 1 - 4 from left to right and top to bottom. Note: The reference to the left and right locations applies to canister 1, which is the upper canister. The port locations are inverted for canister 2, which is the lower canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Table 19. Fibre Channel port LED locations on canister 1 Associated port LED location LED status Port 3 3 First LED between ports 1 Speed and 3 1 Port 1 1 Second LED between ports 1 Speed and 3 2 Chapter 1. Storwize V7000 Unified hardware components...
27. One port is used during installation. Note: The reference to the left and right locations applies to canister 1, which is the upper canister. The port locations are inverted for canister 2, which is the lower canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Two LEDs are associated with each port. Note: The reference to the left and right locations applies to canister 1, which is the upper canister. The port locations are inverted for canister 2, which is the lower canister. Chapter 1. Storwize V7000 Unified hardware components...
Figure 32 shows the location of the 10 Gbps Ethernet ports. Figure 32. 10 Gbps Ethernet ports on the 2076-312 and 2076-324 node canisters Table 22 on page 29 provides a description of the LEDs. Storwize V7000 Unified: Problem Determination Guide 2073-720...
The port locations are inverted for canister 2, which is the lower canister. Figure 33. SAS ports on the node canisters. SAS ports must be connected to Storwize V7000 Unified enclosures only. See “Problem: Storwize V7000 Gen1 SAS cabling not valid” on page 248 for help in attaching the SAS cables.
It is not able to perform I/O in a system. When the node is in either of these states, it can be removed. Do not remove the canister unless directed by a service procedure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
There are no defined procedures that use the port. Storwize V7000 Gen2 expansion canister SAS ports and indicators Two SAS ports are located in the rear of the Storwize V7000 Gen2 expansion canister. Chapter 1. Storwize V7000 Unified hardware components...
One or more, but not all, of the 4 phys are connected. v Not all 4 phys are at the same speed. v One or more of the connected phys are attached to an address different from the others Storwize V7000 Unified: Problem Determination Guide 2073-720...
The link is connected and has activity. The link is connected. Storwize V7000 Gen2 expansion canister LEDs Each Storwize V7000 Gen2 expansion canister has three LEDs that provide status and identification for the expansion canister. Chapter 1. Storwize V7000 Unified hardware components...
The two LEDs are located in a vertical row on the left side of the canister. Figure 38 on page 35 shows the LEDs ( 1 ) in the rear of the expansion canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
If the LED is on, a fault exists. v If the LED is off, no fault exists. v If the LED is flashing, the canister is being identified. This status might or might not be a fault. Chapter 1. Storwize V7000 Unified hardware components...
Page 64
Storwize V7000 Unified: Problem Determination Guide 2073-720...
Use this address if the control enclosure CLI is not working. These addresses are not set during the installation of a Storwize V7000 Unified system, but you can set these IP addresses later by using the management GUI or the chserviceip CLI command.
RAID arrays for the disk system. The Storwize V7000 Unified system uses a pair of file modules for redundancy. Follow the appropriate power down procedures to minimize impacts to the system operations.
RAID arrays for the disk system. The Storwize V7000 Unified system uses a pair of file modules for redundancy. Follow the appropriate power down procedures to minimize impacts to the system operations.
Call Home. When the event is received, IBM automatically opens a problem report, and if appropriate, contacts you to verify if replacement parts are required. If you set up Call Home to IBM, ensure that the contact details that you configure are correct and kept up to date as personnel change.
The management GUI provides the capability to review these issues from the Events panel. For file module issues, use the Storwize V7000 Unified information center to look up the events and perform the actions listed for the events. For Storwize V7000 issues, resolve these problems through the Recommended actions only option from the Events panel.
Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 31. Storwize V7000 Unified Gen2 model numbers Machine type/model Description 2076-524 Storwize V7000 Unified control enclosure, with up to 24 2.5-inch (6.35 cm) drives 2076-12F Storwize V7000 Unified expansion enclosure for up to 12 3.5-inch (8.89...
Before calling for support, be sure to have your IBM Customer Number available. If you are in the US or Canada, you can call 1 (800) IBM SERV for help and service. From other parts of the world, see http://www.ibm.com/planetwide for the number that you can call.
Page 72
If you call from somewhere other than the US or Canada, you must choose the software or hardware option when calling for assistance. Choose the software option if you are uncertain if the problem involves the Storwize V7000 Unified software or hardware. Choose the hardware option only if you are certain the problem solely involves the Storwize V7000 Unified hardware.
Page 73
At times, you might need expert advice about using a function provided by the system or about how to configure the system. Purchasing the IBM Support Line offering gives you access to this professional advice while deploying your system, and in the future.
Page 74
Storwize V7000 Unified: Problem Determination Guide 2073-720...
If users or applications are having trouble accessing data that is held on the Storwize V7000 Unified system, or if the management GUI is not accessible or is running slowly, the Storwize V7000 control enclosure might have a problem.
187; otherwise, see “Checking the GPFS file system mount on each file module” on page 189. If you have lost access to the files, but there is no sign that anything is wrong with the Storwize V7000 Unified system, see “Host to file modules connectivity” on page 63. Installation troubleshooting This topic provides information for troubleshooting problems encountered during the installation.
Page 77
– Product Family: Disk Systems – Product: IBM Storwize V7000 Unified – Release: All – Platform: All Before loading the USB flash drive verify it has a FAT32 formatted file system. Plug the USB flash drive into the laptop. Go to Start (my computer), right-click the USB drive.
Page 78
SONAS_results.txt file and open it. Check for errors and corrective actions (refer to Storwize V7000 Unified Problem Determination Guide PDF on the CD). If no errors are listed, reboot both file modules, allow file modules to boot completely, reinsert the USB flash drive as originally instructed and try again.
3. Refer to Table 33 to match the code (A-I) to the recommended action. Follow the suggested action, in order, completing one before trying the next. 4. If the recommended action or actions fail, call the IBM Support Center. Table actions defined This table serves as a legend for defining the precise action to follow.
Verify that the Ethernet cabling connections are seated properly between the Storwize V7000 Unified control enclosure and the customer network, as well as the file modules cabling to the customer network. Then press the Restart button if the management GUI has already started, otherwise, reinsert the USB flash drive into the original file module.
Page 81
Table 34. Error messages and actions (continued) Error code Error message Action key 0A0D Error querying settings through ASU. 0A0E Error setting ASU command. 0A0F Unable to determine adapter name from VPD. 0A10 Unable to open the ifcfg file. 0A11 Unable to write to the ifcfg file.
Page 82
No host name provided to exchange keys with. 0AD5 Host name is invalid. 0AD6 Invalid parameters. 0AD7 Unable to open vpdnew.txt file. 0AD8 VPD failed to update a value. 0AD9 Invalid option. 0ADA Error while parsing adapter ID. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 83
Table 34. Error messages and actions (continued) Error code Error message Action key 0ADB Unable to open /proc/scsi/scsi. 0AF8 Trying to install management stack on non-management node. 0AF9 Invalid site ID. Curently only 'st001' is supported on physical systems. 0AFA This node is already a part of a cluster.
Page 84
There was an error while installing GPFS callbacks. 0B92 Rsync failed between management nodes. 0B94 There were too many potential peer storage nodes. Storage controllers may be cabled incorrectly or UUIDs might not be set properly. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 85
Error running update test utility on controller, see Storwize V7000 for more details. 0BD7 Yum is reporting a package error on a node. Try running yum manually. 01B2 Unable to start performance collection daemon. Contact IBM Remote Technical Support. Chapter 3. Getting started troubleshooting...
Page 86
01D5 Storwize V7000 stalled. Contact IBM Remote Technical Support. 01D6 Storwize V7000 stalled_non_redundant 01DA GPFS cluster is unhealthy Refer to “Checking the GPFS file system mount on each file module” on page 189 Storwize V7000 Unified: Problem Determination Guide 2073-720...
/opt/IBM/sofs/cli/ cfgperfcenter --stop. If successful restart update. If you are unable to stop performance center please contact IBM Remote Technical Support. Problems reported by the CLI commands during software configuration Use this information when troubleshooting problems reported by the CLI commands during software configurations.
1. Does the GUI launch and are there problems logging into the system? v Yes: Check that the user ID being used was set up to access the GUI. Refer to “Authentication basic concepts” in the IBM Storwize V7000 Unified Information Center.
Page 89
v Yes: a. Run the CLI command lsnode and determine the status of the file nodes. b. If the lsnode reports the management service is not running, refer to “Management node role failover procedures” on page 183. If lsnode provides the system configuration information, check the connection status under the appropriate heading.
About this task Within the Storwize V7000 Unified system, the system Health Status is based on a set of predefined software and hardware health status sensors. The status of each component is displayed against the corresponding logical host name in the System and System Details pages.
This topic instructs you where to go to view the information that is displayed, how to check the status of the various sensors, and how to manually close out sensor events. By performing these tasks, you ensure that the overall Health Status reflects the current system health.
) public file access If you are looking at a problem regarding built-in Ethernet port 1 or built-in Ethernet port 2, refer to “Ethernet connectivity between file modules” on page 65. Isolation procedures: Storwize V7000 Unified: Problem Determination Guide 2073-720...
These connections are used for internal management operations between the file modules. They make use of the Internal IP address range that you provided during initializing the Storwize V7000 Unified system. About this task This procedure is used to troubleshoot Ethernet connectivity between the file modules.
If you are looking at a problem regarding built-in Ethernet port 3, built-in Ethernet port 4, or any network connections to PCI slot 4, refer to “Host to file modules connectivity” on page 63. Isolation procedures: Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 95
It is always possible that somebody in your site could set up another machine to use one or more IP address that your Storwize V7000 Unified system is already using. Use the management GUI to check which four IP addresses the file modules are currently using to communicate with each other.
Use the lsstoragesystem CLI command to show you the IP address that the active management node, running on one of the file modules, will use to ssh commands to the storage system CLI. For example: Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 97
CLI command). Otherwise you may have plugged the USB flash drive into the wrong control enclosure (such as one that is not part of this Storwize V7000 unified system). The node_status should be active for each node canister in the cluster under sainfo lsservicestatus. Otherwise follow the service action under sainfo lsservicerecommendation.
Page 98
CLI command. Here is an example: >ssh superuser@<system IP address> $ chsystemip -clusterip 9.20.136.5 -gw 9.20.136.1 -mask 255.255.255.0 -port 1 The default password for superuser is passw0rd. Update the file module's record of the control enclosure system IP: Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 99
To find the file module's current record of the control enclosure system IP address, use the Storwize V7000 Unified management CLI to issue the lsstoragesystem command. Here is an example: >ssh admin@<management_IP> [kd01ghf.ibm]$ lsstoragesystem name primaryIP secondaryIP id StorwizeV7000 9.11.137.130 9.11.137.130 00000200A2601508 EFSSG1000I The command completed successfully.
Both ports are used to connect to the Storwize V7000 control enclosure with a connection going to each control canister as shown in Figure 41 on page 73 or Figure 42 on page 74. Storwize V7000 Unified: Problem Determination Guide 2073-720...
CAUTIO N CAUT I O N Disconnect all Disconnect all supply power for supply power for complete isolation complete isolation Figure 41. Connecting the file modules to the Storwize V7000 Gen1 control enclosure using Fibre Channel cables A File module 1 B File module 2 C Storwize V7000 control enclosure 1 File module1 - Fibre Channel port 1...
Table 38. How to connect Fibre Channel cables from file modules to the control enclosure. File module Control enclosure A File module 1 C Control enclosure 1 Fibre Channel slot 2, port 1 5 Node canister 1 Fibre Channel port 1 2 Fibre Channel slot 2, port 2 7 Node canister 2 Fibre Channel port 1 B File module 2...
Fibre Channel port but a broken connection at the Storwize V7000 node canister. This broken connection is most likely either a Fibre Channel cable or the Fibre Channel port is bad on the Storwize V7000 node canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Table 41. LED states and associated actions. For the Fibre Channel adapters on the file module check the amber LED lights next to the port. (continued) LED State Definition and Action Rapid flashing amber LED This state indicates the Fibre Channel adapter is attempting to resync the Fibre Channel connection.
Page 106
LEDs on the light path diagnostics panel. This information and the information in Light path diagnostics LEDs can often provide enough information to diagnose the error. Storwize V7000 Unified: Problem Determination Guide 2073-720...
12v channel error LEDs indicate an overcurrent condition. Refer to the procedure “Solving power problems” in the “Troubleshooting the System x3650” in the IBM Storwize V7000 Unified Information Center to identify the components that are associated with each power channel, and the order in which to troubleshoot the components.
Use the IBM Power Configurator utility to determine supplies are damaged. current system power consumption. For more information and to download the utility, go to http://www-03.ibm.com/systems/bladecenter/...
Page 109
PCI riser cards v ServeRAID adapter v Optional network adapter v (Trained technician only) System board e. If the failure remains, go to http://www.ibm.com/ systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-CALL. 2. If the PCI LED and the CONFIG LED are lit, complete the following steps to correct the problem: a.
Page 110
E5-2690. If it is, check that the 2.5-inch hard disk drives installed are lesser than eight. b. Check the system-error logs for information about the error. Replace any component that is identified in the error log. LINK Reserved. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 111
LED on the system board, are installed correctly. b. (Trained technician only) Replace the failing microprocessor. c. For more information, go to http://www.ibm.com/ systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-CALL. 2. If the CONFIG LED and the CPU LED are lit, the system issues an invalid microprocessor configuration error.
Page 112
5. Make sure that the heat sink, the fan on the adapter, or the optional network adapter is seated correctly. If the fan has failed, replace it. 6. If the failure remains, go to http://www.ibm.com/ systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-CALL. A fan that failed, is operating too 1.
Page 113
1) Replace the hard disk drive. 2) Replace the hard disk drive backplane. e. If the problem remains, go to http://www.ibm.com/ systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-CALL. 2. If the HDD LED and the CONFIG LED are lit, complete the following steps to correct the problem: a.
4. If the problem remains, replace the power-supply. The power supply Replace the power supply. has failed. The power supply Replace the power supply. has failed. The power supply Replace the power supply. has failed. Storwize V7000 Unified: Problem Determination Guide 2073-720...
The LEDs provide a general idea of the volume system status. For specifics about the status of control enclosures, expansion enclosures, node canisters, and expansion canisters, see Chapter 1, “Storwize V7000 Unified hardware components,” on page 1. Also refer to “Procedure: Understanding the system status using the LEDs”...
GUI first to diagnose and resolve the problem. Use the views that are available in the management GUI to verify the status of the system, the hardware devices, the physical storage, and the available volumes. The Storwize V7000 Unified: Problem Determination Guide 2073-720...
The fix procedures automatically perform configuration changes that are required to return the system to its optimum state. Accessing the Storwize V7000 Unified management GUI This procedure describes how to access the Storwize V7000 Unified management GUI. About this task You must use a supported web browser.
You can use fix procedures to diagnose and resolve problems with the Storwize V7000 Unified. About this task For example, to repair a Storwize V7000 Unified system, you might complete the following tasks: v Analyze the event log (if it is available, or view node errors)
Removing a file module to perform a maintenance action You can remove an IBM Storwize V7000 Unified file module to perform maintenance. The procedure that you follow differs slightly, depending on whether you must unplug the power cables.
Page 120
Removing a file module and disconnecting power You must remove an IBM Storwize V7000 file module from the file cluster and disconnect it from its power line cords before performing a maintenance action that requires the file module to have no power.
Page 121
To remove the mgmt001st001 file module from the system, for example, issue the following command: # suspendnode mgmt001st001 3. Wait for the Storwize V7000 Unified system to stop the file module at the clustered trivial database (CTDB) level. The command does not unmount any mounted file systems.
About this task Installation guidelines To help you work safely with IBM Storwize V7000 Unified file modules, read the safety information in , Safety information statements, and these guidelines. Before you remove or replace a component, read the following information: v When you install a file module, take the opportunity to download and apply the most recent firmware updates.
Page 123
– To avoid straining the muscles in your back, lift by standing or by pushing up with your leg muscles. v Make sure that you have an adequate number of properly grounded electrical outlets for the PDUs. v Back up all important data before you make changes to disk drives. v Have a small flat-blade screwdriver available.
When returning a device or component, follow all packaging instructions and use any supplied packaging materials for shipping. Resolving hard disk drive problems Use this information to address various hard disk drive issues. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 125
About this task v Before running a procedure, refer to “Removing a file module to perform a maintenance action” on page 91. v Follow the suggested actions for a Symptom in the order in which they are listed in the Action column until the problem is solved.
Page 126
Turn on the server and observe the activity of the hard disk drive LEDs. Displaying node mirror and hard drive status The Storwize V7000 Unified system provides a method to check the node mirror status and hard drive status for each file module.
File modules in this Storwize V7000 Unified Cluster Node Node Name Node Details -------------------------------------------------------------------------------- 1. mgmt001st001 x3650m3 KQ186WX 2. mgmt002st001 x3650m3 KQ186WV B. Back to Menus Choice: Figure 44. Selecting a file module to display node status 3. Select the number for a file module to display its status. For example, type 1 to select mgmt001st001.
The volume is Active. The user data is not fully protected due to a configuration change or drive failure. Rebuilding (RBLD) A data resynchronization or rebuild might be in progress. or Resyncing (RSY) Storwize V7000 Unified: Problem Determination Guide 2073-720...
Table 44. Status of volume (continued) Status of volume Description Inactive, Okay The volume is inactive and the drives are functioning correctly. The (OKY) user data is protected if the current RAID level is RAID 1 (IM) or RAID 1E (IME). Inactive, Degraded The volume is inactive and the user data is not fully protected due (DGD)
SMART ASCQ : none Figure 46. Example that shows that mirroring is re-synchronizing If a drive were not synchronized, the status might appear like the status shown in Figure 47 on page 103: Storwize V7000 Unified: Problem Determination Guide 2073-720...
The mirror is not created/configured. If the mirror is not created, refer to “Troubleshooting the System x3650” in the IBM Storwize V7000 Unified Information Center for information on launching the LSI configuration tool. Chapter 4. File module...
ASC/ ASCQ error of 05/00. For isolation and the repair of hard disk problems, refer to “Troubleshooting the System x3650” in the IBM Storwize V7000 Unified Information Center. For a list of SMART (ASC/ASCQ) error codes and their descriptions, go to “SMART ASC/ASCQ error codes and messages”...
Device is a Hard disk Enclosure # Slot # Connector ID Target ID State : Online (ONL) Size (in MB)/(in sectors) : 286102/585937500 Manufacturer : IBM-ESXS Model Number : MBD2300RC Firmware Revision : SB19 Serial No : D009P9A01SJC Drive Type : SAS Protocol...
Page 134
LOGICAL UNIT NOT READY, START STOP UNIT COMMAND IN PROGRESS LOGICAL UNIT DOES NOT RESPOND TO SELECTION NO REFERENCE POSITION FOUND MULTIPLE PERIPHERAL DEVICES SELECTED LOGICAL UNIT COMMUNICATION FAILURE LOGICAL UNIT COMMUNICATION TIME-OUT LOGICAL UNIT COMMUNICATION PARITY ERROR Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 135
Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description LOGICAL UNIT COMMUNICATION CRC ERROR (ULTRA-DMA/32) UNREACHABLE COPY TARGET TRACK FOLLOWING ERROR HEAD SELECT FAULT ERROR LOG OVERFLOW WARNING WARNING - SPECIFIED TEMPERATURE EXCEEDED WARNING - ENCLOSURE DEGRADED WARNING - BACKGROUND SELF-TEST FAILED WARNING - BACKGROUND PRE-SCAN DETECTED MEDIUM ERROR WARNING - BACKGROUND MEDIUM SCAN DETECTED MEDIUM...
Page 136
RECOVERED DATA WITHOUT ECC - RECOMMEND REWRITE RECOVERED DATA WITHOUT ECC - DATA REWRITTEN RECOVERED DATA WITH ERROR CORRECTION APPLIED RECOVERED DATA WITH ERROR CORR. & RETRIES APPLIED RECOVERED DATA - DATA AUTO-REALLOCATED RECOVERED DATA - RECOMMEND REASSIGNMENT Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 137
Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description RECOVERED DATA - RECOMMEND REWRITE RECOVERED DATA WITH ECC - DATA REWRITTEN DEFECT LIST ERROR DEFECT LIST NOT AVAILABLE DEFECT LIST ERROR IN PRIMARY LIST DEFECT LIST ERROR IN GROWN LIST PARAMETER LIST LENGTH ERROR SYNCHRONOUS DATA TRANSFER ERROR DEFECT LIST NOT FOUND...
Page 138
TIMESTAMP CHANGED SA CREATION CAPABILITIES DATA HAS CHANGED COPY CANNOT EXECUTE SINCE HOST CANNOT DISCONNECT COMMAND SEQUENCE ERROR ILLEGAL POWER CONDITION REQUEST PREVIOUS BUSY STATUS PREVIOUS TASK SET FULL STATUS PREVIOUS RESERVATION CONFLICT STATUS Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 139
Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description ORWRITE GENERATION DOES NOT MATCH COMMANDS CLEARED BY ANOTHER INITIATOR COMMANDS CLEARED BY POWER LOSS NOTIFICATION COMMANDS CLEARED BY DEVICE SERVER INCOMPATIBLE MEDIUM INSTALLED CANNOT READ MEDIUM - UNKNOWN FORMAT CANNOT READ MEDIUM - INCOMPATIBLE FORMAT CLEANING CARTRIDGE INSTALLED CANNOT WRITE MEDIUM - UNKNOWN FORMAT...
Page 140
ATA DEVICE FAILED SET FEATURES SELECT OR RESELECT FAILURE UNSUCCESSFUL SOFT RESET SCSI PARITY ERROR DATA PHASE CRC ERROR DETECTED SCSI PARITY ERROR DETECTED DURING ST DATA PHASE INFORMATION UNIT IUCRC ERROR DETECTED Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 141
Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description ASYNCHRONOUS INFORMATION PROTECTION ERROR DETECTED PROTOCOL SERVICE CRC ERROR PHY TEST FUNCTION IN PROGRESS SOME COMMANDS CLEARED BY ISCSI PROTOCOL EVENT INITIATOR DETECTED ERROR MESSAGE RECEIVED INVALID MESSAGE ERROR COMMAND PHASE ERROR DATA PHASE ERROR INVALID TARGET PORT TRANSFER TAG RECEIVED...
Page 142
DATA CHANNEL IMPENDING FAILURE GENERAL HARD DRIVE FAILURE DATA CHANNEL IMPENDING FAILURE DRIVE ERROR RATE TOO HIGH DATA CHANNEL IMPENDING FAILURE DATA ERROR RATE TOO HIGH DATA CHANNEL IMPENDING FAILURE SEEK ERROR RATE TOO HIGH Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 143
Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description DATA CHANNEL IMPENDING FAILURE TOO MANY BLOCK REASSIGNS DATA CHANNEL IMPENDING FAILURE ACCESS TIMES TOO HIGH DATA CHANNEL IMPENDING FAILURE START UNIT TIMES TOO HIGH DATA CHANNEL IMPENDING FAILURE CHANNEL PARAMETRICS DATA CHANNEL IMPENDING FAILURE CONTROLLER DETECTED DATA CHANNEL IMPENDING FAILURE THROUGHPUT PERFORMANCE...
Page 144
UNABLE TO DECRYPT PARAMETER LIST SA CREATION PARAMETER VALUE INVALID SA CREATION PARAMETER VALUE REJECTED INVALID SA USAGE SA CREATION PARAMETER NOT SUPPORTED AUTHENTICATION FAILED LOGICAL UNIT ACCESS NOT AUTHORIZED SECURITY CONFLICT IN TRANSLATED DEVICE Storwize V7000 Unified: Problem Determination Guide 2073-720...
Understanding error codes The Storwize V7000 Unified error codes convey specific information in an alphanumeric sequence. Tip: Search for error codes or event IDs by using EFS on the front. For 66012FC, for example, search on EFS66012FC.
Optional Ethernet port 7 (Dual Port 10G card) Fibre channel adapter 1 (both ports) – Storage node only Fibre channel adapter 2 (both ports) – Storage node only Bonded device (data0 mgmt0) System x internal hard disk drives Storwize V7000 Unified: Problem Determination Guide 2073-720...
Table 50. Originating file module specific software code – Code 1, 3, 5. Listing devices for variable C in the specific software code sequence of ABBCDDDD. C = Originating specific software code in sequence ABBCDDDD Code Device Red Hat Linux GPFS CIFS server CTDB...
Unique error code Severity of the error Understanding event IDs The Storwize V7000 Unified messages follow a specific format, which is detailed here. About this task Tip: Search for error codes or event IDs by using EFS on the front. For 66012FC, for example, search on EFS66012FC.
I for Asynchronous Replication J for SCM L for HSM AK for NDMP v The element nnnn is a 4 digit message number v The element x indicates the severity of the error. The value x can be: A for Action: GUI error messages. The user must perform a specific action. C for Critical: A critical error occurred which must be corrected by the user or system administrator.
162 “Installing the operator information panel assembly” on page 163 “Removing the hot-swap drive backplane” on page “Installing the hot-swap drive backplane” on page “Removing the 240 VA safety cover” on page 127 Storwize V7000 Unified: Problem Determination Guide 2073-720...
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 153
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 156
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 157
Screw Safety cover 1. Line up and insert the tabs on the bottom of the safety cover into the slots on the system board. 2. Slide the safety cover toward the back of the file module until it is secure. 3.
2. To disconnect the SAS signal cables, make sure that you first disconnect the power cable, and then the signal cable and configuration cable. Storwize V7000 Unified: Problem Determination Guide 2073-720...
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 160
Statement 2 CAUTION: When you are replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery that is recommended by the manufacturer. If your system has a module that contains a lithium battery, replace it only with the same module type made by the same manufacturer.
In the United States, IBM has established a return process for reuse, recycling, or proper disposal of used IBM sealed lead acid, nickel cadmium, nickel metal hydride, and other battery packs from IBM Equipment. For information on proper disposal of these batteries, contact IBM at 1-800-426-4333.
Page 162
For proper collection and treatment, contact your local IBM representative. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 163
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
For more information, see the IBM Environmental Notices and User's Guide on the IBM Documentation CD. To install the replacement battery, complete the following steps: Procedure 1. Follow any special handling and installation instructions that come with the replacement battery.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 167
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 170
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
5. Carefully grasp the adapter by its top edge or upper corners, and pull the adapter from the PCI expansion slot. 6. If you are instructed to return the adapter, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you. Storwize V7000 Unified: Problem Determination Guide 2073-720...
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 174
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 175
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 176
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 179
About this task To remove the DVD drive, complete the following steps. Release tab Procedure 1. Read the Safety information and “Installation guidelines” on page 94. Follow the procedure in “Removing a file module and disconnecting power” on page 92 to suspend the file module from the cluster and shut it down, and then disconnect all power cords and external cables.
Page 180
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 181
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Figure 68. Locations of the DIMM connectors on the system board To install a DIMM, complete the following procedure. See Table 54 on page 155 for a listing of the eight DIMM slots populated with the memory RDIMM. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 184
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 185
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 186
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 188
Hazardous voltage, current, and energy levels are present inside any component that has this label attached. There are no serviceable parts inside these components. If you suspect a problem with one of these parts, contact a service technician. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 189
Attention: During normal operation, each power-supply bay must have either a power supply or power-supply filler installed for proper cooling. To install a hot-swap ac power supply, complete the following steps: Procedure 1. Read the Safety information and “Installation guidelines” on page 94. 2.
Page 190
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 191
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Page 192
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
Removing a microprocessor and heat sink IBM authorized service providers can remove and replace a microprocessor and heat sink in the file module. The following procedure is for a field replaceable unit (FRU).
Page 195
About this task Attention: v Always use the microprocessor installation tool to remove a microprocessor. Failing to use the microprocessor installation tool may damage the microprocessor sockets on the system board. Any damage to the microprocessor sockets may require replacing the system board. v Microprocessors are to be removed only by trained service technicians.
Page 196
Note: If you are replacing a microprocessor, use the empty installation tool that comes with the CRU to remove the microprocessor. a. Twist the handle on the microprocessor tool counterclockwise so that it is in the open position. Handle Installation tool Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 197
b. Align the installation tool with the alignment pins on the microprocessor socket and lower the tool on the microprocessor. The installation tool rests flush on the socket only if aligned correctly. Installation tool Alignment Microprocessor pins c. Twist the handle on the installation tool clockwise. Handle Installation tool...
Page 198
The air baffle must be installed to provide proper system cooling. v If you have to replace the microprocessor, call IBM Remote Technical Support for service. v If the thermal-grease protective cover (for example, a plastic cap or tape liner) is removed from the heat sink, do not touch the thermal grease on the bottom of the heat sink or set down the heat sink.
Page 199
Heat sink release lever Lock tab Retainer bracket 6. Open the microprocessor socket release levers and retainer: Microprocessor release lever Microprocessor release lever a. Identify which release lever is labeled as the first release lever to open and open it. b.
Page 200
Twist the handle on the microprocessor tool counterclockwise to insert the microprocessor into the socket. The microprocessor is keyed to ensure that the microprocessor is installed correctly. The microprocessor rests flush on the socket only if properly installed. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 201
Attention: v Do not press the microprocessor into the socket. v Make sure that the microprocessor is oriented and aligned correctly in the socket before you try to close the microprocessor retainer. v Do not touch the thermal material on the bottom of the heat sink or on top of the microprocessor.
Page 202
Removing and replacing the thermal grease IBM authorized service providers must replace the thermal grease when the heat sink has been removed from the top of a microprocessor in the file module and the Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 203
heat sink is going to be reused or when debris is found in the grease. The following procedure is for a field replaceable unit (FRU). FRUs must be installed only by trained service technicians. About this task The thermal grease must be replaced whenever the heat sink has been removed from the top of the microprocessor and is going to be reused or when debris is found in the grease.
Page 204
Removing a heat-sink retention module IBM authorized service providers can remove and replace a heat-sink retention module in the file module. The following procedure is for a field replaceable unit (FRU). FRUs must be installed only by trained service technicians.
Page 205
Removing the system board IBM authorized service providers can remove and replace the system board in the file module. The following procedure is for a field replaceable unit (FRU). FRUs must be installed only by trained service technicians.
Page 206
(see “Removing a microprocessor and heat sink” on page 166). 12. Pull out and lift up the pin and the thumbscrews on each side of the system board. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 207
Installing the system board IBM authorized service providers can remove and replace the system board in the file module. The following procedure is for a field replaceable unit (FRU). FRUs must be installed only by trained service technicians.
Page 208
1. Align the system board at an angle, as shown in the illustration; then, rotate and lower it flat and slide it back toward the rear of the file module. Make sure that the rear connectors extend through the rear of the chassis. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 209
MT-M SN label on the front of the file module. About this task The ASU package is part of the Storwize V7000 Unified code. ASU is available to authorized service personnel from the command-line interface (CLI) on the file module.
IBM Advanced Settings Utility version 3.62.71B Licensed Materials - Property of IBM (C) Copyright IBM Corp. 2007-2010 All Rights Reserved Try to connect to the primary node to get nodes number. Connected via IPMI device driver (KCS interface) Connected to primary node.
Table 55. Default logical devices and physical port locations for a 2073-720 file module Logical Ethernet device name Device description Physical location information mgmtsl0_0 Internal connection between the file modules Port 1 - Built-In xSeries Ethernet Port mgmtsl0_1 Internal connection between the file modules Port 2 - Built-In xSeries Ethernet Port ethXsl0_0 1-Gbps Public Network Port 3 - Built-In xSeries Ethernet Port...
Note: If you run the startmgtsrv command from the node that is becoming active, you first need to run the setcluster command to set the cluster Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 213
If you see the following error message when running the command, wait until the initialization has completed before running setcluster again: IBM SONAS management service is starting up EFSSG0654I The Management Service is starting up. After you run the startmgtsrv command, the system displays information that is similar to the following example: [yourlogon@yourmachine.mgmt002st001 ~]# startmgtsrv...
Page 214
9. Using the GUI event log, follow the troubleshooting documentation against the file module with the failed management node role to isolate the software or hardware problem that might have caused this issue. Storwize V7000 Unified: Problem Determination Guide 2073-720...
(CTDB) on each file module. About this task CTDB checks the health status of the Storwize V7000 Unified file modules, scanning elements such as storage access, General Parallel File System (GPFS), networking, Common Internet File System (CIFS) shares, and Network File System (NFS) exports.
“Checking the GPFS file system mount on each file module” on page 189. v Refer to the information in "Troubleshooting the System x3650 server" topic in the IBM Storwize V7000 Unified Information Center to determine if any additional hardware problems might be causing the “unhealthy” CTDB status.
System (GPFS) file system mounts on IBM Storwize V7000 Unified file modules. About this task A GPFS file system that is not mounted on an Storwize V7000 Unified file module can cause the clustered trivial database (CTDB) status to be 'UNHEALTHY'." The...
To identify and resolve problems in file system mounts, perform this procedure: 1. To identify all the currently created file systems on the Storwize V7000 Unified system, log in as the admin user, then enter the lsfs -r command from the...
If file systems remain unmounted, contact IBM support. Resolving stale NFS file systems You can resolve problems with stale NFS file systems on Storwize V7000 Unified file modules. A file module might have the file system mounted, but the file system remains inaccessible due to a stale NFS file handle.
Refer to these topics in the IBM Storwize V7000 Unified Information Center “Planning for user authentication”, “Verifying the authentication configuration”, “Establishing user and group mapping for client access”, and “chkauth”. If you cannot resolve the issue, contact the authentication server administrator to validate or reestablish your account.
V7000 Unified configurations are correct About this task If you cannot access an export and the server and Storwize V7000 Unified configurations are correct, it could be because of the following reasons. v If Storwize V7000 Unified authentication is configured against an LDAP server, the user entries are case-sensitive when you access exports.
Use the cfgidmap command to import the ID map XML file. The XML file must be at the /ftdc/files folder. 5. Try to access the data on the subordinate Storwize V7000 Unified system after the import operation is successfully completed.
DNS server for Storwize V7000 Unified. Ideally, these IP addresses should be the same as the addresses that are configured on the Storwize V7000 Unified cluster itself. To check this, issue the lsnw CLI command.
3. Issue the chkfs file_system_name -v | tee /ftdc/chkfs_fs_name.log1 command to capture the output to a file. Review the output file for errors and save it for IBM support to investigate any problems. If the file contains a TSM ERROR message, perform the following steps: a.
Issue the chkfs file_system_name command again. Review the new output file for errors and save it for IBM support to investigate any problems. It is expected that the file contains Lost blocks were found messages. It is normal to have some missing file system blocks. If the only errors that are reported are missing blocks, no further repair is needed.
The issue should be resolved after the reboot and within five minutes after the file module displays Host State OK again. Error for “The mount state of the file system /ibm/ Filesystem_Name changed to error level” About this task If the command lshealth -i gpfs_fs -r returns “The mount state of the file...
Page 227
either different or missing, when providing the VLAN ID to a management network bond, as well as to a shared data network bond. Unless you have intentionally configured your switching network to support this unique case, where the VLAN ID for the management network and data network are not the same and you are confident on how this will be routed from the clustered system to your switch, you might incur unpredictable routing path behavior and even network connectivity loss.
If there is no storage space available, contact IBM support. Analyzing GPFS logs Use this procedure when reviewing GPFS log entries. About this task Note: Contact IBM support if you want to analyze GPFS log entries. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Kerberos tickets, for example, can expire and then no one can access the cluster. For the Storwize V7000 Unified file module, the ntpq –p command shows you which server is used for synchronization and any peers and a set of data about their status.
Page 230
Storwize V7000 Unified: Problem Determination Guide 2073-720...
You cannot manage a system by using the 10 Gbps Ethernet ports. You can perform almost all of the configuration, troubleshooting, recovery, and maintenance of the storage system from within the Storwize V7000 Unified management GUI or the CLI commands that are running on the Storwize V7000 file modules.
Page 232
Use the service assistant in the following situations: v When you cannot access the system from the management GUI and you cannot access the Storwize V7000 Unified to run the recommended actions v When the recommended action directs you to use the service assistant.
For a full description of the storage system commands and how to start an SSH command-line session, see the “Command-line interface” topic in the “Reference” section of the Storwize V7000 Unified Information Center. When to use the storage system CLI The storage system CLI is intended for use by advanced users who are confident at using a command-line interface.
Accessing the storage system CLI Follow the steps that are described in the “Command-line interface” topic in the “Reference” section of the Storwize V7000 Unified Information Center to initialize and use a CLI session. Service command-line interface Use the service command-line interface (CLI) to manage a node canister in a control enclosure by using the task commands and information commands.
Page 235
v When you cannot connect to a node canister in a control enclosure using the service assistant and you want to see the status of the node. v When you do not know, or cannot use, the service IP address for the node canister in the control enclosure and must set the address.
Page 236
USB flash drive and note the IP address after the -gw switch. Make sure this IP address is the gateway for this subnet. If an IP address is needed then check this with your 1 Gbps Ethernet administrator. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 237
You should be able to access the management GUI or CLI from a computer, which is on a different subnet or different Ethernet switch to the Storwize V7000 Unified system. The link to the management GUI from the InitTool.exe panel should now work.
Page 238
255.255.255.0. If the command is run on the lower canister, the default value is 192.168.70.122 subnet mask: 255.255.255.0. If the node canister is active in a system, the superuser password for the system is reset; otherwise, the superuser password is reset on the node canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 239
Use this command when you are unable to logon to the system because you have forgotten the superuser password, and you wish to reset it. Attention: Run this command only when instructed by IBM support. Running this command directly on a Storwize V7000 can affect your I/O operations on the file modules.
Page 240
Install software command: Use this command to install a specific update package on the node canister. Attention: Run this command only when instructed by IBM support. Running this command directly on a Storwize V7000 can affect your I/O operations on the file modules.
Page 241
Note: The reference to cluster is not the same as the file system cluster on the Storwize V7000 file modules. Attention: Run this command only when instructed by IBM support. Running this command directly on a Storwize V7000 can affect your I/O operations on the file modules.
-mask The IPv4 subnet for Ethernet port 1 on the system. -consolip The management IPv4 address of Storwize V7000 Unified system. Description This command is only supported in the satask.txt file on a USB flash drive. It calls the svctask chsystemip command if the USB flash drive is inserted in the configuration node canister, Otherwise it will blink the amber identify LED of the node canister that is the configuration node.
Query status command: Use this command to determine the current service state of the node canister. Syntax ►► sainfo getstatus ►◄ Parameters None. Description This command writes the output from each node canister to the USB flash drive. This command calls the sainfo lsservicenodes command, the sainfo lsservicestatus command, and the sainfo lsservicerecommendation command.
The elapsed time is added to the cumulative counter. Indicates the worst read response time in microseconds for each volume since the last time statistics were collected. This value is reset to zero after each statistics collection sample. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Indicates the total number of fixed or unfixed overlapping writes. When all nodes in all clusters are running Storwize V7000 Unified version 4.3.1, this records the total number of write I/O requests received by the Global Mirror feature on the primary that have overlapped. When any nodes in either cluster are running Storwize V7000 Unified versions earlier than 4.3.1, this...
Table 61 describes the node information that is reported for each nodes. Table 61. Statistic collection for nodes Statistic Description name cluster_id Indicates the name of the cluster. cluster Indicates the name of the cluster. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Table 61. Statistic collection for nodes (continued) busy - Indicates the total CPU average core busy milliseconds since the node was reset. This statistic reports the amount of the time the processor has spent polling while waiting for work versus actually doing work. This statistic accumulates from zero.
Page 248
Average non- cumulative fullness Max non- cumulative fullness Min non- cumulative Destage Target dtav IOs capped Average 9999, non- cumulative Destage Target dtmx IOs, non- cumulative Destage Target dtmn IOs, non- cumulative Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 249
Table 62. Cache statistics collection for volumes and volume copies (continued) Statistics for Statistics for Statistics for Statistics for Statistics for volume volume the Node Cache volume volume cache copy cache Overall statistics for Units and Statistic Acronym cache copy cache partition partition Cache...
Page 250
Owner Remote Average µs, Credit Queue non- Time cumulative Non-Owner Average µs, Remote Credit non- Queue Time cumulative Admin Remote Average µs, Credit Queue non- Time cumulative Cdcb Queue Average µs, Time non- cumulative Storwize V7000 Unified: Problem Determination Guide 2073-720...
Table 62. Cache statistics collection for volumes and volume copies (continued) Statistics for Statistics for Statistics for Statistics for Statistics for volume volume the Node Cache volume volume cache copy cache Overall statistics for Units and Statistic Acronym cache copy cache partition partition Cache...
Indicates the bytes retransmitted to other nodes in other clusters by the IP partnership driver. iprt Indicates the average round-trip time in microseconds for the IP partnership link since the last statistics collection period. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 253
Table 64. XML statistics for an IP Partnership port (continued) Statistic name Description iprx Indicates the bytes received from other nodes in other clusters by the IP partnership driver. ipsz Indicates the average size (in bytes) of data that are being transmitted by the IP partnership driver since the last statistics collection period.
Event reporting process The following methods are used to notify you and the IBM Support Center of a new event: v If you enabled Simple Network Management Protocol (SNMP), an SNMP trap is sent to an SNMP manager that is configured by the customer.
Viewing the event log You can view the event log by using the management GUI or the command-line interface (CLI). About this task You can view the event log by using the Monitoring > Events options in the management GUI. The event log contains many entries. You can, however, select only the type of information that you need.
Event notifications Storwize V7000 Unified can use Simple Network Management Protocol (SNMP) traps, syslog messages, emails and Call Homes to notify you and IBM(r) Remote Technical Support when significant events are detected. Any combination of these notification methods can be used simultaneously. Notifications are normally sent immediately after an event is raised.
Table 66. Notification levels Notification level Description Error Error notification is sent to indicate a problem that must be corrected as soon as possible. This notification indicates a serious problem with the system. For example, the event that is being reported could indicate a loss of redundancy in the system, and it is possible that another failure could result in loss of access to data.
You can view information about collecting log files or you can view examples of a configuration dump, error log, or featurization log. To do this, click Reference in the left pane of the IBM online information, and then expand the Logs and traces section.
Page 259
If power to a node canister fails, the node canister uses battery power to write cache and state data to its boot drive. Note: Storwize V7000 Gen2 expansion canisters do not cache volume data or store state information in volatile memory. Therefore, expansion canisters do not require battery power.
The batteries within the control enclosure provide the power to write the cache and state data to a local drive. Note: Storwize V7000 Unified expansion canisters do not cache volume data or store state information in volatile memory. Therefore, expansion canisters do not require battery power.
Page 261
Design parameters Consider the following important design parameters: v The design life of the battery in the Storwize V7000 Unified is five years service after one year on the shelf. v No periodic learning mode or reconditioning cycle occurs in the battery of this product.
Page 262
2 critical saves or 10 brown outs. Preventing this maintenance cycle from occurring increases the risk that the system accumulates a sufficient number of power outages to cause the remaining battery to be discounted when calculating whether Storwize V7000 Unified: Problem Determination Guide 2073-720...
Understanding the medium errors and bad blocks A storage system returns a medium error response to a host when it is unable to successfully read a block. The Storwize V7000 Unified response to a host read follows this behavior. The volume virtualization that is provided extends the time when a medium error is returned to a host.
The management GUI provides extensive facilities to help you troubleshoot and correct problems on your system. You can connect to and manage a Storwize V7000 Unified system as soon as you have completed the USB initialization. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page not found or similar error, this information might help you resolve the issue. The connection information differs, depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Chapter 5. Control enclosure...
Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 71. Storwize V7000 Unified Gen2 model numbers...
Page 267
v Ping the management address to see if the Ethernet network permits the connection. If the ping fails, check the Ethernet network configuration to see if there is a routing or a firewall issue. Ensure that the Ethernet network configuration is compatible with the gateway and subnet or prefix settings. Ensure that you did not use the Ethernet address of another device as the management address.
Use this information if your attempt to create a clustered system has failed. This information varies depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 72. Storwize V7000 Unified Gen1 model numbers...
Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 73. Storwize V7000 Unified Gen2 model numbers...
You can use several methods to determine the service address of a node canister. The methods of determining the service address differ, depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 74. Storwize V7000 Unified Gen1 model numbers Machine...
Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 75. Storwize V7000 Unified Gen2 model numbers...
Some types of errors can prevent nodes from communicating with each other; in that event, it might be necessary to point your browser directly at the service assistant of the node that requires administering, rather than change the current node in the service assistant. Storwize V7000 Unified: Problem Determination Guide 2073-720...
If you are unable to find the service address of the node using the management GUI or service assistant, you can also use a USB flash drive to find it. For more information, see “Procedure: Getting node canister and system information using a USB flash drive”...
Use this procedure if you receive errors to determine if your SAS cabling is valid. The procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 78. Storwize V7000 Unified Gen1 model numbers Machine type/model...
Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 79. Storwize V7000 Unified Gen2 model numbers...
Ensure that each SAS cable is fully inserted. See the topic about installing SAS cables in the IBM Storwize V7000 Gen2 Quick Installation Guide. Problem: Storwize V7000 Gen1 SAS cabling not valid This topic provides information to be aware of if you receive errors that indicate the SAS cabling is not valid.
Problem: Control enclosure not detected If a control enclosure is not detected by the system, this procedure can help you resolve the problem. When installing a new control enclosure, use the Add Enclosures wizard in the management GUI. To access this wizard, select Monitoring > System. On the Systems page, select Actions >...
The password procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 80. Storwize V7000 Unified Gen1 model numbers Machine...
Procedure: Resetting the superuser password for Storwize V7000 Gen2 The primary method for resetting the superuser password is to change the password as you log in, with the link on the log-in page. You can also access the service assistant from the technician port to change the password. If the password reset function is enabled, the log-in page displays a link for resetting the password.
About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 82. Storwize V7000 Unified Gen1 model numbers Machine type/model Description 2076-112 Storwize V7000 Unified control enclosure for up to 12 3.5-inch (8.89 cm) drives 2076-124 Storwize V7000 Unified control enclosure for up to 24 2.5-inch (6.35...
The model type and serial number of the enclosure are found at the bottom of the left bezel. Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 84. Storwize V7000 Unified Gen2 model numbers...
The Home page shows a table of node errors that exist on the node canister and a table of node details for the current node. The node errors are shown in priority order. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Use this procedure to determine the system status using the LED indicators on the system. The procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 85. Storwize V7000 Unified Gen1 model numbers Machine type/model...
Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 86. Storwize V7000 Unified Gen2 model numbers...
Table 87. LED state descriptions used in the Storwize V7000 2076-524 enclosure (continued) State description Detail Flashing The LED turns on and off at a frequency of 2 Hz: It is on for 250 ms, then off for 250 ms, then repeats. Flashing fast The LED turns on and off at a frequency of 4 Hz: It is on for 125 ms, then off for 125 ms, then repeats.
Restart the node canister, as described in “Procedure: Reseating a Storwize V7000 Gen2 node canister” on page 278. The node canister is doing a self test during start-up. Flashing fast Wait for the canister to complete its start-up sequence. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 287
Otherwise, there might be a fault the node canister, enclosure midplane, or power supply units. The Storwize V7000 Unified software is running, and the node canister is participating in the system. The canister must not be removed.
Page 288
GUI or service assistant to turn off the Identify function, then check the node canister status LEDs, again. The Storwize V7000 Unified software is not running. The BIOS might have detected a fault. It is safe to remove or reseat the canister.
Power Fault (amber) status (green) (green) The Storwize V7000 Unified software is running but there might be an error alert in the event log, such as error code 550. The canister must not be removed. If possible, go to the management GUI and run the fix procedure for the error alerts listed there.
Page 290
263 shows the LEDs on the power supply unit for the 2076-112 or 2076-124. The LEDs on the power supply units for the 2076-312 and 2076-324 are similar, but they are not shown here. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 73. LEDs on the power supply units of the control enclosure Table 91. Power-supply unit LEDs Power supply failure failure failure Status Action Communication Replace the power failure between supply unit. If failure is the power still present, replace the supply unit and enclosure chassis.
There is no power to the canister. Try reseating the canister. Go to “Procedure: Reseating a node canister” on page 278. If the state persists, follow the hardware replacement procedures for the parts in the following order: node canister, enclosure chassis. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Table 92. Power LEDs (continued) Power LED status Description Slow Power is available, but the canister is in standby mode. Try to start the node flashing (1 canister by reseating it. Go to “Procedure: Reseating a node canister” on page 278.
The battery is either charging or a maintenance discharge is in process. Nonrecoverable battery fault. Replace the battery. If replacing the battery does not fix the issue, replace the power supply unit. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Use this procedure to find the status of Ethernet connections when you cannot connect. This procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 95. Storwize V7000 Unified Gen1 model numbers Machine type/model...
Page 296
1. Verify that each end of the cable is securely connected. 2. Verify that the port on the Ethernet switch or hub is configured correctly. 3. Connect the cable to a different port on your Ethernet network. Storwize V7000 Unified: Problem Determination Guide 2073-720...
About this task Ensure that the Storwize V7000 Unified machine code is active on the node before you begin this procedure. To determine if the machine code is active, see “Procedure: Understanding the Storwize V7000 Gen2 system status from the LEDs”...
2. Use the service assistant node action to hold the node in service state. 3. Use the Manage System option to remove the system data from the node. 4. Repeat steps 1 through 3 on the second node canister in the enclosure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
5. On one node, open the service assistant Configure Enclosure and select the Reset System ID option. This action causes the system to reset. Procedure: Fixing node errors To fix node errors that are detected by node canisters in your system, use this procedure.
For other command options, see “Create system command” on page 213. 4. Save the file to a USB flash drive. 5. Plug the USB flash drive into a USB port on a control canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Use this procedure to initialize a clustered system using the service assistant. This procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 97. Storwize V7000 Unified Gen1 model numbers Machine...
Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 98. Storwize V7000 Unified Gen2 model numbers Machine type/model Description 2076-524 Storwize V7000 Unified control enclosure, with up to 24 2.5-inch (6.35 cm) drives 2076-12F Storwize V7000 Unified expansion enclosure for up to 12 3.5-inch (8.89...
Procedure: Initializing a Storwize V7000 Gen1 system using the service assistant To initialize a Storwize V7000 Gen1 system using the service assistant rather than the USB flash drive, use this procedure. About this task Note: The service assistant gives you the option to create a clustered system only if the node state is candidate.
1. Connect one end of an Ethernet cable to Ethernet port 1 of the node canister. Note: A cross-over Ethernet cable is not required. 2. Connect the other end of the Ethernet cable directly to the Ethernet port on a personal computer that has a web browser installed. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Use this procedure to reseat a node canister. The procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 99. Storwize V7000 Unified Gen1 model numbers Machine...
Page 306
6. Grasp the canister and pull it out 2 or 3 inches. 7. Push the canister back into the slot until the handle starts to move. 8. Finish inserting the canister by closing the handle until the locking catch clicks into place. Storwize V7000 Unified: Problem Determination Guide 2073-720...
9. Verify that the cables were not displaced. 10. Verify that the LEDs are on. Results Procedure: Removing a Storwize V7000 Gen2 node canister Follow this procedure to remove a node canister. About this task Attention: Before a node canister can be removed it must be powered off or in service state;...
7. As you pay attention to the number scale, slide the canister out of the slot. Procedure: Powering off your system You must power off your Storwize V7000 Unified system in order to service it, or to permit other maintenance actions in your data center. To turn off the Storwize V7000 Unified system, see “Turning off the system”...
Fault is off. If a canister is not ready, refer to the “Procedure: Understanding the system status using the LEDs” topic in the troubleshooting section of the Storwize V7000 Unified information center. Procedure: Powering off a Storwize V7000 Gen2 control...
While a node canister is powered off, some volumes can become inaccessible. Refer to “Procedure: Understanding Storwize V7000 Gen2 volume dependencies” on page 286 to determine whether it is appropriate to continue this procedure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
The status LEDs on the canister indicate that the node is powered off. Procedure: Collecting information for support IBM support might ask you to collect trace files and dump files from your system to help them resolve a problem. Typically, you perform this task from the Storwize V7000 Unified management GUI.
About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 101. Storwize V7000 Unified Gen1 model numbers Machine type/model Description 2076-112 Storwize V7000 Unified control enclosure for up to 12 3.5-inch (8.89 cm) drives 2076-124 Storwize V7000 Unified control enclosure for up to 24 2.5-inch (6.35...
Verify that Storwize V7000 Unified and host get an fcid on FCF. If not, check the VLAN configuration. b. Verify that Storwize V7000 Unified and host port are part of a zone and that zone is currently in force.
If a control enclosure only has one node canister online, access to a volume depends on the online node canister if the volume is stored partially or wholly on an array that uses drives in the control enclosure or its expansion enclosures. Storwize V7000 Unified: Problem Determination Guide 2073-720...
SFP transceivers, canisters, power supply units, battery assemblies, and enclosure chassis. The parts list varies, depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 103. Storwize V7000 Unified Gen1 model numbers Machine...
Table 105. Control enclosure replaceable units (continued) Part number Part name CRU or FRU Notes 64P8473 4-port 8 Gbps Fibre No SFPs Channel host interface adapter 00AR316 4-port 10 Gbps No SFPs Ethernet host interface adapter 00WY984 4-port 16Gbps Fibre No SFPs Channel host interface adapter...
LFF HDD - 6 TB NL Requires system 12 Gbps SAS software version 7.4 or later. Table 108. Cable replaceable units Part number Part name CRU or FRU Notes Optical 39M5699 1 m FC cable Storwize V7000 Unified: Problem Determination Guide 2073-720...
Table 108. Cable replaceable units (continued) Part number Part name CRU or FRU Notes 39M5700 5 m FC cable 39M5701 25 m FC cable 41V2120 10 m OM3 FC cable 00AR272 0.6 m 12 Gbps SAS For connecting Cable (mini SAS HD expansion enclosures.
2.8 m power cord (South Africa) 39M5144 Customer replaced 2.8 m power cord (Switzerland) 39M5158 Customer replaced 2.8 m power cord (Chile) 39M5165 Customer replaced 2.8 m power cord (Israel) 39M5172 Customer replaced Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 321
Table 109. Replaceable units (continued) Applicable FRU or customer Part Part number models replaced 2.8 m power cord (Group 1 39M5081 Customer including the United States) replaced 2.8 m power cord (Argentina) 39M5068 Customer replaced 2.8 m power cord (China) 39M5206 Customer replaced...
I/O operations, go to the management GUI and follow the fix procedures. Initiating the replacement actions without the assistance of the fix procedures can result in loss of data or loss of access to data. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Before you remove and replace parts, you must be aware of all safety issues. Before you begin First, read the safety precautions in the IBM Systems Safety Notices. These guidelines help you safely work with the Storwize V7000 Unified. Replacing a node canister Remove and replace a node canister.
Page 324
Do not remove a node canister unless directed to do so by a service procedure. To replace the node canister, perform the following steps: Procedure 1. Read the safety information to which “Preparing to remove and replace parts” on page 295 refers. Storwize V7000 Unified: Problem Determination Guide 2073-720...
2. Confirm that you know which canister to replace. Go to “Procedure: Identifying which Storwize V7000 Gen1 enclosure or canister to service” on page 254. 3. Record which data cables are plugged into the specific ports of the node canister. The cables must be inserted back into the same ports after the replacement is complete;...
9. Finish inserting the node canister by closing its release lever so that the orange catch engages the enclosure. If the enclosure is powered and the canister is correctly installed, the canister starts automatically. Remove the canister and repeat the procedure from step 5 on page 298, if the canister is not correctly installed.
I/O operations, go to the management GUI and follow the fix procedures. Initiating the replacement actions without the assistance of the fix procedures can result in loss of data or loss of access to data. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Be careful when you are replacing the hardware components that are located in the back of the system that you do not inadvertently disturb or remove any cables that you are not instructed to remove. Be aware of the following canister LED states: v If the power LED is on, do not remove an expansion canister unless directed to do so by a service procedure.
Be careful when you are replacing the hardware components that are located in the back of the system that you do not inadvertently disturb or remove any cables that you are not instructed to remove. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Important: For correct operation, use the correct SFP transceivers with each adapter card. The topic “Storwize V7000 2076-524 Gen2 replaceable units” identifies the suitable IBM parts. v Use only 8G bps SFP transceivers in the 8 Gbps Fibre Channel adapter cards.
Page 332
Important: Always check that the SFP transceiver that you replace matches the SFP transceiver that you remove. 4. Push the new SFP transceiver into the aperture and ensure that it is securely pushed home. The SFP transceiver usually locks into place without having to Storwize V7000 Unified: Problem Determination Guide 2073-720...
swing the release handle until it locks flush with the SFP transceiver. Figure 85 illustrates an SFP transceiver and its release handle. Figure 85. SFP transceiver 5. Reconnect the optical cable. 6. Confirm that the error is now fixed. Either mark the error as fixed or restart the node depending on the failure indication that you originally noted.
You can replace either of the two 764 watt hot-swap redundant power supplies in the control enclosure. These redundant power supplies operate in parallel, one continuing to power the canister if the other fails. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 335
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 336
Power supply unit 1 is top side up, and power supply unit 2 is inverted. a. Depress the black locking catch from the side with the colored sticker as shown in Figure 87 on page 309. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 87. Directions for lifting the handle on the power supply unit b. Grip the handle to pull the power supply out of the enclosure as shown in Figure 88. Figure 88. Using the handle to remove a power supply unit 6.
4. On the left side of the power supply, press the orange release tab to the right just enough to release the handle (no more than 6 mm [0.25 in.]) as you rotate the handle downward. Storwize V7000 Unified: Problem Determination Guide 2073-720...
5. Using the handle, gently slide the power supply out of the enclosure, as shown in Figure 89. Figure 89. Removing the power supply unit from the left side of the expansion enclosure 6. Hold the new power supply so that the handle is fully extended. Slide the power supply into the enclosure until it stops.
Page 340
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 341
Attention: A powered-on enclosure must not have a power supply removed for more than five minutes because the cooling does not function correctly with an empty slot. Ensure that you have read and understood all these instructions and have the replacement available, and unpacked, before you remove the existing power supply.
6. Insert the replacement power supply unit into the enclosure with the handle pointing towards the center of the enclosure. Insert the unit in the same orientation as the one that you removed. Storwize V7000 Unified: Problem Determination Guide 2073-720...
7. Push the power supply unit back into the enclosure until the handle starts to move. 8. Finish inserting the power supply unit in the enclosure by closing the handle until the locking catch clicks into place. 9. Reattach the power cable and cable retention bracket. 10.
13. When the canister is back online, check the event log for any new events that might indicate a problem with the reassembly. Replacing a battery in a power supply unit Remove and replace the battery in a control enclosure power-supply unit. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 345
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 346
2. Follow the removing steps of the replacing a power-supply unit procedure. Go to “Replacing a Storwize V7000 Gen1 power supply unit for a control enclosure” on page 306. 3. Remove the battery, as shown in Figure 93 on page 319. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 93. Removing the battery from the control enclosure power-supply unit a. Press the catch to release the handle 1 . b. Lift the handle on the battery 2 . c. Lift the battery out of the power supply unit 3 . 4.
About this task The status of the drive must be such that it is not a spare or a member. The status is shown in Pools > Internal Storage in the management GUI. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 349
Attention: v Do not replace a drive unless the drive fault LED is on or you are instructed to do so by a fix procedure. v If the drive is a member of an array, go to the management GUI and follow the fix procedures.
The process can take a few minutes. Replacing a 3.5-inch drive assembly or blank carrier This topic describes how to replace a 3.5-inch drive assembly or blank carrier. Storwize V7000 Unified: Problem Determination Guide 2073-720...
About this task Attention: If your drive is configured for use, go to the management GUI and follow the fix procedures. Initiating the replacement actions without the assistance of the fix procedures results in loss of data or loss of access to data. Attention: Do not leave a drive slot empty.
Do not leave a drive slot empty for extended periods. Do not remove a drive assembly or a blank filler without having a replacement drive or a blank filler with which to replace it. Procedure To prepare to replace a drive assembly, complete the following steps. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 353
1. Read the safety information in “Preparing to remove and replace parts” on page 295. 2. Locate the slot that contains the drive assembly that you want to replace. a. Refer to “Procedure: Identifying which Storwize V7000 Gen2 enclosure or canister to service”...
The process can take a few minutes. Replacing a 2.5-inch drive assembly or blank carrier This topic describes how to remove a 2.5-inch drive assembly or blank carrier. Storwize V7000 Unified: Problem Determination Guide 2073-720...
About this task Attention: If your drive is configured for use, go to the management GUI and follow the fix procedures. Initiating the replacement actions without the assistance of the fix procedures results in loss of data or loss of access to data. Attention: Do not leave a drive slot empty.
2. Grasp the end cap by the blue touch point and pull it until the bottom edge of the end cap is clear of the bottom tab on the chassis flange. Storwize V7000 Unified: Problem Determination Guide 2073-720...
3. Lift the end cap off the chassis flange. 4. Fit the slot on the top of the new end cap over the tab on the top of the chassis flange. 5. Rotate the end cap down until it snaps into place. Ensure that the inside surface of the end cap is flush with the chassis.
Page 358
3. The connector is released and slides out of the port. 4. Repeat steps 2 and 3 on the other end of the SAS cable. 5. To connect the replacement expansion enclosure attachment SAS cable, connect each end to the vacated ports. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Attention: When inserting a SAS connector into a SAS port, ensure that the orientation of the connector matches the orientation of the port before pushing the connector into the port. v The cable connector and socket are keyed and it is important that you have proper alignment of the keys when the cable is inserted.
Remove and replace a control enclosure chassis. This procedure only applies to Storwize V7000 Gen1 control enclosure models. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 110. Storwize V7000 Unified Gen1 model numbers Machine...
Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 111. Storwize V7000 Unified Gen2 model numbers Machine type/model Description 2076-524 Storwize V7000 Unified control enclosure, with up to 24 2.5-inch (6.35 cm) drives 2076-12F Storwize V7000 Unified expansion enclosure for up to 12 3.5-inch (8.89...
Page 362
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 363
Attention: Perform this procedure only if instructed to do so by a service action or the IBM support center. If you have a single control enclosure, this procedure requires that you shut down your system to replace the control enclosure. If you...
Page 364
Using the left end cap that you removed preserves the model and serial number identification. 21. Reinstall the drives in the new enclosure. The drives must be inserted back into the same location from which they were removed on the old enclosure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 365
If you still do not have a full set of values, contact IBM support. After you modify the configuration, the node attempts to restart.
Page 366
The procedures for replacing a control enclosure chassis are different from those procedures for replacing an expansion enclosure chassis. To replace an expansion enclosure chassis, see “Replacing an expansion enclosure chassis” on page 347. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 367
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 368
Attention: Perform this procedure only if instructed to do so by a service action or the IBM support center. If you have a single control enclosure, this procedure requires that you shut down your system to replace the control enclosure. If you...
Page 369
Dependent volume names that start with IFS are file volumes that are used by the file modules to provide file systems. Turn off these file modules. See the procedure “Turning off the system”. 5. If the I/O group is still online, shut down the I/O group by using the control enclosure CLI.
Page 370
If any of the node copy values are all zeroes, connect the service assistant to the other node canister and configure the enclosure there. If you still do not have a full set of values, contact IBM support.
Remove and replace an expansion enclosure chassis. This procedure only applies to Storwize V7000 Gen1 enclosure models. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 112. Storwize V7000 Unified Gen1 model numbers Machine...
Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 113. Storwize V7000 Unified Gen2 model numbers...
Page 373
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 374
13. Replace the end caps. Use the new right end cap and use the left end cap that you removed in step 8. Using the left end cap that you removed preserves the model and serial number identification. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 375
14. Reinstall drives in the new enclosure. You must insert the drives back into the same location from which they were removed on the old enclosure. 15. Reinstall the canisters (and drives) in the enclosure. 16. Install the power supply units. 17.
Page 376
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 377
Attention: If your system is powered on and performing I/O operations, go to the management GUI and follow the fix procedures. Performing the replacement actions without the assistance of the fix procedures can result in loss of data or access to data. Even though many of the parts are hot-swappable, these procedures are intended to be used only when your system is not up and running and performing I/O operations.
22. Go to the management GUI to use the fix procedure to change the machine type and model and serial number in the expansion enclosure. Replacing a Storwize V7000 Gen2 enclosure midplane A trained service provider must replace the midplane assembly of a Storwize V7000 Gen2 enclosure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 379
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 380
Attention: v The enclosure midplane must be replaced only by a trained service provider. Perform this procedure only if instructed to do so by a service action or the IBM support center. v Be careful when you are replacing the hardware components that are in the back of the system that you do not inadvertently disturb or remove any cables that you are not instructed to remove.
Page 381
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 382
Attention: The control enclosure must be replaced only by a trained service provider. Complete this procedure only if instructed to do so by a service action or the IBM support center. If you have a single control enclosure, this procedure requires that you shut down your system to replace the control enclosure midplane assembly.
9. Remove the two power supplies from the enclosure. Refer to “Replacing a Storwize V7000 Gen2 power supply unit for a control enclosure” on page 305 for guidance. 10. Remove the node canisters from the enclosure. Label them to indicate what canister came from each canister slot.
19. Remove the midplane assembly from the chassis by rotating up the midplane assembly to about 45°, then withdraw the midplane assembly from the front of the enclosure. Figure 107 on page 357 shows the midplane assembly at a 45 degree angle. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 107. Angled midplane assembly 20. Unpack the replacement midplane assembly. Grasp the midplane assembly with two hands to hold the assembly at a 45° angle. 21. Insert the tabs on the midplane assembly into the tab holes in the enclosure and rotate down the front of the assembly.
Page 386
If any of the node copy values are all zeros, connect the service assistant to the other node canister and configure the enclosure there. If you still do not have a full set of values, contact IBM support.
Before you begin Three persons are required at step 11 on page 360. About this task Attention: To prevent data loss, you must shut down the system before you begin the procedure to replace an expansion enclosure midplane assembly. The expansion enclosure midplane assembly must be replaced only by a trained service provider.
361). Remove the three screws that are near the front and the screw that is near the middle. Label these screws to indicate the location from which they are removed and place them aside. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 110. Removing the screws of an expansion enclosure assembly 13. Turn the enclosure top side up and place it on a flat surface. 14. Remove the three screws and one screw-pin on the right side that secure the midplane assembly to the enclosure (see Figure 110). Label the screws to indicate the location from which they are removed and place them aside.
Remove and replace the support rails. The procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 114. Storwize V7000 Unified Gen1 model numbers Machine...
Page 391
Before you begin Three persons are required at step 7 About this task Follow all safety precautions when completing this procedure. Procedure To replace the support rails, complete the following steps. 1. Identify the enclosure mounted on the rails being replaced. Follow the steps in “Procedure: Identifying which Storwize V7000 Gen2 enclosure or canister to service”...
10. At the front of the rack, hold onto the rail and open the front hinge bracket. 11. Compress the rail against its spring to shorten it, then remove it from inside the rack (Figure 112 on page 365). Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 112. Compressing rail for removal from rack 12. Repeat steps 9 on page 363 to 11 on page 364 on the right support rail. 13. Install the new support rails at the rack position that is recorded at step 8 on page 363 by following the instructions in Step 6.
Page 394
9. At the rear of the rack, remove the securing M5 screw from the bottom hole of the rear bracket of the rail, then open the rear hinge bracket (Figure 113 on page 367). Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 113. Opening rear hinge bracket of mounting rail 10. At the front of the rack, hold onto the rail and open the front hinge bracket. 11. Compress the rail against its spring to shorten it, then remove it from inside the rack (Figure 114 on page 368).
The system starts. 18. After the system is online, use the management GUI to verify that the system is correct. Replacing the Storwize V7000 Gen1 support rails You can replace the support rails. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Procedure To replace the support rails, complete the following steps: 1. Remove the enclosure. 2. Record the location of the rail assembly in the rack cabinet. 3. Working from the back of the rack cabinet, remove the clamping screw 1 from the rail assembly on both sides of the rack cabinet.
12. Reconnect the cables to the canister, ensuring cables go into the same ports from which they were removed in step 1. 13. When the canister is back online, check the event log for new events, particularly events that relate to hardware changes. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Important: For correct operation, use the correct SFP transceivers with each adapter card. The topic “Storwize V7000 2076-524 Gen2 replaceable units” identifies the suitable IBM parts. v Use only 8G bps SFP transceivers in the 8 Gbps Fibre Channel adapter cards.
9. Maintain alignment while applying pressure to the top edge of the host interface adapter opposite the connecting edge to push the host interface adapter into the connector 4 and 5 . Storwize V7000 Unified: Problem Determination Guide 2073-720...
Figure 118. Installing the host interface adapter 10. Check that the host interface adapter is installed squarely in its slot. If the small tab of the mounting bracket is not positioned correctly, repeat steps 5 on page 371 onward to install the adapter correctly. 11.
2. Open the canister and remove the lid as described in “Procedure: Removing and replacing the lid of a Storwize V7000 Gen2 node canister” on page 285. 3. Locate the CMOS battery inside the node canister.. See Figure 119 on page 375 Storwize V7000 Unified: Problem Determination Guide 2073-720...
Procedure: SAN problem determination About this task SAN failures might cause Storwize V7000 Unified volumes to be inaccessible to host systems. Failures can be caused by SAN configuration changes or by hardware failures in SAN components. The following list identifies some of the hardware that might cause failures:...
Procedure 1. Verify that the power is turned on to all switches and storage controllers that the Storwize V7000 Unified system uses, and that they are not reporting any hardware failures. If problems are found, resolve those problems before you proceed further.
Page 405
Each port is assigned to one CPU, and by balancing the login, one can maximize CPU utilization and achieve better performance. Ideally, configure subnets equal to the number of iSCSI ports on the Storwize V7000 Unified node. Configure each port of a node with an IP on a different subnet and keep it the same for other nodes.
You do not need to enable PFC on the Storwize V7000 Unified system. Storwize V7000 Unified reads the data center bridging exchange (DCBx) packet and enables PFC for iSCSI automatically if it is enabled on the switch. In the lsportip command output, the fields lossless_iscsi and lossless_iscsi6 show [on/off] depending on whether PFC is enabled or not for iSCSI on the system.
Removing the wrong SFP transceiver might result in loss of data access. 4. Contact the IBM Support Center for assistance in replacing the node canister. Ethernet iSCSI host-link problems If you are having problems attaching to the Ethernet hosts, your problem might be related to the network, the Storwize V7000 Unified system, or the host.
Turning on the system, located in the Information Center, to power the file modules back on. Contact IBM Remote Technical support if the health indicator in the management GUI does not turn back to green within 30 minutes. They can assist you with recovering the file modules so that access to the file systems can be restored.
Page 409
Attention: v Run service actions only when directed by the fix procedures. If used inappropriately, service actions can cause loss of access to data or even data loss. Before you attempt to recover a storage system, investigate the cause of the failure and attempt to resolve those issues by using other fix procedures.
Attention: If you experience failures at any time while running the recover system procedure, call the IBM Support Center. Do not attempt to do further recovery actions, because these actions might prevent support from restoring the system to an operational status.
Page 411
Note: If after resolving all these scenarios, half or greater than half of the nodes are reporting node error 578, it is appropriate to run the recovery procedure. Call the IBM Support Center for further assistance. – For any nodes that are reporting a node error 550, ensure that all the missing hardware that is identified by these errors is powered on and connected without faults.
Do not run the recovery procedure on different node canisters in the same system. Before you begin Note: Ensure that the web browser is not blocking pop-up windows. If it does, progress windows cannot open. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 413
Before you begin this procedure, read the recover system procedure introductory information; see “Recover system procedure” on page 380. About this task Attention: This service action has serious implications if not completed properly. If at any time an error is encountered not covered by this procedure, stop and call the support center.
Complete the following steps to recover an offline volume after the recovery procedure has completed: 1. Delete all IBM FlashCopy function mappings and Metro Mirror or Global Mirror relationships that use the offline volumes. 2. Run the recovervdisk or recovervdiskbysystem command. (This will only bring the volume back online so that you can attempt to deal with the data loss.)
Refer to “What to check after running the system recovery” for what to do with volumes that have been corrupted by the loss of data from the write-cache. 4. Recreate all FlashCopy mappings and Metro Mirror or Global Mirror relationships that use the volumes. What to check after running the system recovery Several tasks must be completed before you use the system.
Page 416
Before using the file volumes that are used by GPFS on the file modules to provide Network Attached Storage (NAS), complete the following task: v Contact IBM support for assistance with recovering the GPFS quorum state so that access to files as NAS can be restored.
Contact the IBM support center to help you prepare the Storwize V7000 Unified system to do the restoring of the system configuration on the control enclosure.
1. Before you begin, hardware recovery must be complete. The following hardware must be operational: hosts, Storwize V7000 Unified enclosures, internal flash drives and expansion enclosures (if applicable), the Ethernet network, the SAN fabric, and any external storage systems (if applicable).
data that you wrote to the volumes is not backed up. Any application that uses the volumes on the system as storage, must use the appropriate backup methods to back up its application data. You must regularly back up your configuration data and your application data to avoid data loss, such as after any significant changes to the system configuration.
MDisks and the array will be re-created and configured. If there are multiple storage enclosures involved, the arrays and MDisks will be restored on the proper enclosures based on the enclosure IDs. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 421
If you do not understand the instructions to run the CLI commands, see the command-line interface reference information. To restore your configuration data, follow these steps: Procedure 1. Verify that all nodes are available as candidate nodes before you run this recovery procedure.
If you find errors, correct the condition that caused the errors and reissue the command. You must correct all errors before you can proceed to step 12. v If you need assistance, contact the IBM Support Center. 12. Issue the following CLI command to restore the configuration:...
Page 423
Procedure 1. Issue the following command to log on to the system: plink -i ssh_private_key_file superuser@control_enclosure_management_ip where ssh_private_key_file is the name of the SSH private key file for the superuser and control_enclosure_management_ip is the IP address or DNS name of the system from which you want to delete the configuration. 2.
Page 424
Storwize V7000 Unified: Problem Determination Guide 2073-720...
IBM Support can contact the system administrator in case of any issues. Testing a call home connection Use this information to test a call home connection to the IBM support. From the block-level storage system If call home actions fail, perform the following steps: 1.
Enter the customer name, the case number (use the PMR number), and the geography. f. Talk to the IBM authorized servicer at the customer site to make sure that the servicer is ready to establish the link before you submit the form.
Page 427
For example, click Active. Active mode gives full remote access. Monitor mode restricts the IBM support representative to a view of the console, where the representative can offer guidance on what actions you might take to analyze and correct the problem.
Page 428
Storwize V7000 Unified: Problem Determination Guide 2073-720...
SCSI protocol. Before you begin During the USB initialization of the Storwize V7000 Unified system, one of the node canisters in the control enclosure creates a public/private key pair to use for ssh. The node canister stores the public key and writes the private key to the USB flash drive memory.
You are prompted for the Storwize V7000 superuser password. 5. Log on to the Storwize V7000 Unified management CLI as admin via the management IP and run the following command to register the new NAS SSH key: chstoragesystem --sonasprivkey /tmp/NAS.ppk...
This section covers the recovery procedures related to file module issues. Restoring System x firmware (BIOS) settings During critical repair actions such as the replacement of a system planar in an IBM Storwize V7000 Unified file module, you might have to reset the System x firmware.
Page 433
5. Scroll down to select the USB cable, then press Enter. 6. Turn on the affected file module. 7. From the IBM System x Server Firmware screen, press F1 to set up the firmware. A few seconds after the IBM System x Server Firmware screen is displayed,...
The system now reboots. During the reboot, the Storwize V7000 Unified code automatically modifies the configuration of the System x firmware (BIOS) to change the default settings to the required settings. Recovering from file systems that are offline after the volumes...
The multipath -ll command verifies that all storage devices are either active or not active. The following output shows that all storage devices are active. [root@yourmachine.mgmt001st001 ~]# multipath -ll array1_sas_89360007 (360001ff070e9c0000000001989360007) fm-0 IBM,2073-720 [size=3.1T][features=1 queue_if_no_path][hwhandler=0][rw] \_ round-robin 0 [prio=50][active] \_ 6:0:0:0 sdb 8:16 [active][ready]...
Issue the sc service http start command. 2. When you complete the service action, refer to “Health status and recovery” on page 62. Recovering from an sshd_data service error Use this procedure to recover from an sshd_data service error. Storwize V7000 Unified: Problem Determination Guide 2073-720...
About this task This recovery procedure starts the sshd_data when it is down. Procedure 1. Log in as a CLI user with privileged authority. 2. Issue the service sc sshd_data start command. 3. If the problem persists, restart the node. 4.
About this task Procedure To run the fix procedures, perform the following steps: 1. Log in to the Storwize V7000 Unified management GUI. 2. Go to Monitoring > Events and click the Block tab. 3. Run any Next recommended action.
Free the unusable blocks in the compressed volumes If you cannot increase the storage pool capacity then contact IBM Remote Technical Support to help you. Recovering from a 1001 error code A 1001 error code indicates that the Storwize V7000 control enclosure has automatically performed a recovery.
Page 440
You can immediately remount any remaining unmounted file systems without waiting for IBM support to tell you that it is safe for you to re-enable the control enclosure CLI. Note: The management GUI can become very slow when the control enclosure CLI is restricted, so the following procedure shows how to use the management CLI to check if the file systems are mounted.
Page 441
CLI command to check if all of your file volumes that should be online are online. Note that the names of file volumes are the same as the names of the disks. For example [kd52v6h.ibm]$ lsvdisk id name IO_group_id IO_group_name status mdisk_grp_id mdisk_grp_name capacity type FC_id FC_name RC_id RC_name vdisk_UID fc_map_count copy_count fast_write_state 0 IFS1350385068630 0 io_grp0 online 1 meta1 100.00GB striped...
5. Log back on to the Storwize V7000 Unified CLI. Wait until both nodes show OK in the Connection status column of the output from the CLI command: lsnode -r 6. Resume the file module back into the cluster using the CLI command: resumenode <node name>...
Site A by using the rmtask CLI command. Restoring Tivoli Storage Manager data The Storwize V7000 Unified system contains a Tivoli Storage Manager client that works with your Tivoli Storage Manager server system to perform high-speed data backup and recovery operations.
2. After each recommended fix, restart the upgrade by issuing the applysoftware command again. If the action fails, try the next recommended action. 3. If the recommended actions fail to resolve the issue, call the IBM Support Center. Table 119. Upgrade error codes from using the applysoftware command and recommended...
Page 445
Table 119. Upgrade error codes from using the applysoftware command and recommended actions (continued) The applysoftware Error Code command explanation Action EFSSG4101A The applysoftware command returned required parameter not specified. EFSSG4102 The software package does Verify that the file actually not exist.
Page 446
EFSSG4159 The system is in an See Chapter 3, “Getting unhealthy state and the started troubleshooting,” on upgrade cannot start. page 47. Determine if the system has issues. Storwize V7000 Unified: Problem Determination Guide 2073-720...
2. After each recommended fix, restart the upgrade by issuing the applysoftware command again. If the action fails, try the next recommended action. 3. If the recommended actions fail to resolve the issue, call the IBM Support Center. Table 120. Upgrade error codes and recommended actions...
Page 448
2. Attempt to remove the backup by typing rmtask StartBackupTSM. 3. Contact IBM Remote Technical Support. 01A6 Unable to install CNCSM callbacks. Contact IBM Remote Technical Support. 01A7 Internal vital product data (VPD) Contact IBM Remote Technical Support. error. 01A8 Check the health of management 1.
Page 449
Table 120. Upgrade error codes and recommended actions (continued) Error Code Explanation Action 01A9 Unable to stop performance Contact IBM Remote Technical Support. collection daemon. 01AB Internal upgrade error in Contact IBM Remote Technical Support. node_setup_system. 01B1 Management node replication 1.
Page 450
1. Stop asynchronous replication by stop. Stop asynchronous replication typing stoprepl gpfs0 --kill. and continue with the upgrade. Asynchronous replication is considered active if in RUNNING or KILLING state. 2. Contact IBM Remote Technical Support. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 451
Contact IBM Remote Technical Support. sonas_update_yum. 01C7 Unable to get list of cluster nodes. Contact IBM Remote Technical Support. 01C8 Failed while running cnrsscconfig. Contact IBM Remote Technical Support. 01C9 Unable to install CIM Contact IBM Remote Technical Support. configuration. 01CA Unable to get name of cluster.
Page 452
Failed Contact IBM Remote Technical Support. 01E3 mmchfs Failed Contact IBM Remote Technical Support. 01E4 Disable HSM failed Contact IBM Remote Technical Support. 01E5 Enable HSM failed Contact IBM Remote Technical Support. Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 453
2. Restart the node and then, ping the node. 3. Check the network connections and correct them, if required. 4. Contact IBM Remote Technical Support. 01E8 Unable to apply firmware to Contact IBM Remote Technical Support.
Page 454
Database replication suspend or Contact IBM Remote Technical Support. resume error. 0522 Unable to clean the CTDB Contact IBM Remote Technical Support. configuration file. 0523 Unable to upgrade Samba Contact IBM Remote Technical Support. packages. Storwize V7000 Unified: Problem Determination Guide 2073-720...
429 Storage pool is full and the file system pool Contact IBM Remote Technical Support or is offline, but no additional storage is your service representative. available to add to the pool.
In the Preset field, select the RAID configuration for the storage you are configuring. c. Select Optimize for capacity to configure all available capacity. d. Verify the configuration and click Next. Storwize V7000 Unified: Problem Determination Guide 2073-720...
e. Click Expand an existing pool and select the storage pool that is used for compression. 4. Click Finish. Allocate storage from available external storage: The system supports adding external storage systems to provide additional capacity and virtualization. If your environment has external storage systems, you can increase capacity to the storage pool by completing these steps: 1.
Page 458
Note: If you are unfamiliar with managing spare goals and spare disks, contact IBM support for guidance. Increasing capacity in this way is meant only as a short term solution to this problem. Further provisioning to permanently resolve capacity constraints can be conducted with the help of IBM service personnel who might recommend that additional drives be added to your system.
Page 459
Click OK. To add additional drives to the system, complete these steps: a. Acquire additional drives from IBM or vendor. b. Install drives into available drive slots on the enclosure. See “Installing a hot-swap hard disk drive” on page 149.
In most cases, data does not have the same compression rate because it is constantly changing over the course of life cycle. Incompressible data or data that does not compress well can be added to a file Storwize V7000 Unified: Problem Determination Guide 2073-720...
Page 461
system, which impacts compression rates. The system default for the contingency threshold at 80% of the physical capacity which provides 20% contingency capacity for the storage pool, which is adequate for most environment. For example, if an administrator has a storage pool with 10 TB of physical storage and sets the threshold to 80%, only 8 TB out of the physical 10 TB are available in the pool.
Page 462
Storwize V7000 Unified: Problem Determination Guide 2073-720...
Accessibility features These are the major accessibility features for the Storwize V7000 Unified: v You can use screen-reader software and a digital speech synthesizer to hear what is displayed on the screen. HTML documents have been tested using JAWS version 15.0.
Page 464
Storwize V7000 Unified: Problem Determination Guide 2073-720...
Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead.
Page 466
The materials at those websites are not part of the materials for this IBM product and use of those websites is at your own risk. IBM may use or distribute any of the information you provide in any way it believes appropriate without incurring any obligation to you.
IBM, therefore, cannot guarantee or imply reliability, serviceability, or function of these programs. The sample programs are provided "AS IS", without warranty of any kind. IBM shall not be liable for any damages arising out of your use of the sample programs.
Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. Attention: This is an EN 55022 Class A product. In a domestic environment this product might cause radio interference in which case the user might be required to take adequate measures.
Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung der IBM verändert bzw.
This explains the Japan Voluntary Control Council for Interference (VCCI) statement. Japan Electronics and Information Technology Industries Association Statement This explains the Japan Electronics and Information Technology Industries Association (JEITA) statement for less than or equal to 20 A per phase. Storwize V7000 Unified: Problem Determination Guide 2073-720...
This explains the JEITA statement for greater than 20 A per phase. Korean Communications Commission Class A Statement This explains the Korean Communications Commission (KCC) statement. Russia Electromagnetic Interference Class A Statement This statement explains the Russia Electromagnetic Interference (EMI) statement. Notices...
Page 472
Storwize V7000 Unified: Problem Determination Guide 2073-720...