hit counter script
IBM Storwize V7000 Unified Problem Determination Manual

IBM Storwize V7000 Unified Problem Determination Manual

Table of Contents

Advertisement

IBM Storwize V7000 Unified
Problem Determination Guide
IBM
GA32-1057-14

Advertisement

Table of Contents
loading

Summary of Contents for IBM Storwize V7000 Unified

  • Page 1 IBM Storwize V7000 Unified Problem Determination Guide GA32-1057-14...
  • Page 2 The information in the “Safety and environmental notices” on page xi v The information in the IBM Environmental Notices and User Guide (provided on a DVD) This edition applies to IBM Storwize V7000 Unified and to all subsequent releases and modifications until otherwise indicated in new editions.
  • Page 3: Table Of Contents

    Back up your data . Safety notices and labels . Manage your spare and failed drives . Caution notices for the Storwize V7000 Unified Resolve alerts in a timely manner . Danger notices for Storwize V7000 Unified . Keep your software up to date .
  • Page 4 Procedure: Powering off your system . Battery operation for Storwize V7000 Gen2 Procedure: Powering on the Storwize V7000 control enclosures . Gen2 system. Battery operation for Storwize V7000 Unified Gen1 control enclosures . Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 5 Running system recovery using the service Appendix. Accessibility features for assistant . IBM Storwize V7000 Unified ..Recovering from offline volumes using the CLI What to check after running the system Notices ....
  • Page 6 Industries Association Statement . European Union Electromagnetic Compatibility Korean Communications Commission Class A Directive . Statement Germany Electromagnetic Compatibility Russia Electromagnetic Interference Class A Directive . Statement People's Republic of China Class A Statement Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 7: Figures

    16. Rear view of a model 2076-212 or a model 49. Example of a SMART error . 2076-224 expansion enclosure . 50. Removing the cover . 17. Rear view of a Storwize V7000 Unified control 51. Installing the cover enclosure . 52. Removing the bezel 18.
  • Page 8 94. Unlocking and removing a 3.5-inch drive 118. Installing the host interface adapter . from its slot . 119. Replacing a CMOS Gen2 battery . 95. Installing and locking a 3.5-inch drive into its slot . viii Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 9 65. Description of data fields for the event log information 66. Notification levels . 33. Installation error code actions 67. Storwize V7000 Unified Gen1 model numbers 230 34. Error messages and actions . 68. Storwize V7000 Unified Gen2 model numbers 230 35. CLI command problems .
  • Page 10 81. Storwize V7000 Unified Gen2 model numbers 250 103. Storwize V7000 Unified Gen1 model numbers 287 82. Storwize V7000 Unified Gen1 model numbers 252 104. Storwize V7000 Unified Gen2 model numbers 288 83. Storwize V7000 Unified Gen2 model numbers 252 105.
  • Page 11: Safety And Environmental Notices

    In the preceding examples, the numbers (C001) and (D002) are the identification numbers. 2. Locate the IBM Systems Safety Notices with the user publications that were provided with the Storwize V7000 Unified hardware. 3. Find the matching identification number in the IBM Systems Safety Notices. Then review the topics concerning the safety notices to ensure that you are in compliance.
  • Page 12: Caution Notices For The Storwize V7000 Unified

    “Labels” section. Note: You can find and download the current IBM System Safety Notices by searching for Publication number G229-9054 in the IBM Publications Center.
  • Page 13 CAUTION: The battery contains lithium. To avoid possible explosion, do not burn or charge the battery. Do not: Throw or immerse into water, heat to more than 100°C (212°F), repair or disassemble. (C003) CAUTION: Electrical current from power, telephone, and communication cables can be hazardous.
  • Page 14 It is intended that equipment installed within this rack will have its own enclosure. (R005). CAUTION: Tighten the stabilizer brackets until they are flush against the rack. (R006) CAUTION: Use safe practices when lifting. (R007) Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 15: Danger Notices For Storwize V7000 Unified

    (R009) Danger notices for Storwize V7000 Unified Ensure that you are familiar with the danger notices for Storwize V7000 Unified. Use the reference numbers in parentheses at the end of each notice, such as (C003) for example, to find the matching translated notice in IBM Systems Safety Notices.
  • Page 16 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 17 Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet.
  • Page 18: Special Caution And Safety Notices

    General safety When you service the Storwize V7000 Unified, follow general safety guidelines. Use the following general rules to ensure safety to yourself and others. v Observe good housekeeping in the area where the devices are kept during and after maintenance.
  • Page 19: Handling Static-Sensitive Devices

    Attention: Depending on local conditions, the sound pressure can exceed 85 dB(A) during service operations. In such cases, wear appropriate hearing protection. Environmental notices This information contains all of the required environmental notices for IBM Systems products in English and other languages. Safety and environmental notices...
  • Page 20 The IBM Systems Environmental Notices (http://ibm.co/1fBgWFI) information includes statements on limitations, product information, product recycling and disposal, battery information, flat panel display, refrigeration and water-cooling systems, external power supplies, and safety data sheets. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 21: About This Guide

    Storwize V7000 Unified. IBM Knowledge Center for Storwize V7000 Unified The information collection in the IBM Knowledge Center contains all of the information that is required to install, configure, and manage the system. The information collection in the IBM Knowledge Center is updated between product releases to provide the most current documentation.
  • Page 22 Each of the PDF publications in the Table 2 library is also available in the IBM Knowledge Center by clicking the number in the “Order number” column: Table 2. Storwize V7000 Unified library Title Description Order number IBM Storwize V7000 Model...
  • Page 23 Table 2. Storwize V7000 Unified library (continued) Title Description Order number Safety Information The guide contains translated caution and danger statements for the file module documentation. Each caution and danger statement in the Storwize V7000 Unified documentation has a number. Use the number to...
  • Page 24: How To Order Ibm Publications

    Some publications are available for you to view or download at no charge. You can also order publications. The publications center displays prices in your local currency. You can access the IBM Publications Center through the following website: www.ibm.com/e-business/linkweb/publications/servlet/pbi.wss...
  • Page 25: How To Get Information, Help, And Technical Assistance

    Before calling for support, be sure to have your IBM Customer Number available. If you are in the US or Canada, you can call 1 (800) IBM SERV for help and service. From other parts of the world, see http://www.ibm.com/planetwide for the number that you can call.
  • Page 26 Software option Identify the Storwize V7000 Unified product as your product and supply your customer number as proof of purchase. The customer number is a 7-digit number (0000000 to 9999999) assigned by IBM when the product is purchased. Your customer number should be located on the customer information worksheet or on the invoice from your storage purchase.
  • Page 27: What's New

    At times, you might need expert advice about using a function provided by the system or about how to configure the system. Purchasing the IBM Support Line offering gives you access to this professional advice while deploying your system, and in the future.
  • Page 28 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 29: Chapter 1. Storwize V7000 Unified Hardware Components

    Chapter 1. Storwize V7000 Unified hardware components A Storwize V7000 Unified system consists of 1 or more machine type 2076 rack-mounted enclosures and 2 machine type 2073 rack-mounted file modules. Control enclosures contain the node canisters that manage the system operation and provide the host interfaces.
  • Page 30: Components In The Front Of The 2073-720 File Module

    1 Power-control button and power-on LED (green) 2 Ethernet icon 3 System-locator button and LED (blue) 4 Release latch for the light path diagnostics panel 5 Ethernet activity LEDs 6 Check log LED 7 System-error LED: (yellow) Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 31: Components In The Rear Of The 2073-700 File Module

    Figure 5. Rear view of 2073-720 file module 1 8 Gbps Fibre Channel port 1 (connected to the control enclosure) 2 8 Gbps Fibre Channel port 2 (connected to the control enclosure) Chapter 1. Storwize V7000 Unified hardware components...
  • Page 32: Components In The Front Of The Enclosure

    Important: Drive slots cannot be empty. Install a drive assembly or blank carrier in each slot. Note: Drives that are sold as Storwize V7000 Unified options are the only drives that are supported. For more information, see the Support website for more information.
  • Page 33: Drive Indicators For Control Enclosures

    Figure 9. Storwize V7000 Gen2 Small form factor vertical drive Drive indicators for control enclosures Storwize V7000 Unified enclosures use different drive indicators, depending on the generation of your control enclosure model. Drives have two light-emitting diode (LED) indicators each; they have no controls or connectors.
  • Page 34: Led Indicators On A Single 3.5 Inch Drive

    If the LED is on, a fault exists on the drive. v If the LED is off, no known fault exists on the drive. v If the LED is flashing, the drive is being identified. A fault might or might not exist. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 35: Enclosure End Cap Indicators

    Enclosure end cap indicators Storwize V7000 Unified enclosure end cap indicators vary, depending on the generation of your control enclosure model. Storwize V7000 Gen1 Figure 12 shows where the end caps are located on the front of an enclosure with 12 drives.
  • Page 36: Components In The Rear Of The Enclosure

    Figure 15 on page 9 shows the rear view of a model 2076-312 or a model 2076-324 control enclosure with the 10 Gbps Ethernet port ( 5 ). Figure 16 on page 9 shows the rear of an expansion enclosure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 37: Power Supply Units For Control Enclosures

    Power supply units for control enclosures Storwize V7000 Unified enclosures use different power supply units, depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Chapter 1. Storwize V7000 Unified hardware components...
  • Page 38: Rear View Of A Storwize V7000 Unified Control

    Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 8. Storwize V7000 Unified Gen2 model numbers...
  • Page 39: Leds On The Power Supply Units Of The Control

    Figure 18. LEDs on the power supply units of the control enclosure Table 9 identifies the LEDs in the rear of the control enclosure. Table 9. Power supply unit LEDs in the rear of the control enclosure Name Color Symbol ac power failure Amber Chapter 1. Storwize V7000 Unified hardware components...
  • Page 40: Power Supply Units For Expansion Enclosures

    Storwize V7000 Unified enclosures use different power supply units, depending on the generation of your expansion enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 10. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 41: Rear View Of A Storwize V7000 Gen2 Expansion

    There is a power switch on each of the power supply units. The switch must be on for the power supply unit to be operational. If the power switches are turned off, the power supply units stop providing power to the system. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 42: Storwize V7000 2076-524 Node Canister Ports And Indicators

    A node canister contains a battery that provides power to the canister as it stores cache and system data to an internal drive in the event of a power failure. This process is known as a fire hose dump. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 43: Storwize V7000 2076-524 Node Canister Ports

    Figure 22. Storwize V7000 2076-524 node canister indicators Storwize V7000 2076-524 node canister SAS port LEDs Table 14 on page 16 depict the status LEDs for SAS ports 1 and 2, and their location in Figure 22. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 44: Storwize V7000 2076-524 Sas Ports 1 And

    4 phys connected. v Not all 4 phys are at the same speed. v One or more of the connected phys are attached to an address different from the others Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 45 Storwize V7000 2076-524 node canister battery status LEDs Table 15 on page 18 show battery status LEDs and their location in Figure 22 on page 15. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 46: Storwize V7000 2076-524 Battery Status Leds

    There is a fault in the battery. Storwize V7000 2076-524 node canister system status LEDs Table 16 on page 19 show system status LEDs and their location in Figure 22 on page 15. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 47 FAST BLINK The canister is active, able to complete I/O operations, or starting. The canister is active, able to complete I/O operations, or starting. The node is part of a cluster. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 48 Two USB ports are located on each Storwize V7000 Gen2 node canister. The USB ports are numbered 1 on top and 2 on the bottom as shown in Figure 23 on page 21. One port is used during installation. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 49: Usb Ports On The Storwize V7000 Gen2 Node Canister

    Figure 24 on page 22. Each port can have up to an 8 Gbps SW SFP transceiver installed. Each transceiver connects to a host or Fibre Channel switch with an LC-to-LC Fibre Channel cable. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 50: Fibre Channel Host Interface Adapter Port-State

    Fibre Channel over Ethernet connections to host system or storage systems. Each port can support simultaneous FCoE and iSCSI connections. The Small Form-factor Pluggable (SFP) transceivers that are installed on the adapter support data transfer speeds of 10 Gbps. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 51: Example Of Installed 10 Gbps Fibre Channel Over Ethernet/Iscsi Host Interface Adapters

    Figure 26. 10 Gbps Fibre Channel over Ethernet/iSCSI host interface adapter ports Storwize V7000 2076-524 10 Gbps Fibre Channel over Ethernet/iSCSI host interface adapter indicators Each port has two LED indicators, one green and one amber (see Figure 27 on page 24). Chapter 1. Storwize V7000 Unified hardware components...
  • Page 52: Node Canister Ports And Indicators

    The ports are numbered 1 - 4 from left to right and top to bottom. Note: The reference to the left and right locations applies to canister 1, which is the upper canister. The port locations are inverted for canister 2, which is the lower canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 53: Fibre Channel Ports On The Node Canisters

    Table 19. Fibre Channel port LED locations on canister 1 Associated port LED location LED status Port 3 3 First LED between ports 1 Speed and 3 1 Port 1 1 Second LED between ports 1 Speed and 3 2 Chapter 1. Storwize V7000 Unified hardware components...
  • Page 54: Fibre Channel Port Led Locations On Canister 1

    27. One port is used during installation. Note: The reference to the left and right locations applies to canister 1, which is the upper canister. The port locations are inverted for canister 2, which is the lower canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 55: Usb Ports On The Node Canisters

    Two LEDs are associated with each port. Note: The reference to the left and right locations applies to canister 1, which is the upper canister. The port locations are inverted for canister 2, which is the lower canister. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 56: Ethernet Ports On The 2076-112 And 2076-124 Node Canisters

    Figure 32 shows the location of the 10 Gbps Ethernet ports. Figure 32. 10 Gbps Ethernet ports on the 2076-312 and 2076-324 node canisters Table 22 on page 29 provides a description of the LEDs. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 57: Sas Ports On The Node Canisters

    The port locations are inverted for canister 2, which is the lower canister. Figure 33. SAS ports on the node canisters. SAS ports must be connected to Storwize V7000 Unified enclosures only. See “Problem: Storwize V7000 Gen1 SAS cabling not valid” on page 248 for help in attaching the SAS cables.
  • Page 58: Leds On The Node Canisters

    It is not able to perform I/O in a system. When the node is in either of these states, it can be removed. Do not remove the canister unless directed by a service procedure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 59: Expansion Canister Ports And Indicators

    There are no defined procedures that use the port. Storwize V7000 Gen2 expansion canister SAS ports and indicators Two SAS ports are located in the rear of the Storwize V7000 Gen2 expansion canister. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 60: Sas Ports And Leds At Rear Of Expansion Canister

    One or more, but not all, of the 4 phys are connected. v Not all 4 phys are at the same speed. v One or more of the connected phys are attached to an address different from the others Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 61: Storwize V7000 Gen1 Expansion Canister Sas Ports And Indicators

    The link is connected and has activity. The link is connected. Storwize V7000 Gen2 expansion canister LEDs Each Storwize V7000 Gen2 expansion canister has three LEDs that provide status and identification for the expansion canister. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 62: Expansion Canister Leds

    The two LEDs are located in a vertical row on the left side of the canister. Figure 38 on page 35 shows the LEDs ( 1 ) in the rear of the expansion canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 63: Expansion Canister Leds

    If the LED is on, a fault exists. v If the LED is off, no fault exists. v If the LED is flashing, the canister is being identified. This status might or might not be a fault. Chapter 1. Storwize V7000 Unified hardware components...
  • Page 64 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 65: Chapter 2. Best Practices For Troubleshooting

    Use this address if the control enclosure CLI is not working. These addresses are not set during the installation of a Storwize V7000 Unified system, but you can set these IP addresses later by using the management GUI or the chserviceip CLI command.
  • Page 66: Follow Proper Power Management Procedures

    RAID arrays for the disk system. The Storwize V7000 Unified system uses a pair of file modules for redundancy. Follow the appropriate power down procedures to minimize impacts to the system operations.
  • Page 67: Follow Proper Storwize V7000 Gen2 Power Management Procedures

    RAID arrays for the disk system. The Storwize V7000 Unified system uses a pair of file modules for redundancy. Follow the appropriate power down procedures to minimize impacts to the system operations.
  • Page 68: Set Up Inventory Reporting

    Call Home. When the event is received, IBM automatically opens a problem report, and if appropriate, contacts you to verify if replacement parts are required. If you set up Call Home to IBM, ensure that the contact details that you configure are correct and kept up to date as personnel change.
  • Page 69: Keep Your Software Up To Date

    The management GUI provides the capability to review these issues from the Events panel. For file module issues, use the Storwize V7000 Unified information center to look up the events and perform the actions listed for the events. For Storwize V7000 issues, resolve these problems through the Recommended actions only option from the Events panel.
  • Page 70: Keep Your Storwize V7000 Gen2 Records Up To Date

    Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 31. Storwize V7000 Unified Gen2 model numbers Machine type/model Description 2076-524 Storwize V7000 Unified control enclosure, with up to 24 2.5-inch (6.35 cm) drives 2076-12F Storwize V7000 Unified expansion enclosure for up to 12 3.5-inch (8.89...
  • Page 71: Know Your Ibm Warranty And Maintenance Agreement Details

    Before calling for support, be sure to have your IBM Customer Number available. If you are in the US or Canada, you can call 1 (800) IBM SERV for help and service. From other parts of the world, see http://www.ibm.com/planetwide for the number that you can call.
  • Page 72 If you call from somewhere other than the US or Canada, you must choose the software or hardware option when calling for assistance. Choose the software option if you are uncertain if the problem involves the Storwize V7000 Unified software or hardware. Choose the hardware option only if you are certain the problem solely involves the Storwize V7000 Unified hardware.
  • Page 73 At times, you might need expert advice about using a function provided by the system or about how to configure the system. Purchasing the IBM Support Line offering gives you access to this professional advice while deploying your system, and in the future.
  • Page 74 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 75: Chapter 3. Getting Started Troubleshooting

    If users or applications are having trouble accessing data that is held on the Storwize V7000 Unified system, or if the management GUI is not accessible or is running slowly, the Storwize V7000 control enclosure might have a problem.
  • Page 76: Installation Troubleshooting

    187; otherwise, see “Checking the GPFS file system mount on each file module” on page 189. If you have lost access to the files, but there is no sign that anything is wrong with the Storwize V7000 Unified system, see “Host to file modules connectivity” on page 63. Installation troubleshooting This topic provides information for troubleshooting problems encountered during the installation.
  • Page 77 – Product Family: Disk Systems – Product: IBM Storwize V7000 Unified – Release: All – Platform: All Before loading the USB flash drive verify it has a FAT32 formatted file system. Plug the USB flash drive into the laptop. Go to Start (my computer), right-click the USB drive.
  • Page 78 SONAS_results.txt file and open it. Check for errors and corrective actions (refer to Storwize V7000 Unified Problem Determination Guide PDF on the CD). If no errors are listed, reboot both file modules, allow file modules to boot completely, reinsert the USB flash drive as originally instructed and try again.
  • Page 79: Installation Error Codes

    3. Refer to Table 33 to match the code (A-I) to the recommended action. Follow the suggested action, in order, completing one before trying the next. 4. If the recommended action or actions fail, call the IBM Support Center. Table actions defined This table serves as a legend for defining the precise action to follow.
  • Page 80: Error Messages And Actions

    Verify that the Ethernet cabling connections are seated properly between the Storwize V7000 Unified control enclosure and the customer network, as well as the file modules cabling to the customer network. Then press the Restart button if the management GUI has already started, otherwise, reinsert the USB flash drive into the original file module.
  • Page 81 Table 34. Error messages and actions (continued) Error code Error message Action key 0A0D Error querying settings through ASU. 0A0E Error setting ASU command. 0A0F Unable to determine adapter name from VPD. 0A10 Unable to open the ifcfg file. 0A11 Unable to write to the ifcfg file.
  • Page 82 No host name provided to exchange keys with. 0AD5 Host name is invalid. 0AD6 Invalid parameters. 0AD7 Unable to open vpdnew.txt file. 0AD8 VPD failed to update a value. 0AD9 Invalid option. 0ADA Error while parsing adapter ID. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 83 Table 34. Error messages and actions (continued) Error code Error message Action key 0ADB Unable to open /proc/scsi/scsi. 0AF8 Trying to install management stack on non-management node. 0AF9 Invalid site ID. Curently only 'st001' is supported on physical systems. 0AFA This node is already a part of a cluster.
  • Page 84 There was an error while installing GPFS callbacks. 0B92 Rsync failed between management nodes. 0B94 There were too many potential peer storage nodes. Storage controllers may be cabled incorrectly or UUIDs might not be set properly. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 85 Error running update test utility on controller, see Storwize V7000 for more details. 0BD7 Yum is reporting a package error on a node. Try running yum manually. 01B2 Unable to start performance collection daemon. Contact IBM Remote Technical Support. Chapter 3. Getting started troubleshooting...
  • Page 86 01D5 Storwize V7000 stalled. Contact IBM Remote Technical Support. 01D6 Storwize V7000 stalled_non_redundant 01DA GPFS cluster is unhealthy Refer to “Checking the GPFS file system mount on each file module” on page 189 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 87: Problems Reported By The Cli Commands During Software Configuration

    /opt/IBM/sofs/cli/ cfgperfcenter --stop. If successful restart update. If you are unable to stop performance center please contact IBM Remote Technical Support. Problems reported by the CLI commands during software configuration Use this information when troubleshooting problems reported by the CLI commands during software configurations.
  • Page 88: Management Gui Wizard Failure

    1. Does the GUI launch and are there problems logging into the system? v Yes: Check that the user ID being used was set up to access the GUI. Refer to “Authentication basic concepts” in the IBM Storwize V7000 Unified Information Center.
  • Page 89 v Yes: a. Run the CLI command lsnode and determine the status of the file nodes. b. If the lsnode reports the management service is not running, refer to “Management node role failover procedures” on page 183. If lsnode provides the system configuration information, check the connection status under the appropriate heading.
  • Page 90: Health Status And Recovery

    About this task Within the Storwize V7000 Unified system, the system Health Status is based on a set of predefined software and hardware health status sensors. The status of each component is displayed against the corresponding logical host name in the System and System Details pages.
  • Page 91: Connectivity Issues For The 2073-720

    This topic instructs you where to go to view the information that is displayed, how to check the status of the various sensors, and how to manually close out sensor events. By performing these tasks, you ensure that the overall Health Status reflects the current system health.
  • Page 92: File Module Ethernet Connections

    ) public file access If you are looking at a problem regarding built-in Ethernet port 1 or built-in Ethernet port 2, refer to “Ethernet connectivity between file modules” on page 65. Isolation procedures: Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 93: Ethernet Connectivity Between File Modules

    These connections are used for internal management operations between the file modules. They make use of the Internal IP address range that you provided during initializing the Storwize V7000 Unified system. About this task This procedure is used to troubleshoot Ethernet connectivity between the file modules.
  • Page 94: File Module Ethernet Connections

    If you are looking at a problem regarding built-in Ethernet port 3, built-in Ethernet port 4, or any network connections to PCI slot 4, refer to “Host to file modules connectivity” on page 63. Isolation procedures: Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 95 It is always possible that somebody in your site could set up another machine to use one or more IP address that your Storwize V7000 Unified system is already using. Use the management GUI to check which four IP addresses the file modules are currently using to communicate with each other.
  • Page 96: Ethernet Connectivity From File Modules To The Control Enclosure

    Use the lsstoragesystem CLI command to show you the IP address that the active management node, running on one of the file modules, will use to ssh commands to the storage system CLI. For example: Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 97 CLI command). Otherwise you may have plugged the USB flash drive into the wrong control enclosure (such as one that is not part of this Storwize V7000 unified system). The node_status should be active for each node canister in the cluster under sainfo lsservicestatus. Otherwise follow the service action under sainfo lsservicerecommendation.
  • Page 98 CLI command. Here is an example: >ssh superuser@<system IP address> $ chsystemip -clusterip 9.20.136.5 -gw 9.20.136.1 -mask 255.255.255.0 -port 1 The default password for superuser is passw0rd. Update the file module's record of the control enclosure system IP: Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 99 To find the file module's current record of the control enclosure system IP address, use the Storwize V7000 Unified management CLI to issue the lsstoragesystem command. Here is an example: >ssh admin@<management_IP> [kd01ghf.ibm]$ lsstoragesystem name primaryIP secondaryIP id StorwizeV7000 9.11.137.130 9.11.137.130 00000200A2601508 EFSSG1000I The command completed successfully.
  • Page 100: Fibre Channel Connectivity Between File Modules And Control Enclosure

    Both ports are used to connect to the Storwize V7000 control enclosure with a connection going to each control canister as shown in Figure 41 on page 73 or Figure 42 on page 74. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 101: Connecting The File Modules To The Storwize V7000 Gen1 Control Enclosure Using Fibre Channel Cables

    CAUTIO N CAUT I O N Disconnect all Disconnect all supply power for supply power for complete isolation complete isolation Figure 41. Connecting the file modules to the Storwize V7000 Gen1 control enclosure using Fibre Channel cables A File module 1 B File module 2 C Storwize V7000 control enclosure 1 File module1 - Fibre Channel port 1...
  • Page 102: Connecting The File Modules To A Storwize V7000 Gen2 Control Enclosure That Has A Fibre Channel Interface Adapter In Pci Slot 2 Of Each Node Canister

    7 Node canister 2 (right) - Fibre Channel port 1 8 Node canister 2 (right) - Fibre Channel port 2 Table 38 on page 75 describes the diagrams shown in Figure 41 on page 73 and Figure 42. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 103: How To Connect Fibre Channel Cables From File Modules To The Control Enclosure

    Table 38. How to connect Fibre Channel cables from file modules to the control enclosure. File module Control enclosure A File module 1 C Control enclosure 1 Fibre Channel slot 2, port 1 5 Node canister 1 Fibre Channel port 1 2 Fibre Channel slot 2, port 2 7 Node canister 2 Fibre Channel port 1 B File module 2...
  • Page 104: Fibre Channel Cabling From The File Module To The Control Enclosure

    Fibre Channel port but a broken connection at the Storwize V7000 node canister. This broken connection is most likely either a Fibre Channel cable or the Fibre Channel port is bad on the Storwize V7000 node canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 105: Understanding Led Hardware Indicators

    Table 41. LED states and associated actions. For the Fibre Channel adapters on the file module check the amber LED lights next to the port. (continued) LED State Definition and Action Rapid flashing amber LED This state indicates the Fibre Channel adapter is attempting to resync the Fibre Channel connection.
  • Page 106 LEDs on the light path diagnostics panel. This information and the information in Light path diagnostics LEDs can often provide enough information to diagnose the error. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 107: Enclosure)

    12v channel error LEDs indicate an overcurrent condition. Refer to the procedure “Solving power problems” in the “Troubleshooting the System x3650” in the IBM Storwize V7000 Unified Information Center to identify the components that are associated with each power channel, and the order in which to troubleshoot the components.
  • Page 108: Led Indicators, Corresponding Problem Causes, And Corrective Actions

    Use the IBM Power Configurator utility to determine supplies are damaged. current system power consumption. For more information and to download the utility, go to http://www-03.ibm.com/systems/bladecenter/...
  • Page 109 PCI riser cards v ServeRAID adapter v Optional network adapter v (Trained technician only) System board e. If the failure remains, go to http://www.ibm.com/ systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-CALL. 2. If the PCI LED and the CONFIG LED are lit, complete the following steps to correct the problem: a.
  • Page 110 E5-2690. If it is, check that the 2.5-inch hard disk drives installed are lesser than eight. b. Check the system-error logs for information about the error. Replace any component that is identified in the error log. LINK Reserved. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 111 LED on the system board, are installed correctly. b. (Trained technician only) Replace the failing microprocessor. c. For more information, go to http://www.ibm.com/ systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-CALL. 2. If the CONFIG LED and the CPU LED are lit, the system issues an invalid microprocessor configuration error.
  • Page 112 5. Make sure that the heat sink, the fan on the adapter, or the optional network adapter is seated correctly. If the fan has failed, replace it. 6. If the failure remains, go to http://www.ibm.com/ systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-CALL. A fan that failed, is operating too 1.
  • Page 113 1) Replace the hard disk drive. 2) Replace the hard disk drive backplane. e. If the problem remains, go to http://www.ibm.com/ systems/support/supportsite.wss/ docdisplay?brandind=5000008&lndocid=SERV-CALL. 2. If the HDD LED and the CONFIG LED are lit, complete the following steps to correct the problem: a.
  • Page 114: Locations Of The Power-Supply Leds

    4. If the problem remains, replace the power-supply. The power supply Replace the power supply. has failed. The power supply Replace the power supply. has failed. The power supply Replace the power supply. has failed. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 115: Enclosure Hardware Indicators

    The LEDs provide a general idea of the volume system status. For specifics about the status of control enclosures, expansion enclosures, node canisters, and expansion canisters, see Chapter 1, “Storwize V7000 Unified hardware components,” on page 1. Also refer to “Procedure: Understanding the system status using the LEDs”...
  • Page 116: When To Use The Management Gui

    GUI first to diagnose and resolve the problem. Use the views that are available in the management GUI to verify the status of the system, the hardware devices, the physical storage, and the available volumes. The Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 117: Accessing The Storwize V7000 Unified Management Gui

    The fix procedures automatically perform configuration changes that are required to return the system to its optimum state. Accessing the Storwize V7000 Unified management GUI This procedure describes how to access the Storwize V7000 Unified management GUI. About this task You must use a supported web browser.
  • Page 118: Diagnosing And Resolving Problems With Fix Procedures

    You can use fix procedures to diagnose and resolve problems with the Storwize V7000 Unified. About this task For example, to repair a Storwize V7000 Unified system, you might complete the following tasks: v Analyze the event log (if it is available, or view node errors)
  • Page 119: Chapter 4. File Module

    Removing a file module to perform a maintenance action You can remove an IBM Storwize V7000 Unified file module to perform maintenance. The procedure that you follow differs slightly, depending on whether you must unplug the power cables.
  • Page 120 Removing a file module and disconnecting power You must remove an IBM Storwize V7000 file module from the file cluster and disconnect it from its power line cords before performing a maintenance action that requires the file module to have no power.
  • Page 121 To remove the mgmt001st001 file module from the system, for example, issue the following command: # suspendnode mgmt001st001 3. Wait for the Storwize V7000 Unified system to stop the file module at the clustered trivial database (CTDB) level. The command does not unmount any mounted file systems.
  • Page 122: Removing And Replacing File Module Components

    About this task Installation guidelines To help you work safely with IBM Storwize V7000 Unified file modules, read the safety information in , Safety information statements, and these guidelines. Before you remove or replace a component, read the following information: v When you install a file module, take the opportunity to download and apply the most recent firmware updates.
  • Page 123 – To avoid straining the muscles in your back, lift by standing or by pushing up with your leg muscles. v Make sure that you have an adequate number of properly grounded electrical outlets for the PDUs. v Back up all important data before you make changes to disk drives. v Have a small flat-blade screwdriver available.
  • Page 124: Resolving Hard Disk Drive Problems

    When returning a device or component, follow all packaging instructions and use any supplied packaging materials for shipping. Resolving hard disk drive problems Use this information to address various hard disk drive issues. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 125 About this task v Before running a procedure, refer to “Removing a file module to perform a maintenance action” on page 91. v Follow the suggested actions for a Symptom in the order in which they are listed in the Action column until the problem is solved.
  • Page 126 Turn on the server and observe the activity of the hard disk drive LEDs. Displaying node mirror and hard drive status The Storwize V7000 Unified system provides a method to check the node mirror status and hard drive status for each file module.
  • Page 127: Selecting A File Module To Display Node Status

    File modules in this Storwize V7000 Unified Cluster Node Node Name Node Details -------------------------------------------------------------------------------- 1. mgmt001st001 x3650m3 KQ186WX 2. mgmt002st001 x3650m3 KQ186WV B. Back to Menus Choice: Figure 44. Selecting a file module to display node status 3. Select the number for a file module to display its status. For example, type 1 to select mgmt001st001.
  • Page 128: Displaying Node Status

    The volume is Active. The user data is not fully protected due to a configuration change or drive failure. Rebuilding (RBLD) A data resynchronization or rebuild might be in progress. or Resyncing (RSY) Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 129: State Of Drives

    Table 44. Status of volume (continued) Status of volume Description Inactive, Okay The volume is inactive and the drives are functioning correctly. The (OKY) user data is protected if the current RAID level is RAID 1 (IM) or RAID 1E (IME). Inactive, Degraded The volume is inactive and the user data is not fully protected due (DGD)
  • Page 130: Example That Shows That Mirroring Is Re-Synchronizing

    SMART ASCQ : none Figure 46. Example that shows that mirroring is re-synchronizing If a drive were not synchronized, the status might appear like the status shown in Figure 47 on page 103: Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 131: Example That Shows That A Drive Is Not Synchronized

    The mirror is not created/configured. If the mirror is not created, refer to “Troubleshooting the System x3650” in the IBM Storwize V7000 Unified Information Center for information on launching the LSI configuration tool. Chapter 4. File module...
  • Page 132: Example That Shows That The Mirror Is Not Created

    ASC/ ASCQ error of 05/00. For isolation and the repair of hard disk problems, refer to “Troubleshooting the System x3650” in the IBM Storwize V7000 Unified Information Center. For a list of SMART (ASC/ASCQ) error codes and their descriptions, go to “SMART ASC/ASCQ error codes and messages”...
  • Page 133: Example Of A Smart Error

    Device is a Hard disk Enclosure # Slot # Connector ID Target ID State : Online (ONL) Size (in MB)/(in sectors) : 286102/585937500 Manufacturer : IBM-ESXS Model Number : MBD2300RC Firmware Revision : SB19 Serial No : D009P9A01SJC Drive Type : SAS Protocol...
  • Page 134 LOGICAL UNIT NOT READY, START STOP UNIT COMMAND IN PROGRESS LOGICAL UNIT DOES NOT RESPOND TO SELECTION NO REFERENCE POSITION FOUND MULTIPLE PERIPHERAL DEVICES SELECTED LOGICAL UNIT COMMUNICATION FAILURE LOGICAL UNIT COMMUNICATION TIME-OUT LOGICAL UNIT COMMUNICATION PARITY ERROR Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 135 Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description LOGICAL UNIT COMMUNICATION CRC ERROR (ULTRA-DMA/32) UNREACHABLE COPY TARGET TRACK FOLLOWING ERROR HEAD SELECT FAULT ERROR LOG OVERFLOW WARNING WARNING - SPECIFIED TEMPERATURE EXCEEDED WARNING - ENCLOSURE DEGRADED WARNING - BACKGROUND SELF-TEST FAILED WARNING - BACKGROUND PRE-SCAN DETECTED MEDIUM ERROR WARNING - BACKGROUND MEDIUM SCAN DETECTED MEDIUM...
  • Page 136 RECOVERED DATA WITHOUT ECC - RECOMMEND REWRITE RECOVERED DATA WITHOUT ECC - DATA REWRITTEN RECOVERED DATA WITH ERROR CORRECTION APPLIED RECOVERED DATA WITH ERROR CORR. & RETRIES APPLIED RECOVERED DATA - DATA AUTO-REALLOCATED RECOVERED DATA - RECOMMEND REASSIGNMENT Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 137 Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description RECOVERED DATA - RECOMMEND REWRITE RECOVERED DATA WITH ECC - DATA REWRITTEN DEFECT LIST ERROR DEFECT LIST NOT AVAILABLE DEFECT LIST ERROR IN PRIMARY LIST DEFECT LIST ERROR IN GROWN LIST PARAMETER LIST LENGTH ERROR SYNCHRONOUS DATA TRANSFER ERROR DEFECT LIST NOT FOUND...
  • Page 138 TIMESTAMP CHANGED SA CREATION CAPABILITIES DATA HAS CHANGED COPY CANNOT EXECUTE SINCE HOST CANNOT DISCONNECT COMMAND SEQUENCE ERROR ILLEGAL POWER CONDITION REQUEST PREVIOUS BUSY STATUS PREVIOUS TASK SET FULL STATUS PREVIOUS RESERVATION CONFLICT STATUS Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 139 Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description ORWRITE GENERATION DOES NOT MATCH COMMANDS CLEARED BY ANOTHER INITIATOR COMMANDS CLEARED BY POWER LOSS NOTIFICATION COMMANDS CLEARED BY DEVICE SERVER INCOMPATIBLE MEDIUM INSTALLED CANNOT READ MEDIUM - UNKNOWN FORMAT CANNOT READ MEDIUM - INCOMPATIBLE FORMAT CLEANING CARTRIDGE INSTALLED CANNOT WRITE MEDIUM - UNKNOWN FORMAT...
  • Page 140 ATA DEVICE FAILED SET FEATURES SELECT OR RESELECT FAILURE UNSUCCESSFUL SOFT RESET SCSI PARITY ERROR DATA PHASE CRC ERROR DETECTED SCSI PARITY ERROR DETECTED DURING ST DATA PHASE INFORMATION UNIT IUCRC ERROR DETECTED Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 141 Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description ASYNCHRONOUS INFORMATION PROTECTION ERROR DETECTED PROTOCOL SERVICE CRC ERROR PHY TEST FUNCTION IN PROGRESS SOME COMMANDS CLEARED BY ISCSI PROTOCOL EVENT INITIATOR DETECTED ERROR MESSAGE RECEIVED INVALID MESSAGE ERROR COMMAND PHASE ERROR DATA PHASE ERROR INVALID TARGET PORT TRANSFER TAG RECEIVED...
  • Page 142 DATA CHANNEL IMPENDING FAILURE GENERAL HARD DRIVE FAILURE DATA CHANNEL IMPENDING FAILURE DRIVE ERROR RATE TOO HIGH DATA CHANNEL IMPENDING FAILURE DATA ERROR RATE TOO HIGH DATA CHANNEL IMPENDING FAILURE SEEK ERROR RATE TOO HIGH Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 143 Table 46. SMART ASC/ASCQ error codes and messages (continued) ASCQ Description DATA CHANNEL IMPENDING FAILURE TOO MANY BLOCK REASSIGNS DATA CHANNEL IMPENDING FAILURE ACCESS TIMES TOO HIGH DATA CHANNEL IMPENDING FAILURE START UNIT TIMES TOO HIGH DATA CHANNEL IMPENDING FAILURE CHANNEL PARAMETRICS DATA CHANNEL IMPENDING FAILURE CONTROLLER DETECTED DATA CHANNEL IMPENDING FAILURE THROUGHPUT PERFORMANCE...
  • Page 144 UNABLE TO DECRYPT PARAMETER LIST SA CREATION PARAMETER VALUE INVALID SA CREATION PARAMETER VALUE REJECTED INVALID SA USAGE SA CREATION PARAMETER NOT SUPPORTED AUTHENTICATION FAILED LOGICAL UNIT ACCESS NOT AUTHORIZED SECURITY CONFLICT IN TRANSLATED DEVICE Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 145: Monitoring Memory Usage On A File Module

    Understanding error codes The Storwize V7000 Unified error codes convey specific information in an alphanumeric sequence. Tip: Search for error codes or event IDs by using EFS on the front. For 66012FC, for example, search on EFS66012FC.
  • Page 146: Originating File Module And File Module

    Optional Ethernet port 7 (Dual Port 10G card) Fibre channel adapter 1 (both ports) – Storage node only Fibre channel adapter 2 (both ports) – Storage node only Bonded device (data0 mgmt0) System x internal hard disk drives Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 147: Originating File Module Specific Software Code

    Table 50. Originating file module specific software code – Code 1, 3, 5. Listing devices for variable C in the specific software code sequence of ABBCDDDD. C = Originating specific software code in sequence ABBCDDDD Code Device Red Hat Linux GPFS CIFS server CTDB...
  • Page 148: Understanding Event Ids

    Unique error code Severity of the error Understanding event IDs The Storwize V7000 Unified messages follow a specific format, which is detailed here. About this task Tip: Search for error codes or event IDs by using EFS on the front. For 66012FC, for example, search on EFS66012FC.
  • Page 149: File Module Hardware Problems

    I for Asynchronous Replication J for SCM L for HSM AK for NDMP v The element nnnn is a 4 digit message number v The element x indicates the severity of the error. The value x can be: A for Action: GUI error messages. The user must perform a specific action. C for Critical: A critical error occurred which must be corrected by the user or system administrator.
  • Page 150: Removing And Replacing Parts For The 2073-720

    162 “Installing the operator information panel assembly” on page 163 “Removing the hot-swap drive backplane” on page “Installing the hot-swap drive backplane” on page “Removing the 240 VA safety cover” on page 127 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 151: Removing The Cover

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 152: Installing The Cover

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 153 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 154: Removing The Bezel

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 155: Installing The Bezel

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 156 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 157 Screw Safety cover 1. Line up and insert the tabs on the bottom of the safety cover into the slots on the system board. 2. Slide the safety cover toward the back of the file module until it is secure. 3.
  • Page 158: Configuration Cable

    2. To disconnect the SAS signal cables, make sure that you first disconnect the power cable, and then the signal cable and configuration cable. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 159: Hard Disk Drive Cable Connection

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 160 Statement 2 CAUTION: When you are replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery that is recommended by the manufacturer. If your system has a module that contains a lithium battery, replace it only with the same module type made by the same manufacturer.
  • Page 161: Removing The Battery

    In the United States, IBM has established a return process for reuse, recycling, or proper disposal of used IBM sealed lead acid, nickel cadmium, nickel metal hydride, and other battery packs from IBM Equipment. For information on proper disposal of these batteries, contact IBM at 1-800-426-4333.
  • Page 162 For proper collection and treatment, contact your local IBM representative. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 163 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 164: Installing The Battery

    For more information, see the IBM Environmental Notices and User's Guide on the IBM Documentation CD. To install the replacement battery, complete the following steps: Procedure 1. Follow any special handling and installation instructions that come with the replacement battery.
  • Page 165: Removing The Air Baffle

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 166: Installing The Air Baffle

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 167 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 168: Removing The Fan Bracket

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 169: Installing The Fan Bracket

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 170 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 171: Installing A Pci Riser-Card Assembly

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 172: Removing A Pci Adapter From A Pci Riser-Card Assembly

    5. Carefully grasp the adapter by its top edge or upper corners, and pull the adapter from the PCI expansion slot. 6. If you are instructed to return the adapter, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 173: Inserting The Adapter Into The Pci Connector

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 174 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 175 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 176 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 177: Removing A Hot-Swap Hard Disk Drive

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 178: Installing A Hot-Swap Hard Disk Drive

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 179 About this task To remove the DVD drive, complete the following steps. Release tab Procedure 1. Read the Safety information and “Installation guidelines” on page 94. Follow the procedure in “Removing a file module and disconnecting power” on page 92 to suspend the file module from the cluster and shut it down, and then disconnect all power cords and external cables.
  • Page 180 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 181 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 182: Locations Of The Dimm Connectors On The System Board

    Figure 68. Locations of the DIMM connectors on the system board To install a DIMM, complete the following procedure. See Table 54 on page 155 for a listing of the eight DIMM slots populated with the memory RDIMM. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 183: Dimm Slots Populated With The Memory

    Table 54. DIMM slots populated with the memory RDIMM Processor Memory Channel DIMM Slot Number 1 - 16GB RDIMM 2 - 2GB RDIMM 4 - 16GB RDIMM 5 - 2GB RDIMM 12 - 16GB RDIMM 11 - 2GB RDIMM 9 - 16GB RDIMM 8 - 2GB RDIMM 13 - 16GB RDIMM 14 - 2GB RDIMM...
  • Page 184 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 185 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 186 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 187: Removing A Hot-Swap Ac Power Supply

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 188 Hazardous voltage, current, and energy levels are present inside any component that has this label attached. There are no serviceable parts inside these components. If you suspect a problem with one of these parts, contact a service technician. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 189 Attention: During normal operation, each power-supply bay must have either a power supply or power-supply filler installed for proper cooling. To install a hot-swap ac power supply, complete the following steps: Procedure 1. Read the Safety information and “Installation guidelines” on page 94. 2.
  • Page 190 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 191 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 192 The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 193: Removing The Hot-Swap Drive Backplane

    The following procedure is for a Tier 1 customer replaceable unit (CRU). Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. Service agreements can be purchased so that you can ask IBM to replace these units.
  • Page 194: Installing The Hot-Swap Drive Backplane

    Removing a microprocessor and heat sink IBM authorized service providers can remove and replace a microprocessor and heat sink in the file module. The following procedure is for a field replaceable unit (FRU).
  • Page 195 About this task Attention: v Always use the microprocessor installation tool to remove a microprocessor. Failing to use the microprocessor installation tool may damage the microprocessor sockets on the system board. Any damage to the microprocessor sockets may require replacing the system board. v Microprocessors are to be removed only by trained service technicians.
  • Page 196 Note: If you are replacing a microprocessor, use the empty installation tool that comes with the CRU to remove the microprocessor. a. Twist the handle on the microprocessor tool counterclockwise so that it is in the open position. Handle Installation tool Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 197 b. Align the installation tool with the alignment pins on the microprocessor socket and lower the tool on the microprocessor. The installation tool rests flush on the socket only if aligned correctly. Installation tool Alignment Microprocessor pins c. Twist the handle on the installation tool clockwise. Handle Installation tool...
  • Page 198 The air baffle must be installed to provide proper system cooling. v If you have to replace the microprocessor, call IBM Remote Technical Support for service. v If the thermal-grease protective cover (for example, a plastic cap or tape liner) is removed from the heat sink, do not touch the thermal grease on the bottom of the heat sink or set down the heat sink.
  • Page 199 Heat sink release lever Lock tab Retainer bracket 6. Open the microprocessor socket release levers and retainer: Microprocessor release lever Microprocessor release lever a. Identify which release lever is labeled as the first release lever to open and open it. b.
  • Page 200 Twist the handle on the microprocessor tool counterclockwise to insert the microprocessor into the socket. The microprocessor is keyed to ensure that the microprocessor is installed correctly. The microprocessor rests flush on the socket only if properly installed. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 201 Attention: v Do not press the microprocessor into the socket. v Make sure that the microprocessor is oriented and aligned correctly in the socket before you try to close the microprocessor retainer. v Do not touch the thermal material on the bottom of the heat sink or on top of the microprocessor.
  • Page 202 Removing and replacing the thermal grease IBM authorized service providers must replace the thermal grease when the heat sink has been removed from the top of a microprocessor in the file module and the Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 203 heat sink is going to be reused or when debris is found in the grease. The following procedure is for a field replaceable unit (FRU). FRUs must be installed only by trained service technicians. About this task The thermal grease must be replaced whenever the heat sink has been removed from the top of the microprocessor and is going to be reused or when debris is found in the grease.
  • Page 204 Removing a heat-sink retention module IBM authorized service providers can remove and replace a heat-sink retention module in the file module. The following procedure is for a field replaceable unit (FRU). FRUs must be installed only by trained service technicians.
  • Page 205 Removing the system board IBM authorized service providers can remove and replace the system board in the file module. The following procedure is for a field replaceable unit (FRU). FRUs must be installed only by trained service technicians.
  • Page 206 (see “Removing a microprocessor and heat sink” on page 166). 12. Pull out and lift up the pin and the thumbscrews on each side of the system board. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 207 Installing the system board IBM authorized service providers can remove and replace the system board in the file module. The following procedure is for a field replaceable unit (FRU). FRUs must be installed only by trained service technicians.
  • Page 208 1. Align the system board at an angle, as shown in the illustration; then, rotate and lower it flat and slide it back toward the rear of the file module. Make sure that the rear connectors extend through the rear of the chassis. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 209 MT-M SN label on the front of the file module. About this task The ASU package is part of the Storwize V7000 Unified code. ASU is available to authorized service personnel from the command-line interface (CLI) on the file module.
  • Page 210: How To Reset/Reboot Server Imm Interface

    IBM Advanced Settings Utility version 3.62.71B Licensed Materials - Property of IBM (C) Copyright IBM Corp. 2007-2010 All Rights Reserved Try to connect to the primary node to get nodes number. Connected via IPMI device driver (KCS interface) Connected to primary node.
  • Page 211: Management Node Role Failover Procedures

    Table 55. Default logical devices and physical port locations for a 2073-720 file module Logical Ethernet device name Device description Physical location information mgmtsl0_0 Internal connection between the file modules Port 1 - Built-In xSeries Ethernet Port mgmtsl0_1 Internal connection between the file modules Port 2 - Built-In xSeries Ethernet Port ethXsl0_0 1-Gbps Public Network Port 3 - Built-In xSeries Ethernet Port...
  • Page 212: Hostname And Service Ip Reference

    Note: If you run the startmgtsrv command from the node that is becoming active, you first need to run the setcluster command to set the cluster Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 213 If you see the following error message when running the command, wait until the initialization has completed before running setcluster again: IBM SONAS management service is starting up EFSSG0654I The Management Service is starting up. After you run the startmgtsrv command, the system displays information that is similar to the following example: [yourlogon@yourmachine.mgmt002st001 ~]# startmgtsrv...
  • Page 214 9. Using the GUI event log, follow the troubleshooting documentation against the file module with the failed management node role to isolate the software or hardware problem that might have caused this issue. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 215: Checking Ctdb Health

    (CTDB) on each file module. About this task CTDB checks the health status of the Storwize V7000 Unified file modules, scanning elements such as storage access, General Parallel File System (GPFS), networking, Common Internet File System (CIFS) shares, and Network File System (NFS) exports.
  • Page 216: Management Gui Showing Ctdb Status For

    “Checking the GPFS file system mount on each file module” on page 189. v Refer to the information in "Troubleshooting the System x3650 server" topic in the IBM Storwize V7000 Unified Information Center to determine if any additional hardware problems might be causing the “unhealthy” CTDB status.
  • Page 217: Checking The Gpfs File System Mount On Each File Module

    System (GPFS) file system mounts on IBM Storwize V7000 Unified file modules. About this task A GPFS file system that is not mounted on an Storwize V7000 Unified file module can cause the clustered trivial database (CTDB) status to be 'UNHEALTHY'." The...
  • Page 218: Resolving Problems With Missing Mounted File Systems

    To identify and resolve problems in file system mounts, perform this procedure: 1. To identify all the currently created file systems on the Storwize V7000 Unified system, log in as the admin user, then enter the lsfs -r command from the...
  • Page 219: Resolving Stale Nfs File Systems

    If file systems remain unmounted, contact IBM support. Resolving stale NFS file systems You can resolve problems with stale NFS file systems on Storwize V7000 Unified file modules. A file module might have the file system mounted, but the file system remains inaccessible due to a stale NFS file handle.
  • Page 220: Resolving The "Missing Srv Record In Dns" Error

    Refer to these topics in the IBM Storwize V7000 Unified Information Center “Planning for user authentication”, “Verifying the authentication configuration”, “Establishing user and group mapping for client access”, and “chkauth”. If you cannot resolve the issue, contact the authentication server administrator to validate or reestablish your account.
  • Page 221: Trouble Accessing Exports When Authentication Server And Clientstorwize V7000 Unified Configurations Are Correct

    V7000 Unified configurations are correct About this task If you cannot access an export and the server and Storwize V7000 Unified configurations are correct, it could be because of the following reasons. v If Storwize V7000 Unified authentication is configured against an LDAP server, the user entries are case-sensitive when you access exports.
  • Page 222: Checking Client Access

    Use the cfgidmap command to import the ID map XML file. The XML file must be at the /ftdc/files folder. 5. Try to access the data on the subordinate Storwize V7000 Unified system after the import operation is successfully completed.
  • Page 223: Checking Network Interface Availability

    DNS server for Storwize V7000 Unified. Ideally, these IP addresses should be the same as the addresses that are configured on the Storwize V7000 Unified cluster itself. To check this, issue the lsnw CLI command.
  • Page 224: Recovering A Gpfs File System

    3. Issue the chkfs file_system_name -v | tee /ftdc/chkfs_fs_name.log1 command to capture the output to a file. Review the output file for errors and save it for IBM support to investigate any problems. If the file contains a TSM ERROR message, perform the following steps: a.
  • Page 225: Resolving An Ans1267E Error

    Issue the chkfs file_system_name command again. Review the new output file for errors and save it for IBM support to investigate any problems. It is expected that the file contains Lost blocks were found messages. It is normal to have some missing file system blocks. If the only errors that are reported are missing blocks, no further repair is needed.
  • Page 226: Resolving Network Errors

    The issue should be resolved after the reboot and within five minutes after the file module displays Host State OK again. Error for “The mount state of the file system /ibm/ Filesystem_Name changed to error level” About this task If the command lshealth -i gpfs_fs -r returns “The mount state of the file...
  • Page 227 either different or missing, when providing the VLAN ID to a management network bond, as well as to a shared data network bond. Unless you have intentionally configured your switching network to support this unique case, where the VLAN ID for the management network and data network are not the same and you are confident on how this will be routed from the clustered system to your switch, you might incur unpredictable routing path behavior and even network connectivity loss.
  • Page 228: Resolving Full Condition For Gpfs File System

    If there is no storage space available, contact IBM support. Analyzing GPFS logs Use this procedure when reviewing GPFS log entries. About this task Note: Contact IBM support if you want to analyze GPFS log entries. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 229: Synchronizing Time On The File Modules

    Kerberos tickets, for example, can expire and then no one can access the cluster. For the Storwize V7000 Unified file module, the ntpq –p command shows you which server is used for synchronization and any peers and a set of data about their status.
  • Page 230 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 231: Chapter 5. Control Enclosure

    You cannot manage a system by using the 10 Gbps Ethernet ports. You can perform almost all of the configuration, troubleshooting, recovery, and maintenance of the storage system from within the Storwize V7000 Unified management GUI or the CLI commands that are running on the Storwize V7000 file modules.
  • Page 232 Use the service assistant in the following situations: v When you cannot access the system from the management GUI and you cannot access the Storwize V7000 Unified to run the recommended actions v When the recommended action directs you to use the service assistant.
  • Page 233: Storage System Command-Line Interface

    For a full description of the storage system commands and how to start an SSH command-line session, see the “Command-line interface” topic in the “Reference” section of the Storwize V7000 Unified Information Center. When to use the storage system CLI The storage system CLI is intended for use by advanced users who are confident at using a command-line interface.
  • Page 234: Service Command-Line Interface

    Accessing the storage system CLI Follow the steps that are described in the “Command-line interface” topic in the “Reference” section of the Storwize V7000 Unified Information Center to initialize and use a CLI session. Service command-line interface Use the service command-line interface (CLI) to manage a node canister in a control enclosure by using the task commands and information commands.
  • Page 235 v When you cannot connect to a node canister in a control enclosure using the service assistant and you want to see the status of the node. v When you do not know, or cannot use, the service IP address for the node canister in the control enclosure and must set the address.
  • Page 236 USB flash drive and note the IP address after the -gw switch. Make sure this IP address is the gateway for this subnet. If an IP address is needed then check this with your 1 Gbps Ethernet administrator. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 237 You should be able to access the management GUI or CLI from a computer, which is on a different subnet or different Ethernet switch to the Storwize V7000 Unified system. The link to the management GUI from the InitTool.exe panel should now work.
  • Page 238 255.255.255.0. If the command is run on the lower canister, the default value is 192.168.70.122 subnet mask: 255.255.255.0. If the node canister is active in a system, the superuser password for the system is reset; otherwise, the superuser password is reset on the node canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 239 Use this command when you are unable to logon to the system because you have forgotten the superuser password, and you wish to reset it. Attention: Run this command only when instructed by IBM support. Running this command directly on a Storwize V7000 can affect your I/O operations on the file modules.
  • Page 240 Install software command: Use this command to install a specific update package on the node canister. Attention: Run this command only when instructed by IBM support. Running this command directly on a Storwize V7000 can affect your I/O operations on the file modules.
  • Page 241 Note: The reference to cluster is not the same as the file system cluster on the Storwize V7000 file modules. Attention: Run this command only when instructed by IBM support. Running this command directly on a Storwize V7000 can affect your I/O operations on the file modules.
  • Page 242: Modules

    -mask The IPv4 subnet for Ethernet port 1 on the system. -consolip The management IPv4 address of Storwize V7000 Unified system. Description This command is only supported in the satask.txt file on a USB flash drive. It calls the svctask chsystemip command if the USB flash drive is inserted in the configuration node canister, Otherwise it will blink the amber identify LED of the node canister that is the configuration node.
  • Page 243: Starting Statistics Collection

    Query status command: Use this command to determine the current service state of the node canister. Syntax ►► sainfo getstatus ►◄ Parameters None. Description This command writes the output from each node canister to the USB flash drive. This command calls the sainfo lsservicenodes command, the sainfo lsservicestatus command, and the sainfo lsservicerecommendation command.
  • Page 244: Statistic Collection For Volumes For Individual Nodes

    The elapsed time is added to the cumulative counter. Indicates the worst read response time in microseconds for each volume since the last time statistics were collected. This value is reset to zero after each statistics collection sample. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 245: Statistic Collection For Volumes That Are Used

    Indicates the total number of fixed or unfixed overlapping writes. When all nodes in all clusters are running Storwize V7000 Unified version 4.3.1, this records the total number of write I/O requests received by the Global Mirror feature on the primary that have overlapped. When any nodes in either cluster are running Storwize V7000 Unified versions earlier than 4.3.1, this...
  • Page 246: Statistic Collection For Nodes

    Table 61 describes the node information that is reported for each nodes. Table 61. Statistic collection for nodes Statistic Description name cluster_id Indicates the name of the cluster. cluster Indicates the name of the cluster. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 247: Cache Statistics Collection For Volumes And Volume Copies

    Table 61. Statistic collection for nodes (continued) busy - Indicates the total CPU average core busy milliseconds since the node was reset. This statistic reports the amount of the time the processor has spent polling while waiting for work versus actually doing work. This statistic accumulates from zero.
  • Page 248 Average non- cumulative fullness Max non- cumulative fullness Min non- cumulative Destage Target dtav IOs capped Average 9999, non- cumulative Destage Target dtmx IOs, non- cumulative Destage Target dtmn IOs, non- cumulative Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 249 Table 62. Cache statistics collection for volumes and volume copies (continued) Statistics for Statistics for Statistics for Statistics for Statistics for volume volume the Node Cache volume volume cache copy cache Overall statistics for Units and Statistic Acronym cache copy cache partition partition Cache...
  • Page 250 Owner Remote Average µs, Credit Queue non- Time cumulative Non-Owner Average µs, Remote Credit non- Queue Time cumulative Admin Remote Average µs, Credit Queue non- Time cumulative Cdcb Queue Average µs, Time non- cumulative Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 251: Statistic Collection For Volume Cache Per

    Table 62. Cache statistics collection for volumes and volume copies (continued) Statistics for Statistics for Statistics for Statistics for Statistics for volume volume the Node Cache volume volume cache copy cache Overall statistics for Units and Statistic Acronym cache copy cache partition partition Cache...
  • Page 252: Xml Statistics For An Ip Partnership Port

    Indicates the bytes retransmitted to other nodes in other clusters by the IP partnership driver. iprt Indicates the average round-trip time in microseconds for the IP partnership link since the last statistics collection period. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 253 Table 64. XML statistics for an IP Partnership port (continued) Statistic name Description iprx Indicates the bytes received from other nodes in other clusters by the IP partnership driver. ipsz Indicates the average size (in bytes) of data that are being transmitted by the IP partnership driver since the last statistics collection period.
  • Page 254: Event Reporting

    Event reporting process The following methods are used to notify you and the IBM Support Center of a new event: v If you enabled Simple Network Management Protocol (SNMP), an SNMP trap is sent to an SNMP manager that is configured by the customer.
  • Page 255: Description Of Data Fields For The Event Log

    Viewing the event log You can view the event log by using the management GUI or the command-line interface (CLI). About this task You can view the event log by using the Monitoring > Events options in the management GUI. The event log contains many entries. You can, however, select only the type of information that you need.
  • Page 256: Event Notifications

    Event notifications Storwize V7000 Unified can use Simple Network Management Protocol (SNMP) traps, syslog messages, emails and Call Homes to notify you and IBM(r) Remote Technical Support when significant events are detected. Any combination of these notification methods can be used simultaneously. Notifications are normally sent immediately after an event is raised.
  • Page 257: Power-On Self-Test

    Table 66. Notification levels Notification level Description Error Error notification is sent to indicate a problem that must be corrected as soon as possible. This notification indicates a serious problem with the system. For example, the event that is being reported could indicate a loss of redundancy in the system, and it is possible that another failure could result in loss of access to data.
  • Page 258: Viewing Logs And Traces

    You can view information about collecting log files or you can view examples of a configuration dump, error log, or featurization log. To do this, click Reference in the left pane of the IBM online information, and then expand the Logs and traces section.
  • Page 259 If power to a node canister fails, the node canister uses battery power to write cache and state data to its boot drive. Note: Storwize V7000 Gen2 expansion canisters do not cache volume data or store state information in volatile memory. Therefore, expansion canisters do not require battery power.
  • Page 260: Battery Operation For Storwize V7000 Unified Gen1 Control Enclosures

    The batteries within the control enclosure provide the power to write the cache and state data to a local drive. Note: Storwize V7000 Unified expansion canisters do not cache volume data or store state information in volatile memory. Therefore, expansion canisters do not require battery power.
  • Page 261 Design parameters Consider the following important design parameters: v The design life of the battery in the Storwize V7000 Unified is five years service after one year on the shelf. v No periodic learning mode or reconditioning cycle occurs in the battery of this product.
  • Page 262 2 critical saves or 10 brown outs. Preventing this maintenance cycle from occurring increases the risk that the system accumulates a sufficient number of power outages to cause the remaining battery to be discounted when calculating whether Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 263: Understanding The Medium Errors And Bad Blocks

    Understanding the medium errors and bad blocks A storage system returns a medium error response to a host when it is unable to successfully read a block. The Storwize V7000 Unified response to a host read follows this behavior. The volume virtualization that is provided extends the time when a medium error is returned to a host.
  • Page 264: Resolving A Problem

    The management GUI provides extensive facilities to help you troubleshoot and correct problems on your system. You can connect to and manage a Storwize V7000 Unified system as soon as you have completed the USB initialization. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 265: Problem: Management Ip Address Unknown

    Page not found or similar error, this information might help you resolve the issue. The connection information differs, depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Chapter 5. Control enclosure...
  • Page 266: Storwize V7000 Unified Gen1 Model Numbers

    Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 71. Storwize V7000 Unified Gen2 model numbers...
  • Page 267 v Ping the management address to see if the Ethernet network permits the connection. If the ping fails, check the Ethernet network configuration to see if there is a routing or a firewall issue. Ensure that the Ethernet network configuration is compatible with the gateway and subnet or prefix settings. Ensure that you did not use the Ethernet address of another device as the management address.
  • Page 268: Problem: Unable To Log On To The Management Gui

    Use this information if your attempt to create a clustered system has failed. This information varies depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 72. Storwize V7000 Unified Gen1 model numbers...
  • Page 269: Storwize V7000 Unified Gen2 Model Numbers

    Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 73. Storwize V7000 Unified Gen2 model numbers...
  • Page 270: Problem: Node Canister Service Ip Address Unknown

    You can use several methods to determine the service address of a node canister. The methods of determining the service address differ, depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 74. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 271: Storwize V7000 Unified Gen2 Model Numbers

    Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 75. Storwize V7000 Unified Gen2 model numbers...
  • Page 272: Default Service Ip Addresses

    Some types of errors can prevent nodes from communicating with each other; in that event, it might be necessary to point your browser directly at the service assistant of the node that requires administering, rather than change the current node in the service assistant. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 273: Problem: Cannot Connect To The Service Assistant

    If you are unable to find the service address of the node using the management GUI or service assistant, you can also use a USB flash drive to find it. For more information, see “Procedure: Getting node canister and system information using a USB flash drive”...
  • Page 274: Problem: Management Gui Or Service Assistant Does Not Display Correctly

    Use this procedure if you receive errors to determine if your SAS cabling is valid. The procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 78. Storwize V7000 Unified Gen1 model numbers Machine type/model...
  • Page 275: Storwize V7000 Unified Gen2 Model Numbers

    Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 79. Storwize V7000 Unified Gen2 model numbers...
  • Page 276: Problem: New Expansion Enclosure Not Detected

    Ensure that each SAS cable is fully inserted. See the topic about installing SAS cables in the IBM Storwize V7000 Gen2 Quick Installation Guide. Problem: Storwize V7000 Gen1 SAS cabling not valid This topic provides information to be aware of if you receive errors that indicate the SAS cabling is not valid.
  • Page 277: Problem: Control Enclosure Not Detected

    Problem: Control enclosure not detected If a control enclosure is not detected by the system, this procedure can help you resolve the problem. When installing a new control enclosure, use the Add Enclosures wizard in the management GUI. To access this wizard, select Monitoring > System. On the Systems page, select Actions >...
  • Page 278: Procedure: Resetting Superuser Password

    The password procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 80. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 279: Procedure: Identifying Which Enclosure Or Canister To Service

    Procedure: Resetting the superuser password for Storwize V7000 Gen2 The primary method for resetting the superuser password is to change the password as you log in, with the link on the log-in page. You can also access the service assistant from the technician port to change the password. If the password reset function is enabled, the log-in page displays a link for resetting the password.
  • Page 280: Storwize V7000 Unified Gen1 Model Numbers

    About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 82. Storwize V7000 Unified Gen1 model numbers Machine type/model Description 2076-112 Storwize V7000 Unified control enclosure for up to 12 3.5-inch (8.89 cm) drives 2076-124 Storwize V7000 Unified control enclosure for up to 24 2.5-inch (6.35...
  • Page 281: Storwize V7000 Unified Gen2 Model Numbers

    The model type and serial number of the enclosure are found at the bottom of the left bezel. Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 84. Storwize V7000 Unified Gen2 model numbers...
  • Page 282: Procedure: Checking The Status Of Your System

    The Home page shows a table of node errors that exist on the node canister and a table of node details for the current node. The node errors are shown in priority order. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 283: Procedure: Getting Node Canister And System Information Using A Usb Flash Drive

    Use this procedure to determine the system status using the LED indicators on the system. The procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 85. Storwize V7000 Unified Gen1 model numbers Machine type/model...
  • Page 284: Storwize V7000 Unified Gen2 Model Numbers

    Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 86. Storwize V7000 Unified Gen2 model numbers...
  • Page 285: Understanding The Power Supply Unit Leds

    Table 87. LED state descriptions used in the Storwize V7000 2076-524 enclosure (continued) State description Detail Flashing The LED turns on and off at a frequency of 2 Hz: It is on for 250 ms, then off for 250 ms, then repeats. Flashing fast The LED turns on and off at a frequency of 4 Hz: It is on for 125 ms, then off for 125 ms, then repeats.
  • Page 286: Understanding The Node Canister Status Leds

    Restart the node canister, as described in “Procedure: Reseating a Storwize V7000 Gen2 node canister” on page 278. The node canister is doing a self test during start-up. Flashing fast Wait for the canister to complete its start-up sequence. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 287 Otherwise, there might be a fault the node canister, enclosure midplane, or power supply units. The Storwize V7000 Unified software is running, and the node canister is participating in the system. The canister must not be removed.
  • Page 288 GUI or service assistant to turn off the Identify function, then check the node canister status LEDs, again. The Storwize V7000 Unified software is not running. The BIOS might have detected a fault. It is safe to remove or reseat the canister.
  • Page 289: Understanding The Node Canister Battery Status Leds

    Power Fault (amber) status (green) (green) The Storwize V7000 Unified software is running but there might be an error alert in the event log, such as error code 550. The canister must not be removed. If possible, go to the management GUI and run the fix procedure for the error alerts listed there.
  • Page 290 263 shows the LEDs on the power supply unit for the 2076-112 or 2076-124. The LEDs on the power supply units for the 2076-312 and 2076-324 are similar, but they are not shown here. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 291: Leds On The Power Supply Units Of The

    Figure 73. LEDs on the power supply units of the control enclosure Table 91. Power-supply unit LEDs Power supply failure failure failure Status Action Communication Replace the power failure between supply unit. If failure is the power still present, replace the supply unit and enclosure chassis.
  • Page 292: Power Leds

    There is no power to the canister. Try reseating the canister. Go to “Procedure: Reseating a node canister” on page 278. If the state persists, follow the hardware replacement procedures for the parts in the following order: node canister, enclosure chassis. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 293: Leds On The Node Canisters

    Table 92. Power LEDs (continued) Power LED status Description Slow Power is available, but the canister is in standby mode. Try to start the node flashing (1 canister by reseating it. Go to “Procedure: Reseating a node canister” on page 278.
  • Page 294: Control Enclosure Battery Leds

    The battery is either charging or a maintenance discharge is in process. Nonrecoverable battery fault. Replace the battery. If replacing the battery does not fix the issue, replace the power supply unit. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 295: Procedure: Finding The Status Of Ethernet Connections

    Use this procedure to find the status of Ethernet connections when you cannot connect. This procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 95. Storwize V7000 Unified Gen1 model numbers Machine type/model...
  • Page 296 1. Verify that each end of the cable is securely connected. 2. Verify that the port on the Ethernet switch or hub is configured correctly. 3. Connect the cable to a different port on your Ethernet network. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 297: Procedure: Finding The Status Of Storwize V7000 Gen2 Sas Connections

    About this task Ensure that the Storwize V7000 Unified machine code is active on the node before you begin this procedure. To determine if the machine code is active, see “Procedure: Understanding the Storwize V7000 Gen2 system status from the LEDs”...
  • Page 298: Procedure: Removing System Data From A Node Canister

    2. Use the service assistant node action to hold the node in service state. 3. Use the Manage System option to remove the system data from the node. 4. Repeat steps 1 through 3 on the second node canister in the enclosure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 299: Procedure: Fixing Node Errors

    5. On one node, open the service assistant Configure Enclosure and select the Reset System ID option. This action causes the system to reset. Procedure: Fixing node errors To fix node errors that are detected by node canisters in your system, use this procedure.
  • Page 300: Procedure: Initializing A Clustered System With A Usb Flash Drive Without Using The Initialization Tool

    For other command options, see “Create system command” on page 213. 4. Save the file to a USB flash drive. 5. Plug the USB flash drive into a USB port on a control canister. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 301: Procedure: Initializing A Clustered System Using The Service Assistant

    Use this procedure to initialize a clustered system using the service assistant. This procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 97. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 302: Storwize V7000 Unified Gen2 Model Numbers

    Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 98. Storwize V7000 Unified Gen2 model numbers Machine type/model Description 2076-524 Storwize V7000 Unified control enclosure, with up to 24 2.5-inch (6.35 cm) drives 2076-12F Storwize V7000 Unified expansion enclosure for up to 12 3.5-inch (8.89...
  • Page 303: Procedure: Accessing The Service Assistant From The Technician Port

    Procedure: Initializing a Storwize V7000 Gen1 system using the service assistant To initialize a Storwize V7000 Gen1 system using the service assistant rather than the USB flash drive, use this procedure. About this task Note: The service assistant gives you the option to create a clustered system only if the node state is candidate.
  • Page 304: Procedure: Accessing Astorwize V7000 Gen1 Canister Using A Directly Attached Ethernet Cable

    1. Connect one end of an Ethernet cable to Ethernet port 1 of the node canister. Note: A cross-over Ethernet cable is not required. 2. Connect the other end of the Ethernet cable directly to the Ethernet port on a personal computer that has a web browser installed. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 305: Problem: Reseating A Node Canister

    Use this procedure to reseat a node canister. The procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 99. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 306 6. Grasp the canister and pull it out 2 or 3 inches. 7. Push the canister back into the slot until the handle starts to move. 8. Finish inserting the canister by closing the handle until the locking catch clicks into place. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 307: Procedure: Removing A Storwize V7000 Gen2 Node Canister

    9. Verify that the cables were not displaced. 10. Verify that the LEDs are on. Results Procedure: Removing a Storwize V7000 Gen2 node canister Follow this procedure to remove a node canister. About this task Attention: Before a node canister can be removed it must be powered off or in service state;...
  • Page 308: Procedure: Powering Off Your System

    7. As you pay attention to the number scale, slide the canister out of the slot. Procedure: Powering off your system You must power off your Storwize V7000 Unified system in order to service it, or to permit other maintenance actions in your data center. To turn off the Storwize V7000 Unified system, see “Turning off the system”...
  • Page 309: Procedure: Powering On The Storwize V7000 Gen2 System

    Fault is off. If a canister is not ready, refer to the “Procedure: Understanding the system status using the LEDs” topic in the troubleshooting section of the Storwize V7000 Unified information center. Procedure: Powering off a Storwize V7000 Gen2 control...
  • Page 310: Procedure: Powering Off A Storwize V7000 Gen2 Node Canister

    While a node canister is powered off, some volumes can become inaccessible. Refer to “Procedure: Understanding Storwize V7000 Gen2 volume dependencies” on page 286 to determine whether it is appropriate to continue this procedure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 311: Procedure: Collecting Information For Support

    The status LEDs on the canister indicate that the node is powered off. Procedure: Collecting information for support IBM support might ask you to collect trace files and dump files from your system to help them resolve a problem. Typically, you perform this task from the Storwize V7000 Unified management GUI.
  • Page 312: Storwize V7000 Unified Gen1 Model Numbers

    About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 101. Storwize V7000 Unified Gen1 model numbers Machine type/model Description 2076-112 Storwize V7000 Unified control enclosure for up to 12 3.5-inch (8.89 cm) drives 2076-124 Storwize V7000 Unified control enclosure for up to 24 2.5-inch (6.35...
  • Page 313: Procedure: Fcoe Host-Linking

    Verify that Storwize V7000 Unified and host get an fcid on FCF. If not, check the VLAN configuration. b. Verify that Storwize V7000 Unified and host port are part of a zone and that zone is currently in force.
  • Page 314: Procedure: Understanding Storwize V7000 Gen2 Volume Dependencies

    If a control enclosure only has one node canister online, access to a volume depends on the online node canister if the volume is stored partially or wholly on an array that uses drives in the control enclosure or its expansion enclosures. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 315: Storwize V7000 Replaceable Units

    SFP transceivers, canisters, power supply units, battery assemblies, and enclosure chassis. The parts list varies, depending on the generation of your control enclosure model. Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 103. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 316: Storwize V7000 2076-524 Gen2 Replaceable Units

    Table 104. Storwize V7000 Unified Gen2 model numbers Machine type/model Description 2076-524 Storwize V7000 Unified control enclosure, with up to 24 2.5-inch (6.35 cm) drives 2076-12F Storwize V7000 Unified expansion enclosure for up to 12 3.5-inch (8.89 cm) drives 2076-24F Storwize V7000 Unified expansion enclosure for 2.5-inch drives...
  • Page 317: Expansion Enclosure Replaceable Units

    Table 105. Control enclosure replaceable units (continued) Part number Part name CRU or FRU Notes 64P8473 4-port 8 Gbps Fibre No SFPs Channel host interface adapter 00AR316 4-port 10 Gbps No SFPs Ethernet host interface adapter 00WY984 4-port 16Gbps Fibre No SFPs Channel host interface adapter...
  • Page 318: Drive Replaceable Units

    LFF HDD - 6 TB NL Requires system 12 Gbps SAS software version 7.4 or later. Table 108. Cable replaceable units Part number Part name CRU or FRU Notes Optical 39M5699 1 m FC cable Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 319: Storwize V7000 2076-1Xx And 2076-3Xx Gen1 Replaceable Units

    Table 108. Cable replaceable units (continued) Part number Part name CRU or FRU Notes 39M5700 5 m FC cable 39M5701 25 m FC cable 41V2120 10 m OM3 FC cable 00AR272 0.6 m 12 Gbps SAS For connecting Cable (mini SAS HD expansion enclosures.
  • Page 320: Replaceable Units

    2.8 m power cord (South Africa) 39M5144 Customer replaced 2.8 m power cord (Switzerland) 39M5158 Customer replaced 2.8 m power cord (Chile) 39M5165 Customer replaced 2.8 m power cord (Israel) 39M5172 Customer replaced Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 321 Table 109. Replaceable units (continued) Applicable FRU or customer Part Part number models replaced 2.8 m power cord (Group 1 39M5081 Customer including the United States) replaced 2.8 m power cord (Argentina) 39M5068 Customer replaced 2.8 m power cord (China) 39M5206 Customer replaced...
  • Page 322: Replacing Parts

    I/O operations, go to the management GUI and follow the fix procedures. Initiating the replacement actions without the assistance of the fix procedures can result in loss of data or loss of access to data. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 323: Preparing To Remove And Replace Parts

    Before you remove and replace parts, you must be aware of all safety issues. Before you begin First, read the safety precautions in the IBM Systems Safety Notices. These guidelines help you safely work with the Storwize V7000 Unified. Replacing a node canister Remove and replace a node canister.
  • Page 324 Do not remove a node canister unless directed to do so by a service procedure. To replace the node canister, perform the following steps: Procedure 1. Read the safety information to which “Preparing to remove and replace parts” on page 295 refers. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 325: Rear Of Node Canisters That Shows The

    2. Confirm that you know which canister to replace. Go to “Procedure: Identifying which Storwize V7000 Gen1 enclosure or canister to service” on page 254. 3. Record which data cables are plugged into the specific ports of the node canister. The cables must be inserted back into the same ports after the replacement is complete;...
  • Page 326: Replacing A Fan Module

    8. Replace the node canister into the canister slot until it stops. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 327: Replacing An Expansion Canister

    9. Finish inserting the node canister by closing its release lever so that the orange catch engages the enclosure. If the enclosure is powered and the canister is correctly installed, the canister starts automatically. Remove the canister and repeat the procedure from step 5 on page 298, if the canister is not correctly installed.
  • Page 328: Removing And Replacing The Storwize V7000 Gen2 Expansion Canister

    I/O operations, go to the management GUI and follow the fix procedures. Initiating the replacement actions without the assistance of the fix procedures can result in loss of data or loss of access to data. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 329: Rear Of Expansion Canisters That Shows The

    Be careful when you are replacing the hardware components that are located in the back of the system that you do not inadvertently disturb or remove any cables that you are not instructed to remove. Be aware of the following canister LED states: v If the power LED is on, do not remove an expansion canister unless directed to do so by a service procedure.
  • Page 330: Replacing An Sfp Transceiver

    Be careful when you are replacing the hardware components that are located in the back of the system that you do not inadvertently disturb or remove any cables that you are not instructed to remove. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 331: Sfp Transceiver

    Important: For correct operation, use the correct SFP transceivers with each adapter card. The topic “Storwize V7000 2076-524 Gen2 replaceable units” identifies the suitable IBM parts. v Use only 8G bps SFP transceivers in the 8 Gbps Fibre Channel adapter cards.
  • Page 332 Important: Always check that the SFP transceiver that you replace matches the SFP transceiver that you remove. 4. Push the new SFP transceiver into the aperture and ensure that it is securely pushed home. The SFP transceiver usually locks into place without having to Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 333: Replacing A Power Supply Unit For A Control Enclosure

    swing the release handle until it locks flush with the SFP transceiver. Figure 85 illustrates an SFP transceiver and its release handle. Figure 85. SFP transceiver 5. Reconnect the optical cable. 6. Confirm that the error is now fixed. Either mark the error as fixed or restart the node depending on the failure indication that you originally noted.
  • Page 334: Removing The Power Supply Unit

    You can replace either of the two 764 watt hot-swap redundant power supplies in the control enclosure. These redundant power supplies operate in parallel, one continuing to power the canister if the other fails. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 335 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 336 Power supply unit 1 is top side up, and power supply unit 2 is inverted. a. Depress the black locking catch from the side with the colored sticker as shown in Figure 87 on page 309. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 337: Directions For Lifting The Handle On The Power

    Figure 87. Directions for lifting the handle on the power supply unit b. Grip the handle to pull the power supply out of the enclosure as shown in Figure 88. Figure 88. Using the handle to remove a power supply unit 6.
  • Page 338: Replacing A Power Supply Unit For An Expansion Enclosure

    4. On the left side of the power supply, press the orange release tab to the right just enough to release the handle (no more than 6 mm [0.25 in.]) as you rotate the handle downward. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 339: Removing The Power Supply Unit From The Left Side Of The Expansion Enclosure

    5. Using the handle, gently slide the power supply out of the enclosure, as shown in Figure 89. Figure 89. Removing the power supply unit from the left side of the expansion enclosure 6. Hold the new power supply so that the handle is fully extended. Slide the power supply into the enclosure until it stops.
  • Page 340 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 341 Attention: A powered-on enclosure must not have a power supply removed for more than five minutes because the cooling does not function correctly with an empty slot. Ensure that you have read and understood all these instructions and have the replacement available, and unpacked, before you remove the existing power supply.
  • Page 342: Directions For Lifting The Handle On The Power

    6. Insert the replacement power supply unit into the enclosure with the handle pointing towards the center of the enclosure. Insert the unit in the same orientation as the one that you removed. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 343: Replacing The Battery In A Node Canister

    7. Push the power supply unit back into the enclosure until the handle starts to move. 8. Finish inserting the power supply unit in the enclosure by closing the handle until the locking catch clicks into place. 9. Reattach the power cable and cable retention bracket. 10.
  • Page 344: Replacing A Battery In A Power Supply Unit

    13. When the canister is back online, check the event log for any new events that might indicate a problem with the reassembly. Replacing a battery in a power supply unit Remove and replace the battery in a control enclosure power-supply unit. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 345 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 346 2. Follow the removing steps of the replacing a power-supply unit procedure. Go to “Replacing a Storwize V7000 Gen1 power supply unit for a control enclosure” on page 306. 3. Remove the battery, as shown in Figure 93 on page 319. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 347: Removing The Battery From The Control

    Figure 93. Removing the battery from the control enclosure power-supply unit a. Press the catch to release the handle 1 . b. Lift the handle on the battery 2 . c. Lift the battery out of the power supply unit 3 . 4.
  • Page 348: Releasing The Cable Retention Bracket

    About this task The status of the drive must be such that it is not a spare or a member. The status is shown in Pools > Internal Storage in the management GUI. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 349 Attention: v Do not replace a drive unless the drive fault LED is on or you are instructed to do so by a fix procedure. v If the drive is a member of an array, go to the management GUI and follow the fix procedures.
  • Page 350: Unlocking And Removing A 3.5-Inch Drive From Its Slot

    The process can take a few minutes. Replacing a 3.5-inch drive assembly or blank carrier This topic describes how to replace a 3.5-inch drive assembly or blank carrier. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 351: Unlocking The 3.5 Inch Drive

    About this task Attention: If your drive is configured for use, go to the management GUI and follow the fix procedures. Initiating the replacement actions without the assistance of the fix procedures results in loss of data or loss of access to data. Attention: Do not leave a drive slot empty.
  • Page 352: Replacing A 2.5 Inch Drive Assembly Or Blank Carrier

    Do not leave a drive slot empty for extended periods. Do not remove a drive assembly or a blank filler without having a replacement drive or a blank filler with which to replace it. Procedure To prepare to replace a drive assembly, complete the following steps. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 353 1. Read the safety information in “Preparing to remove and replace parts” on page 295. 2. Locate the slot that contains the drive assembly that you want to replace. a. Refer to “Procedure: Identifying which Storwize V7000 Gen2 enclosure or canister to service”...
  • Page 354: Unlocking And Removing A 2.5-Inch Drive From Its Slot

    The process can take a few minutes. Replacing a 2.5-inch drive assembly or blank carrier This topic describes how to remove a 2.5-inch drive assembly or blank carrier. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 355: Unlocking The 2.5 Inch Drive

    About this task Attention: If your drive is configured for use, go to the management GUI and follow the fix procedures. Initiating the replacement actions without the assistance of the fix procedures results in loss of data or loss of access to data. Attention: Do not leave a drive slot empty.
  • Page 356: Replacing Enclosure End Caps

    2. Grasp the end cap by the blue touch point and pull it until the bottom edge of the end cap is clear of the bottom tab on the chassis flange. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 357: Replacing A Sas Cable To An Expansion Enclosure

    3. Lift the end cap off the chassis flange. 4. Fit the slot on the top of the new end cap over the tab on the top of the chassis flange. 5. Rotate the end cap down until it snaps into place. Ensure that the inside surface of the end cap is flush with the chassis.
  • Page 358 3. The connector is released and slides out of the port. 4. Repeat steps 2 and 3 on the other end of the SAS cable. 5. To connect the replacement expansion enclosure attachment SAS cable, connect each end to the vacated ports. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 359: Proper Orientation For Sas Cable Connector

    Attention: When inserting a SAS connector into a SAS port, ensure that the orientation of the connector matches the orientation of the port before pushing the connector into the port. v The cable connector and socket are keyed and it is important that you have proper alignment of the keys when the cable is inserted.
  • Page 360: Replacing A Control Enclosure Chassis

    Remove and replace a control enclosure chassis. This procedure only applies to Storwize V7000 Gen1 control enclosure models. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 110. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 361: Storwize V7000 Unified Gen2 Model Numbers

    Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 111. Storwize V7000 Unified Gen2 model numbers Machine type/model Description 2076-524 Storwize V7000 Unified control enclosure, with up to 24 2.5-inch (6.35 cm) drives 2076-12F Storwize V7000 Unified expansion enclosure for up to 12 3.5-inch (8.89...
  • Page 362 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 363 Attention: Perform this procedure only if instructed to do so by a service action or the IBM support center. If you have a single control enclosure, this procedure requires that you shut down your system to replace the control enclosure. If you...
  • Page 364 Using the left end cap that you removed preserves the model and serial number identification. 21. Reinstall the drives in the new enclosure. The drives must be inserted back into the same location from which they were removed on the old enclosure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 365 If you still do not have a full set of values, contact IBM support. After you modify the configuration, the node attempts to restart.
  • Page 366 The procedures for replacing a control enclosure chassis are different from those procedures for replacing an expansion enclosure chassis. To replace an expansion enclosure chassis, see “Replacing an expansion enclosure chassis” on page 347. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 367 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 368 Attention: Perform this procedure only if instructed to do so by a service action or the IBM support center. If you have a single control enclosure, this procedure requires that you shut down your system to replace the control enclosure. If you...
  • Page 369 Dependent volume names that start with IFS are file volumes that are used by the file modules to provide file systems. Turn off these file modules. See the procedure “Turning off the system”. 5. If the I/O group is still online, shut down the I/O group by using the control enclosure CLI.
  • Page 370 If any of the node copy values are all zeroes, connect the service assistant to the other node canister and configure the enclosure there. If you still do not have a full set of values, contact IBM support.
  • Page 371: Replacing An Expansion Enclosure Chassis

    Remove and replace an expansion enclosure chassis. This procedure only applies to Storwize V7000 Gen1 enclosure models. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 112. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 372: Storwize V7000 Unified Gen2 Model Numbers

    Storwize V7000 Unified expansion enclosure for 3.5-inch drives 2076-224 Storwize V7000 Unified expansion enclosure for 2.5-inch drives Storwize V7000 Unified Gen2 refers to the newer generation of enclosures in the following table: Table 113. Storwize V7000 Unified Gen2 model numbers...
  • Page 373 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 374 13. Replace the end caps. Use the new right end cap and use the left end cap that you removed in step 8. Using the left end cap that you removed preserves the model and serial number identification. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 375 14. Reinstall drives in the new enclosure. You must insert the drives back into the same location from which they were removed on the old enclosure. 15. Reinstall the canisters (and drives) in the enclosure. 16. Install the power supply units. 17.
  • Page 376 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 377 Attention: If your system is powered on and performing I/O operations, go to the management GUI and follow the fix procedures. Performing the replacement actions without the assistance of the fix procedures can result in loss of data or access to data. Even though many of the parts are hot-swappable, these procedures are intended to be used only when your system is not up and running and performing I/O operations.
  • Page 378: Replacing A Storwize V7000 Gen2 Enclosure Midplane

    22. Go to the management GUI to use the fix procedure to change the machine type and model and serial number in the expansion enclosure. Replacing a Storwize V7000 Gen2 enclosure midplane A trained service provider must replace the midplane assembly of a Storwize V7000 Gen2 enclosure. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 379 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 380 Attention: v The enclosure midplane must be replaced only by a trained service provider. Perform this procedure only if instructed to do so by a service action or the IBM support center. v Be careful when you are replacing the hardware components that are in the back of the system that you do not inadvertently disturb or remove any cables that you are not instructed to remove.
  • Page 381 Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied a power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 382 Attention: The control enclosure must be replaced only by a trained service provider. Complete this procedure only if instructed to do so by a service action or the IBM support center. If you have a single control enclosure, this procedure requires that you shut down your system to replace the control enclosure midplane assembly.
  • Page 383: Bottom Enclosure Screws

    9. Remove the two power supplies from the enclosure. Refer to “Replacing a Storwize V7000 Gen2 power supply unit for a control enclosure” on page 305 for guidance. 10. Remove the node canisters from the enclosure. Label them to indicate what canister came from each canister slot.
  • Page 384: Right-Side Enclosure Screws

    19. Remove the midplane assembly from the chassis by rotating up the midplane assembly to about 45°, then withdraw the midplane assembly from the front of the enclosure. Figure 107 on page 357 shows the midplane assembly at a 45 degree angle. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 385: Angled Midplane Assembly

    Figure 107. Angled midplane assembly 20. Unpack the replacement midplane assembly. Grasp the midplane assembly with two hands to hold the assembly at a 45° angle. 21. Insert the tabs on the midplane assembly into the tab holes in the enclosure and rotate down the front of the assembly.
  • Page 386 If any of the node copy values are all zeros, connect the service assistant to the other node canister and configure the enclosure there. If you still do not have a full set of values, contact IBM support.
  • Page 387: Removing A Vertical Style Hard Disk Drive

    Before you begin Three persons are required at step 11 on page 360. About this task Attention: To prevent data loss, you must shut down the system before you begin the procedure to replace an expansion enclosure midplane assembly. The expansion enclosure midplane assembly must be replaced only by a trained service provider.
  • Page 388: Removing A Horizontal Style Hard Disk Drive

    361). Remove the three screws that are near the front and the screw that is near the middle. Label these screws to indicate the location from which they are removed and place them aside. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 389: Removing The Screws Of An Expansion Enclosure Assembly

    Figure 110. Removing the screws of an expansion enclosure assembly 13. Turn the enclosure top side up and place it on a flat surface. 14. Remove the three screws and one screw-pin on the right side that secure the midplane assembly to the enclosure (see Figure 110). Label the screws to indicate the location from which they are removed and place them aside.
  • Page 390: Replacing The Support Rails

    Remove and replace the support rails. The procedure differs, depending on the generation of your control enclosure model. About this task Storwize V7000 Unified Gen1 refers to the enclosure models in the following table: Table 114. Storwize V7000 Unified Gen1 model numbers Machine...
  • Page 391 Before you begin Three persons are required at step 7 About this task Follow all safety precautions when completing this procedure. Procedure To replace the support rails, complete the following steps. 1. Identify the enclosure mounted on the rails being replaced. Follow the steps in “Procedure: Identifying which Storwize V7000 Gen2 enclosure or canister to service”...
  • Page 392: Opening Rear Hinge Bracket Of Mounting Rail

    10. At the front of the rack, hold onto the rail and open the front hinge bracket. 11. Compress the rail against its spring to shorten it, then remove it from inside the rack (Figure 112 on page 365). Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 393: Compressing Rail For Removal From Rack

    Figure 112. Compressing rail for removal from rack 12. Repeat steps 9 on page 363 to 11 on page 364 on the right support rail. 13. Install the new support rails at the rack position that is recorded at step 8 on page 363 by following the instructions in Step 6.
  • Page 394 9. At the rear of the rack, remove the securing M5 screw from the bottom hole of the rear bracket of the rail, then open the rear hinge bracket (Figure 113 on page 367). Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 395: Opening Rear Hinge Bracket Of Mounting Rail

    Figure 113. Opening rear hinge bracket of mounting rail 10. At the front of the rack, hold onto the rail and open the front hinge bracket. 11. Compress the rail against its spring to shorten it, then remove it from inside the rack (Figure 114 on page 368).
  • Page 396: Compressing Rail For Removal From Rack

    The system starts. 18. After the system is online, use the management GUI to verify that the system is correct. Replacing the Storwize V7000 Gen1 support rails You can replace the support rails. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 397: Replacing Node Canister Memory Modules

    Procedure To replace the support rails, complete the following steps: 1. Remove the enclosure. 2. Record the location of the rail assembly in the rack cabinet. 3. Working from the back of the rack cabinet, remove the clamping screw 1 from the rail assembly on both sides of the rack cabinet.
  • Page 398: Installing A Storwize V7000 2076-524 Node

    12. Reconnect the cables to the canister, ensuring cables go into the same ports from which they were removed in step 1. 13. When the canister is back online, check the event log for new events, particularly events that relate to hardware changes. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 399: Replacing A Host Interface Adapter

    Important: For correct operation, use the correct SFP transceivers with each adapter card. The topic “Storwize V7000 2076-524 Gen2 replaceable units” identifies the suitable IBM parts. v Use only 8G bps SFP transceivers in the 8 Gbps Fibre Channel adapter cards.
  • Page 400: Removing The Host Interface Adapter

    9. Maintain alignment while applying pressure to the top edge of the host interface adapter opposite the connecting edge to push the host interface adapter into the connector 4 and 5 . Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 401: Installing The Host Interface Adapter

    Figure 118. Installing the host interface adapter 10. Check that the host interface adapter is installed squarely in its slot. If the small tab of the mounting bracket is not positioned correctly, repeat steps 5 on page 371 onward to install the adapter correctly. 11.
  • Page 402: Replacing A Cmos Battery

    2. Open the canister and remove the lid as described in “Procedure: Removing and replacing the lid of a Storwize V7000 Gen2 node canister” on page 285. 3. Locate the CMOS battery inside the node canister.. See Figure 119 on page 375 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 403: General Storage System Procedures

    Procedure: SAN problem determination About this task SAN failures might cause Storwize V7000 Unified volumes to be inaccessible to host systems. Failures can be caused by SAN configuration changes or by hardware failures in SAN components. The following list identifies some of the hardware that might cause failures:...
  • Page 404: Iscsi Performance Analysis And Tuning

    Procedure 1. Verify that the power is turned on to all switches and storage controllers that the Storwize V7000 Unified system uses, and that they are not reporting any hardware failures. If problems are found, resolve those problems before you proceed further.
  • Page 405 Each port is assigned to one CPU, and by balancing the login, one can maximize CPU utilization and achieve better performance. Ideally, configure subnets equal to the number of iSCSI ports on the Storwize V7000 Unified node. Configure each port of a node with an IP on a different subnet and keep it the same for other nodes.
  • Page 406: Fibre Channel Link Failures

    You do not need to enable PFC on the Storwize V7000 Unified system. Storwize V7000 Unified reads the data center bridging exchange (DCBx) packet and enables PFC for iSCSI automatically if it is enabled on the switch. In the lsportip command output, the fields lossless_iscsi and lossless_iscsi6 show [on/off] depending on whether PFC is enabled or not for iSCSI on the system.
  • Page 407: Ethernet Iscsi Host-Link Problems

    Removing the wrong SFP transceiver might result in loss of data access. 4. Contact the IBM Support Center for assistance in replacing the node canister. Ethernet iSCSI host-link problems If you are having problems attaching to the Ethernet hosts, your problem might be related to the network, the Storwize V7000 Unified system, or the host.
  • Page 408: Recover System Procedure

    Turning on the system, located in the Information Center, to power the file modules back on. Contact IBM Remote Technical support if the health indicator in the management GUI does not turn back to green within 30 minutes. They can assist you with recovering the file modules so that access to the file systems can be restored.
  • Page 409 Attention: v Run service actions only when directed by the fix procedures. If used inappropriately, service actions can cause loss of access to data or even data loss. Before you attempt to recover a storage system, investigate the cause of the failure and attempt to resolve those issues by using other fix procedures.
  • Page 410: When To Run The Recover System Procedure

    Attention: If you experience failures at any time while running the recover system procedure, call the IBM Support Center. Do not attempt to do further recovery actions, because these actions might prevent support from restoring the system to an operational status.
  • Page 411 Note: If after resolving all these scenarios, half or greater than half of the nodes are reporting node error 578, it is appropriate to run the recovery procedure. Call the IBM Support Center for further assistance. – For any nodes that are reporting a node error 550, ensure that all the missing hardware that is identified by these errors is powered on and connected without faults.
  • Page 412: Removing System Information For Node Canisters With Error Code 550 Or Error Code 578 Using The Service Assistant

    Do not run the recovery procedure on different node canisters in the same system. Before you begin Note: Ensure that the web browser is not blocking pop-up windows. If it does, progress windows cannot open. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 413 Before you begin this procedure, read the recover system procedure introductory information; see “Recover system procedure” on page 380. About this task Attention: This service action has serious implications if not completed properly. If at any time an error is encountered not covered by this procedure, stop and call the support center.
  • Page 414: Recovering From Offline Volumes Using The Cli

    Complete the following steps to recover an offline volume after the recovery procedure has completed: 1. Delete all IBM FlashCopy function mappings and Metro Mirror or Global Mirror relationships that use the offline volumes. 2. Run the recovervdisk or recovervdiskbysystem command. (This will only bring the volume back online so that you can attempt to deal with the data loss.)
  • Page 415: What To Check After Running The System Recovery

    Refer to “What to check after running the system recovery” for what to do with volumes that have been corrupted by the loss of data from the write-cache. 4. Recreate all FlashCopy mappings and Metro Mirror or Global Mirror relationships that use the volumes. What to check after running the system recovery Several tasks must be completed before you use the system.
  • Page 416 Before using the file volumes that are used by GPFS on the file modules to provide Network Attached Storage (NAS), complete the following task: v Contact IBM support for assistance with recovering the GPFS quorum state so that access to files as NAS can be restored.
  • Page 417: Backing Up And Restoring The System Configuration

    Contact the IBM support center to help you prepare the Storwize V7000 Unified system to do the restoring of the system configuration on the control enclosure.
  • Page 418: Backing Up The System Configuration Using The Cli

    1. Before you begin, hardware recovery must be complete. The following hardware must be operational: hosts, Storwize V7000 Unified enclosures, internal flash drives and expansion enclosures (if applicable), the Ethernet network, the SAN fabric, and any external storage systems (if applicable).
  • Page 419: Files Created By The Backup Process

    data that you wrote to the volumes is not backed up. Any application that uses the volumes on the system as storage, must use the appropriate backup methods to back up its application data. You must regularly back up your configuration data and your application data to avoid data loss, such as after any significant changes to the system configuration.
  • Page 420: Restoring The System Configuration

    MDisks and the array will be re-created and configured. If there are multiple storage enclosures involved, the arrays and MDisks will be restored on the proper enclosures based on the enclosure IDs. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 421 If you do not understand the instructions to run the CLI commands, see the command-line interface reference information. To restore your configuration data, follow these steps: Procedure 1. Verify that all nodes are available as candidate nodes before you run this recovery procedure.
  • Page 422: Deleting Backup Configuration Files Using The Cli

    If you find errors, correct the condition that caused the errors and reissue the command. You must correct all errors before you can proceed to step 12. v If you need assistance, contact the IBM Support Center. 12. Issue the following CLI command to restore the configuration:...
  • Page 423 Procedure 1. Issue the following command to log on to the system: plink -i ssh_private_key_file superuser@control_enclosure_management_ip where ssh_private_key_file is the name of the SSH private key file for the superuser and control_enclosure_management_ip is the IP address or DNS name of the system from which you want to delete the configuration. 2.
  • Page 424 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 425: Chapter 6. Call Home And Remote Support

    IBM Support can contact the system administrator in case of any issues. Testing a call home connection Use this information to test a call home connection to the IBM support. From the block-level storage system If call home actions fail, perform the following steps: 1.
  • Page 426: Establishing A Lights-On Aos Connection

    Enter the customer name, the case number (use the PMR number), and the geography. f. Talk to the IBM authorized servicer at the customer site to make sure that the servicer is ready to establish the link before you submit the form.
  • Page 427 For example, click Active. Active mode gives full remote access. Monitor mode restricts the IBM support representative to a view of the console, where the representative can offer guidance on what actions you might take to analyze and correct the problem.
  • Page 428 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 429: Chapter 7. Recovery Procedures

    If the chrootpwd command does not work, continue to the next step and finish the lost root password recovery procedure. chrootpwd [-c { clusterID | clusterName }] [-p password] [-v] © Copyright IBM Corp. 2011, 2015...
  • Page 430: Resetting The Nas Ssh Key For Configuration Communications

    SCSI protocol. Before you begin During the USB initialization of the Storwize V7000 Unified system, one of the node canisters in the control enclosure creates a public/private key pair to use for ssh. The node canister stores the public key and writes the private key to the USB flash drive memory.
  • Page 431: Working With Nfs Clients That Fail To Mount Nfs Shares After A Client Ip Change

    You are prompted for the Storwize V7000 superuser password. 5. Log on to the Storwize V7000 Unified management CLI as admin via the management IP and run the following command to register the new NAS SSH key: chstoragesystem --sonasprivkey /tmp/NAS.ppk...
  • Page 432: File Module-Related Issues

    This section covers the recovery procedures related to file module issues. Restoring System x firmware (BIOS) settings During critical repair actions such as the replacement of a system planar in an IBM Storwize V7000 Unified file module, you might have to reset the System x firmware.
  • Page 433 5. Scroll down to select the USB cable, then press Enter. 6. Turn on the affected file module. 7. From the IBM System x Server Firmware screen, press F1 to set up the firmware. A few seconds after the IBM System x Server Firmware screen is displayed,...
  • Page 434: Recovering From File Systems That Are Offline After The Volumes Came Back Online

    The system now reboots. During the reboot, the Storwize V7000 Unified code automatically modifies the configuration of the System x firmware (BIOS) to change the default settings to the required settings. Recovering from file systems that are offline after the volumes...
  • Page 435: Recovering From An Nfsd Service Error

    The multipath -ll command verifies that all storage devices are either active or not active. The following output shows that all storage devices are active. [root@yourmachine.mgmt001st001 ~]# multipath -ll array1_sas_89360007 (360001ff070e9c0000000001989360007) fm-0 IBM,2073-720 [size=3.1T][features=1 queue_if_no_path][hwhandler=0][rw] \_ round-robin 0 [prio=50][active] \_ 6:0:0:0 sdb 8:16 [active][ready]...
  • Page 436: Recovering From An Httpd Service Error

    Issue the sc service http start command. 2. When you complete the service action, refer to “Health status and recovery” on page 62. Recovering from an sshd_data service error Use this procedure to recover from an sshd_data service error. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 437: Recovering From An Sshd_Int Service Error

    About this task This recovery procedure starts the sshd_data when it is down. Procedure 1. Log in as a CLI user with privileged authority. 2. Issue the service sc sshd_data start command. 3. If the problem persists, restart the node. 4.
  • Page 438: Control Enclosure-Related Issues

    About this task Procedure To run the fix procedures, perform the following steps: 1. Log in to the Storwize V7000 Unified management GUI. 2. Go to Monitoring > Events and click the Block tab. 3. Run any Next recommended action.
  • Page 439: Recovering From A 1001 Error Code

    Free the unusable blocks in the compressed volumes If you cannot increase the storage pool capacity then contact IBM Remote Technical Support to help you. Recovering from a 1001 error code A 1001 error code indicates that the Storwize V7000 control enclosure has automatically performed a recovery.
  • Page 440 You can immediately remount any remaining unmounted file systems without waiting for IBM support to tell you that it is safe for you to re-enable the control enclosure CLI. Note: The management GUI can become very slow when the control enclosure CLI is restricted, so the following procedure shows how to use the management CLI to check if the file systems are mounted.
  • Page 441 CLI command to check if all of your file volumes that should be online are online. Note that the names of file volumes are the same as the names of the disks. For example [kd52v6h.ibm]$ lsvdisk id name IO_group_id IO_group_name status mdisk_grp_id mdisk_grp_name capacity type FC_id FC_name RC_id RC_name vdisk_UID fc_map_count copy_count fast_write_state 0 IFS1350385068630 0 io_grp0 online 1 meta1 100.00GB striped...
  • Page 442: Restoring Data

    5. Log back on to the Storwize V7000 Unified CLI. Wait until both nodes show OK in the Connection status column of the output from the CLI command: lsnode -r 6. Resume the file module back into the cluster using the CLI command: resumenode <node name>...
  • Page 443: Restoring Tivoli Storage Manager Data

    Site A by using the rmtask CLI command. Restoring Tivoli Storage Manager data The Storwize V7000 Unified system contains a Tivoli Storage Manager client that works with your Tivoli Storage Manager server system to perform high-speed data backup and recovery operations.
  • Page 444: Upgrade Recovery

    2. After each recommended fix, restart the upgrade by issuing the applysoftware command again. If the action fails, try the next recommended action. 3. If the recommended actions fail to resolve the issue, call the IBM Support Center. Table 119. Upgrade error codes from using the applysoftware command and recommended...
  • Page 445 Table 119. Upgrade error codes from using the applysoftware command and recommended actions (continued) The applysoftware Error Code command explanation Action EFSSG4101A The applysoftware command returned required parameter not specified. EFSSG4102 The software package does Verify that the file actually not exist.
  • Page 446 EFSSG4159 The system is in an See Chapter 3, “Getting unhealthy state and the started troubleshooting,” on upgrade cannot start. page 47. Determine if the system has issues. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 447: Upgrade Error Codes And Recommended Actions

    2. After each recommended fix, restart the upgrade by issuing the applysoftware command again. If the action fails, try the next recommended action. 3. If the recommended actions fail to resolve the issue, call the IBM Support Center. Table 120. Upgrade error codes and recommended actions...
  • Page 448 2. Attempt to remove the backup by typing rmtask StartBackupTSM. 3. Contact IBM Remote Technical Support. 01A6 Unable to install CNCSM callbacks. Contact IBM Remote Technical Support. 01A7 Internal vital product data (VPD) Contact IBM Remote Technical Support. error. 01A8 Check the health of management 1.
  • Page 449 Table 120. Upgrade error codes and recommended actions (continued) Error Code Explanation Action 01A9 Unable to stop performance Contact IBM Remote Technical Support. collection daemon. 01AB Internal upgrade error in Contact IBM Remote Technical Support. node_setup_system. 01B1 Management node replication 1.
  • Page 450 1. Stop asynchronous replication by stop. Stop asynchronous replication typing stoprepl gpfs0 --kill. and continue with the upgrade. Asynchronous replication is considered active if in RUNNING or KILLING state. 2. Contact IBM Remote Technical Support. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 451 Contact IBM Remote Technical Support. sonas_update_yum. 01C7 Unable to get list of cluster nodes. Contact IBM Remote Technical Support. 01C8 Failed while running cnrsscconfig. Contact IBM Remote Technical Support. 01C9 Unable to install CIM Contact IBM Remote Technical Support. configuration. 01CA Unable to get name of cluster.
  • Page 452 Failed Contact IBM Remote Technical Support. 01E3 mmchfs Failed Contact IBM Remote Technical Support. 01E4 Disable HSM failed Contact IBM Remote Technical Support. 01E5 Enable HSM failed Contact IBM Remote Technical Support. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 453 2. Restart the node and then, ping the node. 3. Check the network connections and correct them, if required. 4. Contact IBM Remote Technical Support. 01E8 Unable to apply firmware to Contact IBM Remote Technical Support.
  • Page 454 Database replication suspend or Contact IBM Remote Technical Support. resume error. 0522 Unable to clean the CTDB Contact IBM Remote Technical Support. configuration file. 0523 Unable to upgrade Samba Contact IBM Remote Technical Support. packages. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 455: Chapter 8. Troubleshooting Compressed File Systems

    429 Storage pool is full and the file system pool Contact IBM Remote Technical Support or is offline, but no additional storage is your service representative. available to add to the pool.
  • Page 456: Recovery Procedure: Increase Capacity Of The Storage Pool

    In the Preset field, select the RAID configuration for the storage you are configuring. c. Select Optimize for capacity to configure all available capacity. d. Verify the configuration and click Next. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 457: Recovery Procedure: Adding Additional Capacity For Offline Compressed File Systems

    e. Click Expand an existing pool and select the storage pool that is used for compression. 4. Click Finish. Allocate storage from available external storage: The system supports adding external storage systems to provide additional capacity and virtualization. If your environment has external storage systems, you can increase capacity to the storage pool by completing these steps: 1.
  • Page 458 Note: If you are unfamiliar with managing spare goals and spare disks, contact IBM support for guidance. Increasing capacity in this way is meant only as a short term solution to this problem. Further provisioning to permanently resolve capacity constraints can be conducted with the help of IBM service personnel who might recommend that additional drives be added to your system.
  • Page 459 Click OK. To add additional drives to the system, complete these steps: a. Acquire additional drives from IBM or vendor. b. Install drives into available drive slots on the enclosure. See “Installing a hot-swap hard disk drive” on page 149.
  • Page 460: Monitoring File System Compression

    In most cases, data does not have the same compression rate because it is constantly changing over the course of life cycle. Incompressible data or data that does not compress well can be added to a file Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 461 system, which impacts compression rates. The system default for the contingency threshold at 80% of the physical capacity which provides 20% contingency capacity for the storage pool, which is adequate for most environment. For example, if an administrator has a storage pool with 10 TB of physical storage and sets the threshold to 80%, only 8 TB out of the physical 10 TB are available in the pool.
  • Page 462 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 463: Appendix. Accessibility Features For Ibm Storwize V7000 Unified

    Accessibility features These are the major accessibility features for the Storwize V7000 Unified: v You can use screen-reader software and a digital speech synthesizer to hear what is displayed on the screen. HTML documents have been tested using JAWS version 15.0.
  • Page 464 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 465: Notices

    Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead.
  • Page 466 The materials at those websites are not part of the materials for this IBM product and use of those websites is at your own risk. IBM may use or distribute any of the information you provide in any way it believes appropriate without incurring any obligation to you.
  • Page 467: Trademarks

    IBM, therefore, cannot guarantee or imply reliability, serviceability, or function of these programs. The sample programs are provided "AS IS", without warranty of any kind. IBM shall not be liable for any damages arising out of your use of the sample programs.
  • Page 468: Industry Canada Compliance Statement

    Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. Attention: This is an EN 55022 Class A product. In a domestic environment this product might cause radio interference in which case the user might be required to take adequate measures.
  • Page 469: People's Republic Of China Class A Statement

    Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung der IBM verändert bzw.
  • Page 470: Taiwan Class A Compliance Statement

    This explains the Japan Voluntary Control Council for Interference (VCCI) statement. Japan Electronics and Information Technology Industries Association Statement This explains the Japan Electronics and Information Technology Industries Association (JEITA) statement for less than or equal to 20 A per phase. Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 471: Korean Communications Commission Class A Statement

    This explains the JEITA statement for greater than 20 A per phase. Korean Communications Commission Class A Statement This explains the Korean Communications Commission (KCC) statement. Russia Electromagnetic Interference Class A Statement This statement explains the Russia Electromagnetic Interference (EMI) statement. Notices...
  • Page 472 Storwize V7000 Unified: Problem Determination Guide 2073-720...
  • Page 474 IBM® Printed in USA GA32-1057-14...

Table of Contents

Save PDF