«Кулинарная книга» по PowerHA


Обновился великолепный Redbok посвящённый HACMP/PowerHA

PowerHA for AIX Cookbook

В этой книге вы найдёте множество советов по приготовлению кластера высокой доступности. Простые рецепты, доступные ингридиенты, множество понятных иллюстраций. Что ещё надо для счастливого администрирования? Cool

Abstract

This IBM® Redbooks® publication will help you install, tailor, and configure the new PowerHA™ Version 5.5, and understand new and improved features such as WebSMIT gateway, non-disruptive migrations, C-SPOC enhancements, and Disaster Recovery (DR) configurations, such as GLVM in asynchronous mode.

This publication provides a broad understanding of the PowerHA and PowerHA Extended Distance (PowerHA/XD) architecture. If you plan to install, migrate, or administer a high availability cluster, this book is right for you. Disaster recovery elements and how PowerHA fulfills these necessities are also presented in detail.

This cookbook is designed to help AIX® professionals that are seeking a comprehensive and task-oriented guide for developing the knowledge and skills required for PowerHA cluster design and implementation as well as for daily system administration. It is designed to provide a combination of theory and practical experience.

This book will be especially useful for system administrators currently running PowerHA or PowerHA Extended Distance (XD) clusters who might want to consolidate their environment and move to new PowerHA Version 5.5.

TOC
Part 1. Introduction
Chapter 1. Introduction to PowerHA for AIX
Chapter 2. High availability components
Part 2. Planning, installation, and migration
Chapter 3. Planning
Chapter 4. Installation and configuration
Chapter 5. Migrating a cluster to PowerHA V5.5
Part 3. Cluster administration
Chapter 6. Cluster maintenance
Chapter 7. Cluster management
Chapter 8. Cluster security
Part 4. Advanced topics (with examples)
Chapter 9. Virtualization and PowerHA
Chapter 10. Extending resource group capabilities
Chapter 11. Customizing events
Chapter 12. Storage related considerations
Chapter 13. Networking considerations
Part 5. Disaster recovery
Chapter 14. PowerHA Extended Distance concepts and planning
Chapter 15. PowerHA with cross-site LVM mirroring
Chapter 16. PowerHA/XD and SVC copy services
Chapter 17. GLVM concepts and configuration
Appendix A. Paper planning worksheets

Content

Part 1. Introduction
Chapter 1. Introduction to PowerHA for AIX
1.1 What is PowerHA for AIX
1.1.1 High availability
1.1.2 Cluster multi-processing
1.2 Availability solutions: An overview
1.2.1 Downtime
1.2.2 Single point of failure (SPOF)
1.3 History and evolution
1.3.1 HACMP Version 5 Release 4
1.3.2 HACMP Version 5 Release 4.1
1.3.3 PowerHA Version 5 Release 5
1.4 High availability terminology and concepts
1.4.1 Terminology
1.4.2 Concepts
1.5 High availability versus fault tolerance
1.5.1 Fault-tolerant systems
1.5.2 High availability systems
1.6 Software planning
1.6.1 AIX level and related requirements
1.6.2 Licensing
1.7 PowerHA software installation
1.7.1 Checking for prerequisites
1.7.2 New installation
1.7.3 Installing PowerHA
Chapter 2. High availability components
2.1 PowerHA configuration data
2.2 Software components
2.3 Cluster topology
2.3.1 RSCT and PowerHA heartbeating
2.3.2 Heartbeat over IP aliases
2.3.3 TCP/IP networks
2.3.4 IP address takeover mechanisms
2.3.5 Persistent IP label or address
2.3.6 Device based or serial networks
2.3.7 Network modules
2.3.8 Clients
2.3.9 Network security considerations
2.4 Resources and resource groups
2.4.1 Definitions
2.4.2 Resources
2.4.3 NFS
2.4.4 Application servers
2.4.5 Application monitors
2.4.6 Communication adapters and links
2.4.7 Tape resources
2.4.8 Fast connect resources
2.4.9 Workload Manager integration
2.4.10 Resource groups
2.5 Plug-ins
2.6 Features (HACMP 5.1, 5.2 and 5.3)
2.6.1 New features
2.6.2 Features no longer supported
2.7 Limits
2.8 Storage characteristics
2.8.1 Shared LVM
2.8.2 Non-concurrent access mode
2.8.3 Concurrent access mode
2.8.4 Enhanced concurrent mode volume groups
2.8.5 Fast disk takeover
2.9 Shared storage configuration
2.9.1 Shared LVM requirements
2.9.2 Non-concurrent, enhanced concurrent, and concurrent
Part 2. Planning, installation, and migration
Chapter 3. Planning
3.1 High availability planning
3.2 Planning for PowerHA
3.2.1 Planning strategy
3.2.2 Planning tools
3.3 Getting started
3.3.1 Current environment
3.3.2 Addressing single points of failure
3.3.3 Initial cluster design
3.3.4 Completing the cluster overview planning worksheet
3.4 Planning cluster hardware
3.4.1 Overview of cluster hardware
3.4.2 Completing the cluster hardware planning worksheet
3.5 Planning cluster software
3.5.1 AIX and RSCT levels
3.5.2 Virtual LAN and SCSI support
3.5.3 Required AIX filesets
3.5.4 AIX security filesets
3.5.5 PowerHA filesets
3.5.6 AIX files altered by PowerHA
3.5.7 Application software
3.5.8 Licensing
3.5.9 Completing the software planning worksheet
3.6 Operating system considerations
3.7 Planning security
3.7.1 Cluster security
3.7.2 User administration
3.7.3 HACMP group
3.7.4 PowerHA ports
3.7.5 Planning for PoweHA file collections
3.8 Planning cluster networks
3.8.1 Terminology
3.8.2 General network considerations
3.8.3 IP Address Takeover planning
3.8.4 Heartbeating over aliases
3.8.5 Non-IP network planning
3.8.6 Planning RS232 serial networks
3.8.7 Planning disk heartbeating
3.8.8 Additional network planning considerations
3.8.9 Completing the network planning worksheets
3.9 Planning storage requirements
3.9.1 Internal disks
3.9.2 Shared disks
3.9.3 Enhanced Concurrent Mode (ECM) volume groups
3.9.4 Shared logical volumes
3.9.5 Fast disk takeover
3.9.6 Completing the storage planning worksheets
3.10 Application planning
3.10.1 Application servers
3.10.2 Application monitoring
3.10.3 Availability analysis tool
3.10.4 Applications integrated with PowerHA
3.10.5 Completing the application planning worksheets
3.11 Planning for resource groups
3.11.1 Resource group attributes
3.11.2 Completing the planning worksheet
3.12 Detailed cluster design
3.13 Developing a cluster test plan
3.13.1 Custom test plan
3.13.2 Cluster Test Tool
3.14 Developing a PowerHA installation plan
3.15 Backing up the cluster configuration
3.16 Documenting the cluster
3.16.1 Exporting a cluster definition file using SMIT
3.16.2 Creating a cluster definition file from a snapshot using SMIT
3.16.3 Creating a configuration report
3.17 Change and problem management
3.18 Planning tools
3.18.1 Cluster diagram
3.18.2 Online Planning Worksheets
3.18.3 Paper planning worksheets
Chapter 4. Installation and configuration
4.1 Basic steps to implement a PowerHA cluster
4.2 Configuring PowerHA
4.2.1 General considerations for the configuration method
4.2.2 Standard Configuration Path: The two-Node Cluster Configuration
Assistant
4.2.3 Using Extended Configuration Path and C-SPOC
4.3 Installing and configuring WebSMIT
4.3.1 The PowerHA related SMIT panels and their structure
4.3.2 Installing a Web server with the IBM HTTP Server code
4.3.3 Installing WebSMIT
4.3.4 Configuring WebSMIT
4.3.5 Starting WebSMIT
4.3.6 Registering clusters with the WebSMIT gateway
4.3.7 Accessing WebSMIT pages and add clusters
4.3.8 Introduction into WebSMIT
4.3.9 WebSMIT monitoring
Chapter 5. Migrating a cluster to PowerHA V5.5
5.1 Identifying the migration path
5.1.1 Migration methods
5.1.2 Supported migration paths
5.2 Prerequisites
5.3 Considerations
5.4 General migration steps
5.5 Scenarios tested
5.5.1 Scenario 1: Non-disruptive upgrade (NDU) from HACMP 5.4.1 to
PowerHA 5.5
5.5.2 Scenario 2: Rolling migration from HACMP 5.3 to PowerHA5.5
5.5.3 Scenario 3: Snapshot upgrade from HACMP 5.3 to PowerHA5.5
5.5.4 Scenario 4: Offline upgrade from HACMP 5.3 to PowerHA 5.5
5.6 Post-migration steps
5.7 Troubleshooting a failed migration
5.7.1 Backing out of a failed migration
5.7.2 Reviewing the cluster version in the HACMP ODM
5.7.3 Troubleshooting a stalled snapshot application
5.7.4 DARE error during synchronization
5.7.5 Error: config_too_long during migration
Part 3. Cluster administration
Chapter 6. Cluster maintenance
6.1 Change control and testing
6.1.1 Scope
6.1.2 Test cluster
6.2 Starting and stopping the cluster
6.2.1 Cluster Services
6.2.2 Starting cluster services
6.2.3 Stopping cluster services
6.3 Resource group and application management
6.3.1 Bringing a resource group offline using SMIT
6.3.2 Bringing a resource group online using SMIT
6.3.3 Moving a resource group using SMIT
6.3.4 Suspending and resuming application monitoring
6.4 Scenarios
6.4.1 PCI hot-plug replacement of a NIC
6.4.2 Fixes
6.4.3 Storage
6.4.4 Applications
6.5 Cluster Test Tool
6.5.1 Custom testing
6.5.2 Test duration
6.5.3 Considerations
6.5.4 Automated testing
6.5.5 Custom testing
Chapter 7. Cluster management
7.1 C-SPOC
7.1.1 C-SPOC in general
7.1.2 C-SPOC SMIT menu
7.2 File collections
7.2.1 Predefined file collections
7.2.2 Managing file collections
7.3 User administration
7.3.1 C-SPOC user and group administration
7.3.2 Password management
7.4 Shared storage management
7.4.1 Updating LVM components
7.4.2 C-SPOC Logical Volume Manager
7.4.3 C-SPOC Concurrent Logical Volume Management
7.4.4 C-SPOC Physical Volume Management
7.4.5 Examples
7.4.6 C-SPOC command line interface (CLI)
7.5 Time synchronization
7.6 Cluster verification and synchronization
7.6.1 Cluster verification and synchronization using SMIT
7.6.2 Dynamic cluster reconfiguration: DARE
7.6.3 Verification log files
7.6.4 Running automatically corrective actions during verification
7.6.5 Automatic cluster verification
7.7 Monitoring PowerHA
7.7.1 Cluster status checking utilities
7.7.2 Cluster status and services checking utilities
7.7.3 Topology information commands
7.7.4 Resource group information commands
7.7.5 Log files
7.7.6 Error notification
7.7.7 Application monitoring
7.7.8 Measuring application availability
Chapter 8. Cluster security
8.1 Cluster security and the clcomd daemon
8.1.1 The /usr/es/sbin/cluster/etc/rhosts file
8.1.2 Disabling the Cluster Communication daemon
8.1.3 Additional cluster security features
8.1.4 Cluster communication over VPN
8.2 Using encrypted inter-node communication
8.2.1 Encryption key management
8.2.2 Setting up message authentication and encryption
8.2.3 Troubleshooting message authentication and encryption
8.2.4 Checking the current message authentication settings
8.3 Secure remote command execution
8.4 WebSmit security
8.4.1 Secure WebSMIT communication
8.4.2 User authentication
8.4.3 Access to WebSMIT
8.4.4 Access to WebSMIT panels
8.5 PowerHA and firewalls
8.6 RSCT security
8.6.1 RSCT and PowerHA
8.6.2 Cluster Security Services (CtSec) overview
8.6.3 Mechanism abstraction layer (MAL)
8.6.4 Mechanism pluggable modules (MPM)
8.6.5 Host-based authentication with ctcasd
8.6.6 Identity mapping service and RMC access control lists
Part 4. Advanced topics (with examples)
Chapter 9. Virtualization and PowerHA

9.1 Virtualization
9.2 Virtual I/O Server
9.3 DLPAR and application provisioning
9.3.1 Requirements
9.3.2 Application provisioning
9.3.3 Configuring DLPAR to PowerHA
9.3.4 Troubleshooting HMC verification errors
9.3.5 Test cluster configuration
9.3.6 Test results
9.4 Live Partition Mobility
9.5 Workload Partitions
9.5.1 Relationships
9.5.2 Planning for Highly Available WPARs
9.5.3 Resource Groups and WPARs
Chapter 10. Extending resource group capabilities
10.1 Settling time
10.2 Node distribution policy
10.2.1 Configuring a RG node-based distribution policy
10.2.2 Node-based distribution scenario
10.3 Dynamic node priority (DNP)
10.3.1 Configuring the dynamic node priority policy
10.3.2 Changing an existing resource group to use DNP policy
10.3.3 How dynamic node priority functions
10.4 Delayed fallback timer
10.5 Resource group dependencies
10.5.1 Resource group parent/child dependency
10.5.2 Resource group location dependency
10.5.3 Combining various dependency relationships
10.5.4 Displaying resource group dependencies
Chapter 11. Customizing events
11.1 Overview of cluster events
11.2 Writing scripts for custom events
11.3 Pre-event and post-event commands
11.3.1 Parallel processed resource groups and usage of pre-event and
post-event scripts
11.3.2 Configuring pre-event or post-event scripts
11.4 Automatic error notification
11.4.1 Disk monitoring consideration
11.4.2 Setting up automatic error notification
11.4.3 Listing automatic error notification
11.4.4 Removing automatic error notification
11.4.5 Using error notification
11.4.6 Customizing event duration
11.4.7 Defining new events
Chapter 12. Storage related considerations
12.1 Volume group types
12.1.1 Enhanced concurrent
12.1.2 Non-concurrent
12.1.3 Concurrent
12.2 Disk reservations
12.3 Forced varyon of volume groups
12.4 Fast disk takeover
12.5 Prerequisites
12.5.1 How fast disk takeover works
12.5.2 Enabling fast disk takeover
12.6 Disk heartbeat
12.6.1 Overview
12.6.2 Prerequisites
12.6.3 Performance considerations
12.6.4 Configuring traditional disk heartbeat
12.6.5 Configuring multi-node disk heartbeat
12.6.6 Testing disk heartbeat connectivity
12.6.7 Monitoring disk heartbeat
12.7 Fast failure detection
Chapter 13. Networking considerations
13.1 EtherChannel
13.1.1 Implementing EtherChannel in a PowerHA environment
13.1.2 Configuration procedures
13.2 Distribution preference for service IP aliases
13.2.1 Configuring service IP distribution policy
13.2.2 Lab experiences with service IP distribution policy
13.3 Site specific service IP labels
13.4 Understanding the netmon.cf file
13.4.1 New netmon.cf format for VIO environments
13.4.2 Implications
13.5 Understanding the clhosts file
13.6 Understanding the clinfo.rc file
Part 5. Disaster recovery
Chapter 14. PowerHA Extended Distance concepts and planning

14.1 Disaster recovery considerations
14.2 PowerHA/XD components
14.2.1 PowerHA/XD Metro Mirror integration feature
14.2.2 Implications
14.3 PowerHA/XD SVC Global Mirror
14.4 PowerHA GLVM
14.5 Locating additional information
Chapter 15. PowerHA with cross-site LVM mirroring
15.1 Cross-site LVM mirroring introduction
15.1.1 Comparison
15.1.2 Requirements
15.2 Infrastructure considerations
15.3 Configuring cross-site LVM mirroring
15.3.1 Configuring the cross-site LVM cluster
15.3.2 Configuring cluster sites
15.3.3 Configuring cross-site LVM mirroring site dependencies
15.3.4 Configuring volume groups with cross-site LVM mirror
15.3.5 Resource groups and cross-site LVM mirroring
15.4 Testing cross-site LVM mirroring
15.4.1 Verifying the cluster
15.4.2 Tested scenarios
15.5 Maintaining cross-site LVM
Chapter 16. PowerHA/XD and SVC copy services
16.1 Scenario description
16.2 implementing a PowerHA/XD SVC configuration
16.2.1 PowerHA/XD SVC prerequisites overview
16.2.2 Installing PowerHA/XD for SVC
16.2.3 Configuring PowerHA/XD for SVC
Chapter 17. GLVM concepts and configuration
17.1 PowerHA/XD GLVM
17.1.1 Definitions and concepts
17.1.2 Configuring Synchronous GLVM with PowerHA/XD
17.2 Converting from GLVM in synchronous mode to asynchronous mode
17.2.1 Migration steps
17.2.2 Test primary site failure
17.3 Migration: Logic for going from HAGEO to GLVM
17.3.1 Install GLVM filesets and configure GLVM
17.3.2 Performance considerations
17.3.3 Troubleshooting
17.4 Steps for migrating from HAGEO to GLVM
Part 6. Appendixes
Appendix A. Paper planning worksheets
Two-node cluster configuration assistant
TCP/IP network planning worksheets
TCP/IP network interface worksheet
Fibre Channel Disks Worksheets
Shared volume group and file system worksheet
NFS-Exported file system or directory worksheet
Application worksheet
Application server worksheet
Application monitor worksheet (custom)
Resource group worksheet
Cluster events worksheet
Cluster file collections worksheet

Скачать

Оставьте комментарий