Final Research Draft
Chapter 18 Disaster Recovery Planning
The Nature of Disaster
Natural Disasters
Earthquakes
Floods, storms, fires
Regional events
Man-Made Disasters
Fires
Acts of terrorism
Bombings/explosions
Power outages
Network/utility/infrastructure failures
Hardware/software failures
Strikes/picketing
Theft/vandalism
Understand System Resilience and Fault Tolerance
Fault Tolerance and System Resilience
Protecting Hard Drives
Protecting Servers
Protecting Power Sources
Trusted Recovery
Quality of Service
overview
Fault Tolerance and System Resilience
Single point of failure (SPOF)
Fault tolerance
System resilience
Protecting Hard Drives
RAID-0
RAID-1
RAID-5
RAID-10
Hardware vs. software
Hot swapping vs. cold swapping
Protecting Servers
Failover clusters
Load balancing
Scalability
Replication between members
Protecting Power Sources
UPS
Spike, sag, surge, brownout
Transient
Generators
Trusted Recovery
Assurance after failure or crash
Fail-secure, fail-open
Preparation
System recovery
Reboot into non-privileged state, restore all affected files to pre-failure settings/values
Manual recovery, automated recovery
Automated recovery without undue loss
Function recovery
Quality of Service
Bandwidth
Latency
Jitter
Packet loss
Interference
Prioritization
Recovery Strategy
Business Unit and Functional Priorities
Crisis Management
Emergency Communications
Workgroup Recovery
Alternate Processing Sites
Mutual Assistance Agreements
Database Recovery
overview
Business Unit and Functional Priorities
Prioritization
Mission critical business functions/units
Detailed ordered list of business processes
Priority based on:
Risk
Cost assessment
Mean time to recovery (MTTR)
Maximum tolerable outage (MTO)
Recovery objectives
Crisis Management
Mitigate with disaster recovery plan
Training on disaster recovery procedures
Train and document to counter panic
Crisis training
Emergency Communications
Internal and external
Keep outside informed of recovery process
Support recovery through internal communications
Alternatives in the event of infrastructure collapse during major disasters
Workgroup Recovery
Each department needs to be recovered
Restore worker’s ability to perform work tasks
DRP is not IT only
May require numerous strategies
Independent recovery of work divisions
Alternate Processing Sites
Cold site
Hot site
Warm site
Mobile site
Service bureaus
Cloud computing
Mutual Assistance Agreements
Reciprocal agreements
Difficult to enforce
Requires close proximity
Confidentiality concerns
Database Recovery
Electronic vaulting
Remote journaling
Remote mirroring
Recovery Plan Development
Emergency response
Personnel and communications
Assessment
Backups and offsite storage (see next slide)
Software escrow arrangements
External communications
Utilities
Logistics and supplies
Recovery vs. restoration
Training, awareness, and documentation
Backups and Offsite Storage
Full, incremental, differential
Onsite and offsite
Media rotation schemes
Backup tape formats
Disk to disk backup
Best practices
Tape rotation
Testing and Maintenance
Read-through test
Structured walk-through
Simulation test
Parallel test
Full-interruption test
Maintenance
Conclusion
Read the Exam Essentials
Review the chapter
Perform the Written Labs
Answer the Review Questions