Transcript Implementing Oracle9i data Guard For Higher Availability
Presentation #32050
Implementing Oracle9i Data Guard For Higher Availability
By Daniel T. Liu First American Real Estate Solutions Date:
Thursday, November 14, 2002 @ 2:30 PM - 3:30 PM
Room:
Marriott Salon 2
Agenda
• • • • •
Introduction General Concepts and Architecture Process Architecture Data Protection Mode Data Guard Implementation
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 2
Agenda
• • • •
Role Management Data Guard Implementation Tips Summary Q & A
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 3
Introduction
• •
Challenges for Database Higher Availability
– – – –
Database Size Database Backup Time Database Recovery Time 24 x 7 Uptime High Availability Options
–
Oracle Advanced Replication
– – – –
Oracle Real Application Clusters (RAC) Oracle Data Guard (Standby Database) EMC Time Finder Quest SharePlex
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 4
Introduction
•
History of Standby Database
–
Oracle7.3: First Release of Standby Database
–
Oracle8i: Automatic shipping and application of redo logs
–
Oracle9i Release 1: Protection mode
–
Oracle9i Release 2: Logical standby database
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 5
General Concepts and Architecture
• • •
Data Guard Components Data Guard Roles Data Guard Interfaces
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 6
Data Guard Components
• • • • • •
Primary Database Standby Database
–
Physical Standby Database
–
Logical Standby Database (9iR2 only) Log Transport Services Network Configuration Log Apply Services Data Guard Broker
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 7
CLI GUI Online Redo Logs Primary Database Data Guard Broker Local Archived Logs Data Guard Broker Standby Database Log Transport Services
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES
Remote Archived Logs Log Apply Services
8
Data Guard Roles
• •
A database can operate in one of the two mutually exclusive roles: Failover
–
One of the standby databases takes the primary database role Switchover
–
In Oracle9i, primary and standby database can continue to alternate roles
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 9
Data Guard Interface
• • •
SQL*Plus and SQL Statements SQL> alter database commit to switchover to physical standby; Data Guard Broker GUI
–
Data Guard Manager Data Guard Broker Command-Line Interface $ dgmgrl DGMGRL for Solaris: Version 9.2.0.1.0 Production.
(c) Copyright 2002 Oracle Corporation. All rights reserved.
Welcome to DGMGRL, type "help" for information.
DGMGRL>
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 10
Process Architecture
•
Physical/Logical Standby Processes
–
LGWR (Log Writer) process
–
ARCH (Archiver) process
–
LNS (LGWR Network Server) process
–
RFS (Remote File Server) process
–
MRP (Managed Recovery) process
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 11
Process Architecture
•
Physical/Logical Standby Processes
–
FAL (Fetch Archive Log) Client process
–
FAL (Fetch Archive Log) Server process
–
LSP (Logical Standby) process
–
PX (Parallel Execution) process
–
DMON (Data Guard Broker Monitor) process
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 12
Physical Standby Processes Architecture
Primary Database LGWR Online Redo Logs ARCH LNS Local Archived Logs FAL Server DMON SYNC ASYNC DMON RFS
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES
Physical standby Database ARCH Standby Redo Logs FAL Client/ MRP Remote Archived Logs
13
Logical Standby Processes Architecture
DMON Primary Database LGWR Online Redo Logs ARCH LNS SYNC ASYNC Local Archived Logs DMON RFS Remote Archived Logs
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES
Logical Standby Database PX PX Applying Group LSP0 PX PX Mining Group
14
Data Protection Mode
•
Oracle9i Release 1 has four data protection modes Mode Guaranteed Instant Rapid Delayed Log Writing Process
LGWR LGWR LGWR ARCH
Network Trans Mode
SYNC SYNC ASYNC ASYNC
Disk Write Option
AFFIRM AFFIRM NOAFFIRM NOAFFIRM
Redo Log Reception Option
Standby redo logs Standby redo logs Standby redo logs Archived redo logs
Failure Resolution Option
Protect Unprotect Unprotect Unprotect OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 15
Data Protection Mode
•
Oracle9i Release 2 has three data protection modes Mode Log Writing Process Network Trans Mode Disk Write Option Redo Log Reception Option Supported on
LGWR SYNC
Maximum Protection Maximum Availability
LGWR
Maximum Performance
LGWR or ARCH SYNC SYNC or ASYNC AFFIRM AFFIRM NOAFFIRM Standby redo logs are required Standby redo logs Standby redo logs Physical standby databases Physical and logical standby databases Physical and logical standby databases OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 16
Data Guard Implementation • The following shows how to set up Data Guard in this given environment:
1. The production database name is
prod
.
2. One primary database instance called
prod_01
on host
server_01
; one physical standby database instance called
prod_02
on host
server_02
.
3. Listener
listener_prod_01
is on host
server_01
, and pointed by TNS entry
prod_01
.
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 17
Data Guard Implementation
4. Listener
listener_prod_02
is on host
server_02
, and pointed by TNS entry
prod_02
.
5. The purpose of TNS entry
prod_01
and
prod_02
are used for LGWR/ARCH process to ship redo logs to the standby site, and for FAL process to fetch redo logs from the primary site.
6. Since Data Guard broker is not used here, we set
dg_broker_start
to false.
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 18
Data Guard Implementation
7. The protection mode is set to best performance. Therefore, only local archive destination (log_archive_dest_1) is set to mandatory; the standby archive destination (log_archive_dest_2) is set to optional for LGWR process, with network transmission method of asynchronous and disk write option of no affirm.
8. The standby site is not using standby online redo logs. Therefore, the redo log reception option is archived logs.
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 19
Data Guard Configuration Primary Site: Standby Site:
Database Name:
prod
Primary Server:
server_01
Primary Instance Name:
prod_01
Primary Listener:
listener_prod_01
/var/opt/oracle/oratab entry: Database Name:
prod
Standby Server:
server_02
Standby Instance name:
prod_02
Standby Listener:
listener_prod_02
/var/opt/oracle/oratab entry:
prod_01
:/u01/app/oracle/product/9.2.0:N
prod_02
:/u01/app/oracle/product/9.2.0:N OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 20
Data Guard Configuration
Primary init.ora file: db_name = prod #drs_start = false # for 9.0.1 dg_broker_start = false # for 9.2.0 #fal_server =
“prod_02”
#fal_client =
“prod_01”
log_archive_dest_1 = ’location=/u02/arch/prod/ mandatory’ log_archive_format = prod_%s.arc log_archive_start= true standby_archive_dest = ’/u02/arch/prod’ log_archive_dest_2 = ’service=
prod_02
optional repoen=60 lgwr async noaffirm’ log_archive_dest_state_2 = enable Standby init.ora file: db_name = prod #drs_start = false # for 9.0.1 dg_broker_start = false # for 9.2.0 fal_server =
“prod_01”
fal_client =
“prod_02”
log_archive_dest_1 = ’location=/u02/arch/prod/ mandatory’ log_archive_format = prod_%s.arc log_archive_start= true standby_archive_dest = ’/u02/arch/prod’ #log_archive_dest_2 = ’service=
prod_01
optional repoen=60 lgwr async noaffirm’ #log_archive_dest_state_2 = enable OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 21
Data Guard Configuration
Primary listener.ora file:
Listener_prod_01
(address_list = (address = (protocol = tcp) (host =
server_01
) (port = 1522) ) ) sid_list_listener_prod_01 (sid_list = (sid_desc = (oracle_home=/u01/app/oracle/product/9.2) (sid_name =
prod_01
) ) ) Standby listener.ora file:
Listener_prod_02
(address_list = (address = (protocol = tcp) (host =
server_02
) (port = 1522) ) ) sid_list_listener_prod_02 (sid_list = (sid_desc = (oracle_home=/u01/app/oracle/product/9.2) (sid_name =
prod_02
) ) ) OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 22
Data Guard Configuration
Primary tnsnames.ora file: Primary tnsnames.ora file:
Prod
= (description = (address = (protocol = tcp) (host =
server_01
) (port = 1522) (connect_data = (sid =
prod_01
)))
Prod_01
= (description = (address = (protocol = tcp) (host = server_01) (port = 1522) (connect_data = (sid =
prod_01
)))
Prod_02
= (description = (address = (protocol = tcp) (host = server_02) (port = 1522) (connect_data = (sid =
prod_02
)))
Prod
= (description = (address = (protocol = tcp) (host =
server_01
) (port = 1522) (connect_data = (sid =
prod_01
)))
Prod_01
= (description = (address = (protocol = tcp) (host = server_01) (port = 1522) (connect_data = (sid =
prod_01
)))
Prod_02
= (description = (address = (protocol = tcp) (host = server_02) (port = 1522) (connect_data = (sid =
prod_02
))) OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 23
Steps to Set up a Data Guard Environment
• Step 1: The preparation • Step 2: Backup the primary database datafiles • Step 3: Create the physical standby database control file • Step 4: Transfer the datafiles and control file to the standby site OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 24
Steps to Set up a Data Guard Environment
• Step 5: Start the listeners on both primary and standby site • Step 6: Start the standby database • Step 7: Place the standby database in managed recovery mode • Step 8: Monitor the Log Transport Services and Log Apply Services OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 25
Step 1: The Preparation
• Plan the Data Guard environment.
– Understand the business requirements and demands – Decide the number of standby databases (logical/physical) – Decide file system layout for standby (if feasible, prefer identical layout of primary database) – Ensure primary database is in ARCHIVELOG mode – Choose the proper redo log size and groups – Choose the proper protection mode OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 26
Step 1: The Preparation
• Setup the init.ora file for both primary and standby databases.
• Setup the listener.ora file for both primary and standby databases.
• Setup tnsnames.ora file for both primary and standby sites.
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 27
Step 2: Backup The Primary Database File
• Shut down the primary database.
• Backup the primary database datafiles.
$ cp /u02/oradata/prod/* /u03/backup/prod/ • Startup the primary database with the modified p-file.
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 28
Step 3: Create The Physical Standby Database Control File
• Ensure the primary database is in ARCHIVELOG mode and automatic archiving is enabled.
• Issue the following command to create the standby control file: SQL> alter database create standby controlfile as ‘/u01/app/oracle/admin/prod/ctl/st bycf.ctl’; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 29
Step 4: Transfer The Datafiles and Control File to The Standby Site
• Transfer the datafiles.
$ rcp /u03/backup/prod/* server_02:/u02/oradata/prod • Transfer the standby control file $ rcp /u01/app/oracle/admin/prod/ctl/stbycf.c
tl server_02:/u01/app/oracle/admin/prod/ct l/control01.ctl OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 30
Step 5: Start The Listeners on Both Primary and Standby Site
• Start the primary database listener.
$ lsnrctl start listener_prod_01 • Start the standby database listener.
$ lsnrctl start listener_prod_02 OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 31
Step 6: Start The Standby Database
• Set the correct Oracle environment.
• Connect as sysdba.
SQL> connect /as sysdba; • Bring the database in nomount mode first.
SQL> startup nomount; • Mount the standby database.
SQL> alter database mount standby database; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 32
Step 7: Place the Standby Database in Managed Recovery Mode
• Issue the following command to bring the standby database in managed recover mode.
SQL> alter database recover managed standby database disconnect from session; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 33
Step 8: Monitor the Log Transport Services and Log Apply Services
• Issue a few log switches on the primary database.
SQL> alter system switch logfile; • Confirm the log files received on the standby archive destination.
• Check the standby alert log file to see if the new logs have applied to the standby database.
Media Recovery Log /u02/arch/prod/prod_1482.arc
• Monitor the managed recovery.
SQL> select process, status, thread#, sequence#, block#, blocks from v$managed_standby; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 34
Failover
• Failover is only performed as a result of an unplanned outage of the primary database. • During a failover, the standby database (
prod_02
) becomes the new primary database. It is possible to have data loss.
• The old primary (
prod_01
) has to be discarded.
• Need to re-create a new standby database.
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 35
Forced Failover
• In 9.0.1, since you do not have Standby Redo Log files, you issue the following command on the standby site to activate the new primary database: SQL> alter database activate standby database; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 36
Graceful Failover
• In 9.2.0, you can gracefully Failover even without standby redo log files. Issue the following command on the standby site to Failover to a new primary database.
SQL> alter database recover managed standby database skip standby logfiles; • This will apply all available redo and make the standby available to become a Primary. Complete the operation by switching the standby over to the primary role with the following command: SQL> alter database commit to switchover to primary; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 37
Switchover
• A switchover operation is a planned operation.
• All the archive logs required bringing the standby to the primary’s point in time need to be available. • The old standby database (
prod_02
) becomes the new primary, and the old primary (
prod_01
) becomes the new standby database.
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 38
Switchover Step
• Step 1: Switchover preparation for formal primary database • Step 2: Shutdown the primary database and bring up as the new standby database • Step 3: Switchover preparation for the former standby database • Step 4: Shutdown the standby database and bring up as the new primary database OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 39
Switchover Step
• Step 5: Add Temp Tablespace • Step 6: Put the new standby database in managed recovery mode • Step 7: Change TNS Entry for the New Primary Database OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 40
Step 1: Switchover preparation for formal primary database
• End all activities on the primary and standby database • Check primary database switchover status SQL> select database_role, switchover_status from v$database; DATABASE_ROLE SWITCHOVER_STATUS -------------- --------------- PRIMARY TO STANDBY • Initiate the switchover operation on the primary database SQL> alter database commit to switchover to physical standby; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 41
Step 2: Shutdown the primary database and bring up as the new standby database
• Shutdown the primary database normally SQL> shutdown normal; • Modify the former primary database’s initialization file – fal_server = “prod_02” – fal_client = “prod_01” – Remove parameters log_archive_dest_2 and log_archive_dest_state_2. Or, just defer it if you like • Bring the former primary database in mount mode SQL> startup nomount; SQL> alter database mount standby database; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 42
Step 3: Switchover preparation for the former standby database
• Check standby database switchover status SQL> select database_role, switchover_status from v$database; DATABASE_ROLE SWITCHOVER_STATUS -------------- ----------------- PHYSICAL STANDBY TO PRIMARY • Initiate the switchover operation on the standby database SQL> alter database commit to switchover to physical primary; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 43
Step 4: Shutdown the standby database and bring up as the new primary database
• Shutdown the standby database SQL> shutdown normal; • Modify the former standby database’s initialization file – fal_server = “prod_01” – fal_client = “prod_02” – Add parameters log_archive_dest_2 and log_archive_dest_state_2 • Bring up the former standby database as the new primary database SQL> startup; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 44
Step 5: Add Temp Tablespace
• Issue the following command to add TEMP tablespace SQL> ALTER TABLESPACE TEMP ADD TEMPFILE '/u02/oradata/prod/temp01.dbf' SIZE 3072M reuse AUTOEXTEND OFF; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 45
Step 6: Put the new standby database in managed recovery mode
• Issue the following command on the new standby database.
SQL> alter database recover managed standby database disconnect; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 46
Step 7: Change TNS Entry for the New Primary Database
• Change the TNS entry on all application hosts to point to the new primary
Prod
= (description = (address = (protocol = tcp) (host =
server_02
) (port = 1522) (connect_data = (sid =
prod_02
)) ) OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 47
Implementation Tips
• Tip #1: Primary Online Redo Logs • Tip #2: Standby Online Redo Logs vs. Standby Archived Redo logs • Tip #3: Enforce Logging • Tip #4: RMAN Backup • Tip #5: Disable Log Transport Services When Standby Database is down OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 48
Implementation Tips
• Tip #6: Standby Database Upgrade • Tip #7: Data Guard Broker • Tip #8: Using ‘Delay’ Option to Protect Logical/Physical Corruptions • Tip #9: Always Monitor Log Apply Services and Check alert.log File for Errors OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 49
Tip #1: Primary Online Redo Logs
• The number of redo groups – Fewest groups possible – Guarantee a recycled group is always available • The size of redo logs – Determined by the amount of transaction – Small (50 MB): less standby database lag time – Medain (200 MB) – Large (1 GB): less log switches • The best way to determine if the current configuration is satisfactory is to examine the contents of the log writer process’s trace file and the database’s alert log. OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 50
Tip #2: Standby Online Redo Logs vs. Standby Archived Redo logs Standby Online Redo Logs Standby Archived Redo Logs 1.
2.
3.
4.
5.
Pre-allocated files Can place on raw devices Can be duplexed for more protection Improve redo data availability No Data Loss capable 1.
2.
No extra ARCH process Reduce lag time
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 51
Tip #3: Enforce Logging
• A new feature in Oracle9i Release 2 • Force redo log to be generated for individual database objects set to NOLOGGING. This is required for a no data loss strategy. • Here is the SQL command to set FORCE LOGGING: SQL> select force_logging from v$database; FORCE_LOGGING ------------- NO SQL> alter database force logging; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 52
Tip #4: RMAN Backup
• A failover operation reset logs for the new primary. If you use RMAN to backup your database, you need to create a new incarnation of the target database. Otherwise, your RMAN backup will fail.
RMAN> reset database; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 53
Tip #5: Disable Log Transport Services When Standby Database is down
• When a standby database or host is down for maintenance, it is advisable to temporarily disable the log transport services for that site • It may temporarily freeze the primary database during heavy transactions • Issue this command on the primary database before bring down the standby database: SQL> alter system set log_archive_dest_state_2 = defer; • When the standby database is up again, issue: SQL> alter system set log_archive_dest_state_2 = enable; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 54
Tip #6: Standby Database Upgrade
• Steps to upgrade standby database to newer database version: – Step 1: Shutdown both primary and standby databases – Step 2: Install Oracle9i software on both primary and standby hosts – Step 3: Upgrade the primary database – Step 4: Rebuild standby database from the upgraded primary OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 55
Tip #7: Data Guard Broker
• Oracle9i R1 broker configuration supported only one primary site and one physical standby site. The first release of broker is not so user friendly with limited features.
• Oracle9i R2 broker has made great improvements. The new configuration now support up to nine standby sites (including logical standby database). Both Data Guard Manager and CLI support switchover and failover operations. OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 56
Tip #8: Using ‘Delay’ Option to Protect Logical/Physical Corruptions
• You may utilize the delay option (if you have multiple standby sites) to prevent physical/logical corruption of your primary. For instance, your standby #1 may not have ‘Delay’ on to be your disaster recovery standby database. However, you may opt to implement a delay of minutes or hours on your standby #2 to allow recover from a possible physical or logical corruption on your primary database.
SQL> alter database recover managed standby database delay 5 disconnect; OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 57
Tip #9: Always Monitor Log Apply Services and Check alert.log File for Errors
• Always Check alert.log and trace file on both primary and standby databases.
• See the following web site for standby monitoring scripts to check alert.log: http://www.dbazine.com/liu2.html
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 58
Summary
• • • • • •
High Availability Options Data Guard Concept and Architecture Data Protection Mode Data Guard Implementation Steps Role Management Implementation Tips
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 59
References
Oracle9i, Data Guard Concepts and Administration. Release 1 (9.0.1); Oracle9i, Data Guard Concepts and Administration. Release 2 (9.2); Oracle9i, Data Guard Broker. Release 2 (9.2); Oracle Metalink Support; Top DBA Shell Scripts for Monitoring Database, Daniel T. Liu; DBAZine; I would also like to acknowledge the assistance of Bob Polak of the Allants Groups, Larry Barry, Ann Collins, Archana Sharma and Husam Tomeh of FARES, and Larry Carpenter, Joseph Meeks of Oracle Corporation. OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 60
Thanks For Coming !!
Daniel Liu Contact Information Phone: (714) 701-3346 Email: [email protected]
Email: [email protected]
Company Web Site: http://www.firstam.com
OracleWorld 2002, Paper #32050, Daniel T. Liu, FARES 61