Oracle数据库数据恢复、性能优化»论坛 › Oracle › Oracle数据库管理 › rac instance recovery

351 积分	0 好友	8 主题

发消息

rac instance recovery

1^#

发表于 2012-5-8 10:16:40 | 查看: 3258| 回复: 1

（1）Instance Failure detected by Cluster Manager and GCS
（2）Reconfiguration of GES resources (enqueues); global resource directory is frozen，During the first phase of recovery, Global Enqueue Services (GES) remasters the enqueues.
（3）Reconfiguration of GCS resources; involves redistribution among surviving instances，The Global Cache Services (GCS) remasters its resources.
（4）One of the surviving instances becomes the “recovering instance”
（5）SMON process of recovering instance starts first pass of redo log read of the failed instance’s redo log thread
（6）SMON finds BWR (block written records) in the redo and removes them as their PI is already written to disk
（7）SMON prepares recovery set of the blocks modified by the failed instance but not written to disk
（8）Entries in the recovery list are sorted by first dirty SCN
（9）SMON informs each block’s master node to take ownership of the block for recovery
（10）Second pass of log read begins.
（11）Redo is applied to the data files.
（12）Global Resource Directory is unfrozen

以上是rac instance recovery的步骤，有几点不明白的：

1.”Reconfiguration of GCS resources“这个应该指的是remaster block资源吧，比如把以crash instance为master node的block remaster到其他节点上，那“Reconfiguration of GES resources (enqueues)”这个是什么意思，具体会做些什么？

2.第9步是指每个block都由其主节点负责恢复吗？比如block a和block b原来的主节点是instance a的，instance a崩溃后block a的主节点变为instance b，block b的主节点变为instance c了，在实例恢复时由instance b来恢复block a，instance c来恢复block b，是这个意思吗？

3.在rac中reconfiguration和remaster是不是一个意思来的，发现这两个术语经常在同一个场合使用，具体请见第2和第3步。

[ 本帖最后由 gdpr-dba 于 2012-5-8 10:31 编辑 ]

分享0