10gR2 RAC(十六)rac进程 - jony413

公告

（二）、相关进程
查看v$bgprocess这个视图查看所有后台进程的说明

1、clusterware
a、CSS（cluster synchronization services）
init.cssd  ocssd.bin
ocssd这个进程是clusterware中最关键的进程，如果这个进程出现异常，会导致系统重启。这个进程提供CSS服务，通过多种心跳机制，实时监控集群健康状态，提供脑裂保护等基础集群服务功能。
CSS--Manages the cluster configuration by controlling which nodes are members of the cluster and by notifying members when a node joins or leaves the cluster. If you are using third-party clusterware, then the css process interfaces with your clusterware to manage node membership information.
b、CRS（cluster ready services）
init.crsd  crsd.bin
crsd是实现高可用性（HA）的主要进程，提供CRS服务，监控应用层资源，并在这些资源运行异常时进行干预，包括关闭、重启进程或转移服务。
CRS--The primary program for managing high availability operations within a cluster. Anything that the crs process manages is known as a cluster resource which could be a database, an instance, a service, a Listener, a virtual IP (VIP) address, an application process, and so on. The crs process manages cluster resources based on the resource's configuration information that is stored in the OCR. This includes start, stop, monitor and failover operations. The crs process generates events when a resource status changes. When you have installed Oracle RAC, crs monitors the Oracle instance, Listener, and so on, and automatically restarts these components when a failure occurs. By default, the crs process makes five attempts to restart a resource and then does not make further restart attempts if the resource does not restart.
c、EVM (Event Management)
init.evmd        evmd.bin       evmlogger.bin
evmd进程负责发布CRS产生的各种事件，还负责crs和css之间的通信。
EVM--A background process that publishes events that crs creates.
d、ONS（oracle notification service）
ons
ONS--A publish and subscribe service for communicating Fast Application Notification (FAN) events.
e、PCM（process monitor daemon）
oprocd  oprocd.bin
在非linux平台，用来检测节点的process Hang，提供“IO隔离”功能；linux平台利用hangcheck-timer模块来实现“IO隔离”。
OPROCD--This process is locked in memory to monitor the cluster and provide I/O fencing. OPROCD performs its check, stops running, and if the wake up is beyond the expected time, then OPROCD resets the processor and reboots the node. An OPROCD failure results in Oracle Clusterware restarting the node. OPROCD uses the hangcheck timer on Linux platforms.
f、RACG
racgimon
racgimon进程负责检查数据库健康状态，负责service的启动、停止、故障转移。
RACG--Extends clusterware to support Oracle-specific requirements and complex resources. Runs server callout scripts when FAN events occur.

2、rac database
（1）实例进程
ora_cjq0_rac1作业队列
ora_j000_rac1

ora_qmnc_rac1高级队列
ora_q001_rac1

ora_mman_rac1 自动内存管理

ora_mmon_rac1 管理监视器，AWR数据收集
ora_mmnl_rac1

ora_d000_rac1 dispatcher进程
ora_s000_rac1

ora_psp0_rac1 process spawner进程

ora_pmon_rac1 主要进程
ora_dbw0_rac1
ora_lgwr_rac1
ora_ckpt_rac1
ora_smon_rac1
ora_reco_rac1
（2）ASM实例进程
ora_asmb_rac1
ora_rbal_rac1
ora_o001_rac1
（3）RAC进程
ora_lms0_rac1   cache fusion的主要进程，负责数据块在实例内的传递，对应GCS
ora_lms1_rac1   (global cache service）服务，进程数量由gcs_server_processes控制
                        默认2，取值范围0-9
ora_lmd0_rac1   提供GES服务，在各实例间协调对数据库的访问顺序，保证数据的一致性，负责锁管理
ra_lck0_rac1    负责non_cache fusion资源的同步访问
ora_lmon_rac1   各实例的lmon进程会定期互相通信，检查集群中各节点的监控状态，当              某个节点出现故障，负责集群重构，grd恢复等操作，提供CGS服务。
ora_diag_rac1    监控实例的健康状态，在出现错误的时候收集诊断数据，记录到alert日志
ora_pz99_rac1

* PZ (PQ slaves used for global Views) are RAC Parallel Server Slave processes, but they are not normal parallel slave processes, PZnn processes (starting at 99) are used to query GV$ views which is done using Parallel Execution on all instances, if more than one PZ process is needed, then PZ98, PZ97,... (in that order) are created automatically.
* O00 (ASM slave processes) A group of slave processes establish connections to the ASM instance. Through this connection pool database processes can send messages to the ASM instance. For example opening a file sends the open request to the ASM instance via a slave. However slaves are not used for long running operations such as creating a file. The use slave (pool) connections eliminate the overhead of  logging into the ASM instance for short requests
* x000 - Slave used to expell disks after diskgroup reconfiguration

posted on 2013-01-20 10:49 jony413 阅读(288) 评论(0) 收藏举报

刷新页面返回顶部