当secondary机器由于某种原因磁盘出错,dg被deport掉,会导致VVR配置出错,这种情况下,需要做下列操作(不一定都必须,但是都要检查)
1. import dg:
vxdg import datadg
2. start all replicated volume:
vxvol start <vol_name>
这样你在secondary机器上用vxprint检查,所有的rvg、rlink、volume都应该为ENABLED、ACTIVE状态,如果有DISABLE类似状态,请再检查并修复。
[root@secondary datadg]# vxprint
Disk group: datadg
TY NAME ASSOC KSTATE LENGTH PLOFFS STATE TUTIL0 PUTIL0
dg datadg datadg - - - - - -
dm datadg01 sdb - 4094608 - - - -
dm datadg02 sdc - 4094608 - - - -
rv primary_secondary_rvg - ENABLED - - ACTIVE - -
rl rlk_primary_primary_secondary_ primary_secondary_rvg CONNECT - - ACTIVE - -
v vol1lv primary_secondary_rvg ENABLED 1048576 - ACTIVE - -
pl vol1lv-01 vol1lv ENABLED 1048576 - ACTIVE - -
sd datadg01-01 vol1lv-01 ENABLED 1048576 0 - - -
pl vol1lv-02 vol1lv ENABLED 1048576 - ACTIVE - -
sd datadg02-01 vol1lv-02 ENABLED 1048576 0 - - -
pl vol1lv-03 vol1lv ENABLED LOGONLY - ACTIVE - -
sd datadg01-05 vol1lv-03 ENABLED 32 LOG - - -
pl vol1lv-04 vol1lv ENABLED LOGONLY - ACTIVE - -
sd datadg02-05 vol1lv-04 ENABLED 32 LOG - - -
v vol2lv primary_secondary_rvg ENABLED 1048576 - ACTIVE - -
pl vol2lv-01 vol2lv ENABLED 1048576 - ACTIVE - -
sd datadg01-02 vol2lv-01 ENABLED 1048576 0 - - -
pl vol2lv-02 vol2lv ENABLED 1048576 - ACTIVE - -
sd datadg02-02 vol2lv-02 ENABLED 1048576 0 - - -
pl vol2lv-03 vol2lv ENABLED LOGONLY - ACTIVE - -
sd datadg01-04 vol2lv-03 ENABLED 32 LOG - - -
pl vol2lv-04 vol2lv ENABLED LOGONLY - ACTIVE - -
sd datadg02-04 vol2lv-04 ENABLED 32 LOG - - -
v srllv primary_secondary_rvg ENABLED 262144 SRL ACTIVE - -
pl srllv-01 srllv ENABLED 262144 - ACTIVE - -
sd datadg01-03 srllv-01 ENABLED 262144 0 - - -
pl srllv-02 srllv ENABLED 262144 - ACTIVE - -
sd datadg02-03 srllv-02 ENABLED 262144 0 - - -
3. 在需要recover的机器执行recover操作:
root@secondary datadg]# vxrecover –s
这步执行之后应该就ok,可以用下面的命令检查。下面是在执行vxrecover之前和之后的vxprint结果显示比较:
[root@secondary datadg]# vxprint -lP
Disk group: datadg
Rlink: rlk_primary_primary_secondary_
info: timeout=500 packet_size=8400 rid=0.1100
latency_high_mark=10000 latency_low_mark=9950
bandwidth_limit=none
state: state=ACTIVE
synchronous=off latencyprot=off srlprot=autodcm
assoc: rvg=primary_secondary_rvg
remote_host=172.111.100.10 IP_addr=172.111.100.10 port=4145
remote_dg=datadg
remote_dg_dgid=1265752821.7.localhost.localdomain
remote_rvg_version=unknown
remote_rlink=rlk_secondary_primary_secondar
remote_rlink_rid=0.1106
local_host=172.111.100.20 IP_addr=172.111.100.20 port=4145
protocol: UDP/IP
flags: write disabled attached consistent disconnected needs_recovery
[root@secondary datadg]# vxrecover -s
[root@secondary datadg]# vxprint -lP
Disk group: datadg
Rlink: rlk_primary_primary_secondary_
info: timeout=500 packet_size=8400 rid=0.1100
latency_high_mark=10000 latency_low_mark=9950
bandwidth_limit=none
state: state=ACTIVE
synchronous=off latencyprot=off srlprot=autodcm
assoc: rvg=primary_secondary_rvg
remote_host=172.111.100.10 IP_addr=172.111.100.10 port=4145
remote_dg=datadg
remote_dg_dgid=1265752821.7.localhost.localdomain
remote_rvg_version=21
remote_rlink=rlk_secondary_primary_secondar
remote_rlink_rid=0.1106
local_host=172.111.100.20 IP_addr=172.111.100.20 port=4145
protocol: UDP/IP
flags: write enabled attached consistent connected
[root@secondary datadg]# vxrlink -g datadg verify rlk_primary_primary_secondary_
RLINK REMOTE HOST LOCAL HOST STATUS STATE
rlk_primary_primary_secondary_ 172.111.100.10 172.111.100.20 OK ACTIVE
Full Text