主存储域hosted_storage中master逻辑卷异常损坏的处理方法

master是ovirt主存储域hosted_storage中的一个特殊的逻辑卷(块存储)。该逻辑卷只能用于SPM主机使用,将该逻辑卷格式化为ext3文件系统并挂载到指定目录,用于存储管理台下发给SPM主机的任务信息。

该逻辑卷一旦损毁或文件系统损坏,则管理台无法在数据中心中选择出SPM主机。存储域全部为红色异常状态,整个环境将无法使用。

master 逻辑卷的文件系统损坏

当逻辑卷的文件系统损坏时,可以直接修复该文件系统即可。

操作步骤如下:

  • 查看当前的系统分区情况
 [root@host71 ~]# df -h
 Filesystem                         Size Used Avail Use% Mounted on
 devtmpfs                          16G  0 16G 0% /dev
 tmpfs                            16G 4.0K 16G 1% /dev/shm
 tmpfs                            16G 370M 16G 3% /run
 tmpfs                            16G  0 16G 0% /sys/fs/cgroup
 /dev/mapper/chost-cHost--node--V5.1.4.202004231650+1    15G 12G 2.2G 85% /
 /dev/sda1                          2.9G 259M 2.5G 10% /boot
 /dev/mapper/chost-var                    15G 1.1G 13G 8% /var
 /dev/mapper/chost-tmp                   976M 2.6M 907M 1% /tmp
 /dev/mapper/chost-home                   976M 2.6M 907M 1% /home
 /dev/mapper/chost-data                   737G 242M 699G 1% /data
 /dev/mapper/chost-var_log                  20G 341M 19G 2% /var/log
 /dev/mapper/chost-var_crash                 9.8G 37M 9.2G 1% /var/crash
 /dev/mapper/chost-var_log_audit               2.0G 41M 1.8G 3% /var/log/audit
 192.168.15.65:/home/nfs                  865G 304G 561G 36% /rhev/data-center/mnt/192.168.15.65:_home_nfs
 192.168.15.72:/nfs/exports                 459G 200G 236G 46% /rhev/data-center/mnt/192.168.15.72:_nfs_exports
 /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master 976M 1.3M 924M 1% /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master                   # master 逻辑卷挂载的分区目录
 tmpfs                            3.2G  0 3.2G 0% /run/user/0


  • 查看master逻辑卷的文件系统是否异常
[root@host71 ~]# mount | grep master
/dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master on /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master type ext3 (ro,relatime,seclabel,data=ordered)   # 文件系统变为ro异常
  •  卸载 master 逻辑卷
 [root@host71 vdsm]# umount /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master
 ​
  •  检测 master 的文件系统
 [root@host71 vdsm]# fsck -p /dev/79064813-e96d-41b0-8e84-aac569008b8c/master 
 fsck from util-linux 2.23.2
 /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master: clean, 13/65536 files, 12646/262144 blocks
 
  •  修复 master 逻辑卷的文件系统(如果master逻辑卷文件系统损坏,该命令会修复文件系统 )
[root@host71 vdsm]# fsck /dev/79064813-e96d-41b0-8e84-aac569008b8c/master
 fsck from util-linux 2.23.2
 e2fsck 1.42.9 (28-Dec-2013)
 /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master: clean, 13/65536 files, 12646/262144 blocks
 
  •  对 master 逻辑卷进行挂载
 [root@host71 ~]# mount /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/mastercha
  • 查看管理台是否能够选出SPM主机

master 逻辑卷磁盘损坏无法修复

当磁盘损坏无法修复时,可以重新创建一个master 逻辑卷。

  • 管理台停止 ovirt-engine 服务
 [root@engine60 ~]# systemctl stop ovirt-engine
  • 在 SPM 主机中设置 HA 的模式为全局维护模式
 [root@host71 ~]# hosted-engine --set-maintenance --mode=global
  • 查看当前SPM的分区挂载状况
 [root@host71 ~]# df -h
 Filesystem                         Size Used Avail Use% Mounted on
 devtmpfs                          16G  0 16G 0% /dev
 tmpfs                            16G 4.0K 16G 1% /dev/shm
 tmpfs                            16G 186M 16G 2% /run
 tmpfs                            16G  0 16G 0% /sys/fs/cgroup
 /dev/mapper/chost-cHost--node--V5.1.4.202004231650+1    15G 12G 2.2G 85% /
 /dev/sda1                          2.9G 259M 2.5G 10% /boot
 /dev/mapper/chost-var                    15G 1.1G 13G 8% /var
 /dev/mapper/chost-tmp                   976M 2.6M 907M 1% /tmp
 /dev/mapper/chost-home                   976M 2.6M 907M 1% /home
 /dev/mapper/chost-data                   737G 242M 699G 1% /data
 /dev/mapper/chost-var_log                  20G 319M 19G 2% /var/log
 /dev/mapper/chost-var_crash                 9.8G 37M 9.2G 1% /var/crash
 /dev/mapper/chost-var_log_audit               2.0G 39M 1.8G 3% /var/log/audit
 192.168.15.65:/home/nfs                  865G 302G 564G 35% /rhev/data-center/mnt/192.168.15.65:_home_nfs
 192.168.15.72:/nfs/exports                 459G 200G 236G 46% /rhev/data-center/mnt/192.168.15.72:_nfs_exports
 tmpfs                            3.2G  0 3.2G 0% /run/user/0
 /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master 976M 1.4M 924M 1% /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master  # 重点
  • 查看 master 逻辑卷所在分区下的文件内容
 [root@host71 ~]# ll /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master
 total 24
 drwx------. 3 root root 16384 Apr 29 2020 lost+found
 drwxr-xr-x. 2 vdsm kvm 4096 Nov 10 10:16 tasks
 drwxr-xr-x. 2 vdsm kvm 4096 Apr 29 2020 vms
  • 卸载 master 逻辑卷
 [root@host71 ~]# umount /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master
  • 再次查看 master 逻辑卷是否已卸载
 [root@host71 ~]# df -h
 Filesystem                     Size Used Avail Use% Mounted on
 devtmpfs                       16G  0 16G 0% /dev
 tmpfs                        16G 4.0K 16G 1% /dev/shm
 tmpfs                        16G 186M 16G 2% /run
 tmpfs                        16G  0 16G 0% /sys/fs/cgroup
 /dev/mapper/chost-cHost--node--V5.1.4.202004231650+1 15G 12G 2.2G 85% /
 /dev/sda1                      2.9G 259M 2.5G 10% /boot
 /dev/mapper/chost-var                15G 1.1G 13G 8% /var
 /dev/mapper/chost-tmp                976M 2.6M 907M 1% /tmp
 /dev/mapper/chost-home               976M 2.6M 907M 1% /home
 /dev/mapper/chost-data               737G 242M 699G 1% /data
 /dev/mapper/chost-var_log              20G 319M 19G 2% /var/log
 /dev/mapper/chost-var_crash             9.8G 37M 9.2G 1% /var/crash
 /dev/mapper/chost-var_log_audit           2.0G 39M 1.8G 3% /var/log/audit
 192.168.15.65:/home/nfs               865G 302G 564G 35% /rhev/data-center/mnt/192.168.15.65:_home_nfs
 192.168.15.72:/nfs/exports             459G 200G 236G 46% /rhev/data-center/mnt/192.168.15.72:_nfs_exports
 tmpfs                        3.2G  0 3.2G 0% /run/user/0
  • 重命名 master 逻辑卷为 master-right
 [root@host71 ~]# lvrename /dev/79064813-e96d-41b0-8e84-aac569008b8c/master /dev/79064813-e96d-41b0-8e84-aac569008b8c/master-right
  Renamed "master" to "master-right" in volume group "79064813-e96d-41b0-8e84-aac569008b8c"
  • 创建新的master 逻辑卷
 [root@host71 ~]# lvcreate -L 1G -n master 79064813-e96d-41b0-8e84-aac569008b8c
  Logical volume "master" created.
  • 对 master 逻辑卷设置 ext3 文件系统
 [root@host71 ~]# mkfs.ext3 /dev/79064813-e96d-41b0-8e84-aac569008b8c/master
 mke2fs 1.42.9 (28-Dec-2013)
 Filesystem label=
 OS type: Linux
 Block size=4096 (log=2)
 Fragment size=4096 (log=2)
 Stride=0 blocks, Stripe width=0 blocks
 65536 inodes, 262144 blocks
 13107 blocks (5.00%) reserved for the super user
 First data block=0
 Maximum filesystem blocks=268435456
 8 block groups
 32768 blocks per group, 32768 fragments per group
 8192 inodes per group
 Superblock backups stored on blocks: 
  32768, 98304, 163840, 229376
 ​
 Allocating group tables: done              
 Writing inode tables: done              
 Creating journal (8192 blocks): done
 Writing superblocks and filesystem accounting information: done
  • 将新建的 master 逻辑卷挂载到 /mnt 目录下
 [root@host71 ~]# mount /dev/79064813-e96d-41b0-8e84-aac569008b8c/master /mnt/
  • 在 /mnt 目录下创建相关的目录
 [root@host71 mnt]# mkdir tasks vms
 [root@host71 mnt]# ll
 total 24
 drwx------. 2 root root 16384 Nov 10 10:27 lost+found
 drwxr-xr-x. 2 root root 4096 Nov 10 10:28 tasks
 drwxr-xr-x. 2 root root 4096 Nov 10 10:28 vms
 [root@host71 mnt]# chown vdsm:kvm tasks/ vms/
 [root@host71 mnt]# ll
 total 24
 drwx------. 2 root root 16384 Nov 10 10:27 lost+found
 drwxr-xr-x. 2 vdsm kvm 4096 Nov 10 10:28 tasks
 drwxr-xr-x. 2 vdsm kvm 4096 Nov 10 10:28 vms
  • 退出 /mnt 目录并卸载 /mnt 目录
 [root@host71 mnt]# cd
 [root@host71 ~]# umount /mnt/
  • 对 master 逻辑卷进行挂载
 [root@host71 ~]# mount /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master
  • 取消HA的全局维护模式
 [root@host71 ~]# hosted-engine --set-maintenance --mode=none
  • 等待管理台虚机启动,并在管理台中开启 ovirt-engine 服务
[root@engine60 ~]# systemctl start ovirt-engine
[root@engine60 ~]# systemctl status ovirt-engine
● ovirt-engine.service - oVirt Engine
   Loaded: loaded (/usr/lib/systemd/system/ovirt-engine.service; enabled; vendor preset: disabled)
   Active: active (running) since Wed 2021-11-10 10:35:39 CST; 1s ago
 Main PID: 56883 (ovirt-engine.py)
   CGroup: /system.slice/ovirt-engine.service
           ├─56883 /usr/bin/python /usr/share/ovirt-engine/services/ovirt-engine/ovirt-engine.py --redirect-output --systemd=notify start
           └─56924 ovirt-engine -server -XX:+TieredCompilation -Xms1955M -Xmx1955M -Xss1M -Djava.awt.headless=true -Dsun.rmi.dgc.client.gcInterval=3600000 -Dsun.rmi.dgc....

Nov 10 10:35:34 engine60.com systemd[1]: Starting oVirt Engine...
Nov 10 10:35:35 engine60.com ovirt-engine.py[56883]: 2021-11-10 10:35:35,086+0800 ovirt-engine: INFO _detectJBossVersion:195 Detecting JBoss version. Running: /...terval=36
Nov 10 10:35:39 engine60.com ovirt-engine.py[56883]: 2021-11-10 10:35:39,674+0800 ovirt-engine: INFO _detectJBossVersion:215 Return code: 0,  | stdout: '[u'Wild...err: '[]'
Nov 10 10:35:39 engine60.com systemd[1]: Started oVirt Engine.
Hint: Some lines were ellipsized, use -l to show in full.

  • 访问管理台是否可以访问,并查看主机列表中是否选出SPM主机,同时查看存储域是否为激活状态。如果存储域正常则表示创建的 master 逻辑卷已生效。
0 条回复 A 作者 M 管理员 E
    所有的伟大,都源于一个勇敢的开始!
欢迎您,新朋友,感谢参与互动!欢迎您 {{author}},您在本站有{{commentsCount}}条评论