master是ovirt主存储域hosted_storage中的一个特殊的逻辑卷(块存储)。该逻辑卷只能用于SPM主机使用,将该逻辑卷格式化为ext3文件系统并挂载到指定目录,用于存储管理台下发给SPM主机的任务信息。
该逻辑卷一旦损毁或文件系统损坏,则管理台无法在数据中心中选择出SPM主机。存储域全部为红色异常状态,整个环境将无法使用。
master 逻辑卷的文件系统损坏
当逻辑卷的文件系统损坏时,可以直接修复该文件系统即可。
操作步骤如下:
- 查看当前的系统分区情况
[root@host71 ~]# df -h Filesystem Size Used Avail Use% Mounted on devtmpfs 16G 0 16G 0% /dev tmpfs 16G 4.0K 16G 1% /dev/shm tmpfs 16G 370M 16G 3% /run tmpfs 16G 0 16G 0% /sys/fs/cgroup /dev/mapper/chost-cHost--node--V5.1.4.202004231650+1 15G 12G 2.2G 85% / /dev/sda1 2.9G 259M 2.5G 10% /boot /dev/mapper/chost-var 15G 1.1G 13G 8% /var /dev/mapper/chost-tmp 976M 2.6M 907M 1% /tmp /dev/mapper/chost-home 976M 2.6M 907M 1% /home /dev/mapper/chost-data 737G 242M 699G 1% /data /dev/mapper/chost-var_log 20G 341M 19G 2% /var/log /dev/mapper/chost-var_crash 9.8G 37M 9.2G 1% /var/crash /dev/mapper/chost-var_log_audit 2.0G 41M 1.8G 3% /var/log/audit 192.168.15.65:/home/nfs 865G 304G 561G 36% /rhev/data-center/mnt/192.168.15.65:_home_nfs 192.168.15.72:/nfs/exports 459G 200G 236G 46% /rhev/data-center/mnt/192.168.15.72:_nfs_exports /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master 976M 1.3M 924M 1% /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master # master 逻辑卷挂载的分区目录 tmpfs 3.2G 0 3.2G 0% /run/user/0
- 查看master逻辑卷的文件系统是否异常
[root@host71 ~]# mount | grep master /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master on /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master type ext3 (ro,relatime,seclabel,data=ordered) # 文件系统变为ro异常
- 卸载 master 逻辑卷
[root@host71 vdsm]# umount /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master
- 检测 master 的文件系统
[root@host71 vdsm]# fsck -p /dev/79064813-e96d-41b0-8e84-aac569008b8c/master fsck from util-linux 2.23.2 /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master: clean, 13/65536 files, 12646/262144 blocks
- 修复 master 逻辑卷的文件系统(如果master逻辑卷文件系统损坏,该命令会修复文件系统 )
[root@host71 vdsm]# fsck /dev/79064813-e96d-41b0-8e84-aac569008b8c/master fsck from util-linux 2.23.2 e2fsck 1.42.9 (28-Dec-2013) /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master: clean, 13/65536 files, 12646/262144 blocks
- 对 master 逻辑卷进行挂载
[root@host71 ~]# mount /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/mastercha
- 查看管理台是否能够选出SPM主机
master 逻辑卷磁盘损坏无法修复
当磁盘损坏无法修复时,可以重新创建一个master 逻辑卷。
- 管理台停止 ovirt-engine 服务
[root@engine60 ~]# systemctl stop ovirt-engine
- 在 SPM 主机中设置 HA 的模式为全局维护模式
[root@host71 ~]# hosted-engine --set-maintenance --mode=global
- 查看当前SPM的分区挂载状况
[root@host71 ~]# df -h Filesystem Size Used Avail Use% Mounted on devtmpfs 16G 0 16G 0% /dev tmpfs 16G 4.0K 16G 1% /dev/shm tmpfs 16G 186M 16G 2% /run tmpfs 16G 0 16G 0% /sys/fs/cgroup /dev/mapper/chost-cHost--node--V5.1.4.202004231650+1 15G 12G 2.2G 85% / /dev/sda1 2.9G 259M 2.5G 10% /boot /dev/mapper/chost-var 15G 1.1G 13G 8% /var /dev/mapper/chost-tmp 976M 2.6M 907M 1% /tmp /dev/mapper/chost-home 976M 2.6M 907M 1% /home /dev/mapper/chost-data 737G 242M 699G 1% /data /dev/mapper/chost-var_log 20G 319M 19G 2% /var/log /dev/mapper/chost-var_crash 9.8G 37M 9.2G 1% /var/crash /dev/mapper/chost-var_log_audit 2.0G 39M 1.8G 3% /var/log/audit 192.168.15.65:/home/nfs 865G 302G 564G 35% /rhev/data-center/mnt/192.168.15.65:_home_nfs 192.168.15.72:/nfs/exports 459G 200G 236G 46% /rhev/data-center/mnt/192.168.15.72:_nfs_exports tmpfs 3.2G 0 3.2G 0% /run/user/0 /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master 976M 1.4M 924M 1% /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master # 重点
- 查看 master 逻辑卷所在分区下的文件内容
[root@host71 ~]# ll /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master total 24 drwx------. 3 root root 16384 Apr 29 2020 lost+found drwxr-xr-x. 2 vdsm kvm 4096 Nov 10 10:16 tasks drwxr-xr-x. 2 vdsm kvm 4096 Apr 29 2020 vms
- 卸载 master 逻辑卷
[root@host71 ~]# umount /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master
- 再次查看 master 逻辑卷是否已卸载
[root@host71 ~]# df -h Filesystem Size Used Avail Use% Mounted on devtmpfs 16G 0 16G 0% /dev tmpfs 16G 4.0K 16G 1% /dev/shm tmpfs 16G 186M 16G 2% /run tmpfs 16G 0 16G 0% /sys/fs/cgroup /dev/mapper/chost-cHost--node--V5.1.4.202004231650+1 15G 12G 2.2G 85% / /dev/sda1 2.9G 259M 2.5G 10% /boot /dev/mapper/chost-var 15G 1.1G 13G 8% /var /dev/mapper/chost-tmp 976M 2.6M 907M 1% /tmp /dev/mapper/chost-home 976M 2.6M 907M 1% /home /dev/mapper/chost-data 737G 242M 699G 1% /data /dev/mapper/chost-var_log 20G 319M 19G 2% /var/log /dev/mapper/chost-var_crash 9.8G 37M 9.2G 1% /var/crash /dev/mapper/chost-var_log_audit 2.0G 39M 1.8G 3% /var/log/audit 192.168.15.65:/home/nfs 865G 302G 564G 35% /rhev/data-center/mnt/192.168.15.65:_home_nfs 192.168.15.72:/nfs/exports 459G 200G 236G 46% /rhev/data-center/mnt/192.168.15.72:_nfs_exports tmpfs 3.2G 0 3.2G 0% /run/user/0
- 重命名 master 逻辑卷为 master-right
[root@host71 ~]# lvrename /dev/79064813-e96d-41b0-8e84-aac569008b8c/master /dev/79064813-e96d-41b0-8e84-aac569008b8c/master-right Renamed "master" to "master-right" in volume group "79064813-e96d-41b0-8e84-aac569008b8c"
- 创建新的master 逻辑卷
[root@host71 ~]# lvcreate -L 1G -n master 79064813-e96d-41b0-8e84-aac569008b8c Logical volume "master" created.
- 对 master 逻辑卷设置 ext3 文件系统
[root@host71 ~]# mkfs.ext3 /dev/79064813-e96d-41b0-8e84-aac569008b8c/master mke2fs 1.42.9 (28-Dec-2013) Filesystem label= OS type: Linux Block size=4096 (log=2) Fragment size=4096 (log=2) Stride=0 blocks, Stripe width=0 blocks 65536 inodes, 262144 blocks 13107 blocks (5.00%) reserved for the super user First data block=0 Maximum filesystem blocks=268435456 8 block groups 32768 blocks per group, 32768 fragments per group 8192 inodes per group Superblock backups stored on blocks: 32768, 98304, 163840, 229376 Allocating group tables: done Writing inode tables: done Creating journal (8192 blocks): done Writing superblocks and filesystem accounting information: done
- 将新建的 master 逻辑卷挂载到 /mnt 目录下
[root@host71 ~]# mount /dev/79064813-e96d-41b0-8e84-aac569008b8c/master /mnt/
- 在 /mnt 目录下创建相关的目录
[root@host71 mnt]# mkdir tasks vms [root@host71 mnt]# ll total 24 drwx------. 2 root root 16384 Nov 10 10:27 lost+found drwxr-xr-x. 2 root root 4096 Nov 10 10:28 tasks drwxr-xr-x. 2 root root 4096 Nov 10 10:28 vms [root@host71 mnt]# chown vdsm:kvm tasks/ vms/ [root@host71 mnt]# ll total 24 drwx------. 2 root root 16384 Nov 10 10:27 lost+found drwxr-xr-x. 2 vdsm kvm 4096 Nov 10 10:28 tasks drwxr-xr-x. 2 vdsm kvm 4096 Nov 10 10:28 vms
- 退出 /mnt 目录并卸载 /mnt 目录
[root@host71 mnt]# cd [root@host71 ~]# umount /mnt/
- 对 master 逻辑卷进行挂载
[root@host71 ~]# mount /dev/mapper/79064813--e96d--41b0--8e84--aac569008b8c-master /rhev/data-center/mnt/blockSD/79064813-e96d-41b0-8e84-aac569008b8c/master
- 取消HA的全局维护模式
[root@host71 ~]# hosted-engine --set-maintenance --mode=none
- 等待管理台虚机启动,并在管理台中开启 ovirt-engine 服务
[root@engine60 ~]# systemctl start ovirt-engine [root@engine60 ~]# systemctl status ovirt-engine ● ovirt-engine.service - oVirt Engine Loaded: loaded (/usr/lib/systemd/system/ovirt-engine.service; enabled; vendor preset: disabled) Active: active (running) since Wed 2021-11-10 10:35:39 CST; 1s ago Main PID: 56883 (ovirt-engine.py) CGroup: /system.slice/ovirt-engine.service ├─56883 /usr/bin/python /usr/share/ovirt-engine/services/ovirt-engine/ovirt-engine.py --redirect-output --systemd=notify start └─56924 ovirt-engine -server -XX:+TieredCompilation -Xms1955M -Xmx1955M -Xss1M -Djava.awt.headless=true -Dsun.rmi.dgc.client.gcInterval=3600000 -Dsun.rmi.dgc.... Nov 10 10:35:34 engine60.com systemd[1]: Starting oVirt Engine... Nov 10 10:35:35 engine60.com ovirt-engine.py[56883]: 2021-11-10 10:35:35,086+0800 ovirt-engine: INFO _detectJBossVersion:195 Detecting JBoss version. Running: /...terval=36 Nov 10 10:35:39 engine60.com ovirt-engine.py[56883]: 2021-11-10 10:35:39,674+0800 ovirt-engine: INFO _detectJBossVersion:215 Return code: 0, | stdout: '[u'Wild...err: '[]' Nov 10 10:35:39 engine60.com systemd[1]: Started oVirt Engine. Hint: Some lines were ellipsized, use -l to show in full.
- 访问管理台是否可以访问,并查看主机列表中是否选出SPM主机,同时查看存储域是否为激活状态。如果存储域正常则表示创建的 master 逻辑卷已生效。