配置:MS7020(V1.5.12T04P08_NAS.01P01)
问题简述:升级好SAN/NAS一体化版本后,在SAN的NAS向导中启用“NAS控制台”提示“NAS系统已停止运行,请联系厂商技术支持人员处理”,提示截图如下:
登录NAS虚拟机底层,pcs查看资源服务状态,如下:
●[root@nas ~]#pcs status ●Cluster name: my_cluster ●Stack: corosync ●Current DC: node1 (version 1.1.23-1.el7_9.1-9acf116022) - partition with quorum ●Last updated: Thu Jun 23 13:45:27 2022 ●Last change: Sun Jun 19 16:07:03 2022 by root via cibadmin on node0 ● ●2 nodes configured ●8 resource instances configured ● ●Online: [ node0 node1 ] ● ●Full list of resources: ● ●Resource Group: nas_group ●nas_shared_metadata (ocf::heartbeat:Filesystem): Stopped ●nas_samba (ocf::heartbeat:smbserver): Stopped ●fence_xvm_node0 (stonith:fence_xvm): Started node1 ●fence_xvm_node1 (stonith:fence_xvm): Started node0 ●Resource Group: nas_tomcat_service ●nas_syslun (ocf::heartbeat:Filesystem):Stopped ●nas_tomcat (ocf::heartbeat:tomcat):Stopped ●nas_ip (ocf::heartbeat:IPaddr2):Stopped ●nas_pool_test (ocf::heartbeat:zpool):Stopped ● ●Failed Resource Actions: ●*nas_shared_metadata_start_0 on node0 'unknown error' (1): call=10, status=complete, exitreason='Couldn'tmount filesystem /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_22222222-22222222 on /nasgui/metadata', ●last-rc-change='Thu Jun 23 12:36:21 2022', queued=0ms, exec=190ms ●* nas_syslun_start_0 on node0 'unknown error' (1): call=38, status=complete,exitreason='Couldn't mount filesystem /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_11111111-11111111 on /nasgui/share', ●last-rc-change='Thu Jun 23 12:36:27 2022', queued=0ms, exec=176ms ●* nas_shared_metadata_start_0 on node1 'unknown error' (1): call=36, status=complete, exitreason='Couldn't mount filesystem /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_22222222-22222222 on /nasgui/metadata', ●last-rc-change='Thu Jun 23 12:36:29 2022', queued=0ms, exec=137ms ●* nas_syslun_start_0 on node1 'unknown error' (1): call=40, status=complete, exitreason='Couldn't mount filesystem /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_11111111-11111111 on /nasgui/share', ●last-rc-change='Thu Jun 23 12:36:35 2022', queued=0ms, exec=118ms ●* nas_pool_test_start_0 on node1 'unknown error' (1): call=37, status=complete, exitreason='', ●last-rc-change='Thu Jun 23 12:36:29 2022', queued=0ms, exec=497ms ● ●Daemon Status: ●corosync: active/disabled ●pacemaker: active/disabled ●pcsd: active/enabled |
集群资源状态异常,提示元数据和共享LUN挂载失败,系统挂载点有残留信息;
1、登录到NAS底层检查/dev/sda3是否挂载在/nasgui/storage/路径下,且检查一下该路径下是否有残留文件,将/dev/sd3挂载分区的数据给删掉,如下所示:
[root@00-b3-42-01-72-da ~]# ssh192.168.122.100 #密码:Sw!c6tSP root@192.168.122.100's password: Last login: Thu Jun 23 13:56:08 2022 -bash: warning: setlocale: LC_TIME: cannot change locale (en_US.UTF-8) [root@nas ~]# df -h Filesystem Size Used Avail Use% Mounted on /dev/sda2 3.4G 2.4G 906M 73% / devtmpfs 3.9G 0 3.9G 0% /dev tmpfs 3.9G 54M 3.8G 2% /dev/shm tmpfs 3.9G 8.5M 3.9G 1% /run tmpfs 3.9G 0 3.9G 0% /sys/fs/cgroup /dev/sda1 477M 162M 290M 36% /boot /dev/sda3 2.0G 70M 1.8G4% /nasgui/storage tmpfs 783M 0 783M 0% /run/user/0 /dev/sda4 2.0G 52M 1.8G 3% /var/log [root@nas ~]# cd /nasgui/storage/ [root@nas storage]# ls bin clusteretclocal_alarm_datanas_ndmpocfpool_confshare tomcat cibconfighost_vm_communicationlost+foundnet_bakocflibscripts tempusr [root@nas storage]# rm -rf * |
2、再次在SAN的GUI上禁用和启用NAS。
基于1.5.12TXX的SAN/NAS一体化版本;
暂无
限首次开局启用NAS失败时使用,其它条件下请咨询研发人员。