4 Replies Latest reply: Nov 28, 2012 2:58 PM by 976257 RSS

    Server won't rejoin pool in VM Manager 3.1.1 after reboot.

    976257
      Hello all,
      I am running Oracle VM Server and manager 3.1.1 and in the course of setting it up, I had to manually reboot the server. Now since iIhave restarted them - they are on two separate pieces of hardware- I can see in the manager, the server, but it is under "unassigned servers" and not in the "Server pool "it was in previous to the reboot. If I click "rediscover servers" it appears i nthe Pool for a second and then reverts back to "unassigned servers". There is an exclamation point on the icon and that is how I got to the message with the error

      "Error OVMEVT_003500D_003 Active data was not found. Cluster service is probably not running."

      I have rebooted it twice thinking it might resync or the services would restart on a reboot. It did not. Is the solution to ssh in and restart the cluster service manually on the server itself? what command would I issue to do that?

      In addition, I have rebooted it before and it has successfully rejoined the server pool each time.
      thank you in advance.
        • 1. Re: Server won't rejoin pool in VM Manager 3.1.1 after reboot.
          user12273962
          Sounds like to me that you have some corruption in the server pool. Create a new server pool. Present it the server and then try adding the server to the new server pool.

          You can also try ssh to the VM server and ocfs.fsck the server ppol mount point. I've had to do it a time of two for other reasons. If you're using NFS for the server pool... It is almost easier to create a new NFS share. Create a new pool and just add the server to the new pool. At least creating a new pool will tell you if something is wrong with the existing server pool filesystem.
          • 2. Re: Server won't rejoin pool in VM Manager 3.1.1 after reboot.
            976257
            thanks, I tried that and it does use NFS. I was able to delete the pool and recrete a new one, but when I try and re-add the server to the new pool, It gives me this :

            Failed
            Description: Add Server RDSOVM01 to Server Pool RDSOVMPool
            Created By: admin
            Duration: 87ms
            Start Time: Nov 28, 2012 1:55:44 pm
            End Time: Nov 28, 2012 1:55:44 pm
            Message: (11/28/2012 01:55:44:833 PM) OVMAPI_4010E Attempt to send command: create_server_pool to server: RDSOVM01 failed. OVMAPI_4004E Server Failed Command: create_server_pool RDSOVMPool 0004fb0000020000e87d5b634fb0d9ea 192.168.0.11 0 RDSOVM01 192.168.0.246 master,xen,utility, Status: org.apache.xmlrpc.XmlRpcException: exceptions.Exception:Server already a member of pool: 0004fb000002000010376918eba0afda Wed Nov 28 13:55:44 EST 2012 Wed Nov 28 13:55:44 EST 2012


            am I missing another step to remove the server from the original pool. Did I mess up be deleting the pool before I removed the server?
            thanks again.
            • 3. Re: Server won't rejoin pool in VM Manager 3.1.1 after reboot.
              user12273962
              I don't think you messed up. I've seen the VM Manager do some wierd things like this before. Give it a little bit and try it again. It seems like the VM Manager has to "catch up" from time to time.
              • 4. Re: Server won't rejoin pool in VM Manager 3.1.1 after reboot.
                976257
                Will do and I will post the results. I appreciate the help.