3 Replies Latest reply: Jan 8, 2013 7:25 AM by user12273962 RSS

    Live migration suddenly won't work on 3.1.1

    Terry Phelps
      I've done many live migrations over the last few months, with no problems. Suddenly, they don't work any more. Here's what I see:

      I start the migration from the Manager.
      The VM immediately disappears from the list of VMs on the source servers, and appears on the destination server.
      The job shows "in progress", and it NEVER completes.
      The "% complete" for the job never says anything but ZERO.
      If I look at the 'details' on the 'in progress' migration job, it says:

      Job Construction Phase
      ----------------------
      begin()
      Appended operation 'Bridge Configure Operation' to object '0004fb00002000005c945b4212271249 (network.BondPort (2) in oravm3.acbl.net)'.
      Appended operation 'Virtual Machine Migrate' to object '0004fb000006000066c8e49bc5ab54b0 (jiplcm01)'.
      commit()
      Completed Step: COMMIT

      Objects and Operations
      ----------------------
      Object (IN_USE): [Server] e2:a3:70:c6:67:89:e1:11:bb:8e:e4:1f:13:eb:92:b2 (oravm3.acbl.net)
      Object (IN_USE): [BondPort] 0004fb00002000005c945b4212271249 (network.BondPort (2) in oravm3.acbl.net)
      Operation: Bridge Configure Operation
      Object (IN_USE): [Server] 92:0f:60:b4:84:91:e1:11:aa:cb:e4:1f:13:eb:d2:3a (oravm2.acbl.net)
      Object (IN_USE): [VirtualMachine] 0004fb000006000066c8e49bc5ab54b0 (jiplcm01)
      Operation: Virtual Machine Migrate

      Job Running Phase at 13:10 on Wed, Jan 2, 2013
      ----------------------------------------------
      Job Participants: [92:0f:60:b4:84:91:e1:11:aa:cb:e4:1f:13:eb:d2:3a (oravm2.acbl.net)]


      Actioner
      --------
      Starting operation 'Bridge Configure Operation' on object '0004fb00002000005c945b4212271249 (network.BondPort (2) in oravm3.acbl.net)'
      Bridge [0004fb001018c4c] already exists (and should exist) on interface [bond1] on server [oravm3.acbl.net]; skipping bridge creation
      Completed operation 'Bridge Configure Operation' completed with direction ==> DONE
      Starting operation 'Virtual Machine Migrate' on object '0004fb000006000066c8e49bc5ab54b0 (jiplcm01)'
      Job failed commit (internal) due to Caught during invoke method: java.net.SocketException: Socket closed
      Wed Jan 02 13:11:36 EST 2013
      com.oracle.odof.exception.InternalException: Caught during invoke method: java.net.SocketException: Socket closed
      Wed Jan 02 13:11:36 EST 2013
      at com.oracle.odof.OdofExchange.invokeMethod(OdofExchange.java:956)
      at com.oracle.ovm.mgr.api.job.InternalJobProxy.objectCommitter(Unknown Source)
      at com.oracle.ovm.mgr.api.job.JobImpl.internalJobCommit(JobImpl.java:281)
      at com.oracle.ovm.mgr.api.job.JobImpl.commit(JobImpl.java:651)
      at com.oracle.ovm.mgr.faces.model.JobEO$CommitWork.run(JobEO.java:233)
      at weblogic.work.j2ee.J2EEWorkManager$WorkWithListener.run(J2EEWorkManager.java:183)
      at weblogic.work.ExecuteThread.execute(ExecuteThread.java:209)
      at weblogic.work.ExecuteThread.run(ExecuteThread.java:178)
      Caused by: java.net.SocketException: Socket closed
      at java.net.SocketInputStream.socketRead0(Native Method)
      at java.net.SocketInputStream.read(SocketInputStream.java:129)
      at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
      at java.io.BufferedInputStream.read(BufferedInputStream.java:237)
      at java.io.ObjectInputStream$PeekInputStream.peek(ObjectInputStream.java:2248)
      at java.io.ObjectInputStream$BlockDataInputStream.peek(ObjectInputStream.java:2541)
      at java.io.ObjectInputStream$BlockDataInputStream.peekByte(ObjectInputStream.java:2551)
      at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1296)
      at java.io.ObjectInputStream.readObject(ObjectInputStream.java:350)
      at com.oracle.odof.io.AbstractSocket.receive(AbstractSocket.java:220)
      at com.oracle.odof.io.AbstractSocket.receive(AbstractSocket.java:173)
      at com.oracle.odof.OdofExchange.send(OdofExchange.java:473)
      at com.oracle.odof.OdofExchange.send(OdofExchange.java:427)
      at com.oracle.odof.OdofExchange.invokeMethod(OdofExchange.java:938)
      ... 7 more

      Anyone have any idea what the problem is? What can I do to gather useful information?