1 Reply Latest reply: Jan 14, 2013 9:54 AM by 984731 RSS

    OVM Server 3.1.1 errors

    968543
      I am running OVM 3.1.1 in a 2 servers cluster. on one of the server I noticed that following errors happening very frequently in /var/log/ovs-agent.log. whereas 2nd OVM Server has no errors. I also noticed in OVM manager that if I go to "repository" tab, every thing in repository has "lock" sign on it.

      I am trying to create a clone from one of the templates but it just stays in "in progress" at 0% for ages and nothing happens.

      I do not have any idea where and why these errors are generated. Any help would be much appreciated.

      Thanks

      [2012-10-16 16:17:15 5378] ERROR (monitor:42) Error in monitor process: Lock file /etc/ovs-agent/db/server failed: timeout occured.
      Traceback (most recent call last):
      File "/usr/lib64/python2.4/site-packages/agent/monitor.py", line 30, in serve_forever
      saved_cluster_state = read_item(LOCAL_SERVER_DB, "cluster_state")
      File "/usr/lib64/python2.4/site-packages/agent/db.py", line 76, in read_item
      db = AgentDB(db_name, db_home)
      File "/usr/lib64/python2.4/site-packages/agent/db.py", line 31, in __init__
      self.lock.acquire(wait=10, delay=0.1)
      File "/usr/lib64/python2.4/site-packages/agent/utils/filelock.py", line 54, in acquire
      raise LockError("Lock file %s failed: timeout occured." % self.filename)
      LockError: Lock file /etc/ovs-agent/db/server failed: timeout occured.
      [2012-10-16 16:17:15 5375] ERROR (remaster:312) Error in remaster process: Lock file /etc/ovs-agent/db/server failed: timeout occured.
      Traceback (most recent call last):
      File "/usr/lib64/python2.4/site-packages/agent/remaster.py", line 310, in serve_forever
      remaster()
      File "/usr/lib64/python2.4/site-packages/agent/remaster.py", line 275, in remaster
      d = dump_db(LOCAL_SERVER_DB)
      File "/usr/lib64/python2.4/site-packages/agent/db.py", line 101, in dump_db
      db = AgentDB(db_name, db_home)
      File "/usr/lib64/python2.4/site-packages/agent/db.py", line 31, in __init__
      self.lock.acquire(wait=10, delay=0.1)
      File "/usr/lib64/python2.4/site-packages/agent/utils/filelock.py", line 54, in acquire
      raise LockError("Lock file %s failed: timeout occured." % self.filename)
      LockError: Lock file /etc/ovs-agent/db/server failed: timeout occured.
      [2012-10-16 16:17:20 5385] ERROR (stats:265) Error in stat process: Lock file /etc/ovs-agent/db/server failed: timeout occured.
      Traceback (most recent call last):
      File "/usr/lib64/python2.4/site-packages/agent/stats.py", line 254, in serve_forever
      if is_clustered():
      File "/usr/lib64/python2.4/site-packages/agent/serverpool.py", line 507, in is_clustered
      return (get_membership_state() == MEMBERSHIP_STATE_CLUSTERED and
      File "/usr/lib64/python2.4/site-packages/agent/serverpool.py", line 403, in get_membership_state
      if get_cluster_flag():
      File "/usr/lib64/python2.4/site-packages/agent/serverpool.py", line 388, in get_cluster_flag
      return read_item(LOCAL_SERVER_DB, "clustered")
      File "/usr/lib64/python2.4/site-packages/agent/db.py", line 76, in read_item
      db = AgentDB(db_name, db_home)
      File "/usr/lib64/python2.4/site-packages/agent/db.py", line 31, in __init__
      self.lock.acquire(wait=10, delay=0.1)
      File "/usr/lib64/python2.4/site-packages/agent/utils/filelock.py", line 54, in acquire
      raise LockError("Lock file %s failed: timeout occured." % self.filename)
      LockError: Lock file /etc/ovs-agent/db/server failed: timeout occured.

      Edited by: user10070298 on Oct 15, 2012 10:20 PM
        • 1. Re: OVM Server 3.1.1 errors
          984731
          Did you get any resolution to this problem?

          I'm getting the same errors and I can't find anything on MOS which indicates what could it be. Same setup as yours, 2 servers in the cluster pool and one is throwing errors and the other is not.