Forum Stats

  • 3,814,069 Users
  • 2,258,812 Discussions


One coherence application instance hangs while the other is active

3483854 Member Posts: 2
edited Jun 27, 2017 11:35AM in Coherence Support

JAVA LIBRARIES: coherence- and coherence-common-12.3.1.jar


I have multiple different applications deployed to tomcat, which successfully communicate without problem between instances using coherence, using a clustering address. There is, however, a specific application deployed on tomcat which is configured in a very similar fashion to the others. It, unlike the others, fails in communication.

The issue is that one always seems to be hung up on coherence and never returns a response. The load balancer is smart enough to determine it is no longer functioning and redirects traffic to the functioning version.

If I stop the instance which is not hung up, the instance that is hung up begins to function, however, the other instance, when restarted, will become hung up on coherence requests.

Investigating the system that hangs up, the number of these threads appear to build up, never returning for a response.

"http-bio-9011-exec-33" #137 daemon prio=5 os_prio=0 tid=0x00002b04a8089000 nid=0x578 in Object.wait() [0x00002b046a623000]

   java.lang.Thread.State: WAITING (on object monitor)

at java.lang.Object.wait(Native Method)


- locked <0x00000000e8bda788> (a

at com.tangosol.coherence.component.util.daemon.queueProcessor.service.Grid.poll(Grid.CDB:24)

at com.tangosol.coherence.component.util.daemon.queueProcessor.service.Grid.poll(Grid.CDB:11)

at com.tangosol.coherence.component.util.daemon.queueProcessor.service.grid.ReplicatedCache.requestIssue(ReplicatedCache.CDB:8)

at com.tangosol.coherence.component.util.daemon.queueProcessor.service.grid.ReplicatedCache.updateResource(ReplicatedCache.CDB:38)

at com.tangosol.coherence.component.util.CacheHandler.put(CacheHandler.CDB:11)

at com.tangosol.coherence.component.util.CacheHandler.put(CacheHandler.CDB:1)

at com.tangosol.coherence.component.util.SafeNamedCache.put(SafeNamedCache.CDB:1)


Because one hangs and not the other, it would seem be a correct assumption that they are communicating to some extent. Is there a good way to help to determine what is happening? Environmental or other?


  • 3483854
    3483854 Member Posts: 2
    edited Jun 27, 2017 11:35AM

    With a bit of investigating I was able to figure out how to handle the problem. We were incorrectly passing an array object for one of the methods that was being cached, since these were singleton arrays, the problem was not immediately apparent. However, on the other server, when it failed to grab the cached object, and then tried to save it, coherence would hang. Simply using a list instead of the array, fixed the issue.

This discussion has been closed.