- **Milestone**: future --> 4.5.0


---

** [tickets:#240] cpsv : checkpoint apis fail with try again continuously when 
multinode applications on try to invoke the api at the same time (70 nodes)**

**Status:** fixed
**Milestone:** 4.5.0
**Created:** Thu May 16, 2013 06:21 AM UTC by A V Mahesh (AVM)
**Last Updated:** Wed Oct 01, 2014 01:34 PM UTC
**Owner:** A V Mahesh (AVM)

>From : http://devel.opensaf.org/ticket/2954

The issue is seen on SLES 70 node VM setup. Changeset 3855 


Two ckpt applications are running on each node on all the 70 nodes. One of the 
application that is running on the SC-1 creates an asynchronous collocated 
checkpoint. The rest of the applications across the cluster tries to open the 
same checkpoint. When try again is returned to the application, the application 
waits for 500ms before retrying the api. Some applications continuously get try 
again and after 3 minutes the application exits (application specific timeout).


This issue is reproducible only in a 70 node cluster. Traces are available and 
huge.





---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Don't Limit Your Business. Reach for the Cloud.
GigeNET's Cloud Solutions provide you with the tools and support that
you need to offload your IT needs and focus on growing your business.
Configured For All Businesses. Start Your Cloud Today.
https://www.gigenetcloud.com/
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to