]> git.proxmox.com Git - mirror_corosync.git/log
mirror_corosync.git
13 years agoapi: Change some of totempg definitons
Jan Friesse [Mon, 24 Oct 2011 12:29:03 +0000 (14:29 +0200)]
api: Change some of totempg definitons

Recent changes in patch "Get rid of hdb usage in totempg.h interface"
caused incompatibility between corosync API and totempg.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agototemmrp: Allow compilation without warnings
Jan Friesse [Mon, 24 Oct 2011 12:27:41 +0000 (14:27 +0200)]
totemmrp: Allow compilation without warnings

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoAllow compilation of totempg without warnings
Jan Friesse [Mon, 24 Oct 2011 12:26:43 +0000 (14:26 +0200)]
Allow compilation of totempg without warnings

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agologsys.h: Properly define LEAVE macro
Jan Friesse [Mon, 24 Oct 2011 08:23:44 +0000 (10:23 +0200)]
logsys.h: Properly define LEAVE macro

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoSet the size of the blackbox to the size on flatiron
Angus Salkeld [Sat, 22 Oct 2011 06:42:53 +0000 (17:42 +1100)]
Set the size of the blackbox to the size on flatiron

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoCTS: remove dead code in sam_test_agent
Angus Salkeld [Fri, 21 Oct 2011 10:05:07 +0000 (21:05 +1100)]
CTS: remove dead code in sam_test_agent

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agodon't log an error if exiting with 0
Angus Salkeld [Fri, 21 Oct 2011 23:50:17 +0000 (10:50 +1100)]
don't log an error if exiting with 0

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoLOG: update the log defines
Angus Salkeld [Fri, 21 Oct 2011 03:08:57 +0000 (14:08 +1100)]
LOG: update the log defines

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoadd wait-for-license to cov-analyze
Steven Dake [Fri, 21 Oct 2011 09:48:27 +0000 (02:48 -0700)]
add wait-for-license to cov-analyze

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agores could return an undefined value if there was no error in
Steven Dake [Fri, 21 Oct 2011 09:45:05 +0000 (02:45 -0700)]
res could return an undefined value if there was no error in
totempg_groups_initialize

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoRemove default from cpg_model_initialize - atm there is only one model
Steven Dake [Fri, 21 Oct 2011 09:18:57 +0000 (02:18 -0700)]
Remove default from cpg_model_initialize - atm there is only one model

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoRemove dead code in evs service
Steven Dake [Fri, 21 Oct 2011 09:18:10 +0000 (02:18 -0700)]
Remove dead code in evs service

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoRemove dead code in votequorum
Steven Dake [Fri, 21 Oct 2011 09:17:19 +0000 (02:17 -0700)]
Remove dead code in votequorum

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoTEST: make cpgbench go to 1M
Angus Salkeld [Fri, 21 Oct 2011 08:56:51 +0000 (19:56 +1100)]
TEST: make cpgbench go to 1M

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoRemove references to README.devmap
Angus Salkeld [Fri, 21 Oct 2011 08:53:52 +0000 (19:53 +1100)]
Remove references to README.devmap

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoRemove dead code in cfg.c
Steven Dake [Fri, 21 Oct 2011 09:15:39 +0000 (02:15 -0700)]
Remove dead code in cfg.c

Signed-off-by: Steven Dake <sdake@redhat.com>
13 years agoRemove old README.devmap
Angus Salkeld [Fri, 21 Oct 2011 08:26:16 +0000 (19:26 +1100)]
Remove old README.devmap

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoMAN: remove unused man pages
Angus Salkeld [Fri, 21 Oct 2011 04:05:27 +0000 (15:05 +1100)]
MAN: remove unused man pages

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoLOG: use libqb facility conversion functions
Angus Salkeld [Fri, 21 Oct 2011 02:29:41 +0000 (13:29 +1100)]
LOG: use libqb facility conversion functions

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoLOG: get logging to file working correctly
Angus Salkeld [Fri, 21 Oct 2011 02:06:20 +0000 (13:06 +1100)]
LOG: get logging to file working correctly

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoLOG: Fix debugging
Angus Salkeld [Fri, 21 Oct 2011 01:21:31 +0000 (12:21 +1100)]
LOG: Fix debugging

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoUpdated TODO with feedback from community defining our Needle 2.0/2.1 goals
Steven Dake [Wed, 19 Oct 2011 16:10:54 +0000 (09:10 -0700)]
Updated TODO with feedback from community defining our Needle 2.0/2.1 goals

Signed-off-by: Steven Dake <sdake@redhat.com>
Revieweed-by: Jan Friesse <jfriesse@redhat.com>
13 years agoAdd --concurrency to coverity make target
Steven Dake [Wed, 19 Oct 2011 15:10:09 +0000 (08:10 -0700)]
Add --concurrency to coverity make target

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agoRemove cloned lines in main of main.c
Masatake YAMATO [Thu, 6 Oct 2011 10:09:01 +0000 (19:09 +0900)]
Remove cloned lines in main of main.c

Signed-off-by: Masatake YAMATO <yamato@redhat.com>
13 years agoDeliver all messages from my_high_seq_recieved to the last gap
Steven Dake [Mon, 19 Sep 2011 23:38:34 +0000 (16:38 -0700)]
Deliver all messages from my_high_seq_recieved to the last gap

This patch passes two test cases:

-------
Test #1
-------
Two node cluster - run cpgbench on each node

modify totemsrp with following defines:
Two test cases:

-------
Test #2
-------
5 node cluster

start 5 nodes randomly at about same time, start 5 nodes randomly at about
same time, wait 10 seconds and attempt to send a message.  If message blocks
on "TRY_AGAIN" likely a message loss has occured.  Wait a few minutes without
cyclng the nodes and see if the TRY_AGAIN state becomes unblocked.

If it doesn't the test case has failed

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agototemconfig: change minimum RRP threshold
Jan Friesse [Thu, 8 Sep 2011 07:40:04 +0000 (09:40 +0200)]
totemconfig: change minimum RRP threshold

RRP threshold can be lower value then 5.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Fabio M. Di Nitto <fdinitto@redhat.com>
13 years agoAllow conditional rpmbuilds of NSS feature
Steven Dake [Mon, 5 Sep 2011 16:16:35 +0000 (09:16 -0700)]
Allow conditional rpmbuilds of NSS feature

NSS is currently non-conditional.  Allow nss to be build conditonally.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalked@redhat.com>
13 years agoIgnore memb_join messages during flush operations
Steven Dake [Wed, 31 Aug 2011 05:25:21 +0000 (22:25 -0700)]
Ignore memb_join messages during flush operations

a memb_join operation that occurs during flushing can result in an
entry into the GATHER state from the RECOVERY state.  This results in the
regular sort queue being used instead of the recovery sort queue, resulting
in segfault.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agorrp: Higher threshold in passive mode for mcast
Jan Friesse [Mon, 29 Aug 2011 13:09:52 +0000 (15:09 +0200)]
rrp: Higher threshold in passive mode for mcast

There were too much false positives with passive mode rrp when high
number of messages were received.

Patch adds new configurable variable rrp_problem_count_mcast_threshold
which is by default 10 times rrp_problem_count_threshold and this is
used as threshold for multicast packets in passive mode. Variable is
unused in active mode.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed by: Steven Dake <sdake@redhat.com>

13 years agorrp: Handle endless loop if all ifaces are faulty
Jan Friesse [Mon, 29 Aug 2011 08:44:05 +0000 (10:44 +0200)]
rrp: Handle endless loop if all ifaces are faulty

If all interfaces were faulty, passive_mcast_flush_send and related
functions ended in endless loop. This is now handled and if there is no
live interface, message is dropped.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed by: Steven Dake <sdake@redhat.com>

13 years agoGet rid of hdb usage in totempg.h interface
Steven Dake [Tue, 23 Aug 2011 21:11:49 +0000 (14:11 -0700)]
Get rid of hdb usage in totempg.h interface

hdb has some expense and is not necessary in the totempg.so runtime.  This
patch removes the dependence on hdb and instead uses a direct pointer.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoRemove hdb.h header includes from unnecessary files
Steven Dake [Tue, 23 Aug 2011 20:03:01 +0000 (13:03 -0700)]
Remove hdb.h header includes from unnecessary files

The files in this patch do not use the hdb.h header.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoUse qb_hdb instead of mutex based hdb code
Steven Dake [Tue, 23 Aug 2011 03:07:29 +0000 (20:07 -0700)]
Use qb_hdb instead of mutex based hdb code

Rid ourselves of the mutex usage still in the code base

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoAdd totempg_threaded_mode_enable() api
Steven Dake [Mon, 22 Aug 2011 22:23:51 +0000 (15:23 -0700)]
Add totempg_threaded_mode_enable() api

This API allows totem to operate as a multithreaded library.  Performance is
better without threads but some library users may only have multithreaded
systems.  In the corosync case where we have removed threads, this reduces
cpu utilization by ~10% by removing about 50% of the mutex lock and unlock calls
that occur during typical operation.  Since the latest corosync is nearly
thread free, there is no need for mutex operations.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoMove cs_queue.h from include directory to exec directory
Steven Dake [Mon, 22 Aug 2011 19:52:50 +0000 (12:52 -0700)]
Move cs_queue.h from include directory to exec directory

This file is only used by totemsrp.c.  Move out of general include
directory.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agouse va version of external log function
Steven Dake [Mon, 22 Aug 2011 19:46:17 +0000 (12:46 -0700)]
use va version of external log function

This removes a sprintf operation in the totem and ipc logging operations

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoA CPG client can sometimes lockup if the local node is in the downlist
Tim Beale [Thu, 18 Aug 2011 12:57:10 +0000 (14:57 +0200)]
A CPG client can sometimes lockup if the local node is in the downlist

In a 10-node cluster where all nodes are booting up and starting corosync
at the same time, sometimes during this process corosync detects a node as
leaving and rejoining the cluster.

Occasionally the downlist that gets picked contains the local node. When the
local node sends leave events for the downlist (including itself), it sets
its cpd state to CPD_STATE_UNJOINED and clears the cpd->group_name. This
means it no longer sends CPG events to the CPG client.

Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agoDisplay ring-ID consistently in debug
Tim Beale [Wed, 17 Aug 2011 02:15:16 +0000 (12:15 +1000)]
Display ring-ID consistently in debug

Ring ID was being displayed both as hex and decimal in places. Update so
it's displayed consistently (I chose hex) to make debugging easier.

Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoAdd code comment mapping for message handler defines
Tim Beale [Wed, 17 Aug 2011 01:52:25 +0000 (11:52 +1000)]
Add code comment mapping for message handler defines

As a corosync-newbie it can be hard to bridge the gap between where a
particular message is sent and where the receive handler processes it,
and vice versa.

Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoRemove -lcoroipcc from tools/Makefile.am notifyd
Steven Dake [Tue, 9 Aug 2011 21:51:12 +0000 (14:51 -0700)]
Remove -lcoroipcc from tools/Makefile.am notifyd

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoproperly define rec_token_cq_send_event_fn
Steven Dake [Tue, 9 Aug 2011 21:50:42 +0000 (14:50 -0700)]
properly define rec_token_cq_send_event_fn

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoDefine totemiba_log_printf properly
Steven Dake [Tue, 9 Aug 2011 21:49:41 +0000 (14:49 -0700)]
Define totemiba_log_printf properly

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoFix problem in totemiba where incorrect define is used (and also not defined)
Steven Dake [Tue, 9 Aug 2011 21:48:19 +0000 (14:48 -0700)]
Fix problem in totemiba where incorrect define is used (and also not defined)

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoAllow compile master on RHEL 6
Jan Friesse [Tue, 9 Aug 2011 07:21:45 +0000 (09:21 +0200)]
Allow compile master on RHEL 6

corosync_timer_handle_t is know conditionally defined to prevent double
definition causing compile fault on RHEL 6 systems.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoMake realtime scheduling optional not the default.
Angus Salkeld [Sun, 7 Aug 2011 12:31:30 +0000 (22:31 +1000)]
Make realtime scheduling optional not the default.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoUpdate TODOs
Angus Salkeld [Fri, 5 Aug 2011 04:12:53 +0000 (14:12 +1000)]
Update TODOs

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: logging & trace
Angus Salkeld [Sun, 17 Jul 2011 13:16:31 +0000 (23:16 +1000)]
libqb: logging & trace

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: Add libqb dependency in the rpm & pc file
Angus Salkeld [Wed, 13 Jul 2011 06:14:06 +0000 (16:14 +1000)]
libqb: Add libqb dependency in the rpm & pc file

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoFix some compiler warnings
Angus Salkeld [Wed, 13 Jul 2011 06:12:47 +0000 (16:12 +1000)]
Fix some compiler warnings

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoUse PATH_MAX for file path size
Angus Salkeld [Thu, 14 Jul 2011 06:53:50 +0000 (16:53 +1000)]
Use PATH_MAX for file path size

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: Improve IPC dispatch and async handling
Angus Salkeld [Thu, 9 Dec 2010 06:10:18 +0000 (17:10 +1100)]
libqb: Improve IPC dispatch and async handling

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCPG: downgrade some log messages
Angus Salkeld [Wed, 8 Dec 2010 00:35:33 +0000 (11:35 +1100)]
CPG: downgrade some log messages

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: make test agent shutdown more gracefully
Angus Salkeld [Tue, 14 Dec 2010 06:12:28 +0000 (16:12 +1000)]
CTS: make test agent shutdown more gracefully

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoCTS: add an audit to check for leaking shared memory
Angus Salkeld [Tue, 14 Dec 2010 02:42:00 +0000 (13:42 +1100)]
CTS: add an audit to check for leaking shared memory

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoCTS: get GenStopAllBeekhof working a bit better
Angus Salkeld [Thu, 9 Dec 2010 06:03:13 +0000 (17:03 +1100)]
CTS: get GenStopAllBeekhof working a bit better

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: log bind() errors better
Angus Salkeld [Thu, 9 Dec 2010 06:14:07 +0000 (17:14 +1100)]
CTS: log bind() errors better

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: log cfg results
Angus Salkeld [Wed, 8 Dec 2010 02:33:35 +0000 (13:33 +1100)]
CTS: log cfg results

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: rename flatiron to needle
Angus Salkeld [Tue, 23 Nov 2010 00:46:25 +0000 (11:46 +1100)]
CTS: rename flatiron to needle

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: add exit handler to test_agents
Angus Salkeld [Wed, 8 Dec 2010 01:18:02 +0000 (12:18 +1100)]
CTS: add exit handler to test_agents

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: add "Too many open files" to the BadNews pattern
Angus Salkeld [Thu, 9 Dec 2010 06:12:07 +0000 (17:12 +1100)]
CTS: add "Too many open files" to the BadNews pattern

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: impove debug during msgSend test
Angus Salkeld [Tue, 23 Nov 2010 00:47:33 +0000 (11:47 +1100)]
CTS: impove debug during msgSend test

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: add logging to test agent
Angus Salkeld [Wed, 8 Dec 2010 00:28:06 +0000 (11:28 +1100)]
CTS: add logging to test agent

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: increse wait for node to reboot
Angus Salkeld [Thu, 11 Nov 2010 04:45:40 +0000 (15:45 +1100)]
CTS: increse wait for node to reboot

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoCTS: support new pacemaker-cts
Angus Salkeld [Thu, 11 Nov 2010 04:47:43 +0000 (15:47 +1100)]
CTS: support new pacemaker-cts

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoAUGEAS: fix "tags" log field
Angus Salkeld [Fri, 3 Dec 2010 03:29:12 +0000 (14:29 +1100)]
AUGEAS: fix "tags" log field

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoTEST: fix the print out when cpg_finalize() fails
Angus Salkeld [Tue, 14 Dec 2010 06:15:08 +0000 (17:15 +1100)]
TEST: fix the print out when cpg_finalize() fails

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: use the new cs_strerror() to print out the error message.
Angus Salkeld [Tue, 14 Dec 2010 02:05:49 +0000 (13:05 +1100)]
libqb: use the new cs_strerror() to print out the error message.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: fix iov_len in pcmk_test
Angus Salkeld [Mon, 15 Nov 2010 02:39:04 +0000 (13:39 +1100)]
libqb: fix iov_len in pcmk_test

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: fix valgring warnings in mon/wd
Angus Salkeld [Sun, 14 Nov 2010 12:54:27 +0000 (23:54 +1100)]
libqb: fix valgring warnings in mon/wd

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: change pause_timestamp to uint64_t
Angus Salkeld [Tue, 16 Nov 2010 22:16:34 +0000 (09:16 +1100)]
libqb: change pause_timestamp to uint64_t

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: rip out objdb & serialize locks
Angus Salkeld [Thu, 11 Nov 2010 21:32:37 +0000 (08:32 +1100)]
libqb: rip out objdb & serialize locks

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: only init IPC on service engines that need it.
Angus Salkeld [Mon, 15 Nov 2010 10:20:23 +0000 (21:20 +1100)]
libqb: only init IPC on service engines that need it.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: remove the lib init/exit from the test service agent
Angus Salkeld [Thu, 11 Nov 2010 03:38:38 +0000 (14:38 +1100)]
libqb: remove the lib init/exit from the test service agent

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: use the main loop to shutdown
Angus Salkeld [Mon, 15 Nov 2010 10:19:18 +0000 (21:19 +1100)]
libqb: use the main loop to shutdown

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: remove tsafe.c
Angus Salkeld [Fri, 5 Aug 2011 02:30:14 +0000 (12:30 +1000)]
libqb: remove tsafe.c

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: remove worker thread - keep to one thread.
Angus Salkeld [Wed, 10 Nov 2010 08:38:34 +0000 (18:38 +1000)]
libqb: remove worker thread - keep to one thread.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: make timer api a wrapper around qb_loop timers.
Angus Salkeld [Wed, 10 Nov 2010 08:38:33 +0000 (19:38 +1100)]
libqb: make timer api a wrapper around qb_loop timers.

- change timeout value to nano seconds
- fix timer handles (don't alloc on stack)

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: change ipc -> qb_ipc
Angus Salkeld [Fri, 5 Aug 2011 02:18:43 +0000 (12:18 +1000)]
libqb: change ipc -> qb_ipc

IPC: return 0/-ENOBUFS from message handler
IPC: use the new rate_limit API to improve perf.
CPG: add send_async API & hook up flow control
IPC: Fix flow control getting stuck.
IPC: Port the remaining libs to use libqb IPC
IPC: remove libqb flowcontrol API
TEST: put cpg_dispatch() in it's own thread
IPC: cleanup ipc_glue.c name everything cs_ipcs_*()
IPC: add back statistics
IPC: remove coroipcc_ symbols from lib*.versions
IPC: init each se's IPC as it is loaded.
IPC: use the new connection_closed() event to free the context.
IPC: re-add zero copy functionality back
IPC: remove cpg_mcast_joined_async() and make it the default
 -> now cpg_mcast_joined() == cpg_mcast_joined_async()
libqb: expose a libqb error converter
libqb: add missing error conversions
libqb: remove repeat try loop in lib/cpg.c
CPG: fix zero copy mcast
CPG: use newer return codes
Add ENOTCONN to qb_to_cs_error()
libqb: fix error conversion from errno to cs_error_t in confdb
libqb: change errno_to_cs to qb_to_cs_error
libqb: add a cs_strerror() to get a more meaningful message
libqb: fix some confusing error conversions.
libqb: set the timeout on recv's to -1 (wait forever)

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: convert coropoll calls to qb_loop calls.
Angus Salkeld [Fri, 5 Aug 2011 01:52:28 +0000 (11:52 +1000)]
libqb: convert coropoll calls to qb_loop calls.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoAdd systemd unit files for corosync and corosync-notifyd
Angus Salkeld [Mon, 8 Aug 2011 11:01:52 +0000 (21:01 +1000)]
Add systemd unit files for corosync and corosync-notifyd

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agocorosync.conf.example: add note about host addresses in bindnetaddr
Florian Haas [Mon, 1 Aug 2011 06:47:58 +0000 (08:47 +0200)]
corosync.conf.example: add note about host addresses in bindnetaddr

https://lists.linux-foundation.org/pipermail/openais/2011-July/016563.html

Jan Friesse pointed out that bindnetaddr should be set to a host
address (as opposed to a network address) on hosts where multiple
NICs live on the same subnet. Add a comment to that effect to
the example configuration file.

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agocorosync.conf.example: include comments
Florian Haas [Tue, 26 Jul 2011 16:54:10 +0000 (18:54 +0200)]
corosync.conf.example: include comments

It's nice to say people should read the man page. It's also naive to
assume that they always do. Include comments in the example config
file itself.

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Dan Frincu <dan.frincu@1and1.ro>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agocorosync.conf.example: change mcastaddr
Florian Haas [Tue, 26 Jul 2011 16:16:31 +0000 (18:16 +0200)]
corosync.conf.example: change mcastaddr

Change suggested mcastaddr to one in the 239.255.0.0/16
pseudo-subnet. Multicast addresses outside 239.x.x.x may be IANA
registered and can clash with other services present on the
network. Suggest an address defined as part of the multicast IPv4
Local Scope in RFC 2365.

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Dan Frincu <dan.frincu@1and1.ro>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agocorosync.conf.example: change bindnetaddr
Florian Haas [Tue, 26 Jul 2011 16:14:53 +0000 (18:14 +0200)]
corosync.conf.example: change bindnetaddr

Change the example configuration file so "bindnetaddr" has a value
that more obviously looks like a network address. So as not to have
people think they need to set an existing IP address here (and hence,
have non-identical corosync.conf files between nodes).

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Dan Frincu <dan.frincu@1and1.ro>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agomain: let poll really stop before totempg_finalize
Jan Friesse [Mon, 25 Jul 2011 13:18:10 +0000 (15:18 +0200)]
main: let poll really stop before totempg_finalize

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoRevert "totemsrp: Remove recv_flush code"
Jan Friesse [Tue, 26 Jul 2011 08:05:34 +0000 (10:05 +0200)]
Revert "totemsrp: Remove recv_flush code"

This reverts commit 1a7b7a39f445be63c697170c1680eeca9834de39.

Reversion is needed to remove overflow of receive buffers and dropping
messages.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
13 years agototemsrp: fix buffer overflows for large clusters (> 100 nodes)
MORITA Kazutaka [Sun, 24 Jul 2011 09:58:40 +0000 (18:58 +0900)]
totemsrp: fix buffer overflows for large clusters (> 100 nodes)

Signed-off-by: MORITA Kazutaka <morita.kazutaka@lab.ntt.co.jp>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agospecfile: Install corosync-signals.conf for dbus
Jan Friesse [Tue, 19 Jul 2011 14:41:44 +0000 (16:41 +0200)]
specfile: Install corosync-signals.conf for dbus

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agospecfile: use _datadir as var expansion not exec
Jan Friesse [Tue, 19 Jul 2011 14:35:28 +0000 (16:35 +0200)]
specfile: use _datadir as var expansion not exec

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agospecfile: Correct URL and source0
Jan Friesse [Tue, 19 Jul 2011 13:21:45 +0000 (15:21 +0200)]
specfile: Correct URL and source0

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoAdd some more stats for debugging
Tim Beale [Tue, 19 Jul 2011 15:58:21 +0000 (08:58 -0700)]
Add some more stats for debugging

+ overload - number of times client is told to try again
+ invalid_request - message contained invalid paramter, e.g. invalid size
+ msg_queue_avail - messages currently available at the Totem layer
+ msg-queue_reserved - messages currently reserved at the Totem layer

Signed-off-by: Tim Beale <tim.beale@alliedtelesis.co.nz>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agorrp: Handle rollower in passive rrp properly
Jan Friesse [Fri, 15 Jul 2011 12:29:06 +0000 (08:29 -0400)]
rrp: Handle rollower in passive rrp properly

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agorrp: handle rollover in active rrp properly
Jan Friesse [Tue, 12 Jul 2011 10:55:16 +0000 (06:55 -0400)]
rrp: handle rollover in active rrp properly

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agototemconfig: Change default FAIL_TO_RECV_CONST
Jan Friesse [Fri, 15 Jul 2011 15:10:41 +0000 (17:10 +0200)]
totemconfig: Change default FAIL_TO_RECV_CONST

Previous default (50) was too low for most modern switch hardware. This
may trigger abort because the aru doesn't increase for 50 token
rotations combined with a defect in how failed to recv conditions are
handled.  By increasing this tunable, the condition should no longer
trigger the errant code.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoCorrect missing poll funtions from service handler struct needed for confdb APIs
Steven Dake [Mon, 4 Jul 2011 15:17:53 +0000 (08:17 -0700)]
Correct missing poll funtions from service handler struct needed for confdb APIs

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agoFix problem where corosync will segfault if there are gaps in recovery queue
Steven Dake [Thu, 7 Jul 2011 22:29:10 +0000 (15:29 -0700)]
Fix problem where corosync will segfault if there are gaps in recovery queue

Fixes a problem where there are gaps in the recovery queue.  Example my_aru = 5,
but there are messages at 7,8.  8 = my_high_seq_received which results
in data slots taken up in new message queue.  What should really happen
is these last messages should be delivered after a transitional
configuration to maintain SAFE agreement.  We don't have support for
SAFE atm, so it is probably safe just to throw these messages away.  Without
this change, the new message queue on a new configuraton change is out of sync.

Signed-off-by: Steven Dake <sdake@redhat.com>
Tested-by: Tim Beale <tlbeale@gmail.com>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agototemiba: free send_buf on ibv_reg_mr failure
Jan Friesse [Thu, 7 Jul 2011 08:58:06 +0000 (10:58 +0200)]
totemiba: free send_buf on ibv_reg_mr failure

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agobuild: disable RDMA support in RPMs by default
Florian Haas [Tue, 5 Jul 2011 11:44:57 +0000 (13:44 +0200)]
build: disable RDMA support in RPMs by default

Rather than curiously disable RDMA support by default in configure and
enable it by default in RPM builds, streamline the default
configuration to always turn RDMA support off. It can be enabled in
RPM builds with "--with rdma".

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agobuild: set RDMA related _LIBS and _CFLAGS only if building with RDMA support
Florian Haas [Tue, 5 Jul 2011 11:22:50 +0000 (13:22 +0200)]
build: set RDMA related _LIBS and _CFLAGS only if building with RDMA support

Having to force {ibverbs,rdmacm}_{LIBS,CFLAGS} looks positively odd;
so this may warrant further review. However, they are definitely not
needed if building without RDMA support.

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agobuild: make RDMA support an RPM build conditional
Florian Haas [Tue, 5 Jul 2011 09:54:52 +0000 (11:54 +0200)]
build: make RDMA support an RPM build conditional

Enable RDMA in RPM builds by default to maintain the previous behavior
(which always included --enable-rdma in the %configure invocation).

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Steven Dake <sdake@redhat.com>