]> git.proxmox.com Git - mirror_corosync.git/log
mirror_corosync.git
12 years agoLOG: cleanup logging resources at exit
Angus Salkeld [Thu, 10 Nov 2011 22:05:08 +0000 (09:05 +1100)]
LOG: cleanup logging resources at exit

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoClean up the poll loop resourses on exit
Angus Salkeld [Thu, 10 Nov 2011 21:13:08 +0000 (08:13 +1100)]
Clean up the poll loop resourses on exit

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoAdd calls to missing object_find_destroy() to fix mem leaks
Angus Salkeld [Thu, 10 Nov 2011 21:12:13 +0000 (08:12 +1100)]
Add calls to missing object_find_destroy() to fix mem leaks

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoFree mem allocated by getaddrinfo
Angus Salkeld [Thu, 10 Nov 2011 21:11:17 +0000 (08:11 +1100)]
Free mem allocated by getaddrinfo

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoSend one confchg event per CPG group to CPG client
Yunkai Zhang [Fri, 28 Oct 2011 07:56:39 +0000 (15:56 +0800)]
Send one confchg event per CPG group to CPG client

We found that sheepdog will receive more than one confchg msg when
network partition occur.  For example, suppose the cluster has 4
nodes: N1, N2, N3, N4,  and they form a single-ring initially. After a
while, network partition occur, the single-ring divide into two
sub-ring: ring(N1, N2, N3) and ring(N4). The sheepdog in the ring(N4)
will receive the following confchg messages in turn:
Memb: N2,N3,N4  Left:N1         Joined:null
memb: N3,N4     Left:N2         Joined:null
memb: N4        Left:N3         Joined:null

This patch will fixed this bug, and the client will only receive one
confchg event in this case:
memb: N4        Left:N1,N2,N3   Joined:null

Signed-off-by: Yunkai Zhang <qiushu.zyk@taobao.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
12 years agoAdding support for dynamic membership with UDPU transport
Anton Jouline [Tue, 25 Oct 2011 00:54:24 +0000 (17:54 -0700)]
Adding support for dynamic membership with UDPU transport

Add a new object called totem.interface.dynamic to allow creation/deletion
of new child objects using the corosync-objctl utility:

to add new member:
linux#  corosync-objctl -c totem.interface.dynamic.10-211-55-12

to delete an existing member:
linux#  corosync-objctl -d totem.interface.dynamic.10-211-55-12

Corosync will dynamically add these members to the configuration and start
communicating with those nodes.

Signed-off-by: Anton Jouline <anton.jouline@cbsinteractive.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoRemove unused buf and len variables in log_printf
Jan Friesse [Tue, 25 Oct 2011 12:50:18 +0000 (14:50 +0200)]
Remove unused buf and len variables in log_printf

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoapi: Change some of totempg definitons
Jan Friesse [Mon, 24 Oct 2011 12:29:03 +0000 (14:29 +0200)]
api: Change some of totempg definitons

Recent changes in patch "Get rid of hdb usage in totempg.h interface"
caused incompatibility between corosync API and totempg.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agototemmrp: Allow compilation without warnings
Jan Friesse [Mon, 24 Oct 2011 12:27:41 +0000 (14:27 +0200)]
totemmrp: Allow compilation without warnings

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoAllow compilation of totempg without warnings
Jan Friesse [Mon, 24 Oct 2011 12:26:43 +0000 (14:26 +0200)]
Allow compilation of totempg without warnings

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agologsys.h: Properly define LEAVE macro
Jan Friesse [Mon, 24 Oct 2011 08:23:44 +0000 (10:23 +0200)]
logsys.h: Properly define LEAVE macro

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
12 years agoSet the size of the blackbox to the size on flatiron
Angus Salkeld [Sat, 22 Oct 2011 06:42:53 +0000 (17:42 +1100)]
Set the size of the blackbox to the size on flatiron

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoCTS: remove dead code in sam_test_agent
Angus Salkeld [Fri, 21 Oct 2011 10:05:07 +0000 (21:05 +1100)]
CTS: remove dead code in sam_test_agent

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agodon't log an error if exiting with 0
Angus Salkeld [Fri, 21 Oct 2011 23:50:17 +0000 (10:50 +1100)]
don't log an error if exiting with 0

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoLOG: update the log defines
Angus Salkeld [Fri, 21 Oct 2011 03:08:57 +0000 (14:08 +1100)]
LOG: update the log defines

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoadd wait-for-license to cov-analyze
Steven Dake [Fri, 21 Oct 2011 09:48:27 +0000 (02:48 -0700)]
add wait-for-license to cov-analyze

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
12 years agores could return an undefined value if there was no error in
Steven Dake [Fri, 21 Oct 2011 09:45:05 +0000 (02:45 -0700)]
res could return an undefined value if there was no error in
totempg_groups_initialize

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
12 years agoRemove default from cpg_model_initialize - atm there is only one model
Steven Dake [Fri, 21 Oct 2011 09:18:57 +0000 (02:18 -0700)]
Remove default from cpg_model_initialize - atm there is only one model

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
12 years agoRemove dead code in evs service
Steven Dake [Fri, 21 Oct 2011 09:18:10 +0000 (02:18 -0700)]
Remove dead code in evs service

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
12 years agoRemove dead code in votequorum
Steven Dake [Fri, 21 Oct 2011 09:17:19 +0000 (02:17 -0700)]
Remove dead code in votequorum

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
12 years agoTEST: make cpgbench go to 1M
Angus Salkeld [Fri, 21 Oct 2011 08:56:51 +0000 (19:56 +1100)]
TEST: make cpgbench go to 1M

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoRemove references to README.devmap
Angus Salkeld [Fri, 21 Oct 2011 08:53:52 +0000 (19:53 +1100)]
Remove references to README.devmap

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoRemove dead code in cfg.c
Steven Dake [Fri, 21 Oct 2011 09:15:39 +0000 (02:15 -0700)]
Remove dead code in cfg.c

Signed-off-by: Steven Dake <sdake@redhat.com>
12 years agoRemove old README.devmap
Angus Salkeld [Fri, 21 Oct 2011 08:26:16 +0000 (19:26 +1100)]
Remove old README.devmap

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoMAN: remove unused man pages
Angus Salkeld [Fri, 21 Oct 2011 04:05:27 +0000 (15:05 +1100)]
MAN: remove unused man pages

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoLOG: use libqb facility conversion functions
Angus Salkeld [Fri, 21 Oct 2011 02:29:41 +0000 (13:29 +1100)]
LOG: use libqb facility conversion functions

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoLOG: get logging to file working correctly
Angus Salkeld [Fri, 21 Oct 2011 02:06:20 +0000 (13:06 +1100)]
LOG: get logging to file working correctly

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoLOG: Fix debugging
Angus Salkeld [Fri, 21 Oct 2011 01:21:31 +0000 (12:21 +1100)]
LOG: Fix debugging

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
12 years agoUpdated TODO with feedback from community defining our Needle 2.0/2.1 goals
Steven Dake [Wed, 19 Oct 2011 16:10:54 +0000 (09:10 -0700)]
Updated TODO with feedback from community defining our Needle 2.0/2.1 goals

Signed-off-by: Steven Dake <sdake@redhat.com>
Revieweed-by: Jan Friesse <jfriesse@redhat.com>
12 years agoAdd --concurrency to coverity make target
Steven Dake [Wed, 19 Oct 2011 15:10:09 +0000 (08:10 -0700)]
Add --concurrency to coverity make target

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
12 years agoRemove cloned lines in main of main.c
Masatake YAMATO [Thu, 6 Oct 2011 10:09:01 +0000 (19:09 +0900)]
Remove cloned lines in main of main.c

Signed-off-by: Masatake YAMATO <yamato@redhat.com>
13 years agoDeliver all messages from my_high_seq_recieved to the last gap
Steven Dake [Mon, 19 Sep 2011 23:38:34 +0000 (16:38 -0700)]
Deliver all messages from my_high_seq_recieved to the last gap

This patch passes two test cases:

-------
Test #1
-------
Two node cluster - run cpgbench on each node

modify totemsrp with following defines:
Two test cases:

-------
Test #2
-------
5 node cluster

start 5 nodes randomly at about same time, start 5 nodes randomly at about
same time, wait 10 seconds and attempt to send a message.  If message blocks
on "TRY_AGAIN" likely a message loss has occured.  Wait a few minutes without
cyclng the nodes and see if the TRY_AGAIN state becomes unblocked.

If it doesn't the test case has failed

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agototemconfig: change minimum RRP threshold
Jan Friesse [Thu, 8 Sep 2011 07:40:04 +0000 (09:40 +0200)]
totemconfig: change minimum RRP threshold

RRP threshold can be lower value then 5.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Fabio M. Di Nitto <fdinitto@redhat.com>
13 years agoAllow conditional rpmbuilds of NSS feature
Steven Dake [Mon, 5 Sep 2011 16:16:35 +0000 (09:16 -0700)]
Allow conditional rpmbuilds of NSS feature

NSS is currently non-conditional.  Allow nss to be build conditonally.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalked@redhat.com>
13 years agoIgnore memb_join messages during flush operations
Steven Dake [Wed, 31 Aug 2011 05:25:21 +0000 (22:25 -0700)]
Ignore memb_join messages during flush operations

a memb_join operation that occurs during flushing can result in an
entry into the GATHER state from the RECOVERY state.  This results in the
regular sort queue being used instead of the recovery sort queue, resulting
in segfault.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agorrp: Higher threshold in passive mode for mcast
Jan Friesse [Mon, 29 Aug 2011 13:09:52 +0000 (15:09 +0200)]
rrp: Higher threshold in passive mode for mcast

There were too much false positives with passive mode rrp when high
number of messages were received.

Patch adds new configurable variable rrp_problem_count_mcast_threshold
which is by default 10 times rrp_problem_count_threshold and this is
used as threshold for multicast packets in passive mode. Variable is
unused in active mode.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed by: Steven Dake <sdake@redhat.com>

13 years agorrp: Handle endless loop if all ifaces are faulty
Jan Friesse [Mon, 29 Aug 2011 08:44:05 +0000 (10:44 +0200)]
rrp: Handle endless loop if all ifaces are faulty

If all interfaces were faulty, passive_mcast_flush_send and related
functions ended in endless loop. This is now handled and if there is no
live interface, message is dropped.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed by: Steven Dake <sdake@redhat.com>

13 years agoGet rid of hdb usage in totempg.h interface
Steven Dake [Tue, 23 Aug 2011 21:11:49 +0000 (14:11 -0700)]
Get rid of hdb usage in totempg.h interface

hdb has some expense and is not necessary in the totempg.so runtime.  This
patch removes the dependence on hdb and instead uses a direct pointer.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoRemove hdb.h header includes from unnecessary files
Steven Dake [Tue, 23 Aug 2011 20:03:01 +0000 (13:03 -0700)]
Remove hdb.h header includes from unnecessary files

The files in this patch do not use the hdb.h header.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoUse qb_hdb instead of mutex based hdb code
Steven Dake [Tue, 23 Aug 2011 03:07:29 +0000 (20:07 -0700)]
Use qb_hdb instead of mutex based hdb code

Rid ourselves of the mutex usage still in the code base

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoAdd totempg_threaded_mode_enable() api
Steven Dake [Mon, 22 Aug 2011 22:23:51 +0000 (15:23 -0700)]
Add totempg_threaded_mode_enable() api

This API allows totem to operate as a multithreaded library.  Performance is
better without threads but some library users may only have multithreaded
systems.  In the corosync case where we have removed threads, this reduces
cpu utilization by ~10% by removing about 50% of the mutex lock and unlock calls
that occur during typical operation.  Since the latest corosync is nearly
thread free, there is no need for mutex operations.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoMove cs_queue.h from include directory to exec directory
Steven Dake [Mon, 22 Aug 2011 19:52:50 +0000 (12:52 -0700)]
Move cs_queue.h from include directory to exec directory

This file is only used by totemsrp.c.  Move out of general include
directory.

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agouse va version of external log function
Steven Dake [Mon, 22 Aug 2011 19:46:17 +0000 (12:46 -0700)]
use va version of external log function

This removes a sprintf operation in the totem and ipc logging operations

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoA CPG client can sometimes lockup if the local node is in the downlist
Tim Beale [Thu, 18 Aug 2011 12:57:10 +0000 (14:57 +0200)]
A CPG client can sometimes lockup if the local node is in the downlist

In a 10-node cluster where all nodes are booting up and starting corosync
at the same time, sometimes during this process corosync detects a node as
leaving and rejoining the cluster.

Occasionally the downlist that gets picked contains the local node. When the
local node sends leave events for the downlist (including itself), it sets
its cpd state to CPD_STATE_UNJOINED and clears the cpd->group_name. This
means it no longer sends CPG events to the CPG client.

Reviewed-by: Jan Friesse <jfriesse@redhat.com>
13 years agoDisplay ring-ID consistently in debug
Tim Beale [Wed, 17 Aug 2011 02:15:16 +0000 (12:15 +1000)]
Display ring-ID consistently in debug

Ring ID was being displayed both as hex and decimal in places. Update so
it's displayed consistently (I chose hex) to make debugging easier.

Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoAdd code comment mapping for message handler defines
Tim Beale [Wed, 17 Aug 2011 01:52:25 +0000 (11:52 +1000)]
Add code comment mapping for message handler defines

As a corosync-newbie it can be hard to bridge the gap between where a
particular message is sent and where the receive handler processes it,
and vice versa.

Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoRemove -lcoroipcc from tools/Makefile.am notifyd
Steven Dake [Tue, 9 Aug 2011 21:51:12 +0000 (14:51 -0700)]
Remove -lcoroipcc from tools/Makefile.am notifyd

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoproperly define rec_token_cq_send_event_fn
Steven Dake [Tue, 9 Aug 2011 21:50:42 +0000 (14:50 -0700)]
properly define rec_token_cq_send_event_fn

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoDefine totemiba_log_printf properly
Steven Dake [Tue, 9 Aug 2011 21:49:41 +0000 (14:49 -0700)]
Define totemiba_log_printf properly

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoFix problem in totemiba where incorrect define is used (and also not defined)
Steven Dake [Tue, 9 Aug 2011 21:48:19 +0000 (14:48 -0700)]
Fix problem in totemiba where incorrect define is used (and also not defined)

Signed-off-by: Steven Dake <sdake@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoAllow compile master on RHEL 6
Jan Friesse [Tue, 9 Aug 2011 07:21:45 +0000 (09:21 +0200)]
Allow compile master on RHEL 6

corosync_timer_handle_t is know conditionally defined to prevent double
definition causing compile fault on RHEL 6 systems.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoMake realtime scheduling optional not the default.
Angus Salkeld [Sun, 7 Aug 2011 12:31:30 +0000 (22:31 +1000)]
Make realtime scheduling optional not the default.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoUpdate TODOs
Angus Salkeld [Fri, 5 Aug 2011 04:12:53 +0000 (14:12 +1000)]
Update TODOs

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: logging & trace
Angus Salkeld [Sun, 17 Jul 2011 13:16:31 +0000 (23:16 +1000)]
libqb: logging & trace

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: Add libqb dependency in the rpm & pc file
Angus Salkeld [Wed, 13 Jul 2011 06:14:06 +0000 (16:14 +1000)]
libqb: Add libqb dependency in the rpm & pc file

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoFix some compiler warnings
Angus Salkeld [Wed, 13 Jul 2011 06:12:47 +0000 (16:12 +1000)]
Fix some compiler warnings

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoUse PATH_MAX for file path size
Angus Salkeld [Thu, 14 Jul 2011 06:53:50 +0000 (16:53 +1000)]
Use PATH_MAX for file path size

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: Improve IPC dispatch and async handling
Angus Salkeld [Thu, 9 Dec 2010 06:10:18 +0000 (17:10 +1100)]
libqb: Improve IPC dispatch and async handling

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCPG: downgrade some log messages
Angus Salkeld [Wed, 8 Dec 2010 00:35:33 +0000 (11:35 +1100)]
CPG: downgrade some log messages

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: make test agent shutdown more gracefully
Angus Salkeld [Tue, 14 Dec 2010 06:12:28 +0000 (16:12 +1000)]
CTS: make test agent shutdown more gracefully

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoCTS: add an audit to check for leaking shared memory
Angus Salkeld [Tue, 14 Dec 2010 02:42:00 +0000 (13:42 +1100)]
CTS: add an audit to check for leaking shared memory

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoCTS: get GenStopAllBeekhof working a bit better
Angus Salkeld [Thu, 9 Dec 2010 06:03:13 +0000 (17:03 +1100)]
CTS: get GenStopAllBeekhof working a bit better

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: log bind() errors better
Angus Salkeld [Thu, 9 Dec 2010 06:14:07 +0000 (17:14 +1100)]
CTS: log bind() errors better

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: log cfg results
Angus Salkeld [Wed, 8 Dec 2010 02:33:35 +0000 (13:33 +1100)]
CTS: log cfg results

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: rename flatiron to needle
Angus Salkeld [Tue, 23 Nov 2010 00:46:25 +0000 (11:46 +1100)]
CTS: rename flatiron to needle

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: add exit handler to test_agents
Angus Salkeld [Wed, 8 Dec 2010 01:18:02 +0000 (12:18 +1100)]
CTS: add exit handler to test_agents

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: add "Too many open files" to the BadNews pattern
Angus Salkeld [Thu, 9 Dec 2010 06:12:07 +0000 (17:12 +1100)]
CTS: add "Too many open files" to the BadNews pattern

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: impove debug during msgSend test
Angus Salkeld [Tue, 23 Nov 2010 00:47:33 +0000 (11:47 +1100)]
CTS: impove debug during msgSend test

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: add logging to test agent
Angus Salkeld [Wed, 8 Dec 2010 00:28:06 +0000 (11:28 +1100)]
CTS: add logging to test agent

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoCTS: increse wait for node to reboot
Angus Salkeld [Thu, 11 Nov 2010 04:45:40 +0000 (15:45 +1100)]
CTS: increse wait for node to reboot

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoCTS: support new pacemaker-cts
Angus Salkeld [Thu, 11 Nov 2010 04:47:43 +0000 (15:47 +1100)]
CTS: support new pacemaker-cts

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoAUGEAS: fix "tags" log field
Angus Salkeld [Fri, 3 Dec 2010 03:29:12 +0000 (14:29 +1100)]
AUGEAS: fix "tags" log field

Reviewed-by: Steven Dake <sdake@redhat.com>
Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
13 years agoTEST: fix the print out when cpg_finalize() fails
Angus Salkeld [Tue, 14 Dec 2010 06:15:08 +0000 (17:15 +1100)]
TEST: fix the print out when cpg_finalize() fails

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: use the new cs_strerror() to print out the error message.
Angus Salkeld [Tue, 14 Dec 2010 02:05:49 +0000 (13:05 +1100)]
libqb: use the new cs_strerror() to print out the error message.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: fix iov_len in pcmk_test
Angus Salkeld [Mon, 15 Nov 2010 02:39:04 +0000 (13:39 +1100)]
libqb: fix iov_len in pcmk_test

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: fix valgring warnings in mon/wd
Angus Salkeld [Sun, 14 Nov 2010 12:54:27 +0000 (23:54 +1100)]
libqb: fix valgring warnings in mon/wd

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: change pause_timestamp to uint64_t
Angus Salkeld [Tue, 16 Nov 2010 22:16:34 +0000 (09:16 +1100)]
libqb: change pause_timestamp to uint64_t

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: rip out objdb & serialize locks
Angus Salkeld [Thu, 11 Nov 2010 21:32:37 +0000 (08:32 +1100)]
libqb: rip out objdb & serialize locks

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: only init IPC on service engines that need it.
Angus Salkeld [Mon, 15 Nov 2010 10:20:23 +0000 (21:20 +1100)]
libqb: only init IPC on service engines that need it.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: remove the lib init/exit from the test service agent
Angus Salkeld [Thu, 11 Nov 2010 03:38:38 +0000 (14:38 +1100)]
libqb: remove the lib init/exit from the test service agent

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: use the main loop to shutdown
Angus Salkeld [Mon, 15 Nov 2010 10:19:18 +0000 (21:19 +1100)]
libqb: use the main loop to shutdown

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: remove tsafe.c
Angus Salkeld [Fri, 5 Aug 2011 02:30:14 +0000 (12:30 +1000)]
libqb: remove tsafe.c

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: remove worker thread - keep to one thread.
Angus Salkeld [Wed, 10 Nov 2010 08:38:34 +0000 (18:38 +1000)]
libqb: remove worker thread - keep to one thread.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: make timer api a wrapper around qb_loop timers.
Angus Salkeld [Wed, 10 Nov 2010 08:38:33 +0000 (19:38 +1100)]
libqb: make timer api a wrapper around qb_loop timers.

- change timeout value to nano seconds
- fix timer handles (don't alloc on stack)

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: change ipc -> qb_ipc
Angus Salkeld [Fri, 5 Aug 2011 02:18:43 +0000 (12:18 +1000)]
libqb: change ipc -> qb_ipc

IPC: return 0/-ENOBUFS from message handler
IPC: use the new rate_limit API to improve perf.
CPG: add send_async API & hook up flow control
IPC: Fix flow control getting stuck.
IPC: Port the remaining libs to use libqb IPC
IPC: remove libqb flowcontrol API
TEST: put cpg_dispatch() in it's own thread
IPC: cleanup ipc_glue.c name everything cs_ipcs_*()
IPC: add back statistics
IPC: remove coroipcc_ symbols from lib*.versions
IPC: init each se's IPC as it is loaded.
IPC: use the new connection_closed() event to free the context.
IPC: re-add zero copy functionality back
IPC: remove cpg_mcast_joined_async() and make it the default
 -> now cpg_mcast_joined() == cpg_mcast_joined_async()
libqb: expose a libqb error converter
libqb: add missing error conversions
libqb: remove repeat try loop in lib/cpg.c
CPG: fix zero copy mcast
CPG: use newer return codes
Add ENOTCONN to qb_to_cs_error()
libqb: fix error conversion from errno to cs_error_t in confdb
libqb: change errno_to_cs to qb_to_cs_error
libqb: add a cs_strerror() to get a more meaningful message
libqb: fix some confusing error conversions.
libqb: set the timeout on recv's to -1 (wait forever)

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agolibqb: convert coropoll calls to qb_loop calls.
Angus Salkeld [Fri, 5 Aug 2011 01:52:28 +0000 (11:52 +1000)]
libqb: convert coropoll calls to qb_loop calls.

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoAdd systemd unit files for corosync and corosync-notifyd
Angus Salkeld [Mon, 8 Aug 2011 11:01:52 +0000 (21:01 +1000)]
Add systemd unit files for corosync and corosync-notifyd

Signed-off-by: Angus Salkeld <asalkeld@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agocorosync.conf.example: add note about host addresses in bindnetaddr
Florian Haas [Mon, 1 Aug 2011 06:47:58 +0000 (08:47 +0200)]
corosync.conf.example: add note about host addresses in bindnetaddr

https://lists.linux-foundation.org/pipermail/openais/2011-July/016563.html

Jan Friesse pointed out that bindnetaddr should be set to a host
address (as opposed to a network address) on hosts where multiple
NICs live on the same subnet. Add a comment to that effect to
the example configuration file.

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agocorosync.conf.example: include comments
Florian Haas [Tue, 26 Jul 2011 16:54:10 +0000 (18:54 +0200)]
corosync.conf.example: include comments

It's nice to say people should read the man page. It's also naive to
assume that they always do. Include comments in the example config
file itself.

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Dan Frincu <dan.frincu@1and1.ro>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agocorosync.conf.example: change mcastaddr
Florian Haas [Tue, 26 Jul 2011 16:16:31 +0000 (18:16 +0200)]
corosync.conf.example: change mcastaddr

Change suggested mcastaddr to one in the 239.255.0.0/16
pseudo-subnet. Multicast addresses outside 239.x.x.x may be IANA
registered and can clash with other services present on the
network. Suggest an address defined as part of the multicast IPv4
Local Scope in RFC 2365.

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Dan Frincu <dan.frincu@1and1.ro>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agocorosync.conf.example: change bindnetaddr
Florian Haas [Tue, 26 Jul 2011 16:14:53 +0000 (18:14 +0200)]
corosync.conf.example: change bindnetaddr

Change the example configuration file so "bindnetaddr" has a value
that more obviously looks like a network address. So as not to have
people think they need to set an existing IP address here (and hence,
have non-identical corosync.conf files between nodes).

Signed-off-by: Florian Haas <florian.haas@linbit.com>
Reviewed-by: Dan Frincu <dan.frincu@1and1.ro>
Reviewed-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agomain: let poll really stop before totempg_finalize
Jan Friesse [Mon, 25 Jul 2011 13:18:10 +0000 (15:18 +0200)]
main: let poll really stop before totempg_finalize

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoRevert "totemsrp: Remove recv_flush code"
Jan Friesse [Tue, 26 Jul 2011 08:05:34 +0000 (10:05 +0200)]
Revert "totemsrp: Remove recv_flush code"

This reverts commit 1a7b7a39f445be63c697170c1680eeca9834de39.

Reversion is needed to remove overflow of receive buffers and dropping
messages.

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
13 years agototemsrp: fix buffer overflows for large clusters (> 100 nodes)
MORITA Kazutaka [Sun, 24 Jul 2011 09:58:40 +0000 (18:58 +0900)]
totemsrp: fix buffer overflows for large clusters (> 100 nodes)

Signed-off-by: MORITA Kazutaka <morita.kazutaka@lab.ntt.co.jp>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agospecfile: Install corosync-signals.conf for dbus
Jan Friesse [Tue, 19 Jul 2011 14:41:44 +0000 (16:41 +0200)]
specfile: Install corosync-signals.conf for dbus

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agospecfile: use _datadir as var expansion not exec
Jan Friesse [Tue, 19 Jul 2011 14:35:28 +0000 (16:35 +0200)]
specfile: use _datadir as var expansion not exec

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agospecfile: Correct URL and source0
Jan Friesse [Tue, 19 Jul 2011 13:21:45 +0000 (15:21 +0200)]
specfile: Correct URL and source0

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agoAdd some more stats for debugging
Tim Beale [Tue, 19 Jul 2011 15:58:21 +0000 (08:58 -0700)]
Add some more stats for debugging

+ overload - number of times client is told to try again
+ invalid_request - message contained invalid paramter, e.g. invalid size
+ msg_queue_avail - messages currently available at the Totem layer
+ msg-queue_reserved - messages currently reserved at the Totem layer

Signed-off-by: Tim Beale <tim.beale@alliedtelesis.co.nz>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agorrp: Handle rollower in passive rrp properly
Jan Friesse [Fri, 15 Jul 2011 12:29:06 +0000 (08:29 -0400)]
rrp: Handle rollower in passive rrp properly

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>
13 years agorrp: handle rollover in active rrp properly
Jan Friesse [Tue, 12 Jul 2011 10:55:16 +0000 (06:55 -0400)]
rrp: handle rollover in active rrp properly

Signed-off-by: Jan Friesse <jfriesse@redhat.com>
Reviewed-by: Steven Dake <sdake@redhat.com>