]>
git.proxmox.com Git - pve-ha-manager.git/log
Dietmar Maurer [Fri, 27 Mar 2015 07:31:13 +0000 (08:31 +0100)]
add timestamp to manager status
Dietmar Maurer [Fri, 27 Mar 2015 05:56:51 +0000 (06:56 +0100)]
update lrm status on each iteration
Dietmar Maurer [Fri, 27 Mar 2015 05:50:45 +0000 (06:50 +0100)]
update_lrm_status: add a time stamp
Dietmar Maurer [Fri, 27 Mar 2015 05:49:19 +0000 (06:49 +0100)]
cleanup lrm startup code
Dietmar Maurer [Fri, 27 Mar 2015 05:32:04 +0000 (06:32 +0100)]
depend on qemu-server
Dietmar Maurer [Fri, 27 Mar 2015 05:28:50 +0000 (06:28 +0100)]
improve docu
Dietmar Maurer [Thu, 26 Mar 2015 16:17:49 +0000 (17:17 +0100)]
remove dead code
Dietmar Maurer [Thu, 26 Mar 2015 15:47:18 +0000 (16:47 +0100)]
add another test
Dietmar Maurer [Thu, 26 Mar 2015 15:39:56 +0000 (16:39 +0100)]
add another test case
Dietmar Maurer [Thu, 26 Mar 2015 12:23:20 +0000 (13:23 +0100)]
bump version 0.5-1
Dietmar Maurer [Thu, 26 Mar 2015 12:01:27 +0000 (13:01 +0100)]
implement migrate
Dietmar Maurer [Thu, 26 Mar 2015 11:50:47 +0000 (12:50 +0100)]
implement change_service_location
Dietmar Maurer [Thu, 26 Mar 2015 09:43:06 +0000 (10:43 +0100)]
lrm: fix stop timeout
Dietmar Maurer [Thu, 26 Mar 2015 09:21:02 +0000 (10:21 +0100)]
fix service dependencies
So that we can shutdown without triggering the watchdog. It is also
important to depend on syslog.service (else logs gets lost)
Dietmar Maurer [Thu, 26 Mar 2015 07:08:58 +0000 (08:08 +0100)]
assume lrm mode 'active' by default
Dietmar Maurer [Thu, 26 Mar 2015 07:01:38 +0000 (08:01 +0100)]
log errors when writing lrm status
And correctly write status once at daemon startup (we need to wait for quorum)
Dietmar Maurer [Thu, 26 Mar 2015 06:26:24 +0000 (07:26 +0100)]
write lrm mode into lrm status file
LRM is normally in 'active' mode, but can be set to 'reboot', 'shutdown' or 'restart'.
We use this to freeze services, so that we can safely reboot a node, or restart
the LRM.
Dietmar Maurer [Wed, 25 Mar 2015 12:59:47 +0000 (13:59 +0100)]
bump version to 0.4-1
Dietmar Maurer [Wed, 25 Mar 2015 12:09:28 +0000 (13:09 +0100)]
increase fence_delay to 60 seconds
To match the watchdog timeout.
Dietmar Maurer [Wed, 25 Mar 2015 12:04:28 +0000 (13:04 +0100)]
remove dead code
Dietmar Maurer [Wed, 25 Mar 2015 12:00:09 +0000 (13:00 +0100)]
fix failover after master crash with pending fence action
Also include a test case for that.
Dietmar Maurer [Wed, 25 Mar 2015 08:06:16 +0000 (09:06 +0100)]
add README for regresstion test
The idea is to describe each test shortly, so that it is easier
to understand the purpose.
Dietmar Maurer [Wed, 25 Mar 2015 08:01:59 +0000 (09:01 +0100)]
re-enable ha-tester (run regression tests)
Dietmar Maurer [Wed, 25 Mar 2015 07:58:18 +0000 (08:58 +0100)]
remove stale tests
Dietmar Maurer [Wed, 25 Mar 2015 07:51:57 +0000 (08:51 +0100)]
fix regression test environment
Dietmar Maurer [Wed, 25 Mar 2015 07:49:48 +0000 (08:49 +0100)]
move exec_resource_agent() to PVE::HA::Sim::Env
so that we can reuse it with regression tests
Dietmar Maurer [Wed, 25 Mar 2015 07:48:29 +0000 (08:48 +0100)]
LRM do not use time(), improve logging
Dietmar Maurer [Wed, 25 Mar 2015 07:46:22 +0000 (08:46 +0100)]
add a hack to support regression tests (can_fork())
Dietmar Maurer [Wed, 25 Mar 2015 07:44:36 +0000 (08:44 +0100)]
use separate binary for regression tester
Dietmar Maurer [Wed, 25 Mar 2015 07:42:27 +0000 (08:42 +0100)]
use $haenv->get_time() instead of time()
Dietmar Maurer [Wed, 25 Mar 2015 07:40:51 +0000 (08:40 +0100)]
improve logging
sort keys so that we get ordered logs with regression tests.
Dietmar Maurer [Tue, 24 Mar 2015 06:11:01 +0000 (07:11 +0100)]
ha-manager: new 'status' command to print crm manager status
Dietmar Maurer [Tue, 24 Mar 2015 05:55:47 +0000 (06:55 +0100)]
remove newlines from log messages
Dietmar Maurer [Tue, 24 Mar 2015 05:54:42 +0000 (06:54 +0100)]
remove stale service from manager state
Dietmar Maurer [Tue, 17 Mar 2015 07:04:54 +0000 (08:04 +0100)]
pve-ha-simulator: write log file
Dietmar Maurer [Tue, 17 Mar 2015 06:48:31 +0000 (07:48 +0100)]
remove test_ha_agent_lock
We need to use the normal get_ha_agent_lock
Dietmar Maurer [Mon, 16 Mar 2015 12:03:46 +0000 (13:03 +0100)]
bump version to 0.3-1
Dietmar Maurer [Mon, 16 Mar 2015 12:01:03 +0000 (13:01 +0100)]
really activate softdog
We can set options in /lib/modprobe.d/aliases.conf
Dietmar Maurer [Mon, 16 Mar 2015 11:16:57 +0000 (12:16 +0100)]
correctly count active services
Dietmar Maurer [Mon, 16 Mar 2015 10:35:02 +0000 (11:35 +0100)]
implement fence_delay to avoid immediate fencing
Dietmar Maurer [Mon, 16 Mar 2015 09:48:56 +0000 (10:48 +0100)]
reset watchdog with poweroff
Dietmar Maurer [Mon, 16 Mar 2015 07:13:36 +0000 (08:13 +0100)]
release agent lock if requested by master
Dietmar Maurer [Mon, 16 Mar 2015 06:09:38 +0000 (07:09 +0100)]
use option nofailback for default simulator groups
So that one can migrate VMs to other nodes (else the get moved back immediately).
Dietmar Maurer [Sat, 14 Mar 2015 09:06:16 +0000 (10:06 +0100)]
always run regression tests
Dietmar Maurer [Sat, 14 Mar 2015 09:00:43 +0000 (10:00 +0100)]
add simple test for failover
Dietmar Maurer [Sat, 14 Mar 2015 08:54:47 +0000 (09:54 +0100)]
sort nodes inside priority groups to get deterministic results
Dietmar Maurer [Fri, 13 Mar 2015 15:18:35 +0000 (16:18 +0100)]
do not restart watchdog-mux.socket
Dietmar Maurer [Fri, 13 Mar 2015 15:14:26 +0000 (16:14 +0100)]
use systemd condition to avoid daemon startup when ha is disabled
Dietmar Maurer [Fri, 13 Mar 2015 14:37:58 +0000 (15:37 +0100)]
lrm: introduce disabled state
Dietmar Maurer [Fri, 13 Mar 2015 14:24:38 +0000 (15:24 +0100)]
implement service_config_exists() - test if ha is enabled
Dietmar Maurer [Fri, 13 Mar 2015 09:54:32 +0000 (10:54 +0100)]
add upload target
Dietmar Maurer [Fri, 13 Mar 2015 09:03:30 +0000 (10:03 +0100)]
add migrate and relocate commands to cli tool
Dietmar Maurer [Fri, 13 Mar 2015 08:27:03 +0000 (09:27 +0100)]
bump version to 0.2-1
Dietmar Maurer [Fri, 13 Mar 2015 07:36:07 +0000 (08:36 +0100)]
add Restart=no to make it more clear
This service should not be re-started, because a restart would clear
the watchdoh timer!
Dietmar Maurer [Fri, 13 Mar 2015 07:31:35 +0000 (08:31 +0100)]
add enable/disable to cli tool
Dietmar Maurer [Fri, 13 Mar 2015 05:57:39 +0000 (06:57 +0100)]
improve API for resources
Dietmar Maurer [Thu, 12 Mar 2015 11:20:18 +0000 (12:20 +0100)]
fix clean target
Dietmar Maurer [Thu, 12 Mar 2015 11:17:34 +0000 (12:17 +0100)]
start API for groups
Dietmar Maurer [Thu, 12 Mar 2015 10:40:57 +0000 (11:40 +0100)]
start implementing an API
Dietmar Maurer [Thu, 12 Mar 2015 09:51:58 +0000 (10:51 +0100)]
pve lrm: implement start/stop
Dietmar Maurer [Thu, 12 Mar 2015 08:39:20 +0000 (09:39 +0100)]
pass service configuration to exec_resource_agent
Dietmar Maurer [Thu, 12 Mar 2015 07:45:19 +0000 (08:45 +0100)]
do not restart watchdog-mux on upgrade
Dietmar Maurer [Thu, 12 Mar 2015 06:08:19 +0000 (07:08 +0100)]
allow to shutdown CRM/LRM
Dietmar Maurer [Thu, 12 Mar 2015 05:55:19 +0000 (06:55 +0100)]
add command line tool
Dietmar Maurer [Wed, 11 Mar 2015 11:31:35 +0000 (12:31 +0100)]
add uuid when adding service first time, log warnings
Dietmar Maurer [Wed, 11 Mar 2015 11:06:25 +0000 (12:06 +0100)]
add service description for pve-ha-lrm
Dietmar Maurer [Wed, 11 Mar 2015 10:37:41 +0000 (11:37 +0100)]
add service state
Dietmar Maurer [Wed, 11 Mar 2015 10:13:46 +0000 (11:13 +0100)]
use boolean compare, only show get_lock messages once
Dietmar Maurer [Wed, 11 Mar 2015 10:00:55 +0000 (11:00 +0100)]
use new PVE::HA:Config
Dietmar Maurer [Wed, 11 Mar 2015 08:29:30 +0000 (09:29 +0100)]
add new class to handle configuration
Dietmar Maurer [Tue, 10 Mar 2015 16:22:59 +0000 (17:22 +0100)]
add parser for resource configuration
Dietmar Maurer [Tue, 10 Mar 2015 14:43:38 +0000 (15:43 +0100)]
set backlog for watchdog-mux.socket
Dietmar Maurer [Tue, 10 Mar 2015 14:23:50 +0000 (15:23 +0100)]
disable OOM killer for watchdog-mux
Dietmar Maurer [Tue, 10 Mar 2015 14:14:19 +0000 (15:14 +0100)]
conflict with watchdog packages
Dietmar Maurer [Tue, 10 Mar 2015 11:31:50 +0000 (12:31 +0100)]
implement get_node_info
Dietmar Maurer [Tue, 10 Mar 2015 11:00:40 +0000 (12:00 +0100)]
mark dead code
Dietmar Maurer [Tue, 10 Mar 2015 10:52:18 +0000 (11:52 +0100)]
use new watchdog-mux.sock socket
Dietmar Maurer [Tue, 10 Mar 2015 10:24:06 +0000 (11:24 +0100)]
add systemd service for pve-ha-crm
Dietmar Maurer [Tue, 10 Mar 2015 10:00:41 +0000 (11:00 +0100)]
implement watchdog update logic
Dietmar Maurer [Tue, 10 Mar 2015 09:46:44 +0000 (10:46 +0100)]
implement magic_close for clients
Dietmar Maurer [Tue, 10 Mar 2015 09:32:48 +0000 (10:32 +0100)]
/watchdog-mux: add signal handling
Dietmar Maurer [Mon, 9 Mar 2015 16:00:47 +0000 (17:00 +0100)]
use new wd_client_t data structure to store info about clients
Dietmar Maurer [Mon, 9 Mar 2015 15:20:20 +0000 (16:20 +0100)]
update watchdog timer each second
Dietmar Maurer [Mon, 9 Mar 2015 11:50:18 +0000 (12:50 +0100)]
add service description, fix socket mode
Dietmar Maurer [Mon, 9 Mar 2015 11:11:46 +0000 (12:11 +0100)]
install systemd files for watchdog-mux
Dietmar Maurer [Mon, 9 Mar 2015 11:00:04 +0000 (12:00 +0100)]
remove dead code fro Makefile
Dietmar Maurer [Mon, 9 Mar 2015 10:59:14 +0000 (11:59 +0100)]
run verifyapi before install
Dietmar Maurer [Mon, 9 Mar 2015 10:49:32 +0000 (11:49 +0100)]
avoid perl warning
Dietmar Maurer [Mon, 9 Mar 2015 09:47:19 +0000 (10:47 +0100)]
restructure directory layout for dpkg-buildpackage
Dietmar Maurer [Mon, 9 Mar 2015 08:58:16 +0000 (09:58 +0100)]
fixup watchdog to cleanly compile with systemd
Dietmar Maurer [Mon, 9 Mar 2015 08:35:43 +0000 (09:35 +0100)]
rename watchdog.c to watchdog-mux.c
Dietmar Maurer [Mon, 9 Mar 2015 08:34:21 +0000 (09:34 +0100)]
make simulator package architecture independent
Dietmar Maurer [Mon, 9 Mar 2015 08:30:20 +0000 (09:30 +0100)]
add missing Makefile
Dietmar Maurer [Mon, 9 Mar 2015 08:26:37 +0000 (09:26 +0100)]
make watchdog systemd aware
Dietmar Maurer [Sat, 21 Feb 2015 14:52:10 +0000 (15:52 +0100)]
add experimental watchdog daemon code
I plan to make that a systemd service.
Dietmar Maurer [Sat, 21 Feb 2015 12:42:06 +0000 (13:42 +0100)]
add thoughts about watchdog implementation
Dietmar Maurer [Sat, 21 Feb 2015 09:26:38 +0000 (10:26 +0100)]
add note about two_node cluster option
Dietmar Maurer [Fri, 20 Feb 2015 10:45:37 +0000 (11:45 +0100)]
implement watchdog - first try
Dietmar Maurer [Fri, 20 Feb 2015 05:17:55 +0000 (06:17 +0100)]
implement PVE cfs lock
Dietmar Maurer [Thu, 19 Feb 2015 08:34:26 +0000 (09:34 +0100)]
fix typo