]> git.proxmox.com Git - pve-ha-manager.git/log
pve-ha-manager.git
9 years agoimprove docu
Dietmar Maurer [Fri, 27 Mar 2015 05:28:50 +0000 (06:28 +0100)]
improve docu

9 years agoremove dead code
Dietmar Maurer [Thu, 26 Mar 2015 16:17:49 +0000 (17:17 +0100)]
remove dead code

9 years agoadd another test
Dietmar Maurer [Thu, 26 Mar 2015 15:47:18 +0000 (16:47 +0100)]
add another test

9 years agoadd another test case
Dietmar Maurer [Thu, 26 Mar 2015 15:39:56 +0000 (16:39 +0100)]
add another test case

9 years agobump version 0.5-1
Dietmar Maurer [Thu, 26 Mar 2015 12:23:20 +0000 (13:23 +0100)]
bump version 0.5-1

9 years agoimplement migrate
Dietmar Maurer [Thu, 26 Mar 2015 12:01:27 +0000 (13:01 +0100)]
implement migrate

9 years agoimplement change_service_location
Dietmar Maurer [Thu, 26 Mar 2015 11:50:47 +0000 (12:50 +0100)]
implement change_service_location

9 years agolrm: fix stop timeout
Dietmar Maurer [Thu, 26 Mar 2015 09:43:06 +0000 (10:43 +0100)]
lrm: fix stop timeout

9 years agofix service dependencies
Dietmar Maurer [Thu, 26 Mar 2015 09:21:02 +0000 (10:21 +0100)]
fix service dependencies

So that we can shutdown without triggering the watchdog. It is also
important to depend on syslog.service (else logs gets lost)

9 years agoassume lrm mode 'active' by default
Dietmar Maurer [Thu, 26 Mar 2015 07:08:58 +0000 (08:08 +0100)]
assume lrm mode 'active' by default

9 years agolog errors when writing lrm status
Dietmar Maurer [Thu, 26 Mar 2015 07:01:38 +0000 (08:01 +0100)]
log errors when writing lrm status

And correctly write status once at daemon startup (we need to wait for quorum)

9 years agowrite lrm mode into lrm status file
Dietmar Maurer [Thu, 26 Mar 2015 06:26:24 +0000 (07:26 +0100)]
write lrm mode into lrm status file

LRM is normally in 'active' mode, but can be set to 'reboot', 'shutdown' or 'restart'.
We use this to freeze services, so that we can safely reboot a node, or restart
the LRM.

9 years agobump version to 0.4-1
Dietmar Maurer [Wed, 25 Mar 2015 12:59:47 +0000 (13:59 +0100)]
bump version to 0.4-1

9 years agoincrease fence_delay to 60 seconds
Dietmar Maurer [Wed, 25 Mar 2015 12:09:28 +0000 (13:09 +0100)]
increase fence_delay to 60 seconds

To match the watchdog timeout.

9 years agoremove dead code
Dietmar Maurer [Wed, 25 Mar 2015 12:04:28 +0000 (13:04 +0100)]
remove dead code

9 years agofix failover after master crash with pending fence action
Dietmar Maurer [Wed, 25 Mar 2015 12:00:09 +0000 (13:00 +0100)]
fix failover after master crash with pending fence action

Also include a test case for that.

9 years agoadd README for regresstion test
Dietmar Maurer [Wed, 25 Mar 2015 08:06:16 +0000 (09:06 +0100)]
add README for regresstion test

The idea is to describe each test shortly, so that it is easier
to understand the purpose.

9 years agore-enable ha-tester (run regression tests)
Dietmar Maurer [Wed, 25 Mar 2015 08:01:59 +0000 (09:01 +0100)]
re-enable ha-tester (run regression tests)

9 years agoremove stale tests
Dietmar Maurer [Wed, 25 Mar 2015 07:58:18 +0000 (08:58 +0100)]
remove stale tests

9 years agofix regression test environment
Dietmar Maurer [Wed, 25 Mar 2015 07:51:57 +0000 (08:51 +0100)]
fix regression test environment

9 years agomove exec_resource_agent() to PVE::HA::Sim::Env
Dietmar Maurer [Wed, 25 Mar 2015 07:49:48 +0000 (08:49 +0100)]
move exec_resource_agent() to PVE::HA::Sim::Env

so that we can reuse it with regression tests

9 years agoLRM do not use time(), improve logging
Dietmar Maurer [Wed, 25 Mar 2015 07:48:29 +0000 (08:48 +0100)]
LRM do not use time(), improve logging

9 years agoadd a hack to support regression tests (can_fork())
Dietmar Maurer [Wed, 25 Mar 2015 07:46:22 +0000 (08:46 +0100)]
add a hack to support regression tests (can_fork())

9 years agouse separate binary for regression tester
Dietmar Maurer [Wed, 25 Mar 2015 07:44:36 +0000 (08:44 +0100)]
use separate binary for regression tester

9 years agouse $haenv->get_time() instead of time()
Dietmar Maurer [Wed, 25 Mar 2015 07:42:27 +0000 (08:42 +0100)]
use $haenv->get_time() instead of time()

9 years agoimprove logging
Dietmar Maurer [Wed, 25 Mar 2015 07:40:51 +0000 (08:40 +0100)]
improve logging

sort keys so that we get ordered logs with regression tests.

9 years agoha-manager: new 'status' command to print crm manager status
Dietmar Maurer [Tue, 24 Mar 2015 06:11:01 +0000 (07:11 +0100)]
ha-manager: new 'status' command to print crm manager status

9 years agoremove newlines from log messages
Dietmar Maurer [Tue, 24 Mar 2015 05:55:47 +0000 (06:55 +0100)]
remove newlines from log messages

9 years agoremove stale service from manager state
Dietmar Maurer [Tue, 24 Mar 2015 05:54:42 +0000 (06:54 +0100)]
remove stale service from manager state

9 years agopve-ha-simulator: write log file
Dietmar Maurer [Tue, 17 Mar 2015 07:04:54 +0000 (08:04 +0100)]
pve-ha-simulator: write log file

9 years agoremove test_ha_agent_lock
Dietmar Maurer [Tue, 17 Mar 2015 06:48:31 +0000 (07:48 +0100)]
remove test_ha_agent_lock

We need to use the normal get_ha_agent_lock

9 years agobump version to 0.3-1
Dietmar Maurer [Mon, 16 Mar 2015 12:03:46 +0000 (13:03 +0100)]
bump version to 0.3-1

9 years agoreally activate softdog
Dietmar Maurer [Mon, 16 Mar 2015 12:01:03 +0000 (13:01 +0100)]
really activate softdog

We can set options in  /lib/modprobe.d/aliases.conf

9 years agocorrectly count active services
Dietmar Maurer [Mon, 16 Mar 2015 11:16:57 +0000 (12:16 +0100)]
correctly count active services

9 years agoimplement fence_delay to avoid immediate fencing
Dietmar Maurer [Mon, 16 Mar 2015 10:35:02 +0000 (11:35 +0100)]
implement fence_delay to avoid immediate fencing

9 years agoreset watchdog with poweroff
Dietmar Maurer [Mon, 16 Mar 2015 09:48:56 +0000 (10:48 +0100)]
reset watchdog with poweroff

9 years agorelease agent lock if requested by master
Dietmar Maurer [Mon, 16 Mar 2015 07:13:36 +0000 (08:13 +0100)]
release agent lock if requested by master

9 years agouse option nofailback for default simulator groups
Dietmar Maurer [Mon, 16 Mar 2015 06:09:38 +0000 (07:09 +0100)]
use option nofailback for default simulator groups

So that one can migrate VMs to other nodes (else the get moved back immediately).

9 years agoalways run regression tests
Dietmar Maurer [Sat, 14 Mar 2015 09:06:16 +0000 (10:06 +0100)]
always run regression tests

9 years agoadd simple test for failover
Dietmar Maurer [Sat, 14 Mar 2015 09:00:43 +0000 (10:00 +0100)]
add simple test for failover

9 years agosort nodes inside priority groups to get deterministic results
Dietmar Maurer [Sat, 14 Mar 2015 08:54:47 +0000 (09:54 +0100)]
sort nodes inside priority groups to get deterministic results

9 years agodo not restart watchdog-mux.socket
Dietmar Maurer [Fri, 13 Mar 2015 15:18:35 +0000 (16:18 +0100)]
do not restart watchdog-mux.socket

9 years agouse systemd condition to avoid daemon startup when ha is disabled
Dietmar Maurer [Fri, 13 Mar 2015 15:14:26 +0000 (16:14 +0100)]
use systemd condition to avoid daemon startup when ha is disabled

9 years agolrm: introduce disabled state
Dietmar Maurer [Fri, 13 Mar 2015 14:37:58 +0000 (15:37 +0100)]
lrm: introduce disabled state

9 years agoimplement service_config_exists() - test if ha is enabled
Dietmar Maurer [Fri, 13 Mar 2015 14:24:38 +0000 (15:24 +0100)]
implement service_config_exists() - test if ha is enabled

9 years agoadd upload target
Dietmar Maurer [Fri, 13 Mar 2015 09:54:32 +0000 (10:54 +0100)]
add upload target

9 years agoadd migrate and relocate commands to cli tool
Dietmar Maurer [Fri, 13 Mar 2015 09:03:30 +0000 (10:03 +0100)]
add migrate and relocate commands to cli tool

9 years agobump version to 0.2-1
Dietmar Maurer [Fri, 13 Mar 2015 08:27:03 +0000 (09:27 +0100)]
bump version to 0.2-1

9 years agoadd Restart=no to make it more clear
Dietmar Maurer [Fri, 13 Mar 2015 07:36:07 +0000 (08:36 +0100)]
add Restart=no to make it more clear

This service should not be re-started, because a restart would clear
the watchdoh timer!

9 years agoadd enable/disable to cli tool
Dietmar Maurer [Fri, 13 Mar 2015 07:31:35 +0000 (08:31 +0100)]
add enable/disable to cli tool

9 years agoimprove API for resources
Dietmar Maurer [Fri, 13 Mar 2015 05:57:39 +0000 (06:57 +0100)]
improve API for resources

9 years agofix clean target
Dietmar Maurer [Thu, 12 Mar 2015 11:20:18 +0000 (12:20 +0100)]
fix clean target

9 years agostart API for groups
Dietmar Maurer [Thu, 12 Mar 2015 11:17:34 +0000 (12:17 +0100)]
start API for groups

9 years agostart implementing an API
Dietmar Maurer [Thu, 12 Mar 2015 10:40:57 +0000 (11:40 +0100)]
start implementing an API

9 years agopve lrm: implement start/stop
Dietmar Maurer [Thu, 12 Mar 2015 09:51:58 +0000 (10:51 +0100)]
pve lrm: implement start/stop

9 years agopass service configuration to exec_resource_agent
Dietmar Maurer [Thu, 12 Mar 2015 08:39:20 +0000 (09:39 +0100)]
pass service configuration to exec_resource_agent

9 years agodo not restart watchdog-mux on upgrade
Dietmar Maurer [Thu, 12 Mar 2015 07:45:19 +0000 (08:45 +0100)]
do not restart watchdog-mux on upgrade

9 years agoallow to shutdown CRM/LRM
Dietmar Maurer [Thu, 12 Mar 2015 06:08:19 +0000 (07:08 +0100)]
allow to shutdown CRM/LRM

9 years agoadd command line tool
Dietmar Maurer [Thu, 12 Mar 2015 05:55:19 +0000 (06:55 +0100)]
add command line tool

9 years agoadd uuid when adding service first time, log warnings
Dietmar Maurer [Wed, 11 Mar 2015 11:31:35 +0000 (12:31 +0100)]
add uuid when adding service first time, log warnings

9 years agoadd service description for pve-ha-lrm
Dietmar Maurer [Wed, 11 Mar 2015 11:06:25 +0000 (12:06 +0100)]
add service description for pve-ha-lrm

9 years agoadd service state
Dietmar Maurer [Wed, 11 Mar 2015 10:37:41 +0000 (11:37 +0100)]
add service state

9 years agouse boolean compare, only show get_lock messages once
Dietmar Maurer [Wed, 11 Mar 2015 10:13:46 +0000 (11:13 +0100)]
use boolean compare, only show get_lock messages once

9 years agouse new PVE::HA:Config
Dietmar Maurer [Wed, 11 Mar 2015 10:00:55 +0000 (11:00 +0100)]
use new PVE::HA:Config

9 years agoadd new class to handle configuration
Dietmar Maurer [Wed, 11 Mar 2015 08:29:30 +0000 (09:29 +0100)]
add new class to handle configuration

9 years agoadd parser for resource configuration
Dietmar Maurer [Tue, 10 Mar 2015 16:22:59 +0000 (17:22 +0100)]
add parser for resource configuration

9 years agoset backlog for watchdog-mux.socket
Dietmar Maurer [Tue, 10 Mar 2015 14:43:38 +0000 (15:43 +0100)]
set backlog for watchdog-mux.socket

9 years agodisable OOM killer for watchdog-mux
Dietmar Maurer [Tue, 10 Mar 2015 14:23:50 +0000 (15:23 +0100)]
disable OOM killer for watchdog-mux

9 years agoconflict with watchdog packages
Dietmar Maurer [Tue, 10 Mar 2015 14:14:19 +0000 (15:14 +0100)]
conflict with watchdog packages

9 years agoimplement get_node_info
Dietmar Maurer [Tue, 10 Mar 2015 11:31:50 +0000 (12:31 +0100)]
implement get_node_info

9 years agomark dead code
Dietmar Maurer [Tue, 10 Mar 2015 11:00:40 +0000 (12:00 +0100)]
mark dead code

9 years agouse new watchdog-mux.sock socket
Dietmar Maurer [Tue, 10 Mar 2015 10:52:18 +0000 (11:52 +0100)]
use new watchdog-mux.sock socket

9 years agoadd systemd service for pve-ha-crm
Dietmar Maurer [Tue, 10 Mar 2015 10:24:06 +0000 (11:24 +0100)]
add systemd service for pve-ha-crm

9 years agoimplement watchdog update logic
Dietmar Maurer [Tue, 10 Mar 2015 10:00:41 +0000 (11:00 +0100)]
implement watchdog update logic

9 years agoimplement magic_close for clients
Dietmar Maurer [Tue, 10 Mar 2015 09:46:44 +0000 (10:46 +0100)]
implement magic_close for clients

9 years ago/watchdog-mux: add signal handling
Dietmar Maurer [Tue, 10 Mar 2015 09:32:48 +0000 (10:32 +0100)]
/watchdog-mux: add signal handling

9 years agouse new wd_client_t data structure to store info about clients
Dietmar Maurer [Mon, 9 Mar 2015 16:00:47 +0000 (17:00 +0100)]
use new wd_client_t data structure to store info about clients

9 years agoupdate watchdog timer each second
Dietmar Maurer [Mon, 9 Mar 2015 15:20:20 +0000 (16:20 +0100)]
update watchdog timer each second

9 years agoadd service description, fix socket mode
Dietmar Maurer [Mon, 9 Mar 2015 11:50:18 +0000 (12:50 +0100)]
add service description, fix socket mode

9 years agoinstall systemd files for watchdog-mux
Dietmar Maurer [Mon, 9 Mar 2015 11:11:46 +0000 (12:11 +0100)]
install systemd files for watchdog-mux

9 years agoremove dead code fro Makefile
Dietmar Maurer [Mon, 9 Mar 2015 11:00:04 +0000 (12:00 +0100)]
remove dead code fro Makefile

9 years agorun verifyapi before install
Dietmar Maurer [Mon, 9 Mar 2015 10:59:14 +0000 (11:59 +0100)]
run verifyapi before install

9 years agoavoid perl warning
Dietmar Maurer [Mon, 9 Mar 2015 10:49:32 +0000 (11:49 +0100)]
avoid perl warning

9 years agorestructure directory layout for dpkg-buildpackage
Dietmar Maurer [Mon, 9 Mar 2015 09:47:19 +0000 (10:47 +0100)]
restructure directory layout for dpkg-buildpackage

9 years agofixup watchdog to cleanly compile with systemd
Dietmar Maurer [Mon, 9 Mar 2015 08:58:16 +0000 (09:58 +0100)]
fixup watchdog to cleanly compile with systemd

9 years agorename watchdog.c to watchdog-mux.c
Dietmar Maurer [Mon, 9 Mar 2015 08:35:43 +0000 (09:35 +0100)]
rename watchdog.c to watchdog-mux.c

9 years agomake simulator package architecture independent
Dietmar Maurer [Mon, 9 Mar 2015 08:34:21 +0000 (09:34 +0100)]
make simulator package architecture independent

9 years agoadd missing Makefile
Dietmar Maurer [Mon, 9 Mar 2015 08:30:20 +0000 (09:30 +0100)]
add missing Makefile

9 years agomake watchdog systemd aware
Dietmar Maurer [Mon, 9 Mar 2015 08:26:37 +0000 (09:26 +0100)]
make watchdog systemd aware

9 years agoadd experimental watchdog daemon code
Dietmar Maurer [Sat, 21 Feb 2015 14:52:10 +0000 (15:52 +0100)]
add experimental watchdog daemon code

I plan to make that a systemd service.

9 years agoadd thoughts about watchdog implementation
Dietmar Maurer [Sat, 21 Feb 2015 12:42:06 +0000 (13:42 +0100)]
add thoughts about watchdog implementation

9 years agoadd note about two_node cluster option
Dietmar Maurer [Sat, 21 Feb 2015 09:26:38 +0000 (10:26 +0100)]
add note about two_node cluster option

9 years agoimplement watchdog - first try
Dietmar Maurer [Fri, 20 Feb 2015 10:45:37 +0000 (11:45 +0100)]
implement watchdog - first try

9 years agoimplement PVE cfs lock
Dietmar Maurer [Fri, 20 Feb 2015 05:17:55 +0000 (06:17 +0100)]
implement PVE cfs lock

9 years agofix typo
Dietmar Maurer [Thu, 19 Feb 2015 08:34:26 +0000 (09:34 +0100)]
fix typo

9 years agoadd missing control file for simulator package
Dietmar Maurer [Thu, 19 Feb 2015 08:33:33 +0000 (09:33 +0100)]
add missing control file for simulator package

9 years agoadd correct lib path for pve-ha-simulator
Dietmar Maurer [Thu, 19 Feb 2015 08:12:36 +0000 (09:12 +0100)]
add correct lib path for pve-ha-simulator

9 years agosplit simulator into separate package
Dietmar Maurer [Thu, 19 Feb 2015 07:59:01 +0000 (08:59 +0100)]
split simulator into separate package

9 years agodepend on pve-cluster an newer perl version
Dietmar Maurer [Wed, 18 Feb 2015 10:40:35 +0000 (11:40 +0100)]
depend on pve-cluster an newer perl version

9 years agocreate debian package
Dietmar Maurer [Wed, 18 Feb 2015 10:30:48 +0000 (11:30 +0100)]
create debian package