git.proxmox.com Git - pve-cluster.git/log

]> git.proxmox.com Git - pve-cluster.git/log

projects / pve-cluster.git / log

commit | commitdiff | tree

Thomas Lamprecht [Fri, 14 Dec 2018 10:06:50 +0000 (11:06 +0100)]

indentation cleanup

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 29 Nov 2018 12:08:27 +0000 (13:08 +0100)]

bump version to 5.0-31

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

David Limbeck [Wed, 3 Oct 2018 09:53:36 +0000 (11:53 +0200)]

add 'for internal use' to description of addnode

Signed-off-by: David Limbeck <d.limbeck@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 7 Sep 2018 11:50:47 +0000 (13:50 +0200)]

buildsys: add build PVE folder to LD_LIBRARY_PATH and PERLLIB

ensures that the PVE modules and libraries currently build get used,
not the installed ones.

Besides correctness from a Packaging POV this also makes bootstrapping
easier.

Reported-by: Rhonda D'Vine <rhonda@proxmox.com>
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 7 Sep 2018 11:50:46 +0000 (13:50 +0200)]

Wpedantic: mark flexible arrays with [] not [0]

https://gcc.gnu.org/onlinedocs/gcc-8.2.0/gcc/Zero-Length.html
(same as in GCC 4.7.4 docs)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 7 Sep 2018 11:50:45 +0000 (13:50 +0200)]

Wpedantic: add casts to void* for pointer printing

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 7 Sep 2018 11:50:44 +0000 (13:50 +0200)]

Wpedantic: remove extra semicolons after function bodies

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 6 Sep 2018 12:29:01 +0000 (14:29 +0200)]

buildsys: IPCConst.pm is needed by 'check' target

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 28 Aug 2018 13:12:36 +0000 (15:12 +0200)]

fix some format security issues

fix places reported by '-Werror=format-security' in preperance to
make this package debhelper level 9 compatible.

Rename the touched 'str' variable to an actual name in this process

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 23 Aug 2018 12:00:51 +0000 (14:00 +0200)]

bump version to 5.0-30

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Dominik Csapak [Fri, 17 Aug 2018 09:20:24 +0000 (11:20 +0200)]

add missing languages to datacenter_schema

copied from proxmox-widget-toolkit Utils.js
since i see no good way to only have this in one place

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 31 Jul 2018 09:40:42 +0000 (11:40 +0200)]

Fix #1849: typos: aquire -> acquire

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 31 Jul 2018 09:19:33 +0000 (11:19 +0200)]

Fix #1848: typo in PVE::Cluster::join()

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 30 Jul 2018 12:31:00 +0000 (14:31 +0200)]

api/join: avoid using an IPv6 address as worker task ID

We used the hostname of the node over which we joined a cluster as
worker ID, which is then encoded in it's task UPID - a unique ID with
encoded information, separated by colons.
While this is no problem for normal hostnames, or IPv4 addresses, the
hostname can also be an IPv6 address - which is also separated by
colons. This throws of the upid_decode method.

While the, from a user POV, best solution would probably be to
connect and query the cluster name from the join peer it is much
simpler to just omit the ID to avoid such problems.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 26 Jul 2018 07:33:06 +0000 (09:33 +0200)]

gitignore: do not track *.buildinfo files

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 26 Jul 2018 07:27:35 +0000 (09:27 +0200)]

buildsys: also cleanup dependency tracking files

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 24 Jul 2018 10:23:11 +0000 (12:23 +0200)]

buildsys: add dependency tracking

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 24 Jul 2018 10:17:34 +0000 (12:17 +0200)]

buildsys: fixup data/Makefile

Actually fill the 'all' target so we don't compile at
install time...

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 4 Jul 2018 08:52:48 +0000 (10:52 +0200)]

buildsys: replace autotools with plain makefiles

pve-cluster is not a big project with to much dependencies, so
autotools was a bit of an overkill for it.
Omit it, plus a ./configure step in general and just use a plain
Makefile - in combination with pkg-config - like we do in our other
projects.

Build time gets reduced quite a bit - albeit the were never that big
anyway...:

(old autotools) time make deb
make deb  12.96s user 1.78s system 94% cpu 15.543 total

(new plain makefile) time make deb
make deb  9.40s user 1.14s system 100% cpu 10.465 total

A third less time needed here, and with compiling in parallel I can
shave off even 1.5 seconds more, so almost  half of the original
time.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 24 Jul 2018 09:19:58 +0000 (11:19 +0200)]

bump version to 5.0-29

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 24 Jul 2018 09:00:11 +0000 (11:00 +0200)]

fixup coding style

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Stoiko Ivanov [Tue, 24 Jul 2018 08:42:40 +0000 (10:42 +0200)]

fix #1843: always free certificate file after reading it

Reading https://$host:8006/nodes repeadedly leads to pveproxy keeping
a filedescriptor open for each node-certificate in the cluster and
eventually reaching its NOFile limit..

Signed-off-by: Stoiko Ivanov <s.ivanov@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 3 Jul 2018 12:16:42 +0000 (14:16 +0200)]

bump version to 5.0-28

commit | commitdiff | tree

Thomas Lamprecht [Tue, 3 Jul 2018 12:10:41 +0000 (14:10 +0200)]

pvecm updatecerts: followup: do work in fork

to workaround the case that we may possible get into the
uninterruptedly D state.
While this may still happen, it happens to a fork and we can return
an error to our caller.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 2 Jul 2018 14:29:40 +0000 (16:29 +0200)]

pvecm updatecerts: time out after 30 seconds

We call this in pve-cluster.service as ExecStartPost. We prefix it
with '-' to tell systemd that it should ignore non-zero exit codes,
but if the command hangs (e.g., on IO) systemd kills it after a
timeout (90 seconds default) which then doesn't get ignored and the
unit will also be put in failure state and stopped.
We specifically do not want this to happen, so wrap the updatecerts
call in run_with_timeout and give it a maximum of 30 seconds to
finish.

Reviewed-by: Stoiko Ivanov <s.ivanov@proxmox.com>
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 23 May 2018 09:51:59 +0000 (11:51 +0200)]

buildsys: fixup IPCConst.pm dependencies

IPCConst.pm depends on cfs-ipc-ops.h. Additionally, since
the header is the "input" and IPCConst.pm.awk is the
generator, use the header as the main direct dependency and
add the generator as a secondary dependency afterwards
(thus we have to swap the awk parameters).

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 18 May 2018 10:37:25 +0000 (12:37 +0200)]

use constants for IPC request types

Add a simple header with the constants as defines.
Use a simple awk script to translate this to an perl module with the
constants exported. awk is far easier to understand and maintain than
h2ph or h2xs, also their result is quite a mess for such a trivial
thing, IMO.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 4 May 2018 10:58:54 +0000 (12:58 +0200)]

bump version to 5.0-27

commit | commitdiff | tree

Thomas Lamprecht [Wed, 2 May 2018 08:02:57 +0000 (10:02 +0200)]

fix #1743: cluster create: default to ring0_addr for bindnet0

Else, if a separate network address was passed for ring0_addr but no
bindnet0 adress was set we used the wrong fallback.
Do not fallback to $local_node_ip but always to $ring0_addr, which
itself falls back to local node IP.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 30 Apr 2018 12:40:48 +0000 (14:40 +0200)]

api: allow cluster wide sys auditors to get join info

add same permissions as the GET cluster config call has.

Else the WebUI shows 'Standalone Node' for clustered nodes if the
user is not root but has audit rights.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Fabian Grünbichler [Thu, 19 Apr 2018 12:01:31 +0000 (14:01 +0200)]

cluster: add cfs_lock_acme

to lock an ACME account config file

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 24 Apr 2018 09:07:40 +0000 (11:07 +0200)]

bump version to 5.0-26

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 20 Apr 2018 10:01:39 +0000 (12:01 +0200)]

fixup: use 'an' for 'HTML5'

commit | commitdiff | tree

Dominik Csapak [Fri, 13 Apr 2018 07:49:58 +0000 (09:49 +0200)]

add 'xtermjs' to console types

so that we can have it as default in the webinterface

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Fri, 13 Apr 2018 11:50:31 +0000 (13:50 +0200)]

bump version to 5.0-25

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Dominik Csapak [Thu, 12 Apr 2018 12:38:15 +0000 (14:38 +0200)]

pmxcfs: only exit parent when successfully started

since systemd depends that parent exits only
when the service is actually started, we need to wait for the
child to get to the point where it starts the fuse loop
and signal the parent to now exit and write the pid file

without this, we had an issue, where the
ExecStartPost hook (which runs pvecm updatecerts) did not run reliably,
but which is necessary to setup the nodes/ dir in /etc/pve
and generating the ssl certificates

this could also affect every service which has an
After=pve-cluster

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 11 Apr 2018 09:28:42 +0000 (11:28 +0200)]

bump version to 5.0-24

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Dominik Csapak [Wed, 11 Apr 2018 07:26:38 +0000 (09:26 +0200)]

create known_hosts symlink on join

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Dominik Csapak [Tue, 10 Apr 2018 11:46:39 +0000 (13:46 +0200)]

fix #1723: add missing package for 'remote_node_ip'

without this, we would get perl errors when joining

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Dietmar Maurer [Thu, 5 Apr 2018 10:26:48 +0000 (12:26 +0200)]

use ssh -X for upload target

commit | commitdiff | tree

Dietmar Maurer [Thu, 5 Apr 2018 10:25:24 +0000 (12:25 +0200)]

bump version to 5.0-23

commit | commitdiff | tree

Thomas Lamprecht [Tue, 27 Mar 2018 06:08:37 +0000 (08:08 +0200)]

API/Cluster: autoflush STDOUT for join and create

We're in a forked worker here, so STDOUT isn't connected to a
(pseudo)TTY directly, so perl flushes only when it's intewrnal buffer
is full.

Ensure each line gets flushed out to the API client in use to give
immediate feedback about the operation.

For example, our WebUIs Task Viewer won't show anything without this
quite a bit of time, you may even get logged out before the flush
from the perl side happens, which is simply bad UX.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 29 Mar 2018 09:06:08 +0000 (11:06 +0200)]

pvecm join: also default to resolved IP with use_ssh param

We already switched to this behaviour in pvecm create and pvecm join
(with API) but did not changed it for the case when a user requested
to use the old method to join with --use_ssh.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Dietmar Maurer [Tue, 13 Mar 2018 11:36:01 +0000 (12:36 +0100)]

bump version to 5.0-22

commit | commitdiff | tree

Thomas Lamprecht [Mon, 12 Mar 2018 08:26:27 +0000 (09:26 +0100)]

fix tainted input in backup_cfs_database

We can call this module also now over the API through
pvedaemon/pveproxy which have tainting checks on.

Thus we need to untaint the "read existing backups" inputs,
as else this errors out here.

Only triggers when over 10 backups existed already, so this does
not triggers really often in the real world.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 12 Mar 2018 08:26:26 +0000 (09:26 +0100)]

refactor backup_cfs_database

The logic for finding out which backups to delete was a bit weird

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 8 Mar 2018 16:17:45 +0000 (17:17 +0100)]

cluster join: ensure updatecerts gets called on quorate cluster

We moved the start of pve-cluster together with the one of corosync
earlier, before the quorate check.
This meant that the 'pvecm updatecerts --silent' we call in the
from the pve-cluster.service through ExecStartPost exited as it has
not yet quorum.

So factor the respective code out to the Cluster perl module and
call this function manually after we reached quorum.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 6 Mar 2018 09:06:42 +0000 (10:06 +0100)]

api: add fork worker ID for create and join

allows to shows this in the web interface in the task log heading,
e.g.:
Create Cluster 'Foo'
Join node 'Bar'

Needs the respective mappings in the widget toolkits Utils file

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 6 Mar 2018 09:06:41 +0000 (10:06 +0100)]

pvecm: check if APIClient exception code is defined

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Fabian Grünbichler [Tue, 27 Feb 2018 08:57:00 +0000 (09:57 +0100)]

bump version to 5.0-21

commit | commitdiff | tree

Fabian Grünbichler [Mon, 19 Feb 2018 10:52:06 +0000 (11:52 +0100)]

buildsys: use git rev-parse to retrieve commit

commit | commitdiff | tree

Fabian Grünbichler [Mon, 19 Feb 2018 10:45:31 +0000 (11:45 +0100)]

d/control: wrap-and-sort dependencies

commit | commitdiff | tree

Fabian Grünbichler [Mon, 19 Feb 2018 10:43:33 +0000 (11:43 +0100)]

d/control: bump dependency on libpve-common-perl

for 'bwlimit' option in datacenter.cfg

commit | commitdiff | tree

Thomas Lamprecht [Mon, 19 Feb 2018 10:21:29 +0000 (11:21 +0100)]

pvecm: add: create task log on cluster join

The API join path creates a task log when joining a cluster.
Also create such a log in the CLI code path.

Changes are mostly indentation only.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 9 Jan 2018 14:31:00 +0000 (15:31 +0100)]

lock locally on cluster create and join

If we are not part of a cluster we do not need to worry about other
members messing with the config. But there may be local contenders,
e.g., two automation script instances started in parallel by mistake
or two admin (sessions) which start a create or join clsuter request
at the same time.
Reuse the local flock for this purpose.

lock_file silents an exception, but does not alters it so we die if
$@ is set, to ensure a worker gets to know that something bad
happened.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 18 Dec 2017 14:30:10 +0000 (15:30 +0100)]

use resolved IP address for ring0_addr as default

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 1 Dec 2017 12:57:42 +0000 (13:57 +0100)]

api/cluster: add endpoint to GET cluster join information

Returns all relevant information for joining this cluster over the
current connected node securely over the API, address, fingerprint,
totem config section and (not directly needed but possibly useful)
cluster configuration digest.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 26 Jan 2018 12:07:57 +0000 (13:07 +0100)]

factor out common parameter definitions

Besides the obvious reduction of duplicated code, this also
streamlines the descriptions.

Suggested-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 12 Dec 2017 16:15:56 +0000 (17:15 +0100)]

api/cluster: create cluster in forked worker

Creating a cluster may need a bit longer, we need to gather random
data for the corosync authkey, restart services and such.
As we're now exposed in the API the 30 second response limit from
pveproxy is a big reason to do this. But we also get a nice task log
entry with this, which is nice.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 1 Dec 2017 12:56:57 +0000 (13:56 +0100)]

move cluster create to API

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 1 Dec 2017 12:33:09 +0000 (13:33 +0100)]

cluster create: restart corosync & pmxfs in one go and say so

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 1 Dec 2017 12:31:42 +0000 (13:31 +0100)]

cluster create: factor out initial corosync config assembly

Easier to read and work with in comparison to a heredocs text with
other string variables in there.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 27 Nov 2017 11:53:46 +0000 (12:53 +0100)]

pvecm add: use API by default to join cluster

Default to using the API for a add node procedure.

But, allow the user to manually fall back to the legacy SSH method.
Also fallback if the API detected an not up to date peer, this is
done by checking for the 501 HTTP_NOT_IMPLEMENTED response code.

This could be removed in a later major release, e.g. 6.0.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 27 Nov 2017 09:55:14 +0000 (10:55 +0100)]

api/cluster: add join endpoint

Add an endpoint to the API which allows to join an existing PVE
cluster by only using the API instead of CLI tools (pvecm).

Use a worker as this operation may need longer than 30 seconds.
With the worker we also get a task log entry/window for an UI for
free, allowing to give better feedback.

The join helper will be reused by the CLI handler in a later patch.
It is based on its behaviour, but swapped out the ssh parts with API
calls.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 20 Nov 2017 14:04:56 +0000 (15:04 +0100)]

return cluster config and authkey in addnode API call

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 18 Dec 2017 14:13:28 +0000 (15:13 +0100)]

use run_command instead of system

perls system wants to open /dev/tty which is not available in forked
workers. Use our run_command instead.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 18 Dec 2017 14:20:16 +0000 (15:20 +0100)]

assert_joinable: simplify error and warning handling

remove the if check for force, as we handle this on another level and
want to record errors independent of the value of force.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 26 Jan 2018 10:40:29 +0000 (11:40 +0100)]

node add: factor out local joining steps

Factor out the code which finishes the join to a cluster on the
joinee side, after a cluster member approved the join request and
supplied us with the necessary information.

Will be used by API and the SSH join code paths.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 23 Nov 2017 13:29:37 +0000 (14:29 +0100)]

node add: factor out checks for joining

Factor out the code which checks if the node can join another
cluster. It will be used by the new API endpoint to join a cluster
but stays also in the CLIHandler as we keep the old legacy SSH method
for a bit.

This is not a completely 1:1 move, I changed:
* &$error(...) to $error->(...)
* removing a few empty lines, where code was so spread out that those
lines resulted in the opposite of what they intended, i.e., less
readability

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 12 Dec 2017 15:27:45 +0000 (16:27 +0100)]

tell cluster log when adding/deleting a node

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 20 Nov 2017 12:08:40 +0000 (13:08 +0100)]

move addnode/delnode from CLI to cluster config API

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 30 Jan 2018 08:58:22 +0000 (09:58 +0100)]

datacenter.cfg: add bwlimit

This will define the global defaults which can be overridden
by per-storage limits.

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Mon, 22 Jan 2018 14:27:18 +0000 (15:27 +0100)]

bump version to 5.0-20

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 22 Jan 2018 09:52:12 +0000 (10:52 +0100)]

avoid harmful '<>' pattern, explicitly read from STDIN

Fixes problems in CLIHandler using the code pattern:

while (my $line = <>) {
...
}

For why this causes only _now_ problems lets first look how <>
behaves:

"The null filehandle <> is special: [...] Input from <> comes either
from standard input, or from each file listed on the command line.
Here's how it works: the first time <> is evaluated, the @ARGV array
is checked, and if it is empty, $ARGV[0] is set to "-" , which when
opened gives you standard input. The @ARGV array is then processed
as a list of filenames." - 'perldoc perlop'

Recent changes in the CLIHandler code changed how we modfiied @ARGV
Earlier we assumed that the first argument must be the command and
thus shifted it out of @ARGV, now we can have multiple levels of
(sub)commands. This change also changed how we handle @ARGV, we do
not unshift anything but go through the arguments until we got to
the final command and copy the rest of @ARGV as we know that this
must be the commandos arguments.

For '<>' this means that ARGV was still fully populated and perl
tried to open element as a file, which naturally failed.
Thus the change in pve-common only exposed this 'dangerous' code
pattern.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 22 Dec 2017 13:34:32 +0000 (14:34 +0100)]

add pmxcfs restart detection heuristic for IPCC

Allow clean pmxcfs restarts to be fully transparent for the IPCC
using perl stack aboce.

A restart of pmxcfs invalidates the connection cache, we only set the
cached connection to NULL for this case and the next call to
ipcc_send_rec would connect newly.
Further, such a restart may need quite a bit time (seconds).

Thus write a status file to flag a possible restart when terminating
from pmxcfs. Delete this flag file once we're up and ready again.
Error case handling is described further below.

If a new connections fails and this flag file exists then retry
connecting for a certain period (for now five seconds).

If a cached connection fails always retry once, as every pmxcfs
restart makes the cached connection invalid, even if IPCC would be
fully up and ready again and then also follow the connection polling
heuristic if the restart flag exists, as new connections do.

We use the monotonic clock to avoid problems if the (system) time
changes and to keep things as easy as possible.

We delete the flag file if a IPCC call could not connect in the grace
period, but only if the file is still the same, i.e., no one else has
deleted and recreated it in the meantime (e.g. a second cfs restart).
This guarantees that IPCC calls try this heuristic only for a limited
time (5 seconds until the first failed one) if the cfs does not
starts again.

Further, as the flag resided in /run/... - which is always a tmpfs
(thus in memory and thus cleaned upon reboot) we may not run into
leftover flag files on a node reset, e.g. done by the HA watchdog for
self-fencing.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 22 Dec 2017 13:34:31 +0000 (14:34 +0100)]

pmxcfs: do not wait artificially when stopping

Most of PVE services profit from an up and running pmxcfs, thus
artificially prolonging its graceful termination is contra
productive.

As everything gets de-initialized nicely and gracefully this should
not be problematic, the restart is not the fastest anyhow anyway.

As this specific line has no git tracking information available
(imported from svn) I could not find if and what the reason for this
was.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Dominik Csapak [Thu, 21 Dec 2017 11:52:53 +0000 (12:52 +0100)]

deprecate and map 'applet' console setting in datacenter.cfg

we do not use the applet anymore, and setting it throws an error
in the gui when clicking the console button

map it to 'html5' and mark it deprecated, so that we can remove
that setting in the next major release

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 20 Dec 2017 09:35:21 +0000 (10:35 +0100)]

postinst: stop LRM before CRM in workaround

may help in a single node HA cluster, which while not usable for real
HA can be still found in the wild and make sense as ensure my VM/CT
stays started manager.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 7 Dec 2017 14:20:54 +0000 (15:20 +0100)]

bump version to 5.0-19

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 7 Dec 2017 14:16:28 +0000 (15:16 +0100)]

whitespace fixup

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 7 Dec 2017 14:10:02 +0000 (15:10 +0100)]

ensure problematic ha service is stopped during update

Add a postinst file which stops, if running, the ha service before it
configures pve-cluster and starts them again, if enabled.
Do this only if the version installed before the upgrade is <= 2.0-3

dpkg-query has Version and Config-Version

Version is at this time the new unpacked version already, so we need
to check both to catch all cases.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 7 Dec 2017 07:11:54 +0000 (08:11 +0100)]

bump version to 5.0-18

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 7 Dec 2017 07:09:51 +0000 (08:09 +0100)]

whitespace fixup

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Fabian Grünbichler [Wed, 6 Dec 2017 19:39:49 +0000 (20:39 +0100)]

build: fix sysctl.d install path

and remove the directory before installing the snippet when upgrading
from a broken version (and if the incorrect directory exists).

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 1 Dec 2017 12:25:12 +0000 (13:25 +0100)]

datacenter.cfg write: retransform migration property to string

We use parse_property_string in the parser to make life easier for
code working with the migration format, but we did not retransform
it back when writing datacenter.cfg

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 1 Dec 2017 10:09:01 +0000 (11:09 +0100)]

pvecm create: there is no rrp_mode param anymore

We removed the possibility to pass a rrp_mode param to the create call
in commit 606a890448f0a6219db3d1c19b98960a3dcbcaa8 so this check is
rather useless.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 28 Nov 2017 15:42:03 +0000 (16:42 +0100)]

bump version to 5.0-17

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Fabian Grünbichler [Mon, 27 Nov 2017 08:48:03 +0000 (09:48 +0100)]

handle Net::SSLeay errors correctly

commit | commitdiff | tree

Thomas Lamprecht [Thu, 23 Nov 2017 08:42:58 +0000 (09:42 +0100)]

factor out reading a nodes ssl cert fingerprint

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 23 Nov 2017 08:57:59 +0000 (09:57 +0100)]

setup_sshd_config: remove useless start_sshd parameter

This controlled if we use reload-or-restart or try-reload-or-restart.
They differ in the following way:
> reload-or-restart - Reload one or more units if possible, otherwise
> start or restart
>
> try-reload-or-restart - Reload one or more units if possible,
> otherwise (re)start if active

Under PVE we normally need a running ssh for a node/cluster to work,
there isn't the case where it should be stopped, especially not for
this method which is normally called when setting up or joining a
cluster.
So always use 'reload-or-restart'.

Semantically reverts: 6c0e95b3

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 23 Nov 2017 11:12:05 +0000 (12:12 +0100)]

pvecm: add/delete: local lock & avoid problems on first node addition

cfs_lock is per node, thus we had a possibility for a node addition
race if the process was started on the same node (e.g. by a
script/ansible/...).

So always request a local lock first, if that is acquired check how
many members currently reside in the cluster and then decide if we
can directly execute the code (single node cluster = no contenders)
or must hold the lock.

One may think that there remains a race when adding a node to single
node cluster, i.e., once the node is added it may itself be a target
for another joining node. But this cannot happen as we only tell the
joining node that it could be added once we already *have* added it
locally.

Besides the defense against a race if two user execute a node
addition to the same node at the same time, this also addresses a
issue where the cluster lock could not be removed after writing the
corosync conf, as pmxcfs and corosync triggered an config reload and
added the new node, which itself did not yet know that it was
accepted in the cluster. Thus, the former single node cluster expects
two nodes but has only one for now, until the other node pulled the
config and authkey and started up its cluster stack.

That resulted in a failing removal of the corosync lock, thus adding
another node did not work until this lock timed out (~2 minutes).

While often node additions are separated by more than 2 minutes time
intervall, deployment helpers (or fast admins, for that matter) may
trigger this easily.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 23 Nov 2017 11:12:04 +0000 (12:12 +0100)]

pvecm: check early if the deletion cannot work

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Nov 2017 07:31:31 +0000 (08:31 +0100)]

bump version to 5.0-16

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 16 Nov 2017 14:27:53 +0000 (15:27 +0100)]

pvecm: module cleanup: remove unused modules

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 16 Nov 2017 14:27:52 +0000 (15:27 +0100)]

pvecm: module cleanup: use our get_host_address_family

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 16 Nov 2017 14:27:51 +0000 (15:27 +0100)]

pvecm: remove unused variable

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 17 Nov 2017 14:04:21 +0000 (15:04 +0100)]

buildsys: autogen debug package and cleanup unecessary rules

don't do manually what the deb helpers do automatically and better.

Autogenerate the debug package, it includes now only the debugsymbols
without effectively duplicating all executables and libraries.
In the same step add a install file which installs our sysctl
settings, this is done together as it allows to skip some
intermediate steps, also the change is not to big, so it should be
possible to see whats going on.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 17 Nov 2017 14:04:20 +0000 (15:04 +0100)]

buildsys: remove traces from unused rsyslog config

does not get included in package since a long time, if ever, and has
no purpose currently.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 17 Nov 2017 14:04:19 +0000 (15:04 +0100)]

buildsys: rules: remove outdated variables

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 17 Nov 2017 14:02:24 +0000 (15:02 +0100)]

buildsys: use autoreconf

do not manually call autogen/configure just use dh_autoreconf

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

Cluster FS and Tools

RSS Atom