git.proxmox.com Git - pve-cluster.git/log

]> git.proxmox.com Git - pve-cluster.git/log

projects / pve-cluster.git / log

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 9 Nov 2017 11:12:26 +0000 (12:12 +0100)]

deps: we now break pve-ha-manager < 2.0-4

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 9 Nov 2017 09:36:51 +0000 (10:36 +0100)]

cfs_lock: subtract sleep time from rest timeout

We take the left-over timeout returned from alarm() and then
sleep for a second, so when continuing the alarm timeout we
we need to subtract that second for consistency.

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 9 Nov 2017 08:47:27 +0000 (09:47 +0100)]

cfs_lock: save and restore outer alarm

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 9 Nov 2017 08:47:26 +0000 (09:47 +0100)]

cfs_lock: always include lockid in error

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 9 Nov 2017 08:47:25 +0000 (09:47 +0100)]

cfs_lock: swap checks for specific errors with $got_lock

We checked if a specific error was set or, respectively, not set to
know if we got the lock or not.
The check if we may unlock again was negated and thus could lead to
problems, in specific - rather unlikely - cases.

Use the by the previous patch added $got_lock variable, which only
gets set when we really got the lock, instead.

While refactoring for the new variable, set the $noerr parameter of
check_cfs_quorum() as we do not want to die here.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 9 Nov 2017 08:47:24 +0000 (09:47 +0100)]

cfs_lock: address race where alarm triggers with lock accquired

As mkdir can possibly hang forever we need to enforce a timeout on
it. But this was made in such a way so that a small time window
existed where the lock could be acquired successfully but the alarm
triggered still, leaving around an unused lock for 120 seconds.

Wrap only the mkdir call itself in an alarm and save its result
directly in a $got_lock variable, this minimizes the window as far as
possible from the perl side.

This is also easier to track for humans reading the code and should
cope better against code changes, e.g., it does not breaks just if an
error message typo got corrected a few lines above.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Fabian Grünbichler [Tue, 17 Oct 2017 13:07:09 +0000 (15:07 +0200)]

bump version to 5.0-15

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 11 Oct 2017 12:24:56 +0000 (14:24 +0200)]

cluster: improve error handling when reading files

When querying file contents via IPC we return undef if the
file does not exist, but also on any other error. This is
potentially problematic as the ipcc_send_rec() xs function
returns undef on actual errors as well, while setting $!
(errno).

It's better to die in cases other than ENOENT. Before this,
pvesr would assume an empty replication config and an empty
vm list if pmxcfs wasn't running, which could then clear out
the entire local replication state file.

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 11 Oct 2017 12:24:57 +0000 (14:24 +0200)]

cluster: cfs_update: option to die rather than warn

It can be useful to know whether we actually have an empty
vm list or whether the last cfs_update call simply failed.
Previously this only warned.

This way we can avoid a nasty type of race condition. For
instance in pvesr where it's possible that the vm list query
fails while everything else worked (eg. if the pmxcfs was
just starting up, or died between the queries), in which
case it would assume there are no guests and the
purge-old-states step would clear out the entire local state
file.

Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 21 Sep 2017 12:26:31 +0000 (14:26 +0200)]

bump version to 5.0-14

commit | commitdiff | tree

Thomas Lamprecht [Thu, 21 Sep 2017 12:08:00 +0000 (14:08 +0200)]

cfs-func-plug: use RW lock for safe cached data access

fuse may spawn multiple threads if there are concurrent accesses.

Our virtual files, e.g. ".members", ".rrd", are registered over our
"func" cfs plug which is a bit special.

For each unique virtual file there exists a single cfs_plug_func_t
instance, shared between all threads.
As we directly operated unlocked on members of this structure
parallel accesses raced between each other.
This could result in quite visible problems like a crash after a
double free (Bug 1504) or in less noticeable effects where one thread
may read from an inconsistent, or already freed memory region.

Add a Reader/Writer lock to efficiently address this problem.
Other plugs implement more functions and use a mutex to ensure
consistency and thus do not have this problem.

Fixes: #1504
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 21 Sep 2017 07:44:29 +0000 (09:44 +0200)]

bump version to 5.0-13

commit | commitdiff | tree

Thomas Lamprecht [Wed, 20 Sep 2017 13:11:05 +0000 (15:11 +0200)]

test: add test for legacy corosync.conf

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 20 Sep 2017 13:11:04 +0000 (15:11 +0200)]

corosync: add atomic_write_conf and cleanup

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 20 Sep 2017 13:11:03 +0000 (15:11 +0200)]

corosync: transform config to allow easier access

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 20 Sep 2017 13:11:02 +0000 (15:11 +0200)]

corosync config parser: move to hash format

The old parser itself was simple and easy but resulted in quite a bit
of headache when changing corosync config sections, especially if
multiple section levelsshould be touched.

Move to a more practical internal format which represents the
corosync configuration in hash

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 20 Sep 2017 12:17:59 +0000 (14:17 +0200)]

buildsys: remove autogenerated files

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 20 Sep 2017 12:15:31 +0000 (14:15 +0200)]

pvecm addnode: pass code reference correctly

commit | commitdiff | tree

Thomas Lamprecht [Mon, 18 Sep 2017 08:32:53 +0000 (10:32 +0200)]

pvecm: import often needed run_command

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 18 Sep 2017 08:32:52 +0000 (10:32 +0200)]

pvecm: remove Data::Dumper

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 14 Sep 2017 07:51:51 +0000 (09:51 +0200)]

fixup: escape @ in double quoted string

commit | commitdiff | tree

Fabian Grünbichler [Wed, 31 May 2017 07:38:00 +0000 (09:38 +0200)]

update SSH Ciphers for Debian Stretch

blowfish, 3des and arcfour are not enabled by default on the
server side anyway.

on most hardware, AES is about 3 times faster than Chacha20
because of hardware accelerated AES, hence the changed order
of preference compared to the default.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>

commit | commitdiff | tree

Alwin Antreich [Wed, 23 Aug 2017 08:49:29 +0000 (10:49 +0200)]

fix #1486 pmxcfs spelling mistake

Signed-off-by: Alwin Antreich <a.antreich@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 3 Aug 2017 15:11:18 +0000 (17:11 +0200)]

pvecm mtunnel: factor out run command

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 18 Aug 2017 09:21:18 +0000 (11:21 +0200)]

limit tasklist to the maximal pmxcfs status entry size

We tried to limit the size of the tasklist by including non-running
task only if we have less than 25 entries. A reason, among others,
was that a single status entry in the cfs_status.kvhash is limited to
32 KiB.

The "max. 25 entry" heuristic assumes that entries are small, which
is also the norm. But on failed tasks, e.g. a Qemu VM with a
problematic command line, is far longer than the usual task entry.

This led to a situation where the last 25 task were bigger than
32KiB, so the ipcc call to the pmxcfs failed with EFBIG.
This aborted then every new task run with fork_worker, and could
render a node partially unusable until "/var/log/pve/tasks/active"
got truncated.

To recreate this issue quite fast do:

# qm create 11109 --args "'$(dd if=/dev/urandom bs=1024 count=1 2>/dev/null | base64 -w 0)'"
# while true; do qm start 11109; done

You should see soon a "ipcc_send_rec failed: File too large"
After this all new task fail, even if they could succeed. pvestatd
also fails to broadcast the tasklist now. To get out of this do:

To address this check the length of the serialized list and remove
elements from its end until we do not exceed the size limit anymore.

Current running tasks and chronological newer ones will get
prioritized.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Fri, 23 Jun 2017 08:37:37 +0000 (10:37 +0200)]

cleanup outdated build files/links

All but the ChangeLog file are dead links, the correct and current
ones will get generated by auototools in the build directory, so
remove them here.

This ChangeLog file was unused for quite some years so remove it too.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Dominik Csapak [Mon, 7 Aug 2017 14:04:24 +0000 (16:04 +0200)]

fix #1472: fix rrd file path

upstream rrd-tools changed the syntax for the perl binding,
we now have to supply '-' as the path despite what the documentation
says (it says to supply an empty path, what we did)

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 2 Aug 2017 11:27:30 +0000 (13:27 +0200)]

pvecm delnode: pass code reference correctly

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 25 Jul 2017 11:45:07 +0000 (13:45 +0200)]

ipcc_send_rec*: include msgid in error

else we often may have no idea which request failed at all...

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 13 Jul 2017 13:01:02 +0000 (15:01 +0200)]

pvecm: lock corosync config on addition and deletion

This avoids potentiall races which would lead to an inconsistent
corosync config.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Dietmar Maurer [Wed, 12 Jul 2017 10:00:54 +0000 (12:00 +0200)]

bump version to 5.0-12

commit | commitdiff | tree

Thomas Lamprecht [Wed, 12 Jul 2017 09:53:16 +0000 (11:53 +0200)]

ssh_merge_known_hosts: also add entry if current sshkey does not match

this ensures that our current valid SSH keys gets added even if
another key on the same hostname exists already for some reasons.
The code path which handles hashed host names has this behavior
already since the beginning, so let the new non-hashed code act the
same way.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Dietmar Maurer [Mon, 10 Jul 2017 06:54:38 +0000 (08:54 +0200)]

bump version to 5.0-11

commit | commitdiff | tree

Thomas Lamprecht [Thu, 6 Jul 2017 11:19:38 +0000 (13:19 +0200)]

ssh_merge_known_hosts: refactor and simplify

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Thu, 6 Jul 2017 11:19:37 +0000 (13:19 +0200)]

ssh_merge_known_hosts: address auth failure problem

On node addition we create two entries in the cluster-wide known_host
file with our public host key, one with the pve-localhost bound IP
address and one with our nodename.

SSH always lower cases hostnames or their aliases before comparing
them to the known host entry. This is allowed as per RFC 1035,
Section "2.3.3 Character Case" [1].
No problems are caused by this, if known_host entries are not hashed,
as both, the original value and the now specified value can be
compared canonically in an case insensitive matter.

But, if a known_host entry is hashed we have no access to its
original plain text value – and we cannot do a case insensitive
comparison anymore. SSH thus expects that the original value was
transformed to lowercase before hashing. We did not follow this
convention when we added node keys to the clusters known_host file as
we kept the case. This resulted in problems when a user set up nodes
with names containing uppercase letters.[2]

Instead of transforming everything to lowercase on hashing lets omit
hashing known_host entries completely.
To explain why this can be done safely – without security
implications - we need to state the reason why hashing those entries
would gain some security in the first place. It wants to prevent
information leakage for the case an local account gets taken over by
an attacker. If not hashed, the attacker could use the known_host
file to see which other host the user connected to.
This could "just" leak information on what a user does but could also
make it easier to attacked the listed hosts too - e.g. if the user
had an unprotected SSH key which the hosts trust. As there are other
ways to get a list of hosts where an user connected too
(.bash_history, monitoring outgoing traffic, ...) hashing known_host
entries itself provides just a small hurdle of obfuscation in the
case an account got already taken over. And this is the case for an
normal, unprivileged user account.
In the case of PVE hashing the used known_host file brings absolutely
*no* advantage. First, the affected known_host file is located under
/etc/pve/priv where only root has read access. Thus, an attacker
would need to take over root to get the known_hosts in the first
place. If he did take over root all hope is lost one way or another.
Even if known_host was world readable, hashing would not do much.
As and attacker would know that the nodes IPs are entries he could
use /etc/network/interfaces to get the subnet of interest and just
bruteforce all entries until we got all node IPs - he normally would
only need to iterate through 8-16 bit in an IPv4 network.
Even this could be simplified by just port scanning the range for an
open port 8006, to get all PVE nodes in a subnet.
Further /etc/hosts (world readable) often provides the information
which hashing known_hosts tries to hide, as does /etc/pve/.members
(readable by: www-data,root)

So, to summarize, while for an unprivileged user it may add a slight
defense against a information leak it really doesn't for a PVE
systems root/cluster members - all information which it tries to hide
is accessible in various other ways.

Add new entries in plain text, add checks if entries are already
there for the plain text case too. Further use lowercase comparison
as openssh does.
If hashed entries are already there allow them still, but ensure that
a lowercase'd version is saved to avoid authentication failed
problems.

[1]: https://tools.ietf.org/html/rfc1035#section-2.3.3
[2]: https://forum.proxmox.com/threads/35473

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Mon, 26 Jun 2017 12:10:57 +0000 (14:10 +0200)]

add simple corosync config parser self check

Each test reads and parses a config "writes" it again and then
re-parses it.
Then both the parsed hash structures and the raw config get compared
This is cheap and should catch simple regressions in either the
parser or writer, as currently we have no safety net that
modifications on either one didn't cause regressions.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Dietmar Maurer [Thu, 22 Jun 2017 06:29:49 +0000 (08:29 +0200)]

bump version to 5.0-10

commit | commitdiff | tree

Thomas Lamprecht [Tue, 13 Jun 2017 07:25:34 +0000 (09:25 +0200)]

pvecm delnode: prevent deleting current node

Else corosync really delete himself from the cluster which pmxcfs
cannot really handle and this is a bad idea in general.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 13 Jun 2017 07:25:33 +0000 (09:25 +0200)]

factor out corosync methods to own module

PVE::Cluster is already quite big, the corosync part is ~250 lines
long of 1900 total. Further the corosync part is only needed in a few
specialised places (API2/ClusterConfig and CLI/pvecm).
This speaks for factoring out this part in a separate perl module as
most modules which use Cluster load the corosync parts for no reason.
Further, cluster handling through API may even add more corosync
related methods.

Create a new Corosync perl module and move all relevant methods over.
Method names lost the 'corosync_' prefix, not really needed anymore
as they already lives in the 'Corosync' namespace now.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Fabian Grünbichler [Tue, 13 Jun 2017 13:22:05 +0000 (15:22 +0200)]

pmxcfs: fix segfault in cfs_create_status_msg

it's possible to request a status message for a no longer
existing nodename in a standalone setting (e.g., node was
renamed after pmxcfs was started).

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 6 Jun 2017 08:03:57 +0000 (10:03 +0200)]

add sshinfo_to_command_base

required for rsync's --rsh

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 30 May 2017 13:30:12 +0000 (15:30 +0200)]

mtunnel: add -run-command for insecure pipes

While we still need ssh to initiate the command, the data
can then be sent insecurely to the IP and port the mtunnel
command tells us to connect to.

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 1 Jun 2017 08:06:10 +0000 (10:06 +0200)]

bump version to 5.0-9

commit | commitdiff | tree

Wolfgang Bumiller [Thu, 1 Jun 2017 07:34:34 +0000 (09:34 +0200)]

pmxcfs: don't warn when calling destructors with NULL

Similar to free() & friends, destructors should simply
return in that case.

Fixes a0fce192be37 (pmxcfs: use memdb_tree_entry_free())

commit | commitdiff | tree

Dietmar Maurer [Wed, 31 May 2017 07:11:25 +0000 (09:11 +0200)]

bump version to 5.0-8

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 30 May 2017 13:30:11 +0000 (15:30 +0200)]

sshinfo: add the network cidr

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 16 May 2017 09:32:41 +0000 (11:32 +0200)]

Fix #1383: pmxcfs: use memdb_tree_entry_free()

Use the right destructor instead of g_free(), as it may
contain another data pointer which needs freeing.

commit | commitdiff | tree

Wolfgang Bumiller [Mon, 22 May 2017 08:29:58 +0000 (10:29 +0200)]

Cluster.pm: add get_ssh_info and ssh_info_to_command

To get a node's address info optionally inside a specified
network (eg. migration_network), and a standardized way to
create an SSH command from this info.

commit | commitdiff | tree

Dietmar Maurer [Tue, 9 May 2017 10:43:34 +0000 (12:43 +0200)]

add replication.cfg to observed files

commit | commitdiff | tree

Fabian Grünbichler [Mon, 15 May 2017 12:59:15 +0000 (14:59 +0200)]

bump version to 5.0-7

commit | commitdiff | tree

Dietmar Maurer [Mon, 8 May 2017 09:33:20 +0000 (11:33 +0200)]

Revert "Add storage_replication_network to datacenter.cfg"

This reverts commit 1341b8fe392c4d3e6cc74e6ba4ff68bc32821195.

We want to use the migration network settings instead.

commit | commitdiff | tree

Thomas Lamprecht [Tue, 2 May 2017 09:51:22 +0000 (11:51 +0200)]

pvecm add: fix #1369 - re-allow using hostnames for ringX_addr

If an user passed a hostname as ring0_addr or ring1_addr the check_ip
checked failed as it implicitly assumed IPs even if we allowed a
general address (i.e. IP or hostname) as a format for those
properties.

Fixes: #1369
Reported here: https://forum.proxmox.com/threads/34342/

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 2 May 2017 09:51:21 +0000 (11:51 +0200)]

remote_node_ip: replace fallback method with new PVE::Network helper

Improve code reuse.

Note that the wantarray check exists in the used helper, so the
return signature of remote_node_ip stayed the same here

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Tue, 2 May 2017 09:51:20 +0000 (11:51 +0200)]

remote_node_ip: use same return signature for both branches

We have two return statements in the remote_node_ip submethod, one
checked if we are in list context and adapt the returning values
accordingly and one just returned a list, independent of the
context.
Adapt the second one and check the context there.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Dietmar Maurer [Wed, 3 May 2017 05:42:44 +0000 (07:42 +0200)]

bump version to 5.0-6

commit | commitdiff | tree

Dominik Csapak [Thu, 13 Apr 2017 09:35:04 +0000 (11:35 +0200)]

remove postinst script

we only executed a 'pvecm updatecerts --silent there', but we do this
already in the systemd service in ExecStartPost, so
this is unnecessary

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Fri, 28 Apr 2017 11:59:04 +0000 (13:59 +0200)]

bump version to 5.0-5

commit | commitdiff | tree

Wolfgang Link [Mon, 24 Apr 2017 15:15:24 +0000 (17:15 +0200)]

Add storage_replication_network to datacenter.cfg

This parameter will define the network fore the storage replication.

commit | commitdiff | tree

Dominik Csapak [Fri, 14 Apr 2017 15:07:46 +0000 (17:07 +0200)]

fix file permission check in chmod

since mode_t has additional bits set for file mode (see stat(2) ),
we have to ignore those, or we never can set the mode

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Fabian Grünbichler [Mon, 10 Apr 2017 14:02:15 +0000 (16:02 +0200)]

bump version to 5.0-4

commit | commitdiff | tree

Dominik Csapak [Tue, 4 Apr 2017 12:40:58 +0000 (14:40 +0200)]

change installarchlib to vendorarch

installarchlib is
/usr/lib/<arch>/perl/5.24
which is only a symlink provided by libperl5.24 and not suited
to install files in it directly

vendorarch is
/usr/lib/<arch>/perl5/5.24
which is the correct location for installing arch libraries
(we already use this in librados2-perl)

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Dominik Csapak [Tue, 4 Apr 2017 12:40:57 +0000 (14:40 +0200)]

remove autogenerated makefiles

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Stefan Priebe [Tue, 4 Apr 2017 14:43:31 +0000 (16:43 +0200)]

implement chown and chmod for user root group www-data and perm 0640

This allows us to use management software for files inside of /etc/pve.
e.g. saltstack which rely on being able to set uid,gid and chmod

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
Signed-off-by: Stefan Priebe <s.priebe@profihost.ag>

commit | commitdiff | tree

Dietmar Maurer [Fri, 17 Mar 2017 10:59:40 +0000 (11:59 +0100)]

bump version to 5.0-3

commit | commitdiff | tree

Dietmar Maurer [Fri, 17 Mar 2017 10:58:40 +0000 (11:58 +0100)]

rrd_dump: filter undefined values encodes as 'U'

commit | commitdiff | tree

Fabian Grünbichler [Mon, 13 Mar 2017 12:23:27 +0000 (13:23 +0100)]

bump version to 5.0-2

commit | commitdiff | tree

Fabian Grünbichler [Fri, 10 Mar 2017 12:03:40 +0000 (13:03 +0100)]

bump version to 5.0-1

commit | commitdiff | tree

Fabian Grünbichler [Fri, 10 Mar 2017 12:01:39 +0000 (13:01 +0100)]

buildsys: update make upload target for stretch

commit | commitdiff | tree

Fabian Grünbichler [Fri, 10 Mar 2017 12:00:43 +0000 (13:00 +0100)]

update corosync dependencies

commit | commitdiff | tree

Emmanuel Kasper [Mon, 6 Mar 2017 10:42:30 +0000 (11:42 +0100)]

Require Sys.Audit to read the cluster configuration

Up to now only root could see the corosync cluster config.

Sys.Audit is the same permission required
for reading the HA Config and the HA Resources Config.

commit | commitdiff | tree

Fabian Grünbichler [Tue, 7 Mar 2017 07:40:35 +0000 (08:40 +0100)]

buildsys: reformat (build-)depends

commit | commitdiff | tree

Dominik Csapak [Fri, 3 Mar 2017 08:17:58 +0000 (09:17 +0100)]

use unsigned long for strtoul result

strtoul gives back an unsigned long int, which may or may not be wider
than a guint32 (depending on the platform)

when it is wider, the assignment would parse vmids bigger than 2^32 but
truncate them, giving back an invalid vmid

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>

commit | commitdiff | tree

Dietmar Maurer [Tue, 28 Feb 2017 11:03:30 +0000 (12:03 +0100)]

bump version to 4.0-49

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Feb 2017 15:59:11 +0000 (16:59 +0100)]

pvecm add: assert that ringX IPs are available on node add

If 'ringX_addr' parameters are used on adding a node to a cluster
check if those addresses are actually configured on the to-be-added
node. It makes no sense that the address is not or multiple times
configured.

This prevents a node in limbo, waiting for quorum (if it was the
second node in a cluster, even two node would be in the no-quorum
limbo) where manual pmxcfs kills, local starts and manual
configuration edits which may need to get manually synced to other
cluster members are needed.

The check does not cost much and gets only made on node additions, so
assert with our get_local_ip_from_cidr method that the IP is
configured on any interface.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Feb 2017 15:59:10 +0000 (16:59 +0100)]

pvecm add: report all errors found at once

Else only the first error got reported and we had no idea what else
was possible wrong.

I'll also use the $err method more in next commits

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Feb 2017 15:59:09 +0000 (16:59 +0100)]

pvecm addnode: ensure ring1_addr is set if ring 1 is configured

Else the joining node will not be able to work correctly.
Also improve the respective error messages.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Feb 2017 15:59:08 +0000 (16:59 +0100)]

pvecm addnode: ensure ring address isn't already used by cluster

If someone enters the wrong address by accident when adding a node it
may cause havoc in the cluster (meaning a reset of the whole cluster
when HA is used, may even happen more often during the recovery
tries. Also a whole lot of problems get triggered in gneral, even
witouth HA).

Further, user get into a hard to repair situation where a layman may
not be able to fix it by hand even when given directions by an
experienced user.

This is a really bad outcome for such a small and easy to make
mistake, so just make a small check and assert that the requested IPs
are not used by any node on any ring in the cluster configuration.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Feb 2017 15:59:07 +0000 (16:59 +0100)]

pvecm addnode: error out on interactive call

addnode is thought to be used by the `add` command only.
So check if STDIN or STOUT are connected to a tty and exit with an
error message if this is the case.
The force flag allows overwriting this check.

Fixes bug #294

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Feb 2017 15:59:06 +0000 (16:59 +0100)]

pvecm create: remove rrp_mode parameter

I detected a bug where we overwrote the whole $interfaces variable
(and so all interfaces from the corosync config) if the 'rrp_mode'
param was set.

While this would be easy to with by changing the line to
$interfaces .= ...
I removed the whole rrp_mode parameter instead.

As:
a) I've seen no one running into this bug, so this parameter was not
   really used either way.
b) only the 'passive' is supported and works, 'active' has a whole
   lot of problems. If someone really wants it he should edit the
   corosync config file to achieve this

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Feb 2017 15:59:05 +0000 (16:59 +0100)]

pvecm: small cleanup

clvm is was used in 3.4 and earlier, it won't come back anytime soon

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Thomas Lamprecht [Wed, 22 Feb 2017 15:59:04 +0000 (16:59 +0100)]

pvecm add: fix check if corosync alread runs

`corosync-quorumtool` exit with 1 (CS_OK) if corosync runs and is
quorate.
Use `corosync-quorumtool -l` (list nodes) instead, this returns
1 if corosync does not run
0 if corosync runs, independent if a cluster is quorate or not.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Emmanuel Kasper [Thu, 16 Feb 2017 15:34:16 +0000 (16:34 +0100)]

Remove depency to libxml-parser-perl

This xml parser was added to parser RHCM cluster.conf, we don't parse this
anymore.

commit | commitdiff | tree

Thomas Lamprecht [Wed, 8 Feb 2017 10:24:55 +0000 (11:24 +0100)]

buildsys: write control file into build directory

Else the first make {deb,dinstall} from a clean repo fails as we
generated the debian control file to the source debian/ folder.
Just write it directly to the build/debian directory, so we do not
clutter the source directory and build always with the up to date
control file.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 7 Feb 2017 09:50:13 +0000 (10:50 +0100)]

buildsys: don't include autogenerated files

they're usually outdated

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 7 Feb 2017 09:49:49 +0000 (10:49 +0100)]

buildsys: remove gthread dependency

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 7 Feb 2017 09:46:22 +0000 (10:46 +0100)]

make clean: remove *.buildinfo

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 7 Feb 2017 09:45:06 +0000 (10:45 +0100)]

buildsys: shlibs:Depends finds librrd on its own

On stretch we have librrd8 instead of librrd4.

commit | commitdiff | tree

Wolfgang Bumiller [Fri, 3 Feb 2017 15:37:25 +0000 (16:37 +0100)]

buildsys: generate perlapi dependency version

commit | commitdiff | tree

Wolfgang Bumiller [Fri, 3 Feb 2017 15:24:46 +0000 (16:24 +0100)]

buildsys: depend on lsb-base

shipping init scripts without this dependency is considered
an error by lintian in stretch

commit | commitdiff | tree

Wolfgang Bumiller [Tue, 31 Jan 2017 10:15:20 +0000 (11:15 +0100)]

buildsys: make perl find the local IPCC.so

Otherwise the checks depend on an installed version of the
package.

commit | commitdiff | tree

Wolfgang Bumiller [Fri, 3 Feb 2017 15:15:34 +0000 (16:15 +0100)]

buildsys: missing build dependencies

commit | commitdiff | tree

Wolfgang Bumiller [Fri, 3 Feb 2017 14:31:30 +0000 (15:31 +0100)]

buildsys: make job safety and old svn cruft removal

commit | commitdiff | tree

Dietmar Maurer [Thu, 19 Jan 2017 08:51:04 +0000 (09:51 +0100)]

also update PVE/Makefile.in

commit | commitdiff | tree

Dietmar Maurer [Tue, 29 Nov 2016 11:00:57 +0000 (12:00 +0100)]

bump version to 4.0-48

commit | commitdiff | tree

Dietmar Maurer [Tue, 29 Nov 2016 10:51:38 +0000 (11:51 +0100)]

add API class for cluster configuration

Read-only for now. addnode/delnode implementation will follow.

commit | commitdiff | tree

Dietmar Maurer [Tue, 29 Nov 2016 10:21:16 +0000 (11:21 +0100)]

move corosync config helpers to PVE::Cluster

commit | commitdiff | tree

Dietmar Maurer [Tue, 29 Nov 2016 06:44:46 +0000 (07:44 +0100)]

cleanup: delete trailing whitespace

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 9 Nov 2016 08:15:56 +0000 (09:15 +0100)]

Fix #1199: pmxcfs: vmlist cache update condition in rename

rename() wrongly used the vmid filled in by
path_contain_vm_config() as a condition for whether to
update the vmlist cache rather than the returned nodename.

This caused a rename in any folder of a file whose name
was a number followed by '.conf' to remove the corresponding
vmid from the vmlist cache.

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 9 Nov 2016 08:15:55 +0000 (09:15 +0100)]

pmxcfs: cleanup: remove unnecessary checks before free

commit | commitdiff | tree

Wolfgang Bumiller [Wed, 9 Nov 2016 08:15:54 +0000 (09:15 +0100)]

pmxcfs: cleanup

Cluster FS and Tools

RSS Atom