- 16 Jun, 2015 3 commits
-
-
Helga Velroyen authored
Signed-off-by:
Helga Velroyen <helgav@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Helga Velroyen authored
Signed-off-by:
Helga Velroyen <helgav@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Helga Velroyen authored
.. to prepare for the new release. Signed-off-by:
Helga Velroyen <helgav@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
- 15 Jun, 2015 3 commits
-
-
Helga Velroyen authored
* stable-2.14 (no changes) * stable-2.13 Mention migration change in NEWS Move misplaced NEWS entry Add protection against daemons that may already be listening Attempt to cleanup failed migrations using a pidfile Add utility that gets the full command line of a process Introduce socat as a way of doing xl migrations fix typo in UPGRADE notes During start, only update job file locally * stable-2.12 Fix tests for setting (shared) file storage directory Add missing call for setting shared file storage directory Update ganeti-luxid synopsis Update ganeti-mond synopsis Update ganeti-confd synopsis Update copyright statement Signed-off-by:
Helga Velroyen <helgav@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Helga Velroyen authored
* stable-2.13 Mention migration change in NEWS Move misplaced NEWS entry Add protection against daemons that may already be listening Attempt to cleanup failed migrations using a pidfile Add utility that gets the full command line of a process Introduce socat as a way of doing xl migrations fix typo in UPGRADE notes During start, only update job file locally * stable-2.12 Fix tests for setting (shared) file storage directory Add missing call for setting shared file storage directory Update ganeti-luxid synopsis Update ganeti-mond synopsis Update ganeti-confd synopsis Update copyright statement Conflicts: NEWS lib/cmdlib/cluster/verify.py Resolutions: NEWS: keep all released entries lib/cmdlib/cluster/verify.py: put the additional line that caused the confict into lib/cmdlib/cluster/__init__.py Signed-off-by:
Helga Velroyen <helgav@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Hrvoje Ribicic authored
* stable-2.12 Fix tests for setting (shared) file storage directory Add missing call for setting shared file storage directory Update ganeti-luxid synopsis Update ganeti-mond synopsis Update ganeti-confd synopsis Update copyright statement Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
- 11 Jun, 2015 6 commits
-
-
Hrvoje Ribicic authored
This patch adds information about the xl migration change to the NEWS file. Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Hrvoje Ribicic authored
A section meant for 2.13.0 was moved to a 2.12 release, and this patch fixes that. Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Hrvoje Ribicic authored
Should the migration port already be taken, Ganeti will try and start a socat daemon that will immediately die, leaving Ganeti to pipe the migration data into whatever process that happens to be listening. This patch prevents that from happening by checking if the socat daemon started by Ganeti is ready to accept the migration data. Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Hrvoje Ribicic authored
In the case that a listening socat daemon was started but the migration failed on the sending side, the daemon will stay in place and occupy the migration port forever. This patch attempts to remedy this by saving the PID of the daemon, and attempting to kill it when the next migration is started, provided the command line roughly matches our migration workflow. Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Hrvoje Ribicic authored
This patch provides a simple function which fetches the command line of a process given its PID, and some tests for it. It was introduced for safety reasons in introducing socat-based migration to our Xen-handling code. Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Hrvoje Ribicic authored
This patch introduces support for socat as a means of doing xl migrations. The primary reason for doing so is that Ganeti no longer handles SSH key distribution across nodes which are not master candidates. By relying on SSH as the only means of doing migrations, we could only migrate instances off of master candidates. Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
- 10 Jun, 2015 5 commits
-
-
Petr Pudlak authored
- Fix the test for setting file_storage_dir, which didn't check if the value was really set. - Add tests for shared_file_storage_dir, which were missing completely. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Petr Pudlak authored
With the call missing, it wasn't possible to change the directory after cluster initialization. Fixes #1101. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Klaus Aehlig authored
The options --syslog, --no-user-checks, --no-voting, and --yes-do-it were already described in the description. Add them to the synopsis as well. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
The -b option was already described in the description of the daemon, add it to the synopsis as well. While there, sort the synopsis to reflect the order in which the options are described; this is also consistent with the argument order in the synopsis of other Ganeti daemons. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
The options -p, -b, --syslog, and --no-user-check were already described in the description; add them to the synopsis as well. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
- 09 Jun, 2015 1 commit
-
-
Klaus Aehlig authored
Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
- 08 Jun, 2015 3 commits
-
-
Klaus Aehlig authored
The first thing the newly forked job will do anyway is to update its job file as the first op-code is now waiting. It is enough that this information is replicated to all master candidates. Note that a local change of the livelock file will not be helpful to other master candidates, as the livelock has only a meaning on the current node anyway. This seemingly little saving of two replications is significant, however, as it happens under the fork lock by which we avoid two forks at the same time, as this can cause problems with the Haskell runtime. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com> Cherry-picked-from: 7684a501Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Hrvoje Ribicic <riba@google.com>
-
Petr Pudlak authored
.. so it retries correctly when communicating on a non-master node. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Petr Pudlak authored
If a communication failure occurred and the caller was not running on the master node, Transport assumed that this itself was the cause of the error condition. However, for communication with the metadata daemon we need to support non-master nodes as well. Add a parameter that allows to use the class on non-master nodes. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
- 02 Jun, 2015 5 commits
-
-
Klaus Aehlig authored
Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Merge c2607a60 dropped the fix of 4ca31587 instead of following the code move of applyMoveEx. So re-add that patch at the new location of applyMoveEx. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Petr Pudlak authored
* stable-2.14 Suffix bump for 2.14.0 Prepare NEWS for 2.14.0 release Get QuickCheck 2.7 compatibility For instances use the real instance disk template Log iAllocator request at debug level Do not assume all requests are relocation requests In opportunistic locking, update NODE_RES locks after nodes Suffix bump for 2.14.0~rc2 Prepare NEWS for 2.14.0~rc2 * stable-2.13 Increase waiting time to avoid races Avoid watcher interaction in TestFilterRateLimit Avoid races in TestAdHocReasonRateLimit Avoid races in TestFilterRateLimit * stable-2.12 Make WConfD's updateLocksWaiting safe Tests specifying safeUpdateLocksWaiting Provide a repeatable version of updateLocksWaiting Verify that updateLocks is idempotent Always accept no-op requests Allow unconditional failovers off offline nodes Remove now unused variable Fix bug in ssconf comparison, disable it for vcluster QA: test renewing the cluster certificate only QA: Assert equality of ssconf_master_candidate_certs QA: Add more verify steps in renew crypto QA Add a unit test for the Gluster storage type Add missing Gluster storage verification checks Add Gluster type to the node storage reporting set Introduce the Gluster storage type Make xend-config.sxp optional Revision bump to 2.12.4 Update NEWS for 2.12.4 Ignore *.dyn* files generated by GHC 7.8 Replace HFLAGS_NOPROF with HFLAGS_DYNAMIC in the Makefile * stable-2.11 (no news) * stable-2.10 Substitute 'suffix' for 'revision' Check for gnt-cluster before running gnt-cluster upgrade Conflicts: NEWS - keep both 2.14 and 2.15 parts configure.ac - keep 2.15 version src/Ganeti/HTools/Cluster.hs - remove applyMoveEx, which was moved to Ganeti.HTools.Cluster.Moves Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Klaus Aehlig authored
Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Set the release date and describe the fixes since the last release candidate. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
- 01 Jun, 2015 9 commits
-
-
Klaus Aehlig authored
Replace deprecated `printTestCase` by its replacement `counterexample`. Note that commit 077c415a added a CPP-guarded fallback for QuickCheck < 2.7. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
* stable-2.13 (no changes) * stable-2.14 Make WConfD's updateLocksWaiting safe Tests specifying safeUpdateLocksWaiting Provide a repeatable version of updateLocksWaiting Verify that updateLocks is idempotent Always accept no-op requests Allow unconditional failovers off offline nodes Remove now unused variable Fix bug in ssconf comparison, disable it for vcluster QA: test renewing the cluster certificate only QA: Assert equality of ssconf_master_candidate_certs QA: Add more verify steps in renew crypto QA * stable-2.11 (no changes) * stable-2.10 Substitute 'suffix' for 'revision' Conflicts: src/Ganeti/HTools/Node.hs (apply condition at new code location) Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Klaus Aehlig authored
* stable-2.12 Make WConfD's updateLocksWaiting safe Tests specifying safeUpdateLocksWaiting Provide a repeatable version of updateLocksWaiting Verify that updateLocks is idempotent Always accept no-op requests Allow unconditional failovers off offline nodes Remove now unused variable Fix bug in ssconf comparison, disable it for vcluster QA: test renewing the cluster certificate only QA: Assert equality of ssconf_master_candidate_certs QA: Add more verify steps in renew crypto QA * stable-2.11 (no changes) * stable-2.10 Substitute 'suffix' for 'revision' Conflicts: qa/qa_cluster.py src/Ganeti/HTools/Cluster.hs src/Ganeti/HTools/Node.hs (All trivial by taking the union of the changes.) Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Klaus Aehlig authored
* stable-2.11 (no changes) * stable-2.10 Substitute 'suffix' for 'revision' Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Klaus Aehlig authored
...so that the implicit retry on connection failure is handled gracefully. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Add tests that verify the defining properties of safeUpdateLocksWaiting. 1.) If the state contains no pending request by the requester, then updateLocksWaiting and safeUpdateLocksWaiting coincide. 2.) safeUpdateLocksWaiting is idempotent on all states. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
With our timeouts on connections, we have to deal with connections being interrupted at any time. Therefore, we provide a repeatable version of updateLocksWaiting that gracefully ignores requests that have already been recorded. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
...so that it can be repeated, if necessary. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
In order to have update requests repeatable, always accept requests that do not require any change to the state. Note that this is not implied by the current definition, as the request might ask for two locks at different level, and thus the repetition would violate lock order. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
- 29 May, 2015 5 commits
-
-
Klaus Aehlig authored
Normally, we should not place instances on nodes that do not have enough disks. However, there is one exception: if we failover an instance from an offline node, that node can well be secondary of that instance---the fact that it is currently primary proves it has enough disks space. The reason why we have to handle that case special is that if a node is offline, we sometimes cannot determine the amount of disk available, hence the conservative estimation is 0. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Hrvoje Ribicic <riba@google.com>
-
Hrvoje Ribicic authored
* stable-2.10 Substitute 'suffix' for 'revision' Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Klaus Aehlig authored
...instead of assuming there is a single template that fits all instances of the cluster. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Hrvoje Ribicic <riba@google.com>
-
Klaus Aehlig authored
Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Hrvoje Ribicic <riba@google.com>
-
Klaus Aehlig authored
Commit 717ede37 incorrectly assumed that each iallocator request is about a specific instance and a specific disk template. Both assumptions are wrong (consider e.g., a node evacuation request) yielding a type error that, unfortunately, in python is only discovered at run time. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Hrvoje Ribicic <riba@google.com>
-