- 29 Apr, 2014 2 commits
-
-
Petr Pudlak authored
Apparently due to some library bug, forking sometimes fails: The new process is running, but it doesn't start executing. Therefore we retry the attempt several times. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Petr Pudlak authored
In particular, functions for retrying a MonadPlus action: It is repeated until it returns a valid result. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
- 28 Apr, 2014 8 commits
-
-
Klaus Aehlig authored
* stable-2.11 Avoid assertIn Be more explicit about the scope of gnt-cluster upgrade Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Hrvoje Ribicic <riba@google.com>
-
Klaus Aehlig authored
...as it is only present in python version 2.7 and higher. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Add an explicit statement to the documentation that gnt-cluster upgrade only handles the switching between Ganeti versions, not the installation or removal of the Ganeti binaries. Some users got confused about this. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com> Reviewed-by:
Thomas Thrainer <thomasth@google.com>
-
Klaus Aehlig authored
This also avoids writing to hard-coded paths under /tmp Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Apollon Oikonomopoulos authored
Signed-off-by:
Apollon Oikonomopoulos <apoikos@gmail.com> Reviewed-by:
Jose A. Lopes <jabolopes@google.com>
-
Apollon Oikonomopoulos authored
Signed-off-by:
Apollon Oikonomopoulos <apoikos@gmail.com> Reviewed-by:
Jose A. Lopes <jabolopes@google.com>
-
Apollon Oikonomopoulos authored
We add a second category of daemons in daemon-util, those that are started on demand and should not be started by the start_all action, but should be stopped during service shutdown. This category currently includes metad and kvmd. To this end we introduce two more array variables: - ON_DEMAND_DAEMONS: currently holds kvmd and metad, which should only be stopped during stop-all. - ALL_DAEMONS: is the union of DAEMONS and ON_DEMAND_DAEMONS, i.e. the list of all known daemons. We also change the daemon-util test to expect kvmd and metad in the stop daemon list. Signed-off-by:
Apollon Oikonomopoulos <apoikos@gmail.com> Reviewed-by:
Jose A. Lopes <jabolopes@google.com>
-
Petr Pudlak authored
Since _SetDrbdHelper calls methods on self.cfg, it's necessary to update the configuration before it is called. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com> Reviewed-by:
Hrvoje Ribicic <riba@google.com>
-
- 27 Apr, 2014 1 commit
-
-
Hrvoje Ribicic authored
Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
- 25 Apr, 2014 9 commits
-
-
Klaus Aehlig authored
* stable-2.11 Make GlusterFS tests dual-stack-aware Moving release date of 2.11.0 to 25th of April Revision bump for 2.11.0 Prepare NEWS file for release of 2.11.0 Fix Xen instance state * stable-2.10 Fix failed DRBD disk creation cleanup Fix lint errors introduced during cherry-pick Hooking up verification for shared file storage Fix --shared-file-storage-dir option of gnt-cluster modify Clarify default setting of 'metavg' Fix invocation of GetCommandOutput in QA Clean up RunWithLocks Add an exception-trapping thread class Wait for delay to provide interruption information Add an expected block option to RunWithLocks Track if a QA test was blocked by locks Add a RunWithLocks QA utility function * stable-2.9 If Automake version > 1.11, force serial tests Conflicts: NEWS: take all additions configure.ac: ignore suffix bump lib/client/gnt_cluster.py: union of additions lib/cmdlib/cluster.py: union of additions lib/hypervisor/hv_xen.py: union of additions qa/qa_job_utils.py: take version of master Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Hrvoje Ribicic <riba@google.com>
-
Klaus Aehlig authored
Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Klaus Aehlig authored
* stable-2.10 Fix failed DRBD disk creation cleanup Fix lint errors introduced during cherry-pick Hooking up verification for shared file storage Fix --shared-file-storage-dir option of gnt-cluster modify Clarify default setting of 'metavg' Fix invocation of GetCommandOutput in QA Clean up RunWithLocks Add an exception-trapping thread class Wait for delay to provide interruption information Add an expected block option to RunWithLocks Track if a QA test was blocked by locks Add a RunWithLocks QA utility function * stable-2.9 If Automake version > 1.11, force serial tests Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Klaus Aehlig authored
* stable-2.9 If Automake version > 1.11, force serial tests Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Apollon Oikonomopoulos authored
Resolving "localhost" may either return "127.0.0.1" or "::1", so we have to check for both cases in the relevant tests. Signed-off-by:
Apollon Oikonomopoulos <apoikos@gmail.com> Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Petr Pudlak authored
This fixes broken compilation on Debian Jessie (#802). See also http://stackoverflow.com/questions/15820844/ Thanks to Apollon Oikonomopoulos for finding the solution and Klaus Aehlig for helping to do it conditionally. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Klaus Aehlig authored
When forking off jobs, make them inherit the debug level of the parent process (i.e., of luxid). In this way, we can debug jobs in test clusters without cluttering production logs. We pass the debug level through the environment instead of the startup dialog with the parent, as we would like to be able to debug the startup dialog. While there, fix the call of SetupLogging: the debug parameter is supposed to be an int. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
While jobs are still handled by what used to be masterd, the log message should be assigned to the respective job. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Hrvoje Ribicic authored
When creating a DRBD disk, Ganeti reserves minor numbers on a per-node basis. In case of a failed disk creation, these reservations should be released. During the name/uuid refactoring, the invocation of the function that releases the minors was not updated, resulting in no minors being released in case of a DRBD disk creation failure. Ganeti's verification utilities catch this error and rightly complain. This patch fixes the issue by providing the UUID and not the name. Signed-off-by:
Hrvoje Ribicic <riba@google.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
- 24 Apr, 2014 20 commits
-
-
Helga Velroyen authored
Due to unforseen circumstances, we are moving the release of 2.11.0 to tomorrow and thus the NEWS file needs to be updated. Signed-off-by:
Helga Velroyen <helgav@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Marco Casavecchia (aka MCM) authored
This patch adds support for the native aio on KVM hypervisor. Basically, It adds a new HV-KVM optional parameter "disk_aio" that can be set with the followings values: threads (the default for KVM) or native. If not set, it uses the KVM default that is "threads". The patch also updates the man page for gnt-instance. Signed-off-by:
Marco Casavecchia (aka MCM) <marcolinuz@gmail.com> Reviewed-by:
Helga Velroyen <helgav@google.com>
-
Klaus Aehlig authored
Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Petr Pudlak authored
Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Petr Pudlak authored
.. so that they are displayed properly in logs. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Petr Pudlak authored
Otherwise a job that is being started is falsely reported as dead. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Petr Pudlak authored
.. and add a reason trail message. Otherwise failed jobs hang, never finishing. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Klaus Aehlig authored
In this example, the cluster has two nodes and four instances, two with primary on each of the nodes. The most scarce resource on this cluster are (virtual) CPUs and the second node has 3 times the CPU speed of the first one. So distributing the instances 1 and 3 gives a more balanced cluster. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
...as described in doc/design-cpu-speed.rst Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Add a derived parameter for nodes, providing the ratio of virtual CPUs per CPU-speed weighted physical CPU. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Make the htools luxi backend also query for cpu_speed and take the result into account. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Extend the text format by an optional column for each node containing the relative CPU speed, if provided. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Add a function on nodes modifying the CPU speed parameter. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
Add an additional parameter to the representation of a node for the relative CPU speed, initially set to 1. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
In other words, remove "cpu_speed" from all "nodeparams" where it is present, be it cluster, group, or node. Note that upgrading is no problem, as the default value will be used implicitly. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
This parameter will describe the speed of the CPU relative to the speed of a "normal" node in this node group. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
...in order not to have to declare floating point values as VTypeInt and rely on the sloppiness of the JSON specification to not distinguish between integers and floating point numbers. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Klaus Aehlig authored
This document really only talks about CPU speed. Signed-off-by:
Klaus Aehlig <aehlig@google.com> Reviewed-by:
Petr Pudlak <pudlak@google.com>
-
Petr Pudlak authored
In this case, the call trying to acquire a shared lock always succeeds, because the daemon already has an exclusive lock, which falsely reports that the job has died. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-
Petr Pudlak authored
In particular, distinguish the cases when a job could not have been cancelled and when a job has already finished. Signed-off-by:
Petr Pudlak <pudlak@google.com> Reviewed-by:
Klaus Aehlig <aehlig@google.com>
-