- Aug 24, 2009
-
-
Iustin Pop authored
This patch adds support for instance moves in burnin. This means that non-drbd instances finally get support for being moved. Signed-off-by:
Iustin Pop <iustin@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
Michael Hanselmann authored
Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
- Aug 17, 2009
-
-
Michael Hanselmann authored
Replacing the secondary node always replaces it for all disks. Commit 2945fd2d added an error message for this case. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
- Jul 24, 2009
-
-
Stephen Shirley authored
This way the produced table is formatted nicely. Signed-off-by:
Stephen Shirley <diamond@google.com> Signed-off-by:
Guido Trotter <ultrotter@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
- Jul 21, 2009
-
-
Iustin Pop authored
Many burnin steps initialize the batch queue at the beginning and commit it at the end of their operation. This patch moves this code to a decorator, in order to reduce redundant code. Signed-off-by:
Iustin Pop <iustin@google.com> Reviewed-by:
Olivier Tharan <olive@google.com>
-
Iustin Pop authored
Many burn steps to a manual check of instance aliveness, via duplicate code. This patch moves this code to a decorator. Signed-off-by:
Iustin Pop <iustin@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
Iustin Pop authored
Some burnin steps are idempotent: e.g. reinstalling an instance (from burning p.o.v.) can be done multiple times without any side-effects that would affect later burnin steps. As such, failing the whole burnin process due a reinstall failure is undesirable. This patch modifies burnin by marking each opcode (in case of individual execution) and job set retryable or not. Retryable actions will be retried up to a number of times, after which we give up and return failure. One side-effect is that in case of full-failure in retryable job sets we lose the original exception (but we do log its string format), so we have a little bit less information in this case. Signed-off-by:
Iustin Pop <iustin@google.com> Reviewed-by:
Guido Trotter <ultrotter@google.com>
-
- Jul 19, 2009
-
-
Iustin Pop authored
A long-standing bug in burnin makes errors during the removal phase (e.g. because an import has failed, or because the initial creation has failed) hide the original error. This patch suppresses removal errors if we are already in ‘has_err’ mode, and otherwise it displays them normally. Signed-off-by:
Iustin Pop <iustin@google.com> Reviewed-by:
Guido Trotter <ultrotter@google.com>
-
- Jul 07, 2009
-
-
Michael Hanselmann authored
Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
- Jun 26, 2009
-
-
Guido Trotter authored
A few places in the code open a file "manually" rather than using our wrapper function, because they need an array with the lines. Combining the result of utils.ReadFile with splitlines() we get rid of the exceptions. Signed-off-by:
Guido Trotter <ultrotter@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
- Jun 08, 2009
-
-
Guido Trotter authored
This way the same constant can represent the default profile also for nic, disk and OS parameters. Signed-off-by:
Guido Trotter <ultrotter@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
- Mar 06, 2009
-
-
Iustin Pop authored
The instance objects did not get a serial_no field. This patch adds a new constants for the field name and uses it for all three cases (cluster, nodes, instances). Reviewed-by: imsnah
-
- Mar 04, 2009
-
-
Iustin Pop authored
This patch makes the cfgupgrade script to handle: - instance changes - disk changes - further cluster fixes - adds configuration checks at the end, in non-dry-run mode Reviewed-by: ultrotter
-
Iustin Pop authored
This patch makes cfgupgrade work on empty cluster (i.e. no instances), up to a point that the config file can be converted from 1.2 to 2.0. This is not yet complete, though. Reviewed-by: ultrotter
-
- Feb 16, 2009
-
-
Iustin Pop authored
In rename, we must stop different names in the first and second phases, so we create two different opcodes for this purpose (instead of using the same one twice, which doesn't work). Reviewed-by: imsnah
-
- Feb 10, 2009
-
-
Iustin Pop authored
This patch updates burnin not to use drained nodes (similar to the handling of offline nodes). Reviewed-by: imsnah
-
- Feb 04, 2009
-
-
Iustin Pop authored
This patch adds the framework for, and enables lockless OpQueryInstances. This means that instances will be shown in ERROR_up or ERROR_down state, even though this is not an error (but just an in-progress job). The framework is implemented as follows: - the OpQueryInstances, OpQueryNodes and OpQueryExports opcodes take an additional “use_locking” flag which will denote whether to lock or not; this patch only implements this for LUQueryInstances - the luxi query functions take an additional argument use_locking which is passed to the master daemon, and then passed to the above opcodes - cli.py export a new SYNC_OPT command line options which implement setting this flag to true - except for gnt-instance list, which uses this option, and for name-only queries (e.g. QueryNodes(fields=["names"])), all other callers are setting this flag to True - RAPI also sets the flag to True The patch was tested with a continuous (0.2s sleep in-between) gnt-instance list during a burnin, and no problems were observed. Reviewed-by: ultrotter
-
- Feb 03, 2009
-
-
Iustin Pop authored
For testing or just in case a device is exported by a bad driver with the 'removable' flag set, this patch adds a flag to lvmstrap that allows it to use these devices too. Reviewed-by: ultrotter
-
- Jan 23, 2009
-
-
Iustin Pop authored
This patch changes the iallocator framework to work with and properly export to plugins offline nodes. It does this by only exporting the static configuration data for those nodes, and not attempting to parse the runtime data. The patch also fixes bugs in iallocator related to the RpcResult conversion, changes the should_run to admin_up attribute name (as per the internals change), and adds “-I” as a short option for “--iallocator” in gnt-instance, gnt-backup and burnin. Reviewed-by: ultrotter
-
Iustin Pop authored
This patch changes (significantly) the execution model in burnin: - for all runs, (almost) all instance mods in a single Burn* procedure are done as part of a job; so for example add disk, stop, remove disk, start are no longer done as separate jobs but as a single job consisting of four opcodes - for parallel runs, all Burn* procedures except the rename (which uses a single target name) run in parallel; before, only the creation was done in parallel - due to the single-job execution and also parallel execution, the logging messages are no longer happening synchronously with the execution, so they are more informative than an actual execution log The end result is that burnin now tests properly multi-opcode jobs and also tests all opcodes (except rename) for parallel execution. Note: On a test cluster, parallelization reduces burnin time from 23m to 15m. Reviewed-by: ultrotter
-
- Jan 20, 2009
-
-
Iustin Pop authored
The urllib2 module has very bad error handling. This patch changes to urllib which is simpler, and we derive a custom class from the FancyURLopener. Burning is no longer keeping sockets in CLOSE_WAIT state with this patch. Reviewed-by: ultrotter
-
- Jan 16, 2009
-
-
Iustin Pop authored
In case we pass --disk-grow 0[,0..] then we should not call GrowDisks as it prints confusing log lines. Reviewed-by: imsnah
-
Iustin Pop authored
This patch adds a burnin option to keep instances at the end, so that debugging after a burnin failure is easier. Also, we reorder the command line parsing and client query so that one can use ./tools/burnin --help even on non-ganeti machines. Reviewed-by: ultrotter
-
- Jan 14, 2009
-
-
Iustin Pop authored
Reviewed-by: ultrotter
-
- Jan 13, 2009
-
-
Iustin Pop authored
This is again a copy of the latest 1.2 burnin code related to migration. Reviewed-by: ultrotter
-
Iustin Pop authored
Since we added many more tests in burnin, the output became almost unreadable. This patch changes the output to an indented one, so that the different phases and operations of burnin are more easily understood. Reviwed-by: ultrotter
-
Iustin Pop authored
Traditionally the start/stop test was the last, so move it back to there (added as last option in commit 854). Reviewed-by: amishchenko
-
Iustin Pop authored
This patch adds instance alive checks after most start operations. The check is done in a custom way: - the instance is expected to have an http server up and running - and it should server the '/hostname.txt' resource containing the hostname of the instance This allows checking that: - creation is working OK - start after failover (and in the future migrate) is ok - rename works correctly By default, the check is disabled since one needs a custom OS for this check. The patch also fixes a wrong variable name from a previous burnin patch. Reviewed-by: ultrotter
-
- Jan 12, 2009
-
-
Iustin Pop authored
This patch adds testing of add/remove disks and NICs to the burnin. Reviewed-by: imsnah
-
- Jan 09, 2009
-
-
Iustin Pop authored
Reviewed-by: imsnah
-
Iustin Pop authored
This patch updates burnin to the latest replace disks constant, and changes the constants' values to be more accurate. Reviewed-by: imsnah
-
Iustin Pop authored
This patch makes burnin skip the offline nodes in it's builtin node selection. It also removes an extra line. Reviewed-by: imsnah
-
- Dec 02, 2008
-
-
Iustin Pop authored
These two operations were missing from burnin. The reboot is done with all valid modes (a new constant is added), and the reinstall is done both with and without specifying the OS (to account for the two code paths in the LU). Reviewed-by: imsnah
-
Iustin Pop authored
This is currently not supported, so don't try to do export/import in this case. Reviewed-by: imsnah
-
Iustin Pop authored
This allows burnin to work with diskless instances (since we cannot pass right now no disks to it). Reviewed-by: imsnah
-
- Nov 24, 2008
-
-
Guido Trotter authored
Update the call to CreateInstance used in ImportExport to the new API for multidisk/nic. Reviewed-by: iustinp
-
- Nov 21, 2008
-
-
Iustin Pop authored
This looks beter than '0 disk'. Reviewed-by: ultrotter
-
Iustin Pop authored
Reviewed-by: ultrotter
-
- Nov 20, 2008
-
-
Iustin Pop authored
This patch adds support for mult-disk/multi-nic in: - instance add - burnin The start/stop/failover/cluster verify work as expected. Replace disk and grow disk are TODO. There's also a change gnt-job to allow dictionaries to be listed in gnt-job info. Reviewed-by: imsnah
-