Commits · 76f59a328bd010a982f0621576851a4c041f313d · itminedu / snf-ganeti

Jun 06, 2008
- Forward-port: Fix two problems in QA scripts · 76f59a32
  Michael Hanselmann authored 16 years ago
```
- Failover back to original node in instance failure test
- Exclude secondary node from list of potential nodes in
  replace-disks test

Reviewed-by: iustinp
```
  76f59a32
- Forward-port: Add QA tests for “gnt-instance reboot” · 8a4e8898
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: ultrotter
```
  8a4e8898
- Forward-port: Add QA test for “gnt-instance replace-disks” · 7910e7a5
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
  7910e7a5
- Forward-port: Update gnt-instance and gnt-backup manpages · a53a1b18
  Michael Hanselmann authored 16 years ago
```
- Add --iallocator options
- Small text fixes

Reviewed-by: ultrotter
```
  a53a1b18
- Forward-port: Fix wrong filename in ganeti-watcher manpage · 52da0141
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
  52da0141
- Forward-port: Small codestyle fixes for dumb-allocator · b3a447ef
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
  b3a447ef
- Forward-port: Remove output file if docbook failed · ef267657
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: ultrotter
```
  ef267657
- Forward-port: Alias Dump/Load functions in ganeti.serializer to DumpJson/LoadJson · 228538cf
  Michael Hanselmann authored 16 years ago
```
The remote API will use JSON for the foreseable future, so it's better
to put the serialization format in the function name. We can still
use another serialization format for Ganeti's core.

Reviewed-by: amishchenko, schreiberal
```
  228538cf
- Add line-breaks to gnt-instance manpage · 783583e9
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: ultrotter
```
  783583e9
May 31, 2008

Add check for node memory in instance creation · 49ce1563

Iustin Pop authored 16 years ago

Currently the check for enough memory is done only on instance start
command and failover command. But we also start an instance in instance
create, therefore we need to check this instead of failing to start in
the hypervisor phase.

The patch adds a check for node memory in the case the creation command
specifies that the instance should be started. It is allowed for the
memory to be less than needed if the instance will not be started, in
order to allow migration and other such cases.

Reviewed-by: imsnah

49ce1563

Show cluster hypervisor for gnt-cluster info · 8a12ce45
Iustin Pop authored 16 years ago
```
Author: schreiberal
Reviewed-by: iustinp
```
8a12ce45
Forward-port: Another for gnt-instance modify & HVM parameters · 917f91d5
Iustin Pop authored 16 years ago
```
Another tiny fix. Anybody got a nice brown paper bag I can wear?

Author: schreiberal
Reviewed-by: iustinp
```
917f91d5

Forward-port: make gnt-modify work with new HVM parameters · ec1ba002

Iustin Pop authored 16 years ago

This fixes gnt-instance modify so it actually works with the
new HVM parameters for Ganeti 1.2

Author: schreiberal
Reviewed-by: iustinp

ec1ba002

Forward-port: show only parameters relevant to the instance · a8340917

Iustin Pop authored 16 years ago

This patch modifies the code for "gnt-instance info .." to only display
instance parameters that actually apply to that instance, i.e. for PVM
instances no HVM parameters are shown and vice versa.

Author: schreiberal
Reviewed-by: iustinp

a8340917

Forward-port: patch 4/4 extended HVM features for 1.2 · ca9c49d5
Iustin Pop authored 16 years ago
```
This patch documents the extended HVM features.

Author: schreiberal
Reviewed-by: imsnah
```
ca9c49d5

Forward-port: patch 3/4 extended HVM features for 1.2 · a21dda8b

Iustin Pop authored 16 years ago

This patch adds hypervisor support for the extended HVM features.

Author: schreiberal
Reviewed-by: iustinp

a21dda8b

Forward-port: patch 2/4 extended HVM features for 1.2 · 31a853d2

Iustin Pop authored 16 years ago

This patch adds the commandline extensions and the code to store
and display the extended HVM features.

Author: schreiberal
Reviewed-by: iustinp

31a853d2

May 30, 2008

Complete removal of md/drbd 0.7 code · abdf0113

Iustin Pop authored 16 years ago

This patch removes the last of the md and drbd 0.7 code. Cluster which
have the old device types will be broken if they have this applied.

Reviewed-by: imsnah

abdf0113

LURemoveInstance: fix op.ignore_failures usage · 5c54b832

Iustin Pop authored 16 years ago

Currently: the LURemoveInstance.Exec() method uses the ignore_failures
attribute of the OpRemoveInstance opcode, but it doesn't check for its
existence. The patch adds this attribute to _OP_REQP and to all the
places where this opcode was created.

This attributes is always passed by gnt-instance, but burnin didn't pass
it so it can fail if it enters the 'fail to remove disks' branch of the
method (which is why it was not triggered until now).

Reviewed-by: ultrotter, imsnah

5c54b832

May 29, 2008

Documentation: cleanup of local/remote_raid1 · bd028152

Iustin Pop authored 16 years ago

Since we have removed support for local and remote raid1, update the man
pages and guides to reflect the new situation.

Reviewed-by: imsnah

bd028152

May 24, 2008

Distribute dumb-allocator in examples · 447b2066

Guido Trotter authored 16 years ago

When creating the ganeti tarball the dumb allocator was left out.
Shipping it alongside the other examples.

Reviewed-by: iustinp

447b2066

May 15, 2008

Update command line help and manpages with mandatory options · bdb7d4e8
Michael Hanselmann authored 16 years ago
```
Reviewed-by: ultrotter
```
bdb7d4e8

document cluster verify --no-nsplus1-mem option · 3cf7c9fa

Guido Trotter authored 16 years ago

Add this recently added option to the gnt-cluster man page before
releasing 1.2.4.

Reviewed-by: imsnah

3cf7c9fa

Fix drbd show parser to handle valueless keywords · 63012024

Guido Trotter authored 16 years ago

It turns out in some cases there can exist keywords without an
associated value exported by drbdsetup show. This patch makes the value
part optional in our parser, so that if it's not present the parsing
result will contain an array with just the keyword in it. This is not a
problem since we check all keyword names before accessing their values,
so we won't mistakenly try to access the value of a valueless keyword.

Reviewed-by: iustinp

63012024

Split drbd command creation and execution · 333411a7

Guido Trotter authored 16 years ago

Make _AssembleDisk more similar to _AssembleNet by splitting the
generation of the drbdsetup command and its execution. While not
changing anything this makes it easier to manipulate the command just in
certain cases, which in the future we'll need to do.

Reviewed-by: iustinp

333411a7

May 13, 2008

Small style fixes · 8d59409f
Iustin Pop authored 17 years ago
```
[Trunk version]

Reviwed-by: imsnah
```
8d59409f

Implement node daemon conectivity tests · 9d4bfc96

Iustin Pop authored 17 years ago

This patch adds in gnt-cluster verify checks for inter-node tcp
communication checks on the node daemon port for both the primary and
(if defined) secondary networks.

The output looks like (4-node cluster, one with the secondary interface
down):
* Verifying node node1.example.com
  - ERROR: tcp communication with node 'node3.example.com': failure using the secondary interface(s)
* Verifying node node2.example.com
  - ERROR: tcp communication with node 'node3.example.com': failure using the secondary interface(s)
* Verifying node node3.example.com
  - ERROR: tcp communication with node 'node1.example.com': failure using the secondary interface(s)
  - ERROR: tcp communication with node 'node2.example.com': failure using the secondary interface(s)
  - ERROR: tcp communication with node 'node4.example.com': failure using the secondary interface(s)
* Verifying node node4.example.com
  - ERROR: tcp communication with node 'node3.example.com': failure using the secondary interface(s)

Reviewed-by: imsnah

9d4bfc96

Forward-port changes made to readd in 1.2 · 102b115b

Michael Hanselmann authored 17 years ago

qa_node.py: Fix typo in message
cmdlib.py: Don't add readded node to node list
ganeti-qa.py: Make sure readd isn't done for master node

Reviewed-by: iustinp

102b115b

CLI: retry: remove command opts/args in "gnt-X" · 4e713df6

Iustin Pop authored 17 years ago

This new version of the patch removes only the listing of the usage in
the "gnt-X" list, but keeps the strings in since we'll want to enhance
and use them in "gnt-X $cmd --help".

Reviewed-by: ultrotter

4e713df6

Revert "CLI: remove command opts/args in "gnt-X"" · 9a033156
Iustin Pop authored 17 years ago
```
This reverts commit 976.

Reviewed-by: ultrotter
```
9a033156

CLI: remove command opts/args in "gnt-X" · 57d0151e

Iustin Pop authored 17 years ago

[Forward-port of the 1.2 branch patch]

This patch removes all the parameters and options from the output
"gnt-X" (i.e. the subcommand list for command). This is done in order to
uniformize the output, currently only some parameters are shown and they
are not always consistent (e.g. required versus important parameters).

Reviewed-by: ultrotter

57d0151e

Watcher: do not activate disks for started instances · eee1fa2d

Iustin Pop authored 17 years ago

Currently the watcher runs first the instance startup and then the
boot-id method of disk reactivation. However, irrelevant of the fact
that a node has rebooted or not, if we just started an instance, there's
no need for its disks to be activated again, since the start instance
has done that (if it is at all possible).

The patch modifies the watcher to remember all started instances and not
run activate-disks for them.

Reviewed-by: ultrotter

eee1fa2d

Watcher: do not activate disks for admin_down · 0c0f834d

Iustin Pop authored 17 years ago

Currently the watcher does activate disks (via bootid mechanisms) even
for admin_down instances.  This patch logs and skips over these
instances.

Reviewed-by: ultrotter

0c0f834d

Reduce chance of ssh failures in verify cluster · b544cfe0

Iustin Pop authored 17 years ago

The cluster verify builds a sorted list of nodes and passes that to all
the nodes (in parallel) for ssh checks. This means that for a cluster
with N nodes, there will be approximately N simultaneous connections to
the first node, then to the second node, etc. This, coupled with the
ssh daemon's “MaxStartups” parameter, can create false alarms about ssh
connectivity.

This patch randomizes the node list in the backend (therefore, each node
should have it's own order of ssh-ing to the other nodes) and the chance
of these alarms should be reduced.

Reviewed-by: ultrotter

b544cfe0

May 12, 2008

bdev: always log command output if it failed · 6c896e2f

Iustin Pop authored 17 years ago

Currently many error handling code paths in bdev.py log only
result.fail_reason (i.e. exit code or signal that killed the command)
but not its output. This makes debugging very hard.

The patch changes all places where we only log fail_reason to also log
result.output.

Reviewed-by: ultrotter

6c896e2f

May 10, 2008

DRBD: Fix another bug in diskless activation · ab6cc81c

Iustin Pop authored 17 years ago

DRBD8 requires that we pass ‘--create-device’ to the first command that
wants to activate a new DRBD minor. We do this currently when we run the
“drbdsetup ... disk” command which we run before the network setup.

But if the LVs are missing, we skip the ‘disk’ subcommand and run only
the ‘net’ one, so it might be that the activation fails because the
minor we selected was never created in the first place.

The patch adds the required parameter to the DRBD8._AssembleNet() call.
Since it's a no-op for existing minors, it should not create any
problems (tested and works both with configured and unconfigured
minors).

Reviewed-by: ultrotter

ab6cc81c

May 09, 2008

Remove utils.CheckDaemonAlive and use “xm info” instead · e3e66f02

Michael Hanselmann authored 17 years ago

There are a couple of reasons for doing so:
- /proc is not OS independent, it's only supported by Linux (there are
  emulations on other systems, but those might differ from the way
  Linux represents data).
- Checking a daemon's state doesn't necessarily mean it's usable.
  Connecting to the socket using “xm info” is much safer.
- Reduce code size.

Reviewed-by: iustinp

e3e66f02

May 08, 2008

Improve DRBD8.Open's docstring a bit more · f860ff4e
Guido Trotter authored 17 years ago
```
Reviewed-by: iustinp
```
f860ff4e
Fix comment typo in bdev.py · 7b62772e
Guido Trotter authored 17 years ago
```
Reviewed-by: iustinp
```
7b62772e

Fix DRBD8 diskless assembling · bf25af3b

Iustin Pop authored 17 years ago

The algorithm for attaching to existing DRBD devices is not trivial. It
has four alternatives, and there is a bug in the last one when we have
diskless devices.

The last case (local disk info matches but remote/network configuration
doesn't match) we disconnect from the network and reattach with the
correct info. We do this because correct local device has higher
priority than remote device.

However, the test we use (self._MatchesLocal) can succeed in two cases:
  - we have a disk and it's the same as the one attached
  - we don't have a disk and the drbd is in diskless mode

But this creates problems for the fourth case as when we already have
one diskless DRBD, activating then next one will do:
  - _MatchesLocal? yes, because both config data and system have no
    disks (with the effect that all diskless devices are identical)
  - _MatchesRemote? no, because this disk is configured to its current
    remote peer, not to our new one

The fix is trivial, although the algorithm not: we only allow overriding
the network configuration when the disk information matches and we are
not diskless, by adding the <"local_dev" in info'> test.

Reviewed-by: ultrotter

bf25af3b