Commits · 8a113c7a3e3761dad77f22bf27dc11147854a345 · itminedu / snf-ganeti

Dec 08, 2008

gnt-node modify: add the offline attribute · 3a5ba66a

Iustin Pop authored 16 years ago

This patch changes gnt-node modify and the associated opcode/lu to allow
modification of the node offline attribute.

Setting a node into offline mode automatically demotes it from the
master role.

Reviewed-by: ultrotter

3a5ba66a

Dec 02, 2008

Add cluster candidate pool size parameter · 4b7735f9

Iustin Pop authored 16 years ago

This patch adds a new cluster paramater "candidate_pool_size" which
tracks the desired size of the list of nodes with the master_candidate
flag set.

Reviewed-by: imsnah

4b7735f9

Add a gnt-node modify operation · b31c8676

Iustin Pop authored 16 years ago

This patch adds the OpCode, LogicalUnit and gnt-node command for
modifying node parameters, more specifically the master candidate flag
for a node.

Reviewed-by: imsnah

b31c8676

Nov 25, 2008

Implement support for multi devices changes · 24991749

Iustin Pop authored 16 years ago

This big patch adds support for:
  - changing NIC/disks in the multi-device model
  - adding/removing NICs
  - adding/removing disks

The patch is big and not very nice; the error checking paths are not
very clear.

The biggest problem is that from a simple instance.ATTR=VAL change
(which didn't throw errors before) now we are creating and removing
disks in this LU.

Reviewed-by: imsnah

24991749

Nov 24, 2008

IAllocator: use the right hypervisor · 8cc7e742

Guido Trotter authored 16 years ago

Since the hypervisor is instance dependent we'll get one on instance creation,
and use the one in the instance config on relocation.

Reviewed-by: iustinp

8cc7e742

Nov 20, 2008

Initial multi-disk/multi-nic support · 08db7c5c

Iustin Pop authored 16 years ago

This patch adds support for mult-disk/multi-nic in:
  - instance add
  - burnin

The start/stop/failover/cluster verify work as expected. Replace disk
and grow disk are TODO.

There's also a change gnt-job to allow dictionaries to be listed in
gnt-job info.

Reviewed-by: imsnah

08db7c5c

Oct 16, 2008

Enable gnt-cluster modify to hv/beparams · 779c15bb

Iustin Pop authored 16 years ago

This patch enables the cluster modify to change:
  - enabled hypervisor list
  - hvparams (per hypervisor)
  - beparams (only the default group)

Syntax:
  gnt-cluster modify -B vcpus=3 -H xen-pvm:no_initrd_path

Validation for parameters is somewhat missing - the individual
hypervisors will be checked for syntax and validation, but beparams
doesn't have validation yes (nowhere), it should be added here once we
have a global method (will come soon).

Reviewed-by: imsnah

779c15bb

Oct 14, 2008

grow-disk: wait until resync is completed · 6605411d

Iustin Pop authored 16 years ago

The patch adds a new ‘--no-wait-for-sync’ parameter to grow-disk similar
to the one in instance add, and changes the default to wait.

This is cleaner as at the moment when the command returns, we either
have a fully synced disk or there is an error.

This is a forward-port of rev 1183 on the 1.2 branch.

Reviewed-by: ultrotter

6605411d

Change over to beparams · 338e51e8

Iustin Pop authored 16 years ago

This big patch changes the master code to use the beparams. Errors might
have crept in, but it passes a small burnin.

Reviewed-by: ultrotter

338e51e8

Allow instance info to only query the config file · 57821cac

Iustin Pop authored 16 years ago

This patch adds a new '-s' parameter to ‘gnt-instance info’ that makes
it return only 'static' information. This is much faster, especially for
drbd instances.

This is a forward-port of rev 1570 on the ganeti-1.2 branch, resending
due to some conflicts.

Reviewed-by: imsnah

57821cac

Change gnt-instance modify to the hvparams model · 74409b12
Iustin Pop authored 16 years ago
```
Reviewed-by: imsnah
```
74409b12

Switch instance hypervisor parameters to hvparams · 6785674e

Iustin Pop authored 16 years ago

This big patch changes instance create to the new hvparams structure.
Old parameters are removed, so old jobs or old instances file will break
current clusters.

Reviewed-by: ultrotter

6785674e

Oct 08, 2008

Move the hypervisor attribute to the instances · e69d05fd

Iustin Pop authored 16 years ago

This (big) patch moves the hypervisor type from the cluster to the
instance level; the cluster attribute remains as the default hypervisor,
and will be renamed accordingly in a next patch. The cluster also gains
the ‘enable_hypervisors’ attribute, and instances can be created with
any of the enabled ones (no provision yet for changing that attribute).

The many many changes in the rpc/backend layer are due to the fact that
all backend code read the hypervisor from the local copy of the config,
and now we have to send it (either in the instance object, or as a
separate parameter) for each function.

The node list by default will list the node free/total memory for the
default hypervisor, a new flag to it should exist to select another
hypervisor. Instance list has a new field, hypervisor, that shows the
instance hypervisor. Cluster verify runs for all enabled hypervisor
types.

The new FIXMEs are related to IAllocator, since now the node
total/free/used memory counts are wrong (we can't reliably compute the
free memory).

Reviewed-by: imsnah

e69d05fd

Oct 01, 2008

Add new query to get cluster config values · ae5849b5

Michael Hanselmann authored 16 years ago

This can be used to retrieve certain cluster config values from
within clients.

OpDumpClusterConfig was not used anywhere, hence I'm just reusing
it. The way ConfigWriter.DumpConfig returned the configuration
was not thread-safe, anyway (no deepcopy).

Reviewed-by: iustinp

ae5849b5

Remove last use of utils.RunCmd from the watcher · 5188ab37

Iustin Pop authored 16 years ago

The watcher has one last use of ganeti commands as opposed to sending
requests via luxi. The patch changes this to use the cli functions.

The patch also has two other changes:
  - fix the docstring for OpVerifyDisks (found out while converting
    this)
  - enable stderr logging on the watcher when “-d” is passes

Reviewed-by: imsnah

5188ab37

Sep 29, 2008

Implement job summary in gnt-job list · 60dd1473

Iustin Pop authored 16 years ago

It is not currently possibly to show a summary of the job in the output
of “gnt-job list”. The closes is listing the whole opcode(s), but that
is too verbose. Also, the default output (id, status) is not very
useful, unless one looks for (and knows about) an exact job ID.

The patch adds a “summary” description of a job composed of the list of
OP_ID of the individual opcodes. Moreover, if an opcode has a ‘logical’
target in a certain opcode field (e.g. start instance has the instance
name as the target), then it is included in the formatting also. It's
easier to explain via a sample output:

gnt-job list
ID Status  Summary
1  error   NODE_QUERY
2  success NODE_ADD(gnta2)
3  success CLUSTER_QUERY
4  success NODE_REMOVE(gnta2.example.com)
5  error   NODE_QUERY
6  success NODE_ADD(gnta2)
7  success NODE_QUERY
8  success OS_DIAGNOSE
9  success INSTANCE_CREATE(instance1.example.com)
10 success INSTANCE_REMOVE(instance1.example.com)
11 error   INSTANCE_CREATE(instance1.example.com)
12 success INSTANCE_CREATE(instance1.example.com)
13 success INSTANCE_SHUTDOWN(instance1.example.com)
14 success INSTANCE_ACTIVATE_DISKS(instance1.example.com)
15 error   INSTANCE_CREATE(instance2.example.com)
16 error   INSTANCE_CREATE(instance2.example.com)
17 success INSTANCE_CREATE(instance2.example.com)
18 success INSTANCE_ACTIVATE_DISKS(instance1.example.com)
19 success INSTANCE_ACTIVATE_DISKS(instance2.example.com)
20 success INSTANCE_SHUTDOWN(instance1.example.com)
21 success INSTANCE_SHUTDOWN(instance2.example.com)

This is done by a simple change to the opcode classes, which allows an
opcode to format itself. The additional function is small enough that it
can go in opcodes.py, where it could also be used by a client if needed.

Reviewed-by: imsnah

60dd1473

Sep 01, 2008

Pass the force param to SetInstanceParms · 4300c4b6

Guido Trotter authored 16 years ago

It was already allowed in gnt-instance modify, but ignored.
It will be used to force skipping parameter checks.

This is a forward-port from branches/ganeti-1.2

Original-Reviewed-by: imsnah
Reviewed-by: iustinp

4300c4b6

Aug 29, 2008
- Merge r1536 from branches/ganeti/ganeti-1.2 · 5397e0b7
  Alexander Schreiber authored 16 years ago
```
Add HVM device type flags 2/3

Reviewed-by: ultrotter
```
  5397e0b7
Aug 08, 2008
- Two small style fixes · 0a7bed64
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
  0a7bed64
Jul 30, 2008

Rework master startup/shutdown/failover · b1b6ea87

Iustin Pop authored 16 years ago

This (big) patch reworks the master startup/shutdown and the fixes the
master failover.

What does the patch do?

For master start/stop:
  - remove the old ganeti-master script and its associated man page
  - moves the ip start/stop directly into the backend.(Start|Stop)Master
  - adds start/stop of the master/rapi daemon into these functions,
    selectively based on the start/stop arguments
  - makes the master call via rpc StartMaster(start_daemons=False) to
    the local node so that the master IP is started
  - and finally changes the example init.d script to directly start and
    stop all three daemons, since they do the right thing (depending on
    master/not master role)

For master failover:
  - moves the code from LUMasterFailover into bootstrap.MasterFailover,
    since we need to start/stop the master during this operation and
    thus it can't be executed from the master
  - removes the LUMasterFailover and its associated opcode

Notes: ubuntu's /etc/lsb-base-logging.sh is dumb, so the messages 'not
master' are not seen during startup on non-master nodes.

Reviewed-by: ultrotter

b1b6ea87

Jul 15, 2008

Documentation updates · a7399f66
Iustin Pop authored 16 years ago
```
Reviewed-by: imsnah
```
a7399f66

Rename BaseJO to BaseOpCode · 0e46916d

Iustin Pop authored 16 years ago

Since we don't have for now a job definition object anymore, we rename
this class to BaseOpCode. It's still useful (and not merged with OpCode)
since it holds all the 'pure' logic (no custom field handling, etc.)
whereas OpCode holds opcode specific data (OP_ID handling, etc).

The patch also fixes the module's docstring.

Reviewed-by: imsnah

0e46916d

Jul 09, 2008
- Remove old job queue code · 2467e0d3
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
  2467e0d3
Jun 23, 2008

Fix gnt-cluster “command” and “copyfile” · b3989551

Iustin Pop authored 16 years ago

Since the disabling of forking in the master daemon, the two ssh-based
subcommands were not working anymore. However, there is no need at all
for the commands to be run from the master daemon (permissions to read
the cluster private ssh key notwithstanding), they can be run directly
from the command line utilities.

The patch removes the two opcodes OpRunClusterCommand and
OpClusterCopyFile (and their associated LUs) and changes the code in
‘gnt-cluster’ to query the list of nodes and run directly the SshRunner
over the list. As such, all forking is done from the gnt-cluster script,
and the commands are working again.

Reviewed-by: imsnah

b3989551

Jun 17, 2008

Implement disk grow at LU level · 8729e0d7

Iustin Pop authored 16 years ago

This patch adds a new opcode and LU for growing an instance's disk.

The opcode allows growing only one disk at time, and will throw an error
if the operation fails midway (e.g. on the primary node after it has
been increased on the secondary node). As such, it might actually leave
different sized LVs on different nodes, but this will not create
problems.

Reviewed-by: imsnah

8729e0d7

Jun 12, 2008

Move InitCluster opcode into a single function · a0c9f010

Michael Hanselmann authored 16 years ago

This allows us to initialize a new cluster. The code certainly contains
bugs and hooks aren't implemented yet.

Reviewed-by: iustinp

a0c9f010

May 31, 2008

Forward-port: patch 2/4 extended HVM features for 1.2 · 31a853d2

Iustin Pop authored 16 years ago

This patch adds the commandline extensions and the code to store
and display the extended HVM features.

Author: schreiberal
Reviewed-by: iustinp

31a853d2

Apr 24, 2008

Implement replace secondary via the iallocator · b6e82a65

Iustin Pop authored 16 years ago

This patch implements secondary replace via the iallocator. The new
opcode parameter 'iallocator' behaves like this: if passed, it will
always compute and assign a new secondary, behaving in effect as if the
secondary node has been passed. It conflicts with actually giving the
secondary too.

[Note: not tested with remote_raid1, but the code should behave the
same, we only touch CheckPrereq and we assign a node.]

The patch also adds burnin support for the replace secondary operation;
with this in place, burnin can fully work with auto-assigned nodes.

Reviewed-by: ultrotter

b6e82a65

Apr 23, 2008
- Add gnt-backup remove functionality · 9ac99fda
  Guido Trotter authored 16 years ago
```
This patch also fixes the LUExportInstance Prereq docstring.

Reviewed-by: iustinp
```
  9ac99fda
Apr 16, 2008

Add --readd option to “gnt-node add” · e7c6e02b

Michael Hanselmann authored 16 years ago

This allows us to readd a node after it failed and required a
reinstallation or replacement.

Reviewed-by: iustinp

e7c6e02b

IAllocator part 3: LUCreateInstance changes · 538475ca

Iustin Pop authored 16 years ago

This (final) patch allows the instance's nodes to be selected
automatically based on the passed allocator algorithm.

The patch changes the pnode opcode parameter from required to optional,
now either the pnode or the iallocator must be passed.

A possible improvement could be to organize all the _IAllocator
functions into a separate class, but that can come later and the current
version is functionally ok.

Reviewed-by: ultrotter

538475ca

Allocator framework, 1st part: allocator input generation · d61df03e

Iustin Pop authored 16 years ago

In preparation for the introduction of automatic instance allocator,
this patch adds an allocator simulation opcode, that based on the input
parameters, will return either the input message to the allocator
(implemented) or the result of the allocator run (not yet implemented).

This allows algorithm tests against simulated allocations and the
current cluster state.

The patch adds the following:
  - a function that generates the generic cluster information for the
    allocator
  - a function that generates the 'new instance' information
  - a function that generates the 'replace_secondary' information

These three functions will be used by the allocator framework later to
generate the actual information for the external algorithms. Currently
we just return the json-serialized text.

Reviewed-by: imsnah

d61df03e

Apr 10, 2008

Verify: make skipping checks possible · e54c4c5e

Guido Trotter authored 16 years ago

Add a general way to skip some checks at cluster-verify time and make the N+1
memory redundancy check optional.

Reviewed-by: iustinp

e54c4c5e

Rework the results of OpDiagnoseOS opcode · 1f9430d6

Iustin Pop authored 16 years ago

Currently, the opcode DiagnoseOS is the only opcode that return a
structure of objects.OS (which is a custom class, and not a simple
python object) and furthermore all the processing of OS validity across
nodes is left to the clients of this opcode.

It would be more logical to have this opcode be similar to list
instances/nodes, in the sense that:
  - it should return a table of results
  - the fields in the table should be selectable

This patch does the above. The possible fields are:
  - name (os name)
  - valid (bool representing validity across all nodes)
  - node_status, which is a complicated structure required for ‘gnt-os
    diagnose’

With this patch, gnt-os list becomes a very simple iteration over the
list of results, filtering out non-valid ones. gnt-os diagnose is still
complicated, but no more than before.

The burnin tool has also been modified to work with the modified
results, and is simpler because of this (it only needs to know if an OS
is valid or not, not the per-node details).

Reviewed-by: imsnah

1f9430d6

Add per-opcode results to job processing · 35049ff2

Iustin Pop authored 16 years ago

This patch changes the definition of a job and introduces per-opcode
results.

First, the result and status fields of a job are condensed into a single
'status' attribute. Then, we introduce an opcode status and one result
list, that allow jobs to return values.

The gnt-job script is also modified to allow these new fields to be
queried.

Note that the patch changes the opcode field to op_list, and it changes
its return value from string to a list of (serialized) opcodes.

Reviewed-by: ultrotter

35049ff2

Apr 08, 2008
- Add file_storage_dir,file_driver to OpCreateInstance · dc936b49
  Manuel Franceschini authored 16 years ago
```
Reviewed-by: ultrotter, iustinp
```
  dc936b49
Apr 07, 2008

A small capitalization change (OpCode.LoadOpcode) · 00abdc96

Iustin Pop authored 16 years ago

This small patch fixed the opcodes.OpCode.LoadOpcode capitalization to
what was intented to be (as the comment says): LoadOpCode.

Reviewed-by: ultrotter

00abdc96

Mar 31, 2008
- parms->params Refactoring · 7767bbf5
  Manuel Franceschini authored 16 years ago
```
- Substitute all occurences of name 'parms' with 'params'
- Small codestyle fix

Reviewed-by: ultrotter
```
  7767bbf5
- Add OpSetClusterParams to opcodes · 12515db7
  Manuel Franceschini authored 16 years ago
```
Reviewed-by: iustinp
```
  12515db7
Mar 25, 2008

Remove the add/remove mirror operations · 249069a1

Iustin Pop authored 17 years ago

These two operations are related to md/drbd7 code (remote_raid1). Remove
them as part of the md/drbd7 removal.

Reviewed-by: imsnah

249069a1