Commits · 02f99608dd26b7e91953630870585cd1a431d20b · itminedu / snf-ganeti

Oct 08, 2008

Fix for gnt-cluster init. · 02f99608
Oleksiy Mishchenko authored 16 years ago
```
Reviewed-by: iustinp
```
02f99608

Move the hypervisor attribute to the instances · e69d05fd

Iustin Pop authored 16 years ago

This (big) patch moves the hypervisor type from the cluster to the
instance level; the cluster attribute remains as the default hypervisor,
and will be renamed accordingly in a next patch. The cluster also gains
the ‘enable_hypervisors’ attribute, and instances can be created with
any of the enabled ones (no provision yet for changing that attribute).

The many many changes in the rpc/backend layer are due to the fact that
all backend code read the hypervisor from the local copy of the config,
and now we have to send it (either in the instance object, or as a
separate parameter) for each function.

The node list by default will list the node free/total memory for the
default hypervisor, a new flag to it should exist to select another
hypervisor. Instance list has a new field, hypervisor, that shows the
instance hypervisor. Cluster verify runs for all enabled hypervisor
types.

The new FIXMEs are related to IAllocator, since now the node
total/free/used memory counts are wrong (we can't reliably compute the
free memory).

Reviewed-by: imsnah

e69d05fd

Oct 07, 2008

Updates to the security document · 6884c0ca

Iustin Pop authored 16 years ago

This patch changes formatting and the DRBD shared secret details, and
adds master daemon socket details to the security doc.

Reviewed-by: imsnah

6884c0ca

Move the SECURITY document to the doc/ dir · 73100cf5
Iustin Pop authored 16 years ago
```
Reviewed-by: imsnah
```
73100cf5
Fix formatting in design-2.0-os-interface · 43f30ee6
Iustin Pop authored 16 years ago
```
Reviewed-by: imsnah
```
43f30ee6
Small changes to the index design doc · 109509e4
Iustin Pop authored 16 years ago
```
This is just some additions of not-yet-mentioned docs.

Reviewed-by: ultrotter
```
109509e4
Change default instance reboot type to hard. · bf2fd71e
Alexander Schreiber authored 16 years ago
```
Merged r1777 from branches/ganeti/ganeti-1.2

Reviewed-by: imsnah
```
bf2fd71e
Add new design docs to Makefile.am · 0e861960
Alexander Schreiber authored 16 years ago
```
Reviewed-by: imsnah
```
0e861960

rpc.call_instance_migrate: pass the whole instance · 9f0e6b37

Iustin Pop authored 16 years ago

Currently the call_instance_migrate call only passes the instance name;
we need to pass the whole object for the hypervisor_type changes (all
the other individual instance rpc calls already pass the instance
object).

Reviewed-by: imsnah

9f0e6b37

Slightly change the hypervisor parameter example. · cd55576a
Alexander Schreiber authored 16 years ago
```
Reviewed-by: iustinp
```
cd55576a
Ganeti 2.0 cluster parameters design doc · 132b4ba2
Alexander Schreiber authored 16 years ago
```
Reviewed-by: ultrotter
```
132b4ba2

Implement job 'waiting' status · e92376d7

Iustin Pop authored 16 years ago

Background: when we have multiple jobs in the queue (more than just a
few), many of the jobs (up to the number of threads) will be in state
'running', although many of them could be actually blocked, waiting for
some locks. This is not good, as one cannot easily see what is
happening.

The patch extends the opcode/job possible statuses with another one,
waiting, which shows that the LU is in the acquire locks phase. The
mechanism for doing so is simple, we initialize (in the job queue) the
opcode with OP_STATUS_WAITLOCK, and when the processor is ready to give
control to the LU's Exec, it will call a notifier back into the
_JobQueueWorker that sets the opcode status to OP_STATUS_RUNNING (with
the proper queue locking). Because this mechanism does not save the job,
all opcodes on disk will be in status WAITLOCK and not RUNNING anymore,
so we also change the load sequence to consider WAITLOCK as RUNNING.

With the patch applied, creating in parallel (via burnin) five instances
on a five node cluster shows that only two are executing, while three
are waiting for locks.

Reviewed-by: imsnah

e92376d7

OS Interface design doc · 12222048
Guido Trotter authored 16 years ago
```
Reviewed-by: imsnah
```
12222048
Add .. contents:: marker to design docs · 47eb4b45
Guido Trotter authored 16 years ago
```
Reviewed-by: imsnah
```
47eb4b45

Oct 06, 2008

Implement job auto-archiving · 07cd723a

Iustin Pop authored 16 years ago

This patch adds a new luxi call that implements auto-archiving of jobs
older than a certain age (or -1 for all completed jobs), and the gnt-job
command that makes use of this (with 'all' for -1).

Reviewed-by: imsnah

07cd723a

Add a simple timespec parsing function · 2241e2b9

Iustin Pop authored 16 years ago

This function will be used for auto-archiving jobs via the command line.
The function is pretty simple, we only support up to weeks since months
and higher are not 'precise' entities, and dealing with them would
require us to start using calendar functions.

Reviewed-by: imsnah

2241e2b9

backend.py change to get cluster name from master · 62c9ec92

Iustin Pop authored 16 years ago

Currently there are three function in backend that need the cluster name
in order to instantiate an SshRunner. The patch changes these to get the
cluster name from the master in the rpc call; once the multi-hypervisor
change is implemented, then very few places in which we need the SCR
remain in the backend.

Reviewed-by: killerfoxi, imsnah

62c9ec92

Disable re-reading of config file · 3d3a04bc

Iustin Pop authored 16 years ago

Since the objects read from the config file are passed to the various
threads, it's unsafe to re-read the config file (and throw away
ConfigWriter._config_data). As such, we disable the re-reading of the
file (since now the master is the owner the file, it makes not sense to
re-read it), and any modifications to the file must be done offline,
otherwise they will be overwritten.

Reviewed-by: imsnah

3d3a04bc

RAPI Desing Doc · a72b3711
Oleksiy Mishchenko authored 16 years ago
```
Reviewed-by: iustinp
```
a72b3711

Start implementation of parallel burnin · ec5c88dc

Iustin Pop authored 16 years ago

This patch introduces a simple framework for executing jobs in parallel
in burnin (the ExecJobSet function) and the "--parallel" command line
flag.

The patch also changes the instance creation to run in parallel when the
above flag is given. Error handling/instance removal is currently flacky
with this options if there are errors in the instance creation.

We also modify burnin to reuse a single client.

Reviewed-by: imsnah

ec5c88dc

Fix gnt-job list with empty timestamps · e0ec0ff6

Iustin Pop authored 16 years ago

In case the job object doesn't have a timestamp (which is a separate
issue), the listing should not break. We fix this by changing the
FormatTimstamp function itself to return '?' in case the timestamp
doesn't look good (note that it still can break if non-integers are
returned, but this is unlikely).

Reviewed-by: imsnah

e0ec0ff6

Increase the number of threads to 25 · 1daae384

Iustin Pop authored 16 years ago

Since our locks are not gathered nicely, we can have jobs that are
actually blocking on locks (parallel burnin shows this), so at least we
need to increase the number of threads above the usual number of jobs we
could have in a such a case.

Reviewed-by: imsnah

1daae384

Minor cleanups & typo fixes. · 74bc10e8
Alexander Schreiber authored 16 years ago
```
Reviewed-by: iustinp
```
74bc10e8

Fix SshRunner breakage from the changed API · 6b0469d2

Iustin Pop authored 16 years ago

More places actually use the SshRunner than just the gnt-cluster
commands.

Reviewed-by: ultrotter

6b0469d2

Change SshRunner usage · 56bece1f

Iustin Pop authored 16 years ago

Currently the SshRunner uses a SimpleConfigReader instance, however this
is not best. We change it to use the cluster name directly (and its
constructor now takes this as parameter, instead of SCR), and its
callers are change to pass the name directly.

As a consequence, we can now remove the initialization of SCR in
gnt-cluster (copyfile and command), and instead we query the master for
the cluster name).

Reviewed-by: imsnah

56bece1f

Update document describing cluster security · 4fbe765c
Michael Hanselmann authored 16 years ago
```
It may need further updates, but here's a start.

Reviewed-by: ultrotter
```
4fbe765c

Oct 05, 2008
- Fix ssconf.GetMasterAndMyself · 06dc5b44
  Iustin Pop authored 16 years ago
```
The ssconf migration left this out.

Reviwed-by: imsnah,ultrotter
```
  06dc5b44
Oct 03, 2008
- Fix a mistake in the gnt-backup man page · 56118de5
  Iustin Pop authored 16 years ago
```
The actual location is /export, not /exports.

Reviewed-by: ultrotter
```
  56118de5
Oct 02, 2008

Use docbook2* paths found during configure for actual build · 65dfd777

Michael Hanselmann authored 16 years ago

docbook-wrapper had the names for the docbook2* programs hardcoded. This
patch changes Makefile.am and the wrapper script to pass them via
another argument.

Another issue where rapi.in was built before rapi-resources.sgml is
also fixed.

Reviewed-by: iustinp

65dfd777

Remove references to Twisted framework · ffa1c260
Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
ffa1c260

Oct 01, 2008
- Get rid of ssconf · c259ce64
  Michael Hanselmann authored 16 years ago
```
Remove leftovers from ssconf.

Reviewed-by: iustinp
```
  c259ce64
- Don't pass sstore to LUs anymore · 0b38cf6e
  Michael Hanselmann authored 16 years ago
```
sstore is no longer used in LUs.

Reviewed-by: iustinp
```
  0b38cf6e
- Convert ganeti-master · a42872ff
  Michael Hanselmann authored 16 years ago
```
Use simpleconfig instead of ssconf.

Reviewed-by: iustinp
```
  a42872ff
- Convert ganeti-watcher · 2859b87b
  Michael Hanselmann authored 16 years ago
```
Use RPC calls instead of ssconf.

Reviewed-by: iustinp
```
  2859b87b
- Convert ganeti-noded · 8594f271
  Michael Hanselmann authored 16 years ago
```
Replace ssconf with utility functions.

Reviewed-by: iustinp
```
  8594f271
- Convert gnt-cluster · e00ea635
  Michael Hanselmann authored 16 years ago
```
Replace ssconf with configuration.

Reviewed-by: iustinp
```
  e00ea635
- Convert bootstrap.py · d23ef431
  Michael Hanselmann authored 16 years ago
```
Replace ssconf with configuration.

Reviewed-by: iustinp
```
  d23ef431
- Convert cmdlib.py · d6a02168
  Michael Hanselmann authored 16 years ago
```
Replacing ssconf with configuration. Cluster rename is broken and stays
that way.

Reviewed-by: iustinp
```
  d6a02168
- Convert ssh.py · 7688d0d3
  Michael Hanselmann authored 16 years ago
```
Get rid of ssconf and convert to configuration instead.

Reviewed-by: iustinp
```
  7688d0d3
- Convert rpc.py · eb1328a9
  Michael Hanselmann authored 16 years ago
```
Replacing ssconf with utility functions.

Reviewed-by: iustinp
```
  eb1328a9