Commits · 8e00939c66ebc3f93a4ee34412e2ab5b09e95e0b · itminedu / snf-ganeti

Aug 06, 2008
- Maintain node list in job queue · 8e00939c
  Michael Hanselmann authored 16 years ago
```
The code makes sure not to include the master in the list.

Reviewed-by: iustinp
```
  8e00939c
- masterd: Move job queue into context object · 9113300d
  Michael Hanselmann authored 16 years ago
```
The job queue must be called from cmdlib when adding or removing
nodes to the cluster. Moving it to the context objects makes
this possible.

Reviewed-by: iustinp
```
  9113300d
- Clean job queue directories when leaving cluster · f78346f5
  Michael Hanselmann authored 16 years ago
```
Old job files shouldn't be left on nodes removed from a cluster.

Reviewed-by: iustinp
```
  f78346f5
- Use new RPC call in “gnt-node list” · c54784d9
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
  c54784d9
- Implement query for nodes · 02f7fe54
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
  02f7fe54
- Use new query RPC call in “gnt-instance list” · 1f05af2b
  Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
  1f05af2b
- Implement query for instances · ee6c7b94
  Michael Hanselmann authored 16 years ago
```
Queries don't create jobs and are more efficient. Log messages
are not yet stored anywhere.

Reviewed-by: iustinp
```
  ee6c7b94
Aug 05, 2008

jqueue: Replicate jobs to all nodes · 23752136

Michael Hanselmann authored 16 years ago

Newly added nodes are not yet taken care of. Queue locking on
non-master nodes is not yet correct.

Reviewed-by: iustinp

23752136

Aug 04, 2008

jqueue: Use new jstore module · 04ab05ce
Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
04ab05ce
jstore: Add queue helper functions · 8b537bb0
Michael Hanselmann authored 16 years ago
```
This will be used to move common code out of jqueue.

Reviewed-by: iustinp
```
8b537bb0

Implement job submission for scripts · 94428652

Iustin Pop authored 16 years ago

This patch adds the infrastructure for executing a job in background,
instead of foreground, via a new “--submit” option. The behaviour is
that the job ID is printed and the script will immediately exit.

The patch also converts gnt-node list to this model (yes, this will be a
query in the future).

Reviewed-by: imsnah

94428652

Another typo in the install doc · 17dc2da0
Iustin Pop authored 16 years ago
```
Reviewed-by: imsnah
```
17dc2da0
Update the module build section of install doc · e7d2d69b
Iustin Pop authored 16 years ago
```
Reviewed-by: imsnah
```
e7d2d69b

Jul 31, 2008

jqueue: Move assert into decorator · db37da70

Michael Hanselmann authored 16 years ago

This reduces code duplication. A later patch will modify the job queue
a bit more and will need a change of this assert. The assertion is
also removed from all class-internal functions.

Reviewed-by: iustinp

db37da70

Split cli.SubmitOpCode in two parts · 0a1e74d9

Iustin Pop authored 16 years ago

The current SubmitOpCode function is not flexible enough to be used for
submitters that don't want to wait for the job finish.

The patch splits this in two, a SendJob function and a PollJob one, and
the old SubmitOpCode becomes a wrapper. Note that the new SendJob takes
a list of opcodes (and not a single opcode anymore).

Reviewed-by: imsnah

0a1e74d9

Allow job queue files to be uploaded through ganeti-noded · afee8008
Michael Hanselmann authored 16 years ago
```
This is needed for job queue replication.

Reviewed-by: iustinp
```
afee8008

Add FileLock utility class · a87b4824

Michael Hanselmann authored 16 years ago

This class is a wrapper around fcntl.flock and abstracts opening and
closing the lockfile. It'll used for the job queue.

(The patch also removes a duplicate import of tempfile into the unittest)

Reviewed-by: iustinp

a87b4824

jqueue: Store context in job queue instead of worker pool · 5bdce580

Michael Hanselmann authored 16 years ago

The job queue will need to access to configuration, which is provided
through the context object, to get a list of nodes.

Reviewed-by: iustinp

5bdce580

RAPI Implement DELETE for tags · 15fd9fd5
Oleksiy Mishchenko authored 16 years ago
```
Reviewed-by: imsnah
```
15fd9fd5

First write operation (add tag) for Ganeti RAPI · 441e7cfd

Oleksiy Mishchenko authored 16 years ago

Add instance tag handling, improved error logging.
...oh, yes adopt instance listing for RAPI2!

Reviewed-by: iustinp

441e7cfd

Jul 30, 2008

Fix cluster destroy · 140aa4a8

Iustin Pop authored 16 years ago

With the recent startup/shutdown changes (and with the master daemon in
place), the cluster destroy needs some fixing.

This patch moves the finalization of the destroy out from cmdlib into
bootstrap, so we can nicely shutdown the rapi and master daemons.

Reviewed-by: ultrotter

140aa4a8

Xen: remove two end-of-line semicolons · 97efde45
Guido Trotter authored 16 years ago
```
It's python, isn't it?

Reviewed-by: iustinp
```
97efde45

Fix cluster init · b3f1cf6f

Iustin Pop authored 16 years ago

With the recent changes, I forgot the extra parameter to this rpc call.
Also the rpc call needs to be done after we setup the config data, for
the master daemon to be able to start, so we move it after all other
init steps.

Reviewed-by: ultrotter

b3f1cf6f

Make gnt-* commands fail nicely on non-masters · b33e986b

Iustin Pop authored 16 years ago

This patch adds a check that we are on the master after failing to
connect to the socket, and log nicely the master name.

Reviewed-by: ultrotter

b33e986b

Parallelize LUFailoverInstance · c9e5c064
Guido Trotter authored 16 years ago
```
Reviewed-by: iustinp
```
c9e5c064
ChainOpCode is still BGL-only · 64381ad7
Guido Trotter authored 16 years ago
```
Prevent mistakes with an assert.

Reviewed-by: iustinp
```
64381ad7
Fix a misuse of exc_info in logging.info · 8161a646
Iustin Pop authored 16 years ago
```
This is my fault, sorry.

Reviewed-by: imsnah
```
8161a646

Fix pylint-detected issues · 38206f3c

Iustin Pop authored 16 years ago

This is mostly:
  - whitespace fix (space at EOL in some files, not all, broken
    indentation, etc)
  - variable names overriding others (one is a real bug in there)
  - too-long-lines
  - cleanup of most unused imports (not all)

Reviewed-by: ultrotter

38206f3c

Fix some errors detected by pylint · 3b9e6a30
Iustin Pop authored 16 years ago
```
Reviewed-by: imsnah
```
3b9e6a30

Unify SetupDaemon/SetupLogging · 59f187eb

Iustin Pop authored 16 years ago

The 'old-style' info, error, debug logs do not make much sense. This
patch unifies the SetupLogging and SetupDaemon functions. As a result,
all the commands logs to a 'commands.log' file.

The patch also changes the log setup to keep going if there's an error
in setting up the file logging but we're logging to stderr.

Also, burnin now logs to its own file (burnin.log).

Reviewed-by: ultrotter

59f187eb

Simplify the log constants and add another one · 9936bd63

Iustin Pop authored 16 years ago

The patch changes the log constants by moving the slash to the end of
the log dir instead of at the beginning of *each* log file name.

It also adds a new LOG_COMMANDS constant (to be used in a next patch).

Reviewed-by: ultrotter

9936bd63

Fix gnt-cluster getmaster · ce7151ae

Iustin Pop authored 16 years ago

This is special in the sense that it can run on any node. As such, we
just instantiate ssconf and read the data from it.

Reviewed-by: ultrotter

ce7151ae

Parallelize {Startup,Shutdown,Reboot}Instance · e873317a
Guido Trotter authored 16 years ago
```
Reviewed-by: iustinp
```
e873317a

Parallelize LUReinstallInstance · 4e0b4d2d

Guido Trotter authored 16 years ago

self.recalculate_locks[locking.LEVEL_NODE] could have any value and
everything would work anyway. We'll use the string 'replace' by
convention because in the future we might want an 'append' mode.

Reviewed-by: iustinp

4e0b4d2d

LogicalUnit._LockInstancesNodes helper function · c4a2fee1

Guido Trotter authored 16 years ago

This function is used to lock instances' primary and secondary nodes
after locking instances themselves.

Reviewed-by: iustinp

c4a2fee1

Make sharing locks possible · 3977a4c1

Guido Trotter authored 16 years ago

LUs can declare which locks they need by populating the
self.needed_locks dictionary, but those locks are always acquired as
exclusive. Make it possible to acquire shared locks as well, by
declaring a particular level as shared in the self.share_locks
dictionary. By default this dictionary is populated so that all locks
are acquired exclusively.

Reviewed-by: iustinp

3977a4c1

Add LogicalUnit.DeclareLocks · fb8dcb62

Guido Trotter authored 16 years ago

This additional LogicalUnit function is optional to implement, but lets
you change your locking needs for one level just before locking it, but
after the previous levels have been already locked. It is useful for
example to calculate what nodes to lock after locking an instance.

Reviewed-by: iustinp

fb8dcb62

LURenameInstance, add/remove relevant locks · 74b5913f

Guido Trotter authored 16 years ago

LURenameInstance forgot to remove the old lock name and add the new one,
making it impossible for parallel LUs to act on the instance (without a
master daemon restart). This also fixes burning+rename with the
parallelization of {Start,Stop}Instance.

Reviewed-by: iustinp

74b5913f

Rewrite job queue · 85f03e0d

Michael Hanselmann authored 16 years ago

We found several issues in the old job queue implementation. It had race
conditions, deadlocks and other deficiencies.

Short summary:
- _QueuedOpCode and _QueuedJob are now more or less data structures with a few
  utility functions. __Setup is gone.
- DiskJobStorage and JobQueue classes merged into one to reduce code complexity.
- One lock in JobQueue for almost everything. There's also a lock per opcode
  for log messages.

Reviewed-by: iustinp

85f03e0d

workerpool: Log when waiting for a thread · c0a8eb9e
Michael Hanselmann authored 16 years ago
```
Reviewed-by: iustinp
```
c0a8eb9e