- 22 Nov, 2011 1 commit
-
-
Guido Trotter authored
For now the new "memory" parameter stays there, but it will be removed later. The new values are just taken from the old one, in this patch. Signed-off-by:
Guido Trotter <ultrotter@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
- 21 Nov, 2011 10 commits
-
-
Andrea Spadaccini authored
Instead of relying on clients of the class for setting the device speed (and, in general, the DRBD parameters), move this responsibility inside the Assemble method. Signed-off-by:
Andrea Spadaccini <spadaccio@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Andrea Spadaccini authored
In the last merge I erroneously discarded the changes introduced by commit 2a6de57a "Check the results of master IP RPCs". This commit reintroduces them. Signed-off-by:
Andrea Spadaccini <spadaccio@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
Michael Hanselmann authored
Patch tested and confirmed to work by Andrea Spadaccini <spadaccio@google.com>. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Andrea Spadaccini <spadaccio@google.com>
-
Michael Hanselmann authored
Otherwise jobs started after an unclean master shutdown will fail as they depend on the RPC client. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Until now, if masterd received a fatal signal, it would start shutting down immediately. In the meantime it would hang while jobs are still processed. Clients couldn't connect anymore to retrieve a jobs' status. This this patch masterd checks if any job is running before shutting down. If there is it'll check again every five seconds. Once all jobs are finished, it waits another five seconds to give clients a chance to retrieve the jobs' status. After that masterd will shutdown in a clean fashion. If a second signal is received the old behaviour is preserved. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Instead of aborting the main loop as soon as a fatal signal (SIGTERM or SIGINT) is received, additional logic allows waiting for tasks to finish while I/O is still being processed. If no callback function is provided the old behaviour--shutting down on the first signal--is preserved. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
This is needed in case the scheduler user (daemon.Mainloop in this case) has other timeouts at the same time. Needed for clean master shutdown. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Doing so will prevent job submissions (similar to a drained queue), but won't affect currently running jobs. No further jobs will be executed. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Also log a message when a fatal signal was received and use dict.items. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
- 17 Nov, 2011 10 commits
-
-
Agata Murawska authored
Signed-off-by:
Agata Murawska <agatamurawska@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
Iustin Pop authored
Signed-off-by:
Iustin Pop <iustin@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
Iustin Pop authored
If confd is disabled, do not automatically restart it. Furthermore, we can't run maintenance actions if it is disabled so log a warning. Note that I haven't completely disabled the NodeMaintenance class with ENABLE_CONFD = False because I think they are at two different levels (e.g. we might have other maintenance actions done even with confd disabled). Signed-off-by:
Iustin Pop <iustin@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
Iustin Pop authored
Doesn't do anything yet. Signed-off-by:
Iustin Pop <iustin@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
Michael Hanselmann authored
It is not used. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
This is different from “Quiesce” in the sense that this function just changes an internal flag and doesn't wait for the queue to be empty. Tasks already being processed continue normally, but no new tasks will be started. New tasks can still be added, but won't be processed. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
This saves us from returning to the worker code when there is no task to be processed. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
This is in preparation for a clean(er) shutdown of masterd. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
After iallocator ran we can release any unused node locks. Since they must be in exclusive mode this should improve parallelization during instance creation. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
… instead of a variable which needs to be incremented for every step. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
- 16 Nov, 2011 5 commits
-
-
Agata Murawska authored
Signed-off-by:
Agata Murawska <agatamurawska@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Agata Murawska authored
Signed-off-by:
Agata Murawska <agatamurawska@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Agata Murawska authored
Signed-off-by:
Agata Murawska <agatamurawska@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Agata Murawska authored
Signed-off-by:
Agata Murawska <agatamurawska@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
René Nussbaumer authored
This is in preparation to take deeper dict constructs from the command line. You can feed the optionslist directly constructed of type "identkeyval" to it and it returns a fully deflated dict. This is mainly needed for the resource model changes where we have to modify the disk_state which is a 3 level dict: disk_type/name:disk_reserved=10g Signed-off-by:
René Nussbaumer <rn@google.com> Reviewed-by:
Michael Hanselmann <hansmi@google.com>
-
- 15 Nov, 2011 14 commits
-
-
Michael Hanselmann authored
Some methods, such as “_is_owned” and “list_owned”, have been aliased to make them public for a while now. This patch makes the actual implementation public. SharedLock's “is_owned” needs to be aliased to “_is_owned” to remain compatible with Python's built-in threaded.Condition class. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
The locking library doesn't like it when “release()” is called on a lockset or lock which isn't held by the current thread. Instead of modifying the library, which could have other side-effects, this rather simple change avoids errors when a LU simply tries to release all locks, even when it doesn't own any at a certain level. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Also acquire instance and resource locks in shared mode (see comment). Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
If early-release is not used, the resource lock is kept while waiting for disks to sync. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Important for when disks are converted. Release locks once they're not needed anymore. Make liberal use of assertions. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Removing an instance affects available disk space and memory. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Recreating disks conflicts with other disk operations, therefore the node resource lock must be acquired. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Since this doesn't really touch the node, but it conflicts with e.g. growing a disk, the resource lock must be acquired. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Also add one more feedback line. Downgrade instance lock to shared mode while we're only waiting for disks to sync. The node lock is released when not needed anymore. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
The node resource lock is released once the disks are in sync (that is, after wiping). Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Nothing is being written to. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
Nothing is written to. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-
Michael Hanselmann authored
No writes are being done. Signed-off-by:
Michael Hanselmann <hansmi@google.com> Reviewed-by:
Iustin Pop <iustin@google.com>
-