- Jul 21, 2010
-
-
Iustin Pop authored
In case an instance is living on an offline node, it doesn't make sense to refuse moving it because that would create N+1 failures; failing N+1 is still much better than not running at all. Similarly, if the secondary node of an instance is offline, meaning the instance doesn't have any redundancy, we have a worse case than having a secondary that is N+1 failing and it could not accept the instance as primary, but it stil does redundancy for it. To allow this, we rename Node.addPri to addPriEx and introduce an extra parameter (addPri is a partial application of addPriEx and keeps the same signature). Node.addSec gets the same treatement.
-
- Jul 19, 2010
-
-
Iustin Pop authored
This was only used in one place (hbal), and is obsolete by the change to the dual name/alias structure.
-
Iustin Pop authored
This was a regression from the name handling changes, as we started using the original names for the solution list (which is not designed for parsing/feeding back into ganeti).
-
Iustin Pop authored
printSolution is no longer used, as we print the solution iteratively now.
-
- Jul 18, 2010
-
-
Iustin Pop authored
When the field list is prefixed with a plus sign, this will extend the default field list, instead of replacing it entirely.
-
Iustin Pop authored
This patch renames the pri/sec to pcnt/scnt, and adds the real primary and secondary instance lists, the peermap and the index of a node as selectable options.
-
Iustin Pop authored
If the last secondary instance of a peer is deleted (detected by the new peer memory value being equal to zero), then the pair (pdx, 0) should be deleted completely. This is not optimization per se, but rather cleanup (the speedup is at most a percent, and only in some corner cases).
-
- Jul 16, 2010
-
-
Iustin Pop authored
This needs to be abstracted in a separate function, but in the meantime we fix the issue in both places. Signed-off-by:
Iustin Pop <iustin@google.com>
-
- Jun 21, 2010
-
-
Iustin Pop authored
-
Iustin Pop authored
… for the serialization/deserialization of the job and opcode status. Job status 'gone' was not actually used. It can be reintroduced if needed.
-
Iustin Pop authored
This mirrors, again, the Ganeti constats, and are added for future use.
-
Iustin Pop authored
The rename is done such that we match Ganeti's own constants.
-
- Jun 08, 2010
-
-
Iustin Pop authored
Since the current buffer cannot contain (during network reads) an EOM, we should look for the EOM only in the newly-received string. While this shouldn't make much difference, in some tests it cuts the recvMsg total time by around half. On entering recvMsg, we have though to search the old buffer for a message though, since we could have received two Luxi messages on the last network query; this is however a one-off cost, compared to continuously looking for the EOM in the old string (at each receive loop).
-
- Jun 07, 2010
-
-
Iustin Pop authored
All current Luxi calls are supported after this patch. A bug in ArchiveJob is also fixed (Ganeti's job IDs are strings).
-
Iustin Pop authored
While not are directly useful, having them will open some possibilities (e.g. polling for job changes in hbal's -X mode, and auto-archiving the jobs once they are successful).
-
- Jun 02, 2010
-
-
Iustin Pop authored
-
Iustin Pop authored
Currently, we define the LuxiOp type as a simple enumeration, and leave the arguments structure to the users of the Ganeti.Luxi module. This is suboptimal for a couple of reasons: first, we decouple the operation type from operation arguments, and that means we don't use the type system for validation of the arguments; second, the clients themselves have to know about the JSON encoding of the protocol. For the above arguments, we change the operation type to contain the arguments too, and then the entire conversion/serialization is restricted to the Ganeti.Luxi module. Also, the removal of the JSON encoding from the clients results in an overall simplification of the code.
-
- Jun 01, 2010
-
-
Iustin Pop authored
Incomplete pattern match…
-
Iustin Pop authored
These are not comprehensive, but at least we have a start.
-
- May 30, 2010
-
-
Iustin Pop authored
QuickCheck's batch driver (at least v1) doesn't show the test aborts, but simply discards the specific exception and increases the abort count. This makes it hard to debug the tests, so we modify our own test wrapper (which so far only tracked total failures) to show any exceptions.
-
- May 28, 2010
-
-
Iustin Pop authored
Since the unittests are not 'clean' from the p.o.v. of type declarations, and cannot be made clean in all respects (e.g. orphan instances), we silence some warnings for the test target, to have a cleaner output.
-
- May 27, 2010
-
-
Iustin Pop authored
The tests are moved to a separate data structure, and we can select a subset of tests to run.
-
Iustin Pop authored
-
Iustin Pop authored
Some keys are optional in the Ganeti opcodes (e.g. ‘node’ in the OpReplaceDisks), and as such we need to transform them in a Maybe value, instead of failing. The patch reworks a bit fromObj and adds maybeFromObj which parses such optional values. It then uses it in the opcode reading.
-
Iustin Pop authored
This patch removes all old uses of fromJResult with the annotated version, and removes the non-annotated version. All JSON parsing points should now have annotated errors.
-
Iustin Pop authored
This allows, for example, the RAPI backend to detail which information (instance or node data) fails to parse.
-
Iustin Pop authored
Currently fromObj doesn't detail what we're trying to read, which can lead to cryptic messages: "Cannot read Int". The patch changes this function to annotate the error messages with the key/value we're trying to convert, by using a new version of fromJResult. Since the display of the key in tryFromObj is now redundant (it was already redundant in the 'not found' case), we remove it. The new version of fromJResult (annotateJResult) simply prepends a description string to the actual error message.
-
- May 26, 2010
-
-
Iustin Pop authored
-
- May 25, 2010
-
-
Iustin Pop authored
Instance, Node and Text modules have improved coverage.
-
- May 20, 2010
-
-
Iustin Pop authored
The patch adds some simple unit-tests for both the allocation function (we can allocate small instances on an empty cluster, we can allocate in tiered more starting from any size) and the balancing functions (one single instance is placed optimally, a full cluster plus an empty node can be rebalanced). The coverage has increased greatly, since this is the bulk of the algorithm/code. Also, the cluster tests are now being run with different options, since they are much slower.
-
Iustin Pop authored
This is done so we can test a longer pipeline.
-
Iustin Pop authored
This helps debugging via ghci.
-
Iustin Pop authored
After the name patches, we can pass in either the short or the full name, so update the hbal man page accordingly.
-
Iustin Pop authored
-
Iustin Pop authored
This patch introduces some new functionality in the base Element type and in Container which supports searching for all 'known' names of an element, such that both short and full names are accept for various options like '-O' and '--excluded-instances'.
-
Iustin Pop authored
Currently the name used internally is modified and holds the shortened name of the nodes/instances. This has caused issues before, since we always have to strip the suffix from input data and reapply it if we need to send data back to Ganeti. This patch changes the code such that the names are never modified, only the alias, and all the internal computations can forget about the common suffix addition/removal.
-
Iustin Pop authored
This new field ('alias') will hold the shortened/beautified display name. When resetting the name, the alias is reset too, and there's a new function to update only the alias.
-
Iustin Pop authored
First, we reduce the max size of the disks, since Int on 32bits will overflow for big simulated clusters. This is a real issue, that will need fixing in real life, but for now we just "silence" this test. Second, we increase the amount of time a test is allowed to run, otherwise on slower computers some tests might time-out.
-
- May 19, 2010
-
-
Iustin Pop authored
-
Iustin Pop authored
This increases the overall coverage by 5%-10% (depending on coverage type). Some modules are still not unittested at all, as HUnit is a better choice for them.
-