Commits · f52dadb24f25ad780e3a6fcb6b94d61fa0dc1e18 · itminedu / snf-ganeti

Dec 30, 2010

Fix updating of available (V)CPUs in CStats · f52dadb2

Iustin Pop authored 14 years ago


Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

f52dadb2

Add 'Read' instances for most objects · 6bc39970

Iustin Pop authored 14 years ago


This allows a cluster structure to be easily serialized via "read";
together with the already existing instances of Show, this gives a
poor man's serialization/deserialization implementation.

The patch also exports the compDetailedCV function from Cluster.hs, so
that it can be used by other modules too.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

6bc39970

Dec 23, 2010

Change the balancing function · 4715711d

Iustin Pop authored 14 years ago


Currently the balancing function is a modified version of the standard
deviation (stddev divided by list length), due to historical reasons.

While this works fine for small clusters, for big clusters it makes
the balancing effect too "weak", and in some cases it refuses to
balance correctly some clusters. It also makes the balancing behaviour
dependant on the cluster size, which is a big no-no.

Therefore we revert to the normal version of standard deviation, and
we also rename the function to reflect what it does. The new version
correctly balances some corner cases that the previous version didn't,
and passes the current balancing unittests.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Adeodato Simo <dato@google.com>

4715711d

Move some tiered spec functionality to Cluster.hs · 949397c8

Iustin Pop authored 14 years ago


This splits out a bit of code from hspace.hs and moves it into its own
function in Cluster.hs.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

949397c8

Dec 20, 2010

IAllocator: respect the alloc_policy for groups · 73206d0a

Iustin Pop authored 14 years ago


This patch changes the allocate mode to respect the alloc_policy for
groups. It does this by changing the sort key from simply the solution
score, to a tuple with two elements: the alloc policy (which is now an
Ord instance) and the solution score. Also, the unallocable groups are
filtered out in the filterMGResults phase.

The patch also slightly enhances the informational message by
including the policy in the group information, to help debugging.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

73206d0a

hail: display group names in info messages · aec636b9

Iustin Pop authored 14 years ago


This patch switches from the group index to the group name for the
informational messages in the hail results.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

aec636b9

Change the Node.group attribute · 10ef6b4e

Iustin Pop authored 14 years ago


Currently, the Node.group attribute is the UUID of the group, as until
recently Ganeti didn't export the node group properties. Since it does
so now, we make the following changes (again apologies for a big
patch):

- we change the group attribute to be an index, similar to the way an
  Instance.pnode and snode attributes point to the parent node(s)
- on load, we read the group.uuid attribute and we use that to lookup
  the actual group index, from previously-loaded groups info
- this means that we now first read groups, then read nodes using the
  group info, and then read instances using the node info

This patch leaves a few functions showing the group index (ugly since
it's htools internal), will be converted later.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

10ef6b4e

Dec 09, 2010

Improve error reporting for small clusters · dec88196

Iustin Pop authored 14 years ago


When doing a two-node allocation on a cluster/group in which only one
node is online, or a one-node allocation without any online nodes, we
didn't show a valid error mesage. The patch changes tryAlloc to "fail
hard" in this case, to make the failure explicit.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

dec88196

hail/allocate: implement multi-group support · 9b1584fc

Iustin Pop authored 14 years ago


This is a bit hackish. We add a new function that takes the input data,
splits it into groups, runs the original tryAlloc for each group, and
then chooses the best solution, but adds the log messages from all the
groups, as to give better debugging information. In hail, we just point
to this new function.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

9b1584fc

Add a 'log' attribute to allocation solutions · 859fc11d

Iustin Pop authored 14 years ago


And also a couple of functions for describing a given solution; these
will be used in the future instead of the ones currently in hail.

The patch also enhances the description of failure messages.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

859fc11d

Change AllocSolution from tuple to its own type · 85d0ddc3

Iustin Pop authored 14 years ago


Tuples are good for two, three, at most four elements. Beyond that, the
continuous pattern matching and construction/deconstruction becomes
tedious.

Since in the future we'll probably keep more information in the
AllocSolution type, we change it now from a triple to a "real" data
type. We also do some cleanups: adding a real emptyAlloc value, instead
of the previous hardcoded ones, and add some more comments on how we do
the multi-evacuation.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

85d0ddc3

Dec 01, 2010

Cleanup AllocSolution after AllocElement changes · a334d536

Iustin Pop authored 14 years ago


Since we added the score to AllocElement, we don't need to wrap
AllocElement in yet another tuple, just to attach the cluster score. So
we simplify the AllocSolution type.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

a334d536

AllocElement: extend with the cluster score · 7d3f4253

Iustin Pop authored 14 years ago


AllocElement, a type used as a result of allocations, holds the status
of the nodes after the allocation. In most cases, we'll compare this
allocation result with others, to see which allocation decision makes
the most sense. This comparison is done via the cluster score.

However, if we later need to redo this computation, as part of other
comparisons, we'd need to evaluate it again, etc. So it's easier to just
compute the score at the place where we compute the node list in the
initial step.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

7d3f4253

Add two utility functions for the Result type · 06fb841e

Iustin Pop authored 14 years ago


Actually, this just moves the functions from the QC module to Types, and
removes a duplicate entry from Cluster.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

06fb841e

Add Cluster.splitCluster for node groups · f4161783

Iustin Pop authored 14 years ago


This splits a top-level cluster information into the component node
groups. Instance go to the group of their primary node, but otherwise we
don't disallow split instances.

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

f4161783

Add two functions for checking cluster consistency · 32b8d9c0

Iustin Pop authored 14 years ago


For now, we don't support instances allocated across two groups, and we
will reject such clusters. The isClusterConsistent function will return
a list of inconsistent instances, potentially allowing operation without
touch them (but only the rest).

Signed-off-by: Iustin Pop <iustin@google.com>
Reviewed-by: Balazs Lecz <leczb@google.com>

32b8d9c0

Nov 09, 2010

Fix tag exclusion weight · 306cccd5

Iustin Pop authored 14 years ago

Currently, the tag exclusion metric has a weight of one, which means
there might be cases where we won't move instances around because it
upsets the cluster metrics. However, we do want to make a higher effort
for cleaning up tag collisions, so we increase the weight to an
empirically-determined value of 2.

306cccd5

Sep 03, 2010
- Use the mingain options in the balancing algorithm · 848b65c9
  Iustin Pop authored 14 years ago
```
Also adds them in hbal.
```
  848b65c9
Aug 30, 2010

Change iterateAlloc to return the instance list · 94d08202

Iustin Pop authored 14 years ago

The Cluster.iterateAlloc and tieredAlloc functions are changed to also
return the updated instance list, since it is needed to have a “full”
cluster view.

94d08202

Jul 27, 2010

hail: fix error message for failed multi-evac · 0ca66853

Iustin Pop authored 14 years ago

Currently we show the instance index, but this makes no sense outside
the current running program. Instead, we show the instance name.

0ca66853

Jul 21, 2010

Change the meaning of the N+1 fail metric · c3c7a0c1

Iustin Pop authored 14 years ago

Currently, this metric tracks the nodes failing the N+1 check. While
this helps (in some cases) to evacuate such nodes, it's not a good
metric since rarely it will change during a step (only at the last
instance moving away). Therefore we replace it with the count of
instances living on such nodes, which is much better because:
- moving an instance away while the node is still N+1 failing will still
  reflect in the score as an optimization
- moving the last instance causing an N+1 failure will result in a heavy
  decrease of this score, thus giving the right bonus to clear this
  status

c3c7a0c1

Introduce per-metric weights · 8a3b30ca

Iustin Pop authored 14 years ago

Currently all metrics have the same weight (we just sum them together).
However, for the hard constraints (N+1 failures, offline nodes, etc.)
we should handle the metrics differently based on their meaning. For
example, an instance living on a primary offline node is worse than an
instance having its secondary node offline, which in turn is worse than
an instance having its secondary node failing N+1.

To express this case in our code, we introduce a table of weights for
the metrics, with which we can influence their relative importance.

8a3b30ca

Allow balancing moves to introduce N+1 errors · 2cae47e9

Iustin Pop authored 14 years ago

This patch switches the applyMove function to the extended versions of
Node.addPri and addSec, and passes the override flag based on the state
of the node that we're moving away from.

2cae47e9

Jul 19, 2010

hbal: print short names in steps list · 14c972c7

Iustin Pop authored 14 years ago

This was a regression from the name handling changes, as we started
using the original names for the solution list (which is not designed
for parsing/feeding back into ganeti).

14c972c7

Remove an obsolete function · fb33aaaf
Iustin Pop authored 14 years ago
```
printSolution is no longer used, as we print the solution iteratively
now.
```
fb33aaaf

Jul 18, 2010

Allow '+' in node list fields · 6dfa04fd

Iustin Pop authored 14 years ago

When the field list is prefixed with a plus sign, this will extend the
default field list, instead of replacing it entirely.

6dfa04fd

May 20, 2010

Add more unit tests for allocation/balance · 3fea6959

Iustin Pop authored 15 years ago

The patch adds some simple unit-tests for both the allocation function
(we can allocate small instances on an empty cluster, we can allocate in
tiered more starting from any size) and the balancing functions (one
single instance is placed optimally, a full cluster plus an empty node
can be rebalanced). The coverage has increased greatly, since this is
the bulk of the algorithm/code.

Also, the cluster tests are now being run with different options, since
they are much slower.

3fea6959

Move two functions from hspace to Cluster.hs · 3ce8009a
Iustin Pop authored 15 years ago
```
This is done so we can test a longer pipeline.
```
3ce8009a
Make CStats instance of show · 8423f76b
Iustin Pop authored 15 years ago
```
This helps debugging via ghci.
```
8423f76b

Stop modifying names for internal computations · 3e4480e0

Iustin Pop authored 15 years ago

Currently the name used internally is modified and holds the shortened
name of the nodes/instances. This has caused issues before, since we
always have to strip the suffix from input data and reapply it if we
need to send data back to Ganeti.

This patch changes the code such that the names are never modified, only
the alias, and all the internal computations can forget about the common
suffix addition/removal.

3e4480e0

May 18, 2010

Remove the noLimit values and always use limits · f4c0b8c5

Iustin Pop authored 15 years ago

This patch moves from allowing no-limits for disk/cpu ratios, and always
use a real limit. For disk, it's simple since we use 0, which means no
reservations for disks. For CPU, we set an (arbitrary) limit of 64 v/p,
which should be reasonable as a default limit (it can be changed via the
command line).

f4c0b8c5

May 04, 2010

Fix hspace's KM metrics · e2436511

Iustin Pop authored 15 years ago

We returned the KM_POOL_* metrics as the final state, not as the delta
between the final and the initial state.

e2436511

Apr 15, 2010

Add a new function to compute allocation deltas · 9b8fac3d

Iustin Pop authored 15 years ago

Given two cluster states, the new function can answer the following
questions:

- how much resources currently allocated
- how much resources finally allocated (delta from above is how much we
  can actually allocate on the cluster)
- unallocable resources (whatever is left free after the previous step)

9b8fac3d

Introduce total vcpu tracking in CStats · 86ecce4a

Iustin Pop authored 15 years ago

We add a new field that tracks the available virtual cpus (expressed as
node cpus times the vcpu ratio).

86ecce4a

Feb 25, 2010
- A number of small fixes from hlint · 5182e970
  Iustin Pop authored 15 years ago
  
  5182e970
Feb 23, 2010
- balance function: use the movable flag directly · c424cdc8
  Iustin Pop authored 15 years ago
```
Instead of deciding based on secondary node, use the new flag.
```
  c424cdc8
Feb 22, 2010

Add a tryEvac function · 12b0511d

Iustin Pop authored 15 years ago


This will be used by the node evacuate IAllocator request type.

Signed-off-by: Iustin Pop <iustin@google.com>

12b0511d

Move a type declaration to Node.hs · 1fe81531

Iustin Pop authored 15 years ago


We'll need AllocElement in both Cluster and IAlloc in the future, so we
move it to Node.hs which is imported by both.

Signed-off-by: Iustin Pop <iustin@google.com>

1fe81531

Change an internal type from Maybe to list · 23f9ab76

Iustin Pop authored 15 years ago


In preparation for multiple responses, we change from Maybe to List
(both used in the container sense).

This allows us to keep the same workflow for all kind of requests.

Signed-off-by: Iustin Pop <iustin@google.com>

23f9ab76

Implement evacuation mode in hbal · 2e28ac32

Iustin Pop authored 15 years ago


This mode restricts the list of instances to be moved to the instances
living on the offline (and drained) nodes.

Signed-off-by: Iustin Pop <iustin@google.com>

2e28ac32