Skip to content
  • Iustin Pop's avatar
    c3c7a0c1
    Change the meaning of the N+1 fail metric · c3c7a0c1
    Iustin Pop authored
    Currently, this metric tracks the nodes failing the N+1 check. While
    this helps (in some cases) to evacuate such nodes, it's not a good
    metric since rarely it will change during a step (only at the last
    instance moving away). Therefore we replace it with the count of
    instances living on such nodes, which is much better because:
    - moving an instance away while the node is still N+1 failing will still
      reflect in the score as an optimization
    - moving the last instance causing an N+1 failure will result in a heavy
      decrease of this score, thus giving the right bonus to clear this
      status
    c3c7a0c1
    Change the meaning of the N+1 fail metric
    Iustin Pop authored
    Currently, this metric tracks the nodes failing the N+1 check. While
    this helps (in some cases) to evacuate such nodes, it's not a good
    metric since rarely it will change during a step (only at the last
    instance moving away). Therefore we replace it with the count of
    instances living on such nodes, which is much better because:
    - moving an instance away while the node is still N+1 failing will still
      reflect in the score as an optimization
    - moving the last instance causing an N+1 failure will result in a heavy
      decrease of this score, thus giving the right bonus to clear this
      status
Loading