1. 05 Jul, 2009 4 commits
    • Iustin Pop's avatar
      Add computation of the failure reason in hspace · 44763b51
      Iustin Pop authored
      This patch enhances hspace to report why the allocation sequence
      stopped, both in absolute error count and for the top reason.
      44763b51
    • Iustin Pop's avatar
      Return correct failure data from Node.add* · c43c3354
      Iustin Pop authored
      This patch alters the Node.addPri/addSec to return correct failure data.
      It removes the computeFailN1 function from the module as that used to
      combine both mem and disk checks in the same function and thus the real
      error reason cannot be deduced when using it.
      c43c3354
    • Iustin Pop's avatar
      Introduce a new type for allocation results · f2280553
      Iustin Pop authored
      Currently the allocation/move operations workflow return ‘Maybe a’,
      which is very convenient but loses all details about the failure mode.
      
      This patch introduces a new data type which encodes the specific failure
      mode. It is not yet used correctly (e.g. all node operations result in
      FailN1), but the workflow is updated. Most of the changes are in the
      hail/hspace tools, the library code required only trivial adjustments.
      f2280553
    • Iustin Pop's avatar
      Remove hn1 and related code · 266aea94
      Iustin Pop authored
      hn1 was deprecated for a while and this patch removes it altogether. The
      support code in Cluster.hs is also removed.
      266aea94
  2. 03 Jul, 2009 4 commits
    • Iustin Pop's avatar
      Display two more stats in hspace · 78ecfa8f
      Iustin Pop authored
      This adds two new stats - sum of reserved ram and disk.
      78ecfa8f
    • Iustin Pop's avatar
      Fix totalResources avail disk computation · 301789f4
      Iustin Pop authored
      This uses the newly-added Node.availDisk to compute the actual available
      disk correctl, and display the total allocatable disk in hspace.
      301789f4
    • Iustin Pop's avatar
      Add an availDisk node function · fe3d6f02
      Iustin Pop authored
      This function returns the amount of available disk, which depends on
      whether a low disk limit has been configured or not and on the free disk
      space of the node.
      fe3d6f02
    • Iustin Pop's avatar
      Add two new autocomputed vars to Nodes · 836533fa
      Iustin Pop authored
      Currently we track the max disk usage/max vcpus as percentages, however
      sometimes it's easier to check against minimum free disk or maximum
      number of cpus, as units instead of percentages.
      
      This patch adds two new variables, lo_dsk, hi_cpu, which are recomputed
      whenever we change the m_dsk/m_cpu 'master' values.
      836533fa
  3. 02 Jul, 2009 3 commits
    • Iustin Pop's avatar
      Add a new type for cluster statistics · 1a7eff0e
      Iustin Pop authored
      Currently totalResources returns a 5-tuple of integers. This is not easy
      to handle, as each change on the return type means that each caller must
      be updated.
      
      This patch adds a new type for cluster stats and uses that instead as
      its return type. This simplifies its callers while complicating only a
      little the computation of the stats.
      1a7eff0e
    • Iustin Pop's avatar
      Enhance hspace resource display · 621de5b7
      Iustin Pop authored
      The display of cluster resources is extracted into a separate function
      and enhanced to display more stats.
      621de5b7
    • Iustin Pop's avatar
      Add display of more stats in hspace · e2af3156
      Iustin Pop authored
      This patch changes Cluster.totalResources to compute more details about
      the cluster status, and enhances hspace to display more of these.
      e2af3156
  4. 16 Jun, 2009 2 commits
  5. 12 Jun, 2009 5 commits
  6. 11 Jun, 2009 4 commits
    • Iustin Pop's avatar
      Add two new node attributes · 844eff86
      Iustin Pop authored
      Two new min disk free ratio and max cpu usage attributes are added to the
      nodes. These will be used in the future to restrict allocation.
      844eff86
    • Iustin Pop's avatar
      Fix 'unused X' warnings · c6484f0b
      Iustin Pop authored
      This removes some unused functions and imports to cleanup the warnings.
      c6484f0b
    • Iustin Pop's avatar
      Fix the various monomorphism warning · 78694255
      Iustin Pop authored
      In a few places (e.g. tryRead or any printf call) it's a little bit hard
      to add the correct type signatures, but in the it is possible to fix
      these warnings (which can bite one in subtle cases).
      78694255
    • Iustin Pop's avatar
      Small changes to the node list output · 3c64b5aa
      Iustin Pop authored
      This is just some cleanup of the node list output, adding pcpu/vcpu
      counters, and making the display slightly nicer.
      3c64b5aa
  7. 10 Jun, 2009 5 commits
  8. 05 Jun, 2009 1 commit
  9. 04 Jun, 2009 1 commit
    • Iustin Pop's avatar
      Fix the ReplacePrimary instance move · 70db354e
      Iustin Pop authored
      During a replace-primary instance move, on the real cluster the instance
      is temporarily started on the secondary, and as such we must check that
      the secondary node can hold it for this duration. Currently the code
      does not, and depending on cluster scoring it will put instances on such
      'bad' secondaries (usually without enough memory).
      
      This patch fixes this by adding/removing the instance as a primary on
      the secondary node; and then using the result node as the new secondary.
      (Since we're in a monad, we could have just tried the allocation and
      dropped the result, but this is more clear).
      70db354e
  10. 02 Jun, 2009 1 commit
  11. 01 Jun, 2009 10 commits