• Iustin Pop's avatar
    Reduce chance of ssh failures in verify cluster · b544cfe0
    Iustin Pop authored
    The cluster verify builds a sorted list of nodes and passes that to all
    the nodes (in parallel) for ssh checks. This means that for a cluster
    with N nodes, there will be approximately N simultaneous connections to
    the first node, then to the second node, etc. This, coupled with the
    ssh daemon's “MaxStartups” parameter, can create false alarms about ssh
    connectivity.
    
    This patch randomizes the node list in the backend (therefore, each node
    should have it's own order of ssh-ing to the other nodes) and the chance
    of these alarms should be reduced.
    
    Reviewed-by: ultrotter
    b544cfe0
backend.py 49.4 KB