Elsevier

Journal of Algorithms

Volume 30, Issue 1, January 1999, Pages 144-165
Journal of Algorithms

Regular Article
Fault-Local Distributed Mending

https://doi.org/10.1006/jagm.1998.0972Get rights and content

Abstract

As communication networks grow, existing fault handling tools that involveglobalmeasures such as global time-outs or reset procedures become increasingly unaffordable, since their cost grows with the size of the network. Rather, for a fault handling mechanism to scale to large networks, its cost must depend only on the number of failed nodes (which, thanks to today's technology, grows much more slowly than the networks). Moreover, it should allow the nonfaulty regions of the networks to continue their operation even during the recovery of the faulty parts. This paper introduces the conceptsfault localityandfault-locally mendableproblems, which are problems for which there are correction algorithms (applied after faults) whose cost depends only on the (unknown) number of faults. We show thatanyinput-output problem is fault-locally mendable. The solution involves a novel technique combining data structures and “local votes” among nodes, which may be of interest in itself.

References (33)

  • Y. Afek et al.

    Applying static network protocols to dynamic networks

    Proceedings 28th IEEE Symposium on Foundations of Computer Science

    (October 1987)
  • B. Awerbuch, I. Cidon, S. Kutten, Optimal maintenance of replicated informations, in, Proceedings 31st IEEE Symposium...
  • B. Awerbuch et al.

    Optimal broadcasting with partial knowledge

    SIAM J Comput.

    (1998)
  • B. Awerbuch, B. Patt-Shamir, G. Varghese, Self-stabilization by local checking and correction, in, Proceedings of the...
  • Y. Afek, S. Kutten, M. Yung, Memory-efficient self-stabilizing protocols for general networks, in, Proceedings of the...
  • B. Awerbuch, S. Kutten, Y. Mansour, B. Patt-Shamir, G. Varghese, Time-optimal self-stabilizing synchronization, in,...
  • D.E. Comer

    Internetworking with TCP/IP

    (1991)
  • A. Bar-Noy et al.

    Designing broadcasting algorithms in the postal model for message-passing systems

    Math. Systems Theory

    (1994)
  • J-C. Bermond, D. Peleg, The power of small coalitions in graphs, in, Proceedings of the 2nd Colloquium on Structural...
  • J-C. Bermond, J. Bond, D. Peleg, S. Perennes, Tight bounds on the size of 2-monopolies, in, Proceedings of the 3rd...
  • I. Cidon et al.

    Distributed control for fast networks

    IEEE Trans. Commun.

    (1995)
  • T. D. Chandra, S. Toueg, Unreliable failure detector for asynchronous systems, in, Proceedings of the 10th ACM...
  • S. Ghosh, A. Gupta, T. Herman, S. V. Pemmaraju, Fault-containing self-stabilizing algorithms, in, Proceedings of the...
  • T. D. Chandra, V. Hadzilacos, S. Toueg, The weakest failure detector for solving consensus, in, Proceedings of the 11th...
  • I. Cidon et al.

    New models and algorithms for future networks

    IEEE Trans. Inform. Theory

    (1995)
  • I. Chlamtac, S. Kutten, A spatial reuse TDMA/FDMA for mobile multihop radio networks, in, Proceedings of IEEE INFOCOM,...
  • Cited by (36)

    • Error-sensitive proof-labeling schemes

      2022, Journal of Parallel and Distributed Computing
    • Dynamic networks of finite state machines

      2020, Theoretical Computer Science
    • An analysis of fault detection strategies in wireless sensor networks

      2017, Journal of Network and Computer Applications
      Citation Excerpt :

      In a majority of cases, the faults are local to specific component of WSN. Faults are extremely local in the network and only affect a few components of the network (Kutten and Peleg, 1999). Generally, the entire WSN is not faulty.

    • Irreversible conversion processes with deadlines

      2014, Journal of Discrete Algorithms
      Citation Excerpt :

      The spreading of something like a property or a commodity within a network is a recurrent feature in many contexts such as social influence [8,12,16,20], gene expression networks [15], immune systems [1], cellular automata [2], percolation [4], marketing strategies [7,16], and distributed computing [5,10,17–19].

    • Reversible iterative graph processes

      2012, Theoretical Computer Science
    View all citing articles on Scopus
    *

    Alexander Goldberg lecturer.

    Supported in part by a Walter and Elise Haas Career Development Award and by a grant from the Israel Science Foundation. Part of the work was done while visiting the IBM T. J. Watson Research Center.

    f1

    [email protected]

    f2

    [email protected]

    View full text