skip to main content
10.1145/1921249.1921254acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmicroConference Proceedingsconference-collections
research-article

A reconfigurable fault-tolerant deflection routing algorithm based on reinforcement learning for network-on-chip

Authors Info & Claims
Published:04 December 2010Publication History

ABSTRACT

We propose a reconfigurable fault-tolerant deflection routing algorithm (FTDR) based on reinforcement learning for NoC. The algorithm reconfigures the routing table through a kind of reinforcement learning---Q-learning using 2-hop fault information. It is topology-agnostic and insensitive to the shape of the fault region. In order to reduce the routing table size, we also propose a hierarchical Q-learning based deflection routing algorithm (FTDR-H) with area reduction up to 27% for a switch in an 8 x 8 mesh compared to the original FTDR. Experimental results show that in the presence of faults, FTDR and FTDR-H are better than other fault-tolerant deflection routing algorithms and a turn model based fault-tolerant routing algorithm.

References

  1. C. Constantinescu, "Trends and challenges in VLSI circuit reliability," IEEE Micro, 23(4): 14--19, July-August 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Z. Lu, M. Zhong and A. Jantsch, "Evaluation of on-chip networks using deflection routing," In Proceedings of ACM Great Lakes Symposium on VLSI, pages 296--301, May 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Moscibroda and O. Mutlu, "A case for bufferless routing in on-chip networks," In Proceedings of International Symposium on Computer Architecture, pages 196--207, June 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Hayenga, N. E. Jerger and M. Lipasti, "SCARAB: a single cycle adaptive routing and bufferless network," In Proceedings of International Symposium on Microarchitecture, pages 244--254, June 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Kohler and M. Radetzki, "Fault-tolerant architecture and deflection routing for degradable NoC switches," In Proceedings of IEEE International Symposium on Networks-on-Chip, pages 22--31, May 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Feng, Z. Lu, A. Jantsch, J. Li and M. Zhang, "FoN: Fault-on-Neighbor aware routing algorithm for Networks-on-Chip," In Proceedings of the 23rd IEEE International SoC Conference, pages 441--446, September 2010.Google ScholarGoogle Scholar
  7. D. Fick, A. Deorio, G. Chen, V. Bertacco, D. Sylvester and D. Blaauw, "A highly resilient routing algorithm for fault-tolerant NoCs," In Proceedings of Design, Automation and Test in Europe Conference and Exhibition, pages 21--26, April 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Wu, "Fault-tolerant adaptive and minimal routing in mesh-connected multicomputers using extended safety levels," IEEE Transactions on Parallel and Distributed Systems, 11(2):149--159, February 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Y. J. Suh, B. V. Dao, J. Duato and S. Yalamanchili, "Software-based rerouting for fault-tolerant pipelined communication," IEEE Transactions on Parallel and Distributed Systems, 11(3):193--211, March 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Z. Zhang, A. Greiner and S. Taktak, "A reconfigurable routing algorithm for a fault-tolerant 2D-mesh Network-on-Chip," In Proceedings of ACM/IEEE Design Automation Conference, pages 441--446, June 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Holsmark, M. Palesi and S. Kumar, "Deadlock free routing algorithms for irregular mesh topology NoC systems with rectangular regions," Journal of Systems Architecture, 54(3):427--440, March 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. V. Puente, J. A. Gregorio, F. Vallejo and R. Beivide, "Immunet: dependable routing for interconnection networks with arbitrary topology," IEEE Transactions on Computers, 57(12):1676--1689, December 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Mejia, M. Palesi, J. Flieh, S. Kumar, P. Lopez, R. Holsmark and J. Duato, "Region-based routing: a mechanism to support efficient routing algorithms in NoCs," IEEE Transactions on VLSI Systems, 17(3):356--369, May 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Majer, C. Bobda, A. Ahmadinia and J. Teich, "Packet routing in dynamically changing networks on chip," In Proceedings of IEEE International Parallel and Distributed Processing Symposium, pages 154b, April 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. E. Nilsson, M. Millberg, J. Oberg and A. Jantsch, "Load distribution with the proximity congestion awareness in a network on chip," In Proceedings of Design, Automation and Test in Europe Conference and Exhibition, pages 1126--1127, March 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. X. Chen, Z. Lu, A. Jantsch and S. Chen, "Supporting distributed shared memory on multi-core network-on-chips using a dual microcoded controller," In Proceedings of Design, Automation and Test in Europe Conference and Exhibition, pages 39--44, March 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. A. Boyan and M. L. Littman, "Packet routing in dynamically changing networks: a reinforcement learning approach," Advances in Neural Information Processing Systems, Vol. 6, pages 671--678, 1994.Google ScholarGoogle Scholar

Index Terms

  1. A reconfigurable fault-tolerant deflection routing algorithm based on reinforcement learning for network-on-chip

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        NoCArc '10: Proceedings of the Third International Workshop on Network on Chip Architectures
        December 2010
        62 pages
        ISBN:9781450303972
        DOI:10.1145/1921249

        Copyright © 2010 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 December 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate46of122submissions,38%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader