nach oben

Design Automation for Embedded Systems

Erschienen in:

01.06.2013

Instruction scheduling with k-successor tree for clustered VLIW processors

verfasst von: Xuemeng Zhang, Hui Wu, Jingling Xue

Erschienen in: Design Automation for Embedded Systems | Ausgabe 2/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Clustering is a well-known technique for improving the scalability of classical VLIW (Very Long Instruction Word) processors. A clustered VLIW processor consists of multiple clusters. Each cluster has a local register file and a set of functional units. This paper proposes a novel phase coupled, priority-based heuristic for scheduling a set of operations in a basic block on a clustered VLIW processor. Our heuristic converts the instruction scheduling problem to the problem of scheduling a set of operations with a common deadline. The priority of each operation v _i is the l _max(v _i)-successor-tree-consistent deadline. This deadline is the upper bound on the latest completion time of v _i in any feasible schedule for a relaxed problem where the precedence-latency constraints only between v _i and all its successors are considered. We have simulated our heuristic and the Integrated heuristic on the 808 basic blocks taken from the MediaBench II benchmark suite using three processor models. On average, for the three processor models, our heuristic improves over the Integrated heuristic by 13 %, 18 %, 16 %, respectively.

Vorheriger Artikel Fault buffers

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Our scheduling heuristic can be extended to handle an arbitrary communication network.

The LDW operations load data from the memory to registers. The ADD and SSUB operations perform addition and subtraction.

Hennessy JL, Patterson DA (2006) Computer architecture: a quantitative approach, 4th edn. Elsevier, Amsterdam

Terechko A, Le Thenaff E, Garg M, van Eijndhoven J, Corporaal H (2003) Inter-cluster communication models for clustered VLIW processors. In: Proceedings of symposium on high performance computer architectures

Nagpal R, Srikant YN (2008) Pragmatic integrated scheduling for clustered VLIW architectures. Softw Pract Exp 38:227–257 CrossRef

Ullman JD (1976) Complexity of sequencing problems. Wiley, New York

Ellis JR (1986) Bulldog: a compiler for VLIW architectures. MIT Press, Cambridge

Jang S, Carr S, Sweany P, Kuras D (1998) A code generation framework for VLIW architectures with partitioned register banks. In: Proceedings of 3rd international conference on massively parallel computing systems

Lapinskii VS, Jacome MF (2002) Cluster assignment for high-performace embedded VLIW processors. ACM Trans Des Autom Electron Syst 7(3):430–454 CrossRef

Özer E, Banerjia S, Conte TM (1998) Unified Assign and schedule: a new approach to scheduling for clustered register file microarchitectures. In: Proceedings of the 31st annual international symposium on microarchitecture

Leupers R (2000) Instruction scheduling for clustered VLIW DSPs. In: Proceedings of the international conference on parallel architecture and compilation techniques

10.

Kailas K, Ebcioglu K, Agrawala A (2001) CARS: a new code generation framework for clustered ILP processors. In: Proceedings of the 2001 international symposium on high performance computer architecture

11.

Sánchez J, Gonzálezor A (2000) Instruction scheduling for clustered VLIW architectures. In: Proceedings of 13th international symposium on system synthesis

12.

Zalamea J, Llosa J, Ayguade E, Valero M (2001) Modulo scheduling with integrated register spilling for clustered VLIW architectures. In: Proceedings of the 34th annual international symposium on microarchitecture, pp 160–169

13.

Gibbons PB, Muchnick SS (1986) Efficient instruction scheduling for a pipelined architecture. In: Proceedings of the ACM SIGPLAN conference on programming language design and implementation

14.

Codina JM, Sánchez J, González A (2001) A unified modulo scheduling and register allocation technique for clustered processors. In: Proceedings of 2001 international conference on parallel architecture and compilation techniques

15.

Qian Y, Carr S, Sweany P (2002) Optimizing loop performance for clustered VLIW architectures. In: Proceedings of 2002 international conference on parallel architecture and compilation techniques

16.

Aleta A, Codina JM, Sánchez J, González A, Kaeli D (2009) AGAMOS: a graph-based approach to modulo scheduling for clustered microarchitectures. IEEE Trans Comput 58(6):770–783 CrossRefMathSciNet

17.

TI TMS320C64xx DSPs. http://www.ti.com

18.

Micali S, Vazirani V (1980) An algorithm for finding maximum matching in general graphs. In: Proceedings of the 21st IEEE symposium on foundations of computer science

19.

mediabench. Mediabench II benchmark. http://euler.slu.edu/~fritts/mediabench/

Titel: Instruction scheduling with k-successor tree for clustered VLIW processors
verfasst von: Xuemeng Zhang
Hui Wu
Jingling Xue
Publikationsdatum: 01.06.2013
Verlag: Springer US
Erschienen in: Design Automation for Embedded Systems / Ausgabe 2/2013
Print ISSN: 0929-5585
Elektronische ISSN: 1572-8080
DOI: https://doi.org/10.1007/s10617-012-9103-0

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2013

On the hard-real-time scheduling of embedded streaming applications

Model-based implementation of distributed systems with priorities

The OMLP family of optimal multiprocessor real-time locking protocols

Virtualizing on-chip distributed ScratchPad memories for low power and trusted application execution

Introductions to special issue on ESWEEK 2011

Symbolic system-level design methodology for multi-mode reconfigurable systems

Premium Partner