Skip to main content
Top

Towards a Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms

  • 2024
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The chapter discusses the challenges of deploying large deep neural networks in resource-constrained edge environments and the need for adaptive optimization algorithms. It introduces a new framework that allows flexible configurations of input parameters and automatic selection of regression models for predicting module inference latency. The proposed Multi-task Encoder-Decoder Network (MEDN) is highlighted as a more accurate and efficient alternative to existing regression models. The framework's ability to measure device dynamics and handle various input parameters, including Inferable Parameters, is emphasized. Experimental results demonstrate MEDN's superior performance and the effectiveness of the Time/Space-efficient Auto-selection algorithm. Future research directions are also outlined, focusing on further enhancing the framework's capabilities.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
Towards a Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms
Authors
Jingran Shen
Nikos Tziritas
Georgios Theodoropoulos
Copyright Year
2024
DOI
https://doi.org/10.1007/978-3-031-57808-3_3
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG