Skip to main content

2024 | OriginalPaper | Buchkapitel

9. Intelligence Inference on IoT Devices

verfasst von : Qiyang Zhang, Ying Li, Dingge Zhang, Ilir Murturi, Victor Casamayor Pujol, Schahram Dustdar, Shangguang Wang

Erschienen in: Learning Techniques for the Internet of Things

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With the rapid advancement of artificial intelligence (AI), the proliferation of deep neural networks (DNNs) has ushered in a transformative era, revolutionizing modern lifestyles and enhancing production efficiency. However, the substantial computational and data requirements generated by Internet of Things (IoT) devices present a significant bottleneck, rendering traditional cloud-based computing models inadequate for real-time processing tasks. In response to these challenges, developers have increasingly turned to cloud offloading as a solution, despite the high infrastructure costs and heavy reliance on network conditions associated with this approach. Meanwhile, the emergence of SoCs has enabled on-device execution, particularly on high-tier platforms capable of effectively handling SOTA DNNs. This chapter offers a comprehensive review of intelligent inference approaches, with a specific emphasis on reducing inference time and minimizing transmitted bandwidth between IoT devices and the cloud. The review encompasses various aspects, including the background of inference, hardware architectures supporting inference, a diverse range of intelligent applications, inference libraries tailored for IoT devices, and different types of inference techniques for applications. Additionally, this work addresses the current challenges in intelligent inference, discusses future development trends, and provides future research directions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Adadi, Amina, and Mohammed Berrada. 2018. Peeking inside the black-box: A survey on explainable artificial intelligence (XAI). IEEE access 6: 52138–52160.CrossRef Adadi, Amina, and Mohammed Berrada. 2018. Peeking inside the black-box: A survey on explainable artificial intelligence (XAI). IEEE access 6: 52138–52160.CrossRef
Zurück zum Zitat Ain, Qurat-ul et al. 2018. IoT operating system based fuzzy inference system for home energy management system in smart buildings. Sensors 18 (9): 2802.CrossRef Ain, Qurat-ul et al. 2018. IoT operating system based fuzzy inference system for home energy management system in smart buildings. Sensors 18 (9): 2802.CrossRef
Zurück zum Zitat Alkhabbas, Fahed, et al. 2020. A goal-driven approach for deploying self-adaptive IoT systems. In 2020 IEEE International Conference on Software Architecture (ICSA), 146–156. Piscataway: IEEE.CrossRef Alkhabbas, Fahed, et al. 2020. A goal-driven approach for deploying self-adaptive IoT systems. In 2020 IEEE International Conference on Software Architecture (ICSA), 146–156. Piscataway: IEEE.CrossRef
Zurück zum Zitat Almeida, Mario, et al. 2022. Dyno: Dynamic onloading of deep neural networks from cloud to device. ACM Transactions on Embedded Computing Systems 21 (6): 1–24.CrossRef Almeida, Mario, et al. 2022. Dyno: Dynamic onloading of deep neural networks from cloud to device. ACM Transactions on Embedded Computing Systems 21 (6): 1–24.CrossRef
Zurück zum Zitat Azizi, Shekoofeh, et al. 2023. Synthetic data from diffusion models improves imagenet classification. arXiv preprint. arXiv:2304.08466. Azizi, Shekoofeh, et al. 2023. Synthetic data from diffusion models improves imagenet classification. arXiv preprint. arXiv:2304.08466.
Zurück zum Zitat Bajrami, Xhevahir, et al. 2018. Face recognition performance using linear discriminant analysis and deep neural networks. International Journal of Applied Pattern Recognition 5 (3): 240–250.CrossRef Bajrami, Xhevahir, et al. 2018. Face recognition performance using linear discriminant analysis and deep neural networks. International Journal of Applied Pattern Recognition 5 (3): 240–250.CrossRef
Zurück zum Zitat Bradski, Gary, Adrian Kaehler, et al. 2000. OpenCV. Dr. Dobb’s Journal of Software Tools 3 (2): 1–81. Bradski, Gary, Adrian Kaehler, et al. 2000. OpenCV. Dr. Dobb’s Journal of Software Tools 3 (2): 1–81.
Zurück zum Zitat Cheng, Yu, et al. 2015. An exploration of parameter redundancy in deep networks with circulant projections. In Proceedings of the IEEE International Conference on Computer Vision, 2857–2865. Cheng, Yu, et al. 2015. An exploration of parameter redundancy in deep networks with circulant projections. In Proceedings of the IEEE International Conference on Computer Vision, 2857–2865.
Zurück zum Zitat Choudhary, Tejalal, et al. 2020. A comprehensive survey on model compression and acceleration. Artificial Intelligence Review 53: 5113–5155.CrossRef Choudhary, Tejalal, et al. 2020. A comprehensive survey on model compression and acceleration. Artificial Intelligence Review 53: 5113–5155.CrossRef
Zurück zum Zitat Courville, Vanessa, and Vahid Partovi Nia. 2019. Deep learning inference frameworks for ARM CPU. Journal of Computational Vision and Imaging Systems 5 (1): 3–3. Courville, Vanessa, and Vahid Partovi Nia. 2019. Deep learning inference frameworks for ARM CPU. Journal of Computational Vision and Imaging Systems 5 (1): 3–3.
Zurück zum Zitat Deng, Yunbin. 2019. Deep learning on mobile devices: A review. In Mobile Multimedia/Image Processing, Security, and Applications 2019. Vol. 10993, 52–66. Bellingham: SPIE. Deng, Yunbin. 2019. Deep learning on mobile devices: A review. In Mobile Multimedia/Image Processing, Security, and Applications 2019. Vol. 10993, 52–66. Bellingham: SPIE.
Zurück zum Zitat Developers, TensorFlow. 2022. TensorFlow. In Zenodo. Developers, TensorFlow. 2022. TensorFlow. In Zenodo.
Zurück zum Zitat Donta, Praveen Kumar, and Schahram Dustdar. 2022. The promising role of representation learning for distributed computing continuum systems. In 2022 IEEE International Conference on Service-Oriented System Engineering (SOSE), 126–132. Piscataway: IEEE.CrossRef Donta, Praveen Kumar, and Schahram Dustdar. 2022. The promising role of representation learning for distributed computing continuum systems. In 2022 IEEE International Conference on Service-Oriented System Engineering (SOSE), 126–132. Piscataway: IEEE.CrossRef
Zurück zum Zitat Donta, Praveen Kumar, Boris Sedlak, et al. 2023. Governance and sustainability of distributed continuum systems: A big data approach. Journal of Big Data 10 (1): 1–31.CrossRef Donta, Praveen Kumar, Boris Sedlak, et al. 2023. Governance and sustainability of distributed continuum systems: A big data approach. Journal of Big Data 10 (1): 1–31.CrossRef
Zurück zum Zitat Dustdar, Schahram, and Ilir Murturi. 2020. Towards distributed edge-based systems. In 2020 IEEE Second International Conference on Cognitive Machine Intelligence (CogMI), 1–9. Piscataway: IEEE. Dustdar, Schahram, and Ilir Murturi. 2020. Towards distributed edge-based systems. In 2020 IEEE Second International Conference on Cognitive Machine Intelligence (CogMI), 1–9. Piscataway: IEEE.
Zurück zum Zitat Dustdar, Schahram, and Ilir Murturi. 2021. Towards IoT processes on the edge. In Next-Gen Digital Services. A Retrospective and Roadmap for Service Computing of the Future: Essays Dedicated to Michael Papazoglou on the Occasion of His 65th Birthday and His Retirement, 167–178. Dustdar, Schahram, and Ilir Murturi. 2021. Towards IoT processes on the edge. In Next-Gen Digital Services. A Retrospective and Roadmap for Service Computing of the Future: Essays Dedicated to Michael Papazoglou on the Occasion of His 65th Birthday and His Retirement, 167–178.
Zurück zum Zitat Flamis, Georgios, et al. 2021. Best practices for the deployment of edge inference: The conclusions to start designing. Electronics 10 (16): 1912.CrossRef Flamis, Georgios, et al. 2021. Best practices for the deployment of edge inference: The conclusions to start designing. Electronics 10 (16): 1912.CrossRef
Zurück zum Zitat Girshick, Ross, et al. 2015. Region-based convolutional networks for accurate object detection and segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 38 (1): 142–158.CrossRef Girshick, Ross, et al. 2015. Region-based convolutional networks for accurate object detection and segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 38 (1): 142–158.CrossRef
Zurück zum Zitat Guo, Anqi, et al. 2023. Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training. In Proceedings of the 37th International Conference on Supercomputing, 336–347. Guo, Anqi, et al. 2023. Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training. In Proceedings of the 37th International Conference on Supercomputing, 336–347.
Zurück zum Zitat Guo, Peizhen, and Wenjun Hu. 2018. Potluck: Cross-application approximate deduplication for computation-intensive mobile applications. In Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 271–284. Guo, Peizhen, and Wenjun Hu. 2018. Potluck: Cross-application approximate deduplication for computation-intensive mobile applications. In Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 271–284.
Zurück zum Zitat Han, Seungyeop, et al. 2016. MCDNN: An approximation-based execution framework for deep stream processing under resource constraints. In Proceedings of the 14th Annual International Conference on Mobile Systems, Applications, and Services, 123–136. Han, Seungyeop, et al. 2016. MCDNN: An approximation-based execution framework for deep stream processing under resource constraints. In Proceedings of the 14th Annual International Conference on Mobile Systems, Applications, and Services, 123–136.
Zurück zum Zitat Haris, Jude, Gibson, Perry, Cano, José, Agostini, Nicolas Bohm and Kaeli, David. 2022. Hardware/Software Co-Design of Edge DNN Accelerators with TFLite. 107 (8): 1–4. Haris, Jude, Gibson, Perry, Cano, José, Agostini, Nicolas Bohm and Kaeli, David. 2022. Hardware/Software Co-Design of Edge DNN Accelerators with TFLite. 107 (8): 1–4.
Zurück zum Zitat Hu, Chuang, et al. 2019. Dynamic adaptive DNN surgery for inference acceleration on the edge. In IEEE INFOCOM 2019-IEEE Conference on Computer Communications, 1423–1431. Piscataway: IEEE.CrossRef Hu, Chuang, et al. 2019. Dynamic adaptive DNN surgery for inference acceleration on the edge. In IEEE INFOCOM 2019-IEEE Conference on Computer Communications, 1423–1431. Piscataway: IEEE.CrossRef
Zurück zum Zitat Huang, Jin, et al. 2020. Clio: Enabling automatic compilation of deep learning pipelines across iot and cloud. In Proceedings of the 26th Annual International Conference on Mobile Computing and Networking, 1–12. Huang, Jin, et al. 2020. Clio: Enabling automatic compilation of deep learning pipelines across iot and cloud. In Proceedings of the 26th Annual International Conference on Mobile Computing and Networking, 1–12.
Zurück zum Zitat Huynh, Loc N., et al. 2017. DeepMon: Mobile GPU-based deep learning framework for continuous vision applications. In Proceedings of the 15th Annual International Conference on Mobile Systems, Applications, and Services, 82–95. Huynh, Loc N., et al. 2017. DeepMon: Mobile GPU-based deep learning framework for continuous vision applications. In Proceedings of the 15th Annual International Conference on Mobile Systems, Applications, and Services, 82–95.
Zurück zum Zitat Iandola, Forrest N., et al. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and\(<\) 0.5 MB model size. arXiv preprint. arXiv:1602.07360. Iandola, Forrest N., et al. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and\(<\) 0.5 MB model size. arXiv preprint. arXiv:1602.07360.
Zurück zum Zitat Jayakodi, Nitthilan Kanappan, Janardhan Rao Doppa, et al. 2020. SETGAN: Scale and energy trade-off gans for image applications on mobile platforms. In Proceedings of the 39th International Conference on Computer-Aided Design, 1–9. Jayakodi, Nitthilan Kanappan, Janardhan Rao Doppa, et al. 2020. SETGAN: Scale and energy trade-off gans for image applications on mobile platforms. In Proceedings of the 39th International Conference on Computer-Aided Design, 1–9.
Zurück zum Zitat Jayakodi, Nitthilan Kanappan, Syrine Belakaria, et al. 2020. Design and optimization of energy-accuracy tradeoff networks for mobile platforms via pretrained deep models. ACM Transactions on Embedded Computing Systems (TECS) 19 (1): 1–24.CrossRef Jayakodi, Nitthilan Kanappan, Syrine Belakaria, et al. 2020. Design and optimization of energy-accuracy tradeoff networks for mobile platforms via pretrained deep models. ACM Transactions on Embedded Computing Systems (TECS) 19 (1): 1–24.CrossRef
Zurück zum Zitat Jeong, Hyuk-Jin, et al. 2018. IONN: Incremental offloading of neural network computations from mobile devices to edge servers. In Proceedings of the ACM Symposium on Cloud Computing, 401–411. Jeong, Hyuk-Jin, et al. 2018. IONN: Incremental offloading of neural network computations from mobile devices to edge servers. In Proceedings of the ACM Symposium on Cloud Computing, 401–411.
Zurück zum Zitat Jiang, Xiaotang, et al. 2020. MNN: A universal and efficient inference engine. In Proceedings of Machine Learning and Systems. Vol. 2, 1–13. Jiang, Xiaotang, et al. 2020. MNN: A universal and efficient inference engine. In Proceedings of Machine Learning and Systems. Vol. 2, 1–13.
Zurück zum Zitat Jiao, Meng, et al. 2020. A GRU-RNN based momentum optimized algorithm for SOC estimation. Journal of Power Sources 459: 228051.CrossRef Jiao, Meng, et al. 2020. A GRU-RNN based momentum optimized algorithm for SOC estimation. Journal of Power Sources 459: 228051.CrossRef
Zurück zum Zitat Kang, Yiping, et al. 2017. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. ACM SIGARCH Computer Architecture News 45 (1): 615–629.CrossRef Kang, Yiping, et al. 2017. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. ACM SIGARCH Computer Architecture News 45 (1): 615–629.CrossRef
Zurück zum Zitat Kounoudes, Alexia Dini et al. 2021. User-centred privacy inference detection for smart home devices. 2021 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/IOP/SCI), 210–218. Piscataway: IEEE. Kounoudes, Alexia Dini et al. 2021. User-centred privacy inference detection for smart home devices. 2021 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/IOP/SCI), 210–218. Piscataway: IEEE.
Zurück zum Zitat Kouris, Alexandros, et al. 2022. Multi-exit semantic segmentation networks. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXI, 330–349. Berlin: Springer.CrossRef Kouris, Alexandros, et al. 2022. Multi-exit semantic segmentation networks. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXI, 330–349. Berlin: Springer.CrossRef
Zurück zum Zitat Laskaridis, Stefanos, Stylianos I. Venieris, Hyeji Kim, et al. 2020. HAPI: Hardware-aware progressive inference. In Proceedings of the 39th International Conference on Computer-Aided Design, 1–9. Laskaridis, Stefanos, Stylianos I. Venieris, Hyeji Kim, et al. 2020. HAPI: Hardware-aware progressive inference. In Proceedings of the 39th International Conference on Computer-Aided Design, 1–9.
Zurück zum Zitat Laskaridis, Stefanos, Stylianos I. Venieris, Mario Almeida, et al. 2020. SPINN: Synergistic progressive inference of neural networks over device and cloud. In Proceedings of the 26th Annual International Conference on Mobile Computing and Networking, 1–15. Laskaridis, Stefanos, Stylianos I. Venieris, Mario Almeida, et al. 2020. SPINN: Synergistic progressive inference of neural networks over device and cloud. In Proceedings of the 26th Annual International Conference on Mobile Computing and Networking, 1–15.
Zurück zum Zitat Lebedev, Mikhail, and Pavel Belecky. 2021. A survey of open-source tools for FPGA-based inference of artificial neural networks. In 2021 Ivannikov Memorial Workshop (IVMEM), 50–56. Piscataway: IEEE.CrossRef Lebedev, Mikhail, and Pavel Belecky. 2021. A survey of open-source tools for FPGA-based inference of artificial neural networks. In 2021 Ivannikov Memorial Workshop (IVMEM), 50–56. Piscataway: IEEE.CrossRef
Zurück zum Zitat Lebedev, Vadim, and Victor Lempitsky. 2016. Fast convnets using group-wise brain damage. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2554–2564. Lebedev, Vadim, and Victor Lempitsky. 2016. Fast convnets using group-wise brain damage. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2554–2564.
Zurück zum Zitat Leiserson, Charles E., et al. 2020. There’s plenty of room at the Top: What will drive computer performance after Moore’s law? Science 368 (6495): eaam9744. Leiserson, Charles E., et al. 2020. There’s plenty of room at the Top: What will drive computer performance after Moore’s law? Science 368 (6495): eaam9744.
Zurück zum Zitat Leon, Vasileios, et al. 2022. Systematic embedded development and implementation techniques on intel myriad VPUs. In 2022 IFIP/IEEE 30th International Conference on Very Large Scale Integration (VLSI-SoC), 1–2. Piscataway: IEEE. Leon, Vasileios, et al. 2022. Systematic embedded development and implementation techniques on intel myriad VPUs. In 2022 IFIP/IEEE 30th International Conference on Very Large Scale Integration (VLSI-SoC), 1–2. Piscataway: IEEE.
Zurück zum Zitat Li, En, et al. 2018. Edge intelligence: On-demand deep learning model co-inference with device-edge synergy. In Proceedings of the 2018 Workshop on Mobile Edge Communications, 31–36. Li, En, et al. 2018. Edge intelligence: On-demand deep learning model co-inference with device-edge synergy. In Proceedings of the 2018 Workshop on Mobile Edge Communications, 31–36.
Zurück zum Zitat Li, Hongshan, et al. 2018. JALAD: Joint accuracy-and latency-aware deep structure decoupling for edge-cloud execution. In 2018 IEEE 24th International Conference on Parallel and Distributed Systems (ICPADS), 671–678. Piscataway: IEEE.CrossRef Li, Hongshan, et al. 2018. JALAD: Joint accuracy-and latency-aware deep structure decoupling for edge-cloud execution. In 2018 IEEE 24th International Conference on Parallel and Distributed Systems (ICPADS), 671–678. Piscataway: IEEE.CrossRef
Zurück zum Zitat Li, Liangzhi, et al. 2018. Deep learning for smart industry: Efficient manufacture inspection system with fog computing. IEEE Transactions on Industrial Informatics 14 (10): 4665–4673.CrossRef Li, Liangzhi, et al. 2018. Deep learning for smart industry: Efficient manufacture inspection system with fog computing. IEEE Transactions on Industrial Informatics 14 (10): 4665–4673.CrossRef
Zurück zum Zitat Li, Ying, et al. 2023. Federated domain generalization: A survey. arXiv preprint. arXiv:2306.01334. Li, Ying, et al. 2023. Federated domain generalization: A survey. arXiv preprint. arXiv:2306.01334.
Zurück zum Zitat LiKamWa, Robert, and Lin Zhong. 2015. Starfish: Efficient concurrency support for computer vision applications. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services, 213–226. LiKamWa, Robert, and Lin Zhong. 2015. Starfish: Efficient concurrency support for computer vision applications. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services, 213–226.
Zurück zum Zitat Liu, Hongye, et al. 2016. Deep relative distance learning: Tell the difference between similar vehicles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2167–2175. Liu, Hongye, et al. 2016. Deep relative distance learning: Tell the difference between similar vehicles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2167–2175.
Zurück zum Zitat Liu, Shaoshan, et al. 2019. Edge computing for autonomous driving: Opportunities and challenges. Proceedings of the IEEE 107 (8): 1697–1716.CrossRef Liu, Shaoshan, et al. 2019. Edge computing for autonomous driving: Opportunities and challenges. Proceedings of the IEEE 107 (8): 1697–1716.CrossRef
Zurück zum Zitat Lucas, Bruce D., and Takeo Kanade. 1981. An iterative image registration technique with an application to stereo vision. In IJCAI’81: 7th International Joint Conference on Artificial Intelligence. Vol. 2, 674–679. Lucas, Bruce D., and Takeo Kanade. 1981. An iterative image registration technique with an application to stereo vision. In IJCAI’81: 7th International Joint Conference on Artificial Intelligence. Vol. 2, 674–679.
Zurück zum Zitat Mao, Jiachen, et al. 2017. MoDNN: Local distributed mobile computing system for deep neural network. In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017, 1396–1401. Piscataway: IEEE.CrossRef Mao, Jiachen, et al. 2017. MoDNN: Local distributed mobile computing system for deep neural network. In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017, 1396–1401. Piscataway: IEEE.CrossRef
Zurück zum Zitat Mohammadi, Mehdi, and Ala Al-Fuqaha. 2018. Enabling cognitive smart cities using big data and machine learning: Approaches and challenges. IEEE Communications Magazine 56 (2): 94–101.CrossRef Mohammadi, Mehdi, and Ala Al-Fuqaha. 2018. Enabling cognitive smart cities using big data and machine learning: Approaches and challenges. IEEE Communications Magazine 56 (2): 94–101.CrossRef
Zurück zum Zitat Owens, John D., et al. 2008. GPU computing. In Proceedings of the IEEE 96 (5): 879–899.CrossRef Owens, John D., et al. 2008. GPU computing. In Proceedings of the IEEE 96 (5): 879–899.CrossRef
Zurück zum Zitat Panda, Priyadarshini, et al. 2016. Conditional deep learning for energy-efficient and enhanced pattern recognition. In 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE), 475–480. Piscataway: IEEE. Panda, Priyadarshini, et al. 2016. Conditional deep learning for energy-efficient and enhanced pattern recognition. In 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE), 475–480. Piscataway: IEEE.
Zurück zum Zitat Polino, Antonio, et al. 2018. Model compression via distillation and quantization. arXiv preprint. arXiv:1802.05668. Polino, Antonio, et al. 2018. Model compression via distillation and quantization. arXiv preprint. arXiv:1802.05668.
Zurück zum Zitat Rastegari, Mohammad, et al. 2016. XNOR-Net: Imagenet classification using binary convolutional neural networks. In European conference on computer vision, 525–542. Berlin: Springer. Rastegari, Mohammad, et al. 2016. XNOR-Net: Imagenet classification using binary convolutional neural networks. In European conference on computer vision, 525–542. Berlin: Springer.
Zurück zum Zitat Ren, Wei-Qing, et al. 2023. A survey on collaborative DNN inference for edge intelligence. In Machine Intelligence Research, 1–25. Ren, Wei-Qing, et al. 2023. A survey on collaborative DNN inference for edge intelligence. In Machine Intelligence Research, 1–25.
Zurück zum Zitat Romero, Adriana, et al. 2014. Fitnets: Hints for thin deep nets. arXiv preprint. arXiv:1412.6550. Romero, Adriana, et al. 2014. Fitnets: Hints for thin deep nets. arXiv preprint. arXiv:1412.6550.
Zurück zum Zitat Sedlak, Boris, et al. 2022. Specification and operation of privacy models for data streams on the edge. In 2022 IEEE 6th International Conference on Fog and Edge Computing (ICFEC), 78–82. Piscataway: IEEE.CrossRef Sedlak, Boris, et al. 2022. Specification and operation of privacy models for data streams on the edge. In 2022 IEEE 6th International Conference on Fog and Edge Computing (ICFEC), 78–82. Piscataway: IEEE.CrossRef
Zurück zum Zitat Sengupta, Abhronil, et al. 2019. Going deeper in spiking neural networks: VGG and residual architectures. Frontiers in Neuroscience 13: 95.CrossRef Sengupta, Abhronil, et al. 2019. Going deeper in spiking neural networks: VGG and residual architectures. Frontiers in Neuroscience 13: 95.CrossRef
Zurück zum Zitat Soto, José Angel Carvajal, et al. 2016. CEML: Mixing and moving complex event processing and machine learning to the edge of the network for IoT applications. In Proceedings of the 6th International Conference on the Internet of Things, 103–110. Soto, José Angel Carvajal, et al. 2016. CEML: Mixing and moving complex event processing and machine learning to the edge of the network for IoT applications. In Proceedings of the 6th International Conference on the Internet of Things, 103–110.
Zurück zum Zitat Sun, Yi, Chen, Yuheng, Wang, Xiaogang, Tang, Xiaoou. 2014. Deep learning face representation by joint identification-verification. Advances in Neural Information Processing Systems 27 (8): 1–8. Sun, Yi, Chen, Yuheng, Wang, Xiaogang, Tang, Xiaoou. 2014. Deep learning face representation by joint identification-verification. Advances in Neural Information Processing Systems 27 (8): 1–8.
Zurück zum Zitat Targ, Sasha, et al. 2016. Resnet in resnet: Generalizing residual architectures. arXiv preprint. arXiv:1603.08029. Targ, Sasha, et al. 2016. Resnet in resnet: Generalizing residual architectures. arXiv preprint. arXiv:1603.08029.
Zurück zum Zitat Teerapittayanon, Surat, et al. 2016. Branchynet: Fast inference via early exiting from deep neural networks. In 2016 23rd International Conference on Pattern Recognition (ICPR), 2464–2469. Piscataway: IEEE.CrossRef Teerapittayanon, Surat, et al. 2016. Branchynet: Fast inference via early exiting from deep neural networks. In 2016 23rd International Conference on Pattern Recognition (ICPR), 2464–2469. Piscataway: IEEE.CrossRef
Zurück zum Zitat Tsigkanos, Christos, et al. 2019. Dependable resource coordination on the edge at runtime. Proceedings of the IEEE 107 (8): 1520–1536.CrossRef Tsigkanos, Christos, et al. 2019. Dependable resource coordination on the edge at runtime. Proceedings of the IEEE 107 (8): 1520–1536.CrossRef
Zurück zum Zitat Viola, Paul, and Michael Jones. 2001. Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001. Vol. 1, I–I. Piscataway: IEEE. Viola, Paul, and Michael Jones. 2001. Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001. Vol. 1, I–I. Piscataway: IEEE.
Zurück zum Zitat Wang, Qipeng, et al. 2022. Melon: Breaking the memory wall for resource-efficient on-device machine learning. In Proceedings of the 20th Annual International Conference on Mobile Systems, Applications and Services, 450–463. Wang, Qipeng, et al. 2022. Melon: Breaking the memory wall for resource-efficient on-device machine learning. In Proceedings of the 20th Annual International Conference on Mobile Systems, Applications and Services, 450–463.
Zurück zum Zitat Wang, Yang, et al. 2017. Effective multi-query expansions: Collaborative deep networks for robust landmark retrieval. IEEE Transactions on Image Processing 26 (3): 1393–1404.MathSciNetCrossRef Wang, Yang, et al. 2017. Effective multi-query expansions: Collaborative deep networks for robust landmark retrieval. IEEE Transactions on Image Processing 26 (3): 1393–1404.MathSciNetCrossRef
Zurück zum Zitat Wu, Jiaxiang, et al. 2016. Quantized convolutional neural networks for mobile devices. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4820–4828. Wu, Jiaxiang, et al. 2016. Quantized convolutional neural networks for mobile devices. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4820–4828.
Zurück zum Zitat Xiang, Yecheng, and Hyoseung Kim. 2019. Pipelined data-parallel CPU/GPU scheduling for multi-DNN real-time inference. In 2019 IEEE Real-Time Systems Symposium (RTSS), 392–405. Piscataway: IEEE.CrossRef Xiang, Yecheng, and Hyoseung Kim. 2019. Pipelined data-parallel CPU/GPU scheduling for multi-DNN real-time inference. In 2019 IEEE Real-Time Systems Symposium (RTSS), 392–405. Piscataway: IEEE.CrossRef
Zurück zum Zitat Xu, Daliang, et al. 2022. Mandheling: Mixed-precision on-device DNN training with DSP offloading. In Proceedings of the 28th Annual International Conference on Mobile Computing And Networking, 214–227. Xu, Daliang, et al. 2022. Mandheling: Mixed-precision on-device DNN training with DSP offloading. In Proceedings of the 28th Annual International Conference on Mobile Computing And Networking, 214–227.
Zurück zum Zitat Xu, Mengwei, Jiawei Liu, et al. 2019. A first look at deep learning apps on smartphones. In The World Wide Web Conference, 2125–2136. Xu, Mengwei, Jiawei Liu, et al. 2019. A first look at deep learning apps on smartphones. In The World Wide Web Conference, 2125–2136.
Zurück zum Zitat Xu, Mengwei, Tiantu Xu, et al. 2021. Video analytics with zero-streaming cameras. In 2021 USENIX Annual Technical Conference (USENIX ATC 21), 459–472. Xu, Mengwei, Tiantu Xu, et al. 2021. Video analytics with zero-streaming cameras. In 2021 USENIX Annual Technical Conference (USENIX ATC 21), 459–472.
Zurück zum Zitat Xu, Mengwei, Xiwen Zhang, et al. 2020. Approximate query service on autonomous iot cameras. In Proceedings of the 18th International Conference on Mobile Systems, Applications, and Services, 191–205. Xu, Mengwei, Xiwen Zhang, et al. 2020. Approximate query service on autonomous iot cameras. In Proceedings of the 18th International Conference on Mobile Systems, Applications, and Services, 191–205.
Zurück zum Zitat Yim, Junho, et al. 2017. A gift from knowledge distillation: Fast optimization, network minimization and transfer learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4133–4141. Yim, Junho, et al. 2017. A gift from knowledge distillation: Fast optimization, network minimization and transfer learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4133–4141.
Zurück zum Zitat Yu, Yong, et al. 2019. A review of recurrent neural networks: LSTM cells and network architectures. Neural Computation 31 (7): 1235–1270.MathSciNetCrossRef Yu, Yong, et al. 2019. A review of recurrent neural networks: LSTM cells and network architectures. Neural Computation 31 (7): 1235–1270.MathSciNetCrossRef
Zurück zum Zitat Zhang, Qiyang, Xiang Li, et al. 2022. A comprehensive benchmark of deep learning libraries on mobile devices. In Proceedings of the ACM Web Conference 2022, 3298–3307. Zhang, Qiyang, Xiang Li, et al. 2022. A comprehensive benchmark of deep learning libraries on mobile devices. In Proceedings of the ACM Web Conference 2022, 3298–3307.
Zurück zum Zitat Zhang, Qiyang, Zuo Zhu, et al. 2023. Energy-efficient federated training on mobile device. IEEE Network 35 (5): 1–14.MathSciNet Zhang, Qiyang, Zuo Zhu, et al. 2023. Energy-efficient federated training on mobile device. IEEE Network 35 (5): 1–14.MathSciNet
Zurück zum Zitat Zhang, Xiangyu, et al. 2018. Shufflenet: An extremely efficient convolutional neural network for mobile devices. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 6848–6856. Zhang, Xiangyu, et al. 2018. Shufflenet: An extremely efficient convolutional neural network for mobile devices. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 6848–6856.
Zurück zum Zitat Zhao, Zhuoran, et al. 2018. Deepthings: Distributed adaptive deep learning inference on resource-constrained IoT edge clusters. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 37 (11): 2348–2359.CrossRef Zhao, Zhuoran, et al. 2018. Deepthings: Distributed adaptive deep learning inference on resource-constrained IoT edge clusters. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 37 (11): 2348–2359.CrossRef
Zurück zum Zitat Zhou, Kanglei, et al. 2022. TSVMPath: Fast regularization parameter tuning algorithm for twin support vector machine. Neural Processing Letters 54 (6): 5457–5482.MathSciNetCrossRef Zhou, Kanglei, et al. 2022. TSVMPath: Fast regularization parameter tuning algorithm for twin support vector machine. Neural Processing Letters 54 (6): 5457–5482.MathSciNetCrossRef
Metadaten
Titel
Intelligence Inference on IoT Devices
verfasst von
Qiyang Zhang
Ying Li
Dingge Zhang
Ilir Murturi
Victor Casamayor Pujol
Schahram Dustdar
Shangguang Wang
Copyright-Jahr
2024
DOI
https://doi.org/10.1007/978-3-031-50514-0_9

Premium Partner