跳到主要导航 跳到搜索 跳到主要内容

On-demand Edge Inference Scheduling with Accuracy and Deadline Guarantee

  • Yechao She*
  • , Minming Li
  • , Yang Jin
  • , Meng Xu
  • , Jianping Wang
  • , Bin Liu
  • *此作品的通讯作者
  • City University of Hong Kong
  • Tsinghua University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

To meet increasing demands for machine-learning-based applications, pushing inference services to the network edge has been a trend. This work aims to design an on-demand edge inference scheduler with accuracy and deadline guarantee for repetitive tasks. Specifically, we consider an edge server that is preinstalled with multiple early-exit Deep Neural Networks (DNNs), and each DNN-exit pair can provide inference service of different quality. We also consider tasks' diversity in quality of service requirements and related utility. We aim to maximize the system's total utility by optimizing service assignment and time scheduling subject to resource, accuracy, and deadline constraints. We present this problem's integer linear problem formulation and show this problem is NP-hard even for the offline case. This problem is challenging due to the coupled effect of service assignment and time scheduling. To derive low-complexity scheduling solutions, we introduce a task-service graph and convert this problem into a service assignment selection problem with schedulability constraints. Then, we design a polynomial complexity algorithm with $\frac{\rho}{\delta}$-approximation ratio for the offline problem, with $\rho$ referring to the task-wise utility ratio, $\delta$ referring to the maximum number of concurrent tasks. To handle the online problem, we propose an online heuristic algorithm. Simulation results show that the proposed algorithms outperform the state-of-the-art baseline algorithms.

源语言英语
主期刊名2023 IEEE/ACM 31st International Symposium on Quality of Service, IWQoS 2023
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350399738
DOI
出版状态已出版 - 2023
已对外发布
活动31st IEEE/ACM International Symposium on Quality of Service, IWQoS 2023 - Orlando, 美国
期限: 19 6月 202321 6月 2023

出版系列

姓名IEEE International Workshop on Quality of Service, IWQoS
2023-June
ISSN(印刷版)1548-615X

会议

会议31st IEEE/ACM International Symposium on Quality of Service, IWQoS 2023
国家/地区美国
Orlando
时期19/06/2321/06/23

指纹

探究 'On-demand Edge Inference Scheduling with Accuracy and Deadline Guarantee' 的科研主题。它们共同构成独一无二的指纹。

引用此