Distributing deep neural networks with containerized partitions at the edge

Li Zhou; Hao Wen; Radu Teodorescu; David H.C. Du

Distributing deep neural networks with containerized partitions at the edge

Li Zhou, Hao Wen, Radu Teodorescu, David H.C. Du

Computer Science and Engineering

Research output: Contribution to conference › Paper › peer-review

55 Scopus citations

Abstract

Deploying machine learning on edge devices is becoming increasingly important, driven by new applications such as smart homes, smart cities, and autonomous vehicles. Unfortunately, it is challenging to deploy deep neural networks (DNNs) on resource-constrained devices. These workloads are computationally intensive and often require cloud-like resources. Prior solutions attempted to address these challenges by either sacrificing accuracy or by relying on cloud resources for assistance. In this paper, we propose a containerized partition-based runtime adaptive convolutional neural network (CNN) acceleration framework for Internet of Things (IoT) environments. The framework leverages spatial partitioning techniques through convolution layer fusion to dynamically select the optimal partition according to the availability of computational resources and network conditions. By containerizing each partition, we simplify the model update and deployment with Docker and Kubernetes to efficiently handle runtime resource management and scheduling of containers.

Original language	English (US)
State	Published - 2019
Event	2nd USENIX Workshop on Hot Topics in Edge Computing, HotEdge 2019, co-located with USENIX ATC 2019 - Renton, United States Duration: Jul 9 2019 → …

Conference

Conference	2nd USENIX Workshop on Hot Topics in Edge Computing, HotEdge 2019, co-located with USENIX ATC 2019
Country/Territory	United States
City	Renton
Period	7/9/19 → …

Bibliographical note

Funding Information:
The authors would like to thank the anonymous reviewers for their feedback. We also extend special thanks to Irfan Ahmad for suggestions on the camera ready. This work was supported in part by NSF Award 1439622, 1812537 and NSF XPS Award 60053525.

Publisher Copyright:
© 2019 USENIX Association. All rights reserved.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

OpenUrl availability

Full text

Cite this

@conference{c4846104d42c49aabd36833035bcb3df,

title = "Distributing deep neural networks with containerized partitions at the edge",

abstract = "Deploying machine learning on edge devices is becoming increasingly important, driven by new applications such as smart homes, smart cities, and autonomous vehicles. Unfortunately, it is challenging to deploy deep neural networks (DNNs) on resource-constrained devices. These workloads are computationally intensive and often require cloud-like resources. Prior solutions attempted to address these challenges by either sacrificing accuracy or by relying on cloud resources for assistance. In this paper, we propose a containerized partition-based runtime adaptive convolutional neural network (CNN) acceleration framework for Internet of Things (IoT) environments. The framework leverages spatial partitioning techniques through convolution layer fusion to dynamically select the optimal partition according to the availability of computational resources and network conditions. By containerizing each partition, we simplify the model update and deployment with Docker and Kubernetes to efficiently handle runtime resource management and scheduling of containers.",

author = "Li Zhou and Hao Wen and Radu Teodorescu and Du, {David H.C.}",

note = "Funding Information: The authors would like to thank the anonymous reviewers for their feedback. We also extend special thanks to Irfan Ahmad for suggestions on the camera ready. This work was supported in part by NSF Award 1439622, 1812537 and NSF XPS Award 60053525. Publisher Copyright: {\textcopyright} 2019 USENIX Association. All rights reserved.; 2nd USENIX Workshop on Hot Topics in Edge Computing, HotEdge 2019, co-located with USENIX ATC 2019 ; Conference date: 09-07-2019",

year = "2019",

language = "English (US)",

}

TY - CONF

T1 - Distributing deep neural networks with containerized partitions at the edge

AU - Zhou, Li

AU - Wen, Hao

AU - Teodorescu, Radu

AU - Du, David H.C.

N1 - Funding Information: The authors would like to thank the anonymous reviewers for their feedback. We also extend special thanks to Irfan Ahmad for suggestions on the camera ready. This work was supported in part by NSF Award 1439622, 1812537 and NSF XPS Award 60053525. Publisher Copyright: © 2019 USENIX Association. All rights reserved.

PY - 2019

Y1 - 2019

N2 - Deploying machine learning on edge devices is becoming increasingly important, driven by new applications such as smart homes, smart cities, and autonomous vehicles. Unfortunately, it is challenging to deploy deep neural networks (DNNs) on resource-constrained devices. These workloads are computationally intensive and often require cloud-like resources. Prior solutions attempted to address these challenges by either sacrificing accuracy or by relying on cloud resources for assistance. In this paper, we propose a containerized partition-based runtime adaptive convolutional neural network (CNN) acceleration framework for Internet of Things (IoT) environments. The framework leverages spatial partitioning techniques through convolution layer fusion to dynamically select the optimal partition according to the availability of computational resources and network conditions. By containerizing each partition, we simplify the model update and deployment with Docker and Kubernetes to efficiently handle runtime resource management and scheduling of containers.

AB - Deploying machine learning on edge devices is becoming increasingly important, driven by new applications such as smart homes, smart cities, and autonomous vehicles. Unfortunately, it is challenging to deploy deep neural networks (DNNs) on resource-constrained devices. These workloads are computationally intensive and often require cloud-like resources. Prior solutions attempted to address these challenges by either sacrificing accuracy or by relying on cloud resources for assistance. In this paper, we propose a containerized partition-based runtime adaptive convolutional neural network (CNN) acceleration framework for Internet of Things (IoT) environments. The framework leverages spatial partitioning techniques through convolution layer fusion to dynamically select the optimal partition according to the availability of computational resources and network conditions. By containerizing each partition, we simplify the model update and deployment with Docker and Kubernetes to efficiently handle runtime resource management and scheduling of containers.

UR - http://www.scopus.com/inward/record.url?scp=85083231106&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85083231106&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85083231106

T2 - 2nd USENIX Workshop on Hot Topics in Edge Computing, HotEdge 2019, co-located with USENIX ATC 2019

Y2 - 9 July 2019

ER -

Distributing deep neural networks with containerized partitions at the edge

Abstract

Conference

Bibliographical note

UN SDGs

OpenUrl availability

Other files and links

Fingerprint

Cite this