Optimal Policy for Deployment of Machine Learning Modelson Energy-Bounded Systems

Abstract

With the recent advances in both machine learning and embedded systems research, the demand to deploy computational models for real-time execution on edge devices has increased substantially. Without deploying computational models on edge devices, the frequent transmission of sensor data to the cloud results in rapid battery draining due to the energy consumption of wireless data transmission. This rapid power dissipation leads to a considerable reduction in the battery lifetime of the system, therefore jeopardizing the real-world utility of smart devices. It is well-established that for difficult machine learning tasks, models with higher performance often require more computation power and thus are not power-efficient choices for deployment on edge devices. However, the trade-offs between performance and power consumption are not well studied. While numerous methods (e.g., model compression) have been developed to obtain an optimal model, these methods focus on improving the efficiency of a ‘‘single’’ model. In an entirely new direction, we introduce an effective method to find a combination of ‘‘multiple’’ models that are optimal in terms of power-efficiency and performance by solving an optimization problem in which both performance and power consumption are taken into account. Experimental results demonstrate that on the ImageNet dataset, we can achieve a 20% energy reduction with only 0.3% accuracy drop compared to Squeeze-and-Excitation Networks. Compared to a pruned convolutional neural network for human activity recognition, while consuming 1.7% less energy, our proposed policy achieves 1.3% higher accuracy.

Publication
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI)
Avatar
Iman Mirzadeh
Research Assistant

I am a PhD student and Graduate Research Assistant at the Washington State University Embedded and Pervasive Systems Laboratory (EPSL) under supervision of Dr. Hassan Ghasemzadeh. I am interested in the real-world challenges of working with machine learning models such as energy constraints and human-in-the-loop interactions with these models. Specifically, I am focusing on Model Optimization (such as model compression), where my goal is to build more efficient models or use the existing models more efficiently. Before joining EPSL, I was an ML Engineer at Sokhan AI, where we provided accurate and scalable Natural Language Processing (NLP) and Computer Vision (CV) services to businesses.