Adaptive Performance Control of Computing Systems via Distributed Cooperative Control: Application to Power Management in Computing Clusters

Advanced control and optimization techniques offer a theoretically sound basis to enable self-managing behavior in distributed computing models such as utility computing. To tractably solve the performance management problems of interest, including resource allocation and provisioning in such distri...

Full description

Saved in:
Bibliographic Details
Published in:IEEE International Conference on Autonomic Computing : 3rd International Conference on Autonomic Computing : ICAC 2006 : proceedings pp. 165 - 174
Main Authors: Mianyu Wang, Kandasamy, N., Guez, A., Kam, M.
Format: Conference Proceeding
Language:English
Published: IEEE 2006
Subjects:
ISBN:1424401755, 9781424401758
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Advanced control and optimization techniques offer a theoretically sound basis to enable self-managing behavior in distributed computing models such as utility computing. To tractably solve the performance management problems of interest, including resource allocation and provisioning in such distributed computing environments, we develop a fully decentralized control framework wherein the optimization problem for the system is first decomposed into sub-problems and each sub-problem is solved separately by individual controllers to achieve the overall performance objectives. Concepts from optimal control theory are used to implement individual controllers. The proposed framework is highly scalable, naturally tolerates controller failures and allows for the dynamic addition/removal of controllers during system operation. As a case study, we apply the control framework to minimize the power consumed by a computing cluster subject to a dynamic workload while satisfying the specified quality-of-service goals. Simulations using real-world workload traces show that the proposed technique has very low control overhead and adapts quickly to both workload variations and controller failures.
ISBN:1424401755
9781424401758
DOI:10.1109/ICAC.2006.1662395