Towards efficient service placement and server selection for large-scale deployments
Faculty of Sciences. Mathematics and Computer Science
S.l. :IEEE, 2008
Proceedings of the 4th Advanced International Conference on Telecommunications (AICT), pp 13-18, Athens, Greece, 2008
Currently many service providers offer their services using a private and proprietary hard- and software infrastructure. These infrastructures often share many similarities. Hence we believe a generic service delivery architecture, that allows service providers to offer a large array of different services on a shared infrastructure, would provide many advantages over current silo-based approaches. In this paper we propose the first step towards such an architecture, namely several algorithms for dynamically allocating server and network resources to a set of services and selecting a suitable service instance for each client. Service instances are placed on a set of servers, taking into account network resources (available bandwidth), server resources (CPU and memory) and service quality of service (QoS) demands (maximum transmission delay and bandwidth requirements). The optimization goal is to maximize the percentage of satisfied demand (answered requests) and minimize the total number of used servers for service hosting. Each service has a relative priority, which can be manually adjusted to influence the placement scheme.