Value driven load balancing

Sherwin Doroudi; Esa Hyytiä; Mor Harchol-Balter

doi:10.1016/j.peva.2014.07.019

Value driven load balancing

Sherwin Doroudi, Esa Hyytiä, Mor Harchol-Balter

Industrial and Systems Engineering

Research output: Contribution to journal › Article › peer-review

7 Scopus citations

Abstract

To date, the study of dispatching or load balancing in server farms has primarily focused on the minimization of response time. Server farms are typically modeled by a front-end router that employs a dispatching policy to route jobs to one of several servers, with each server scheduling all the jobs in its queue via Processor-Sharing. However, the common assumption has been that all jobs are equally important or valuable, in that they are equally sensitive to delay. Our work departs from this assumption: we model each arrival as having a randomly distributed value parameter, independent of the arrival's service requirement (job size). Given such value heterogeneity, the correct metric is no longer the minimization or response time, but rather, the minimization of value-weighted response time. In this context, we ask "what is a good dispatching policy to minimize the value-weighted response time metric?" We propose a number of new dispatching policies that are motivated by the goal of minimizing the value-weighted response time. Via a combination of exact analysis, asymptotic analysis, and simulation, we are able to deduce many unexpected results regarding dispatching.

Original language	English (US)
Pages (from-to)	306-327
Number of pages	22
Journal	Performance Evaluation
Volume	79
DOIs	https://doi.org/10.1016/j.peva.2014.07.019
State	Published - Sep 2014

Bibliographical note

Funding Information:
The authors would like to thank the reviewers for their helpful comments. Special thanks to Bruno Gaujal and Gautam Iyer for their assistance in refining some of the paper’s technical details. The second author’s work has been supported by the Academy of Finland in TOP-Energy project (grant no. 268992 ). The third author’s work was funded by NSF-CMMI - 1334194 as well as a Computational Thinking grant from Microsoft Research.

Keywords

C-MU rule
Heterogeneous values
Holding cost
Processor-Sharing
Server farms
Task assignment

Access

10.1016/j.peva.2014.07.019

OpenUrl availability

Full text

Cite this

@article{cc36fbbcd4014656b3c70e57b0bf3c48,

title = "Value driven load balancing",

abstract = "To date, the study of dispatching or load balancing in server farms has primarily focused on the minimization of response time. Server farms are typically modeled by a front-end router that employs a dispatching policy to route jobs to one of several servers, with each server scheduling all the jobs in its queue via Processor-Sharing. However, the common assumption has been that all jobs are equally important or valuable, in that they are equally sensitive to delay. Our work departs from this assumption: we model each arrival as having a randomly distributed value parameter, independent of the arrival's service requirement (job size). Given such value heterogeneity, the correct metric is no longer the minimization or response time, but rather, the minimization of value-weighted response time. In this context, we ask {"}what is a good dispatching policy to minimize the value-weighted response time metric?{"} We propose a number of new dispatching policies that are motivated by the goal of minimizing the value-weighted response time. Via a combination of exact analysis, asymptotic analysis, and simulation, we are able to deduce many unexpected results regarding dispatching.",

keywords = "C-MU rule, Heterogeneous values, Holding cost, Processor-Sharing, Server farms, Task assignment",

author = "Sherwin Doroudi and Esa Hyyti{\"a} and Mor Harchol-Balter",

note = "Funding Information: The authors would like to thank the reviewers for their helpful comments. Special thanks to Bruno Gaujal and Gautam Iyer for their assistance in refining some of the paper{\textquoteright}s technical details. The second author{\textquoteright}s work has been supported by the Academy of Finland in TOP-Energy project (grant no. 268992 ). The third author{\textquoteright}s work was funded by NSF-CMMI - 1334194 as well as a Computational Thinking grant from Microsoft Research. ",

year = "2014",

month = sep,

doi = "10.1016/j.peva.2014.07.019",

language = "English (US)",

volume = "79",

pages = "306--327",

journal = "Performance Evaluation",

issn = "0166-5316",

publisher = "Elsevier",

}

TY - JOUR

T1 - Value driven load balancing

AU - Doroudi, Sherwin

AU - Hyytiä, Esa

AU - Harchol-Balter, Mor

N1 - Funding Information: The authors would like to thank the reviewers for their helpful comments. Special thanks to Bruno Gaujal and Gautam Iyer for their assistance in refining some of the paper’s technical details. The second author’s work has been supported by the Academy of Finland in TOP-Energy project (grant no. 268992 ). The third author’s work was funded by NSF-CMMI - 1334194 as well as a Computational Thinking grant from Microsoft Research.

PY - 2014/9

Y1 - 2014/9

N2 - To date, the study of dispatching or load balancing in server farms has primarily focused on the minimization of response time. Server farms are typically modeled by a front-end router that employs a dispatching policy to route jobs to one of several servers, with each server scheduling all the jobs in its queue via Processor-Sharing. However, the common assumption has been that all jobs are equally important or valuable, in that they are equally sensitive to delay. Our work departs from this assumption: we model each arrival as having a randomly distributed value parameter, independent of the arrival's service requirement (job size). Given such value heterogeneity, the correct metric is no longer the minimization or response time, but rather, the minimization of value-weighted response time. In this context, we ask "what is a good dispatching policy to minimize the value-weighted response time metric?" We propose a number of new dispatching policies that are motivated by the goal of minimizing the value-weighted response time. Via a combination of exact analysis, asymptotic analysis, and simulation, we are able to deduce many unexpected results regarding dispatching.

AB - To date, the study of dispatching or load balancing in server farms has primarily focused on the minimization of response time. Server farms are typically modeled by a front-end router that employs a dispatching policy to route jobs to one of several servers, with each server scheduling all the jobs in its queue via Processor-Sharing. However, the common assumption has been that all jobs are equally important or valuable, in that they are equally sensitive to delay. Our work departs from this assumption: we model each arrival as having a randomly distributed value parameter, independent of the arrival's service requirement (job size). Given such value heterogeneity, the correct metric is no longer the minimization or response time, but rather, the minimization of value-weighted response time. In this context, we ask "what is a good dispatching policy to minimize the value-weighted response time metric?" We propose a number of new dispatching policies that are motivated by the goal of minimizing the value-weighted response time. Via a combination of exact analysis, asymptotic analysis, and simulation, we are able to deduce many unexpected results regarding dispatching.

KW - C-MU rule

KW - Heterogeneous values

KW - Holding cost

KW - Processor-Sharing

KW - Server farms

KW - Task assignment

UR - http://www.scopus.com/inward/record.url?scp=84906782671&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84906782671&partnerID=8YFLogxK

U2 - 10.1016/j.peva.2014.07.019

DO - 10.1016/j.peva.2014.07.019

M3 - Article

AN - SCOPUS:84906782671

SN - 0166-5316

VL - 79

SP - 306

EP - 327

JO - Performance Evaluation

JF - Performance Evaluation

ER -

Value driven load balancing

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this