Anthill: a Framework for the Development of Agent

Transcript Anthill: a Framework for the Development of Agent

Robust Processing Rate Allocation
with Feedback Control for
Proportional Slowdown Differentiation
Xiaobo Zhou
Department of Computer Science
University of Colorado at Colorado Springs
1
Outline
 Proportional Slowdown Differentiation (PSD)
 State-of-the-Art
 An Integrated Approach to PSD
– Queueing-theoretical processing rate allocation
– control-theoretical feedback control
 Performance Evaluation
 Research Plan
[email protected]
2
What is Differentiated Services
 Internet Engineering Task Force (IETF), April 1998
www.ietf.org/html.charters/diffserv-charter.html
 The Goal
– To define configurable types of packet forwarding (called Per-Hop
Behaviors, PHBs), which can provide local (per-hop) service
differentiation for large aggregates of network traffic, as opposed to
end-to-end performance guarantees for individual flows
Best-effort services
(Same-service-to-all)
Integrated Services
(Reservations-based)
Differentiated Services
(relative vs. absolute)
[email protected]
3
Why Differentiated Services
 Network Service Providers want to:
– Offer a scalable service differentiation (defined in SLA’s)
on core routers in stead of current best-effort service
– Improve revenues through premium pricing and
competitive differentiation
 Applications seek better than best effort:
– Bandwidth
– Packet Delay characteristics
– Packet loss characteristics
– Jitter characteristics
[email protected]
4
End-to-End Differentiation
 Why Service Differentiation on Servers?
– To provide predictable and controllable differentiation
QoS levels to different request classes of clients
– Diverse service expectations and constraints from
Internet applications and users, making the current sameservice-to-all model inadequate and limiting
 End-to-end DiffServ
– Network core:
• Per-hop differentiated queueing delay and loss rate
– Network edge:
• Service differentiation on Servers and Proxies
[email protected]
5
Models and Properties
 Models:
– Absolute differentiated services: clients receive an absolute
share of resource usages; possible low resource utilization
• For hard real-time applications
– Relative differentiated services: higher classes will receive
relatively better (or no worse) QoS than lower classes
• For soft real-time applications
 Properties:
– Predictability: differentiation schedules must be consistent,
independent of variations of the class workloads
– Controllability: a number of controllable parameters
adjustable for quality differentiation between classes
– Fairness: lower classes not be over-compromised,
especially when workload is low
[email protected]
6
A Proportional DiffServ Model
 A proportional differentiation model assigns quality factors to
the traffic classes in proportion to their pre-specified
differentiation weights, independent of class workloads
qi
i
qj = j , for all i, j, = 1,2,...,n
 It is popular
– differentiation predictability
– proportional fairness
[email protected]
7
QoS Metrics on Servers
 Multimedia Applications
– Mutli-dimensional QoS metric
• Responsiveness
• Image size, resolution, formats
• Streaming bandwidth
– Audio sample rate and sample size
– Video frame rate, frame size, and color depth
 Web Applications
– Responsiveness
– Throughput
[email protected]
8
Response Time vs. Slowdown


Arrival Rate
Service Rate
Client / Incoming link
Queue
Server / Outgoing link
 Response time
E[W/X] =E[W]W[X-1]
– Queueing delay + service time
 E[W]/E[X]
– Favors requests that need more service time
 Slowdown
– queueing delay / service time
– gives equal weights to requests regardless of service time
– A high slowdown also means a server is heavily loaded
* Clients expect long delay for “large” requests, and anticipate
short delay for “small” requests
[email protected]
9
State-of-the-Art
 Queueing-delay differentiation
– Strict priority based packet/request scheduling
– Time-dependent priority based request packet/scheduling
 Response time differentiation
– Strict priority based request scheduling
– Adaptive process allocation for proportional differentiation
 Slowdown differentiation
– queueing-theoretical Processing rate allocation
– M/M/1 PS queue for stretch factor differentiation
– M/G_P/1 FCFS queue
[email protected]
10
Challenges and Contributions
 A closed form of slowdown for M/GP/1 FCFS Q
 Average slowdown on Task servers
 Processing rate allocation scheme for PSD
 Control-theoretical approach for robust PSD
[email protected]
11
A Heavy-tailed Distribution
 The Pareto distribution is a typical heavy-tailed
f ( x)  k  x  1
 , k  0, x  k
 In practice, there is some upper bound on the maximum
size of a job (p) -- Bounded Pareto distribution
1
  1
 1
f ( x) 

k
x

K(

,
k,
p)
x
 , k  0, p  x  k

1  (k / p)
f(x)
Power law w/ exp - -1
p
k
[email protected]
x
12
Preliminary of Slowdown
 Lemma 1
– Given an M/GP/1 FCFS queue on a server, where the arrival
process has rate  and X denotes the Bounded Pareto service time
density distribution. Let W be a job’s queueing delay (W is
indepenent to X from a FCFS queue), and S be its slowdown.
According to Pollaczek-Khinchin Formula,
2
1

E
[
x
]
E
[
x
]
E[ S ]  E[W ]E[ X 1 ] 
.
2(1  E[ X ])
E[ X ]  
K ( , k , p)

 1

f ( x) xdx  
K (  1, k , p)
 1

(ln
p

ln
k
)
K
(

,
k
,
p
)

p
k
p
K ( , k , p)
K (  2, k , p)
K ( , k , p)
f ( x) x 1dx 
.
K (  1, k , p)
E[ X ]   f ( x) x 2 dx 
2
k
E[ X ]  
1
p
k
[email protected]
13
Slowdown on a Task Server
 What is a task server?
–
–
–
–
A processing unit, handling a request class in FCFS manner
Let c i be the normalized processing rate of task server i
\sum_{i=1}^{N} c i = 1
0 < c i  1 for 0  i N
A process, a thread, a processor, a server node
 Lemma 2
– Given an M/GP/1 FCFS queue on a task server i with
processing rate. Xi denotes the Bounded Pareto service time
density distribution on the task server:
• E[Xi] = 1/c i E[X]
• E[X2i] = 1/c2 i E[X2]
• E[X-1i] = c i E[X-1]
[email protected]
14
Processing Rate Allocation
 PSD model
E[Si ] i
=
, for all i, j, = 1,2,...,N
E[Si ] j
i E[ x 2 ]E[ x 1 ]
i b
E[Si ] 
( E[Si ] 
)
2(ci  i E[ X ])
2(ci  i b)
1  i  N.
 A Proportional Processing Rate Allocation
i 1  i 1 i E[ X ]
N
ci 
i

N
i 1
i /  i
 i E[ X ]
 i E[ x 2 ]E[ x 1 ]i 1 i /  i
1  i  N.
N
E[Si ] 
2(1  E[ X ]i 1 i )
N
[email protected]
1  i  N.
15
Simulation Model
 Processing procedure is partitioned into sampling periods
– Request generator
– Load estimator
– Rate allocator
 GNU Scientific library (GSL)
[email protected]
16
Effectiveness of Rate Allocation
 Simulated and expected slowdowns of 2 classes (1: 2= 1:2/1:4)
[email protected]
17
Effectiveness of Rate Allocation
 Simulated and expected slowdowns of 3 classes (1: 2: 2= 1:2:3)
[email protected]
18
Predictability vs. Variance
 Percentiles of simulated slowdown ratios for 2 and 3 classes
[email protected]
19
Microscopic Views
 Queueing-theoretical allocation is based on the average, a
macro-behavior of class load instead of micro-behaviors,
such as experienced slowdowns of individual requests.
50% vs. 90%
[email protected]
20
Drawbacks of Q-based Approach
 Queueing theory can be applied to calculate a request
class’s average slowdown based on the allocated
processing rate. However, we cannot control the variance
of slowdown simultaneously
 Processing rate allocation is based on the average load
conditions of classes, instead of per-request experienced
slowdown: macro-behavior vs. micro-behavior
 Load condition is stochastic, it is difficult to accurately
estimate a class’s load based on its history; estimation
errors may cause inaccurate rate allocation in the short
time scales and slowdown deviation between achieved
slowdown ratio and predicted slowdown ratio.
 So, how to improve micro-behavior so more robust?
– Integrating control theory and queueing theory
[email protected]
21
Queueing & Control Integration
 Queueing theoretical rate predictor
 A control loop is used for each pair of adjacent classes
– Sensor/monitor measures the achieved slowdown ratio
– Deviation controller adjusts the rate allocation
– Actuator translate the abstract controller output to physical action
[email protected]
22
PID Control
 PID (proportional integral derivative) controller
– Simplicity: adjust the rate allocations in proportion to the
difference between the achieved slowdown ratio and desired one
 A linear feedback control function
– f(e i (k)) = g  e i (k)

S (k )
ei (k )  i 1  i 1
i
Si (k )
//g is the control gain parameter
 Rate allocation adjustment
– At the end of sampling period k, the adjustment for k+1 period
c (k  1)
c (k )
( i
)  ( i
)  f (ei (k )).
ci 1 (k  1)
ci 1 (k )
– Rate allocation for k+1 period is
ci (k  1)
c (k )
c (k )
 i
 ( i
),
ci 1 (k  1) ci 1 (k )
ci 1 (k )
1  i  N.
[email protected]
23
A New Simulation Model
 Integration of queueing and control theory
– Feedback controller
– Comparator (sensor/monitor)
[email protected]
24
Performance Evaulation
 Integrated approach vs. queueing-theoretical approach
[email protected]
25
Performance Evaulation
 System load is 0.8 and 3: (2 : 1) = 4: (2 : 1)
[email protected]
26
Performance Evaulation
 Sensitivity analyses of the integrated approach
Load:0.4->0.2->0.4
[email protected]
27
Future Work
 Evaluate different control techniques
 Integration of process allocation and admission control
with feedback for robust responsiveness differentiation
[email protected]
28
P&P for IDF Applications
 Multi-dimensional Input & Requirements
–
–
–
–
–
Distributed data sources
Different data formats
Different data priority levels
Different decision requirements
Different workload characteristics
 Multi-dimensional Platform and Performance Metric
– Cluster node partitioning
– Performance measurement
– Performance differentiation
[email protected]
29

Anthill: a Framework for the Development of Agent

Transcript Anthill: a Framework for the Development of Agent

Directory