Self-Similarity in Network Traffic

Download Report

Transcript Self-Similarity in Network Traffic






2

Something “feels the same” regardless of scale
What is that???
4

Something “feels the same” regardless of scale
Self-similar in nature
5

Something “feels the same” regardless of scale
The Koch snowflake fractal
6

Something “feels the same” regardless of scale
The Koch snowflake fractal
7

Something “feels the same” regardless of scale
The Koch snowflake fractal
8

Something “feels the same” regardless of scale
9
Categories:

Exact self-similarity: Strongest Type

Approximate self-similarity: Loose Form

Statistical self-similarity: Weakest Type
10
Approximate self-similarity:
Recognisably similar but not exactly so.
e.g. Mandelbrot set
Statistical self-similarity:
Only numerical or statistical
measures that are preserved
across scales
11
In case of Stochastic Objects
e.g. time-series
Self-similarity is used in the distributional sense
12

Recently, network packet traffic has been
identified as being self-similar.

Current network traffic modeling using
Poisson distributing (etc.) does not take into
account the self-similar nature of traffic.

This leads to inaccurate modeling of network
traffic.
13

A Poisson process
 When observed on a fine time scale will appear
bursty
 When aggregated on a coarse time scale will
flatten (smooth) to white noise

A Self-Similar (fractal) process
 When aggregated over wide range of time scales
will maintain its bursty characteristic
14
Ethernet traffic August’89 trace
packets per time unit
15
16
17
18
Reality (self-similar):
Bursty Data
Streams
Aggregation
Bursty Aggregate
Streams
Consequence: Inaccuracy
Current Model:
Bursty Data
Streams
Aggregation
Smooth Pattern
Streams
19

Long-range Dependence
 autocorrelation decays slowly

Hurst Parameter
 Developed by Harold Hurst (1965)
 H is a measure of “burstiness”
▪ also considered a measure of self-similarity
 0<H<1
 H increases as traffic increases
▪ i.e., traffic becomes more self-similar
20

X = (Xt : t = 0, 1, 2, ….) is covariance stationary random process (i.e.
Cov(Xt,Xt+k) does not depend on t for all k)

Let X(m)={Xk(m)} denote the new process obtained by averaging the
original series X in non-overlapping sub-blocks of size m.
e.g. X(1)= 4,12,34,2,-6,18,21,35
Then
X(2)=8,18,6,28
X(4)=13,17

Mean , variance 2

Suppose that Autocorrelation Function r(k)  k -β, 0<β<1
21

X is exactly second-order self-similar if
 The aggregated processes have the same
autocorrelation structure as X. i.e.
 r (m) (k) = r(k), k0 for all m =1,2, …

X is asymptotically second-order self-similar if
the above holds when [ r (m) (k)  r(k), m  ]

Most striking feature of self-similarity: Correlation
structures of the aggregated process do not
degenerate as m  
22
ACF
lag
23
24

Correlation structures of their aggregated processes
degenerate as m  
i.e. r (m) (k)  0 as m  , for k = 1,2,3,...

Short Range Dependence Processes:
 Exponential Decay of autocorrelations
 i.e. r(k) ~ pk , as k  , 0 < p < 1
 Summation is finite
25

Processes with Long Range Dependence are
characterized by an autocorrelation function that
decays hyperbolically as k increases
 r(k )  
k

Important Property:
This is also called non-summability of correlation
26

The intuition behind long-range dependence:
 While high-lag correlations are all individually
small, their cumulative affect is important
 Gives rise to features drastically different from
conventional short-range dependent processes
27

Hurst Parameter H , 0.5 < H < 1

Three approaches to estimate H (Based on
properties of self-similar processes)
 Variance Analysis of aggregated processes
 Rescaled Range (R/S) Analysis for different block
sizes: time domain analysis
 Periodogram Analysis: frequency domain analysis
(Whittle Estimator)
28

Variance of aggregated processes decays as:
 Var(X(m)) = am-b as m infinite,

For short range dependent processes
(e.g. Poisson Process):
 Var(X(m)) = am-1 as m infinite,

Plot Var(X(m)) against m on a log-log plot

Slope > -1 indicative of self-similarity
29
Slope=-0.7
Slope=-1
30
For a given set of observations,
( X k : k  1,2,....n),
Samplemean X (n), SampleVariance S 2 (n)
Rescaled Adjusted Range or R/S statistic is given by
R ( n)
1

[max(0,W1 ,W2 ,......Wn )  min(0,W1 ,W2 ,......Wn )]
S ( n ) S ( n)
where
Wk  ( X1  X 2  .... X k )  kX (n)
31








Xk = 14,1,3,5,10,3
Mean = 36/6 = 6
W1 =14-(1*6 )=8
W2 =15-(2*6 )=3
W3 =18-(3*6 )=0
W4 =23-(4*6 )=-1
W5 =33-(5*6 )=3
W6 =36-(6*6 )=0
R/S = 1/S*[8-(-1)] = 9/S
32

For self-similar data, rescaled range or R/S
statistic grows according to cnH
 H = Hurst Paramater, > 0.5

For short-range processes ,
 R/S statistic ~ dn0.5

History: The Nile river
 In the 1940-50’s, Harold Edwin Hurst studied the 800-year record of
flooding along the Nile river.
 (yearly minimum water level)
 Finds long-range dependence.
33
Slope =
0.79
Slope = 1.0
Slope = 0.5
34

Provides a confidence interval

Property: Any long range dependent process
approaches fractional Gaussian noise (FGN),
when aggregated to a certain level

Test the aggregated observations to ensure
that it has converged to the normal
distribution
35

Self-similarity manifests itself in several
equivalent fashions:
 Non-degenerate autocorrelations
 Slowly decaying variance
 Long range dependence
 Hurst effect
36

Leland and Wilson collected hundreds of
millions of Ethernet packets without loss and
with recorded time-stamps accurate to within
100µs.

Data collected from several Ethernet LAN’s at
the Bellcore Morristown Research and
Engineering Center at different times over
the course of approximately 4 years.
38
39
H=1
H=0.5
H=0.5
Estimate H  0.8
40
High Traffic
5.0%-30.7%
Mid Traffic
3.4%-18.4%
Low Traffic
1.3%-10.4%
Packets
Higher Traffic, Higher H
41

Observation shows “contrary to Poisson”
 Network Utilization
H
As number of Ethernet users increases, the resulting
aggregate traffic becomes burstier instead of smoother
42

Pre-1990: host-to-host workgroup traffic

Post-1990: Router-to-router traffic

Low period router-to-router traffic consists
mostly of machine-generated packets
 Tend to form a smoother arrival stream, than low
period host-to-host traffic
43

Ethernet LAN traffic is statistically self-similar

H : the degree of self-similarity
H : a function of utilization
H : a measure of “burstiness”



Models like Poisson are not able to capture
self-similarity
44
46

The superposition of many ON/OFF sources whose
ON-periods and OFF-periods exhibit the Noah Effect
produces aggregate network traffic that features the
Joseph Effect.
Noah Effect: high variability Joseph Effect: Self-similar or
or infinite variance
long-range dependent traffic
Also known as packet train models
47

Traditional traffic models: finite variance
ON/OFF source models

Superposition of such sources
behaves like white noise, with only short
range correlations
48

Questions related to self-similarity can be
reduced to practical implications of Noah
Effect
 Queuing and Network performance
 Network Congestion Controls
 Protocol Analysis
49

The Queue Length distribution
 Traditional (Markovian) traffic: decreases exponentially fast
 Self-similar traffic: decreases much more slowly

Not accounting for Joseph Effect can lead to overly optimistic
performance
Effect of H
(Burstiness)
50

How to design the buffer size?

Trade-off between Packet Lose and Packet Delay
51
Compare SRD and LRD when increase buffer size
Packet Lose
Packet Delay
Short Range Dependence
Decrease Exponentially
Fixed Limit
Long Range Dependence
Decrease Slowly
Always Increase
52

Protocol design should take into account
knowledge about network traffic such as the
presence or absence of the self-similarity.

Parsimonious Models




Small number of parameters
Every parameter has a physically meaningful interpretation
e.g. Mean , Variance 2, H
Doesn’t quantify the effects of various factors in traffic
53
Demonstrated the existence of self-similarity in
Ethernet Traffic irrespective of time scales

Proposed the degree of self-similarity can be
measured by Hurst parameter H (higher H implies
burstier traffic)

Illustrated the difference between the self-similar and
standard models

Explained Importance of self similarity in design,
control, performance analysis
54

http://ita.ee.lbl.gov/html/contrib/BC.html
55