Chapter 4 Lecture Presentation

Download Report

Transcript Chapter 4 Lecture Presentation

Proposed future direction
for CHEETAH
Malathi Veeraraghavan
University of Virginia
August 23, 2006

Outline

Strategy discussion:

What's our goal for the CHEETAH network:


Bandwidth sharing mode:


eScience network or a scalable GP network?
Book-Ahead (BA) or Immediate-Request (IR)?
Tactical aspects:

Network evolution

Networking software modules

Application software modules

Interconnection to HOPI/DRAGON
1
Observation

"Many e-science experiments are unique
applications that involve collaboration
among a handful of facilities. As a result,
networks supporting these experiments
are optimized to provide maximum
throughput to a few facilities, as opposed
to moderate throughput to millions of
users, which is the raison d'etre for
commercial networks."
2
eScience networks

eScience network requirements





Number of users small
Hard to achieve high utilization; also not impt.
Overprovision network to keep call blocking
rate low
We can then focus on creating software to
allow scientists to automatically create highspeed application-specific topologies: AST,
UCLP, OSCARS, USN scheduler, BRUW
Bandwidth-sharing algorithms of less concern
3
General-purpose commercial
networks

Has to be scalable: large number of users




Metcalfe's statement: Value of a network
increases exponentially with the number of
users
High utilization is an important goal
Low call blocking probability or low waiting
time for resources
Focus on efficient bandwidth-sharing
algorithms
4
Circuit/VC service on GP
commercial networks

Just for ISPs/enterprise admins:


needs similar to eScience
 router-to-router circuits
 limited number of users
 high-bandwidth, long-held circuits
 low price not a high priority
 need BA mode of bandwidth sharing
For end users



large number of users
can only offer moderate BW and limited call holding
times
IR mode of sharing becomes feasible
5
BW sharing modes in circuit/VC networks
m is the link capacity
expressed in channels
e.g., if 1Gbps circuits
are assigned on a 10Gbps link,
m = 10
Large m
Moderate throughput
Small m
immediate-request
with call blocking + retries
("call queueing")
Short calls
Bank teller
(video, gaming)
immediate-request
with delayed-start times
(file transfers)


High throughput
Long calls
Doctor's office
book-ahead
Mean waiting time is proportional to mean call holding time
Can afford to have a queueing based solution if calls are short
6
Impact of increasing m at different
values of link utilization Ud
1000
U =90%
d
U =90%
d
800
U =80%
d
U =80%
d
U =60%
m=10
d
0.4
Pq=41%
U =60%
d
U =40%
d
0 0
10
400
d
U =40%
0.2
600

0.6
PQ
Prob. of arriving job finding
all m circuits busy
0.8
200
1
2
10
10
m
03
10
Offered load: call arrival rate/call departure rate
1
Link capacity expressed in channels
High-rate per-call circuits
Low-rate per-call circuits
7
Impact of mean call holding time 1 / 
5
10
30

m=1000, =1call/hour
4
m=10
3
10
18

2
m=10, =1call/hour
10
12

1
m=10, =10calls/hour
10
6
0
10
0
Mean waiting time for
delayed calls

E[W d ] (minutes)
24
m=100, =1call/hour
N
Number of ports
aggregating traffic
on to the link
10
5
10
15
20
1/ (minutes)
 ' : per host call-generation rate
Ud: 90%
m=100
m=1000
0
25
30
E[Wd ] 
1
m (1  U d )
8
Main findings of analysis

Two key parameters:

If m is small (per-circuit BW is high)
 and mean call holding time is large


and mean call holding is small (file transfers)



then need BA to avoid long waiting times
then use "call queueing"
If m is large, switch hardware costs increase
 N, number of aggregation ports, high
 level of demultiplexing high
Moderate m: best choice
9
Book-Ahead (BA) or
Immediate-Request (IR)?
Bandwidthsharing
mechanisms
Book-Ahead (BA)
Immediate-Request (IR)
eScience
networks
very large file transfers need
high-BW and long holding time
+ remote viz. need to reserve
other resources such as
displays
None?
general-purpose
networks
circuit service to only
ISPs/enterprise admins
- router-to-router circuits
circuit service for end users
- host-to-host + router-torouter (end-to-end)
- partial-path router-to-router
circuits on congested links
(called in by end user)
10
Support for the BA mechanism of
bandwidth sharing





Since RSVP-TE does not have parameters for BA
calls (call duration, start time), this mode is not
implemented in switch controllers
Need an external scheduler to manage bandwidth
into the future
Easiest to make it centralized - one per domain
Cannot utilize the BW management software
implemented in switch controllers as part of
GMPLS control-plane software
The BA mode is necessary for high-BW, long-held
calls
11
Support for the IR mechanism of
bandwidth sharing


Switches have built-in (G)MPLS controlplane software (RSVP-TE/OSPF-TE)
Bandwidth management is part of RSVP-TE
switch controller software



Hence it is distributed bandwidth management
Need to limit call holding time - reminders
for renewals and automatic release
Moderate-to-high per-call bandwidth
12
To implement BA, IR, or both?

Implement only BA





Develop and "standardize" protocols for scheduler-toscheduler signaling for interdomain circuits (one
centralized scheduler per domain)
Implement scheduler and test with other networks
Create software tools to enable scientists and
ISP/enterprise admins to visualize network topologies
and request appropriate circuits/VCs
High-BW, long-held: Therefore AAA is a must
Path being pursued by DRAGON, USN, OSCARS, UCLP
13
Opportunity missed if the whole optical
testbed community only experiments with BA

What opportunity?




Enable the creation of large-scale circuit/VC networks
with moderate-rate circuits that can support a brand
new class of applications
 economic value for the telcom industry
A "reservations-oriented" mode of networking to
complement today's connectionless Internet
 ala airlines that complement roadways
Could prove useful to FIND, GENI, net-neutrality
Alternative pricing models for bandwidth
14
What "brand new class of applications?"





Video, video, video
Gaming
Remote software access + Sync. storage
Async storage
Multimedia (large) files in web sites
15
Video applications





Improve quality of conferencing, telephony,
surveillance, entertainment and distancelearning by a significant degree
Expend bandwidth for a higher-quality, lower
latency, multi-camera, auto-movement, automixing experience
Make the "flat world" flatter
Energy savings/environmental benefits
Moderate bandwidth - IR with call
blocking/retries
16
Gaming applications



Current gamers buy personal graphics cards
Players talk of "lag" caused by differences in graphics
processing speeds
Moderate-speed circuits can enable a new class of
games in which rapidly-changing scenes are possible
compare movies in which multiple story lines keep
scenes changing vs. gaming scenes
Players connect to graphics servers
Data transferred is not GL commands, but rather
rendered bits (doable?)
Moderate bandwidth - IR with call blocking/retries




17
Remote software access/sync storage

Remote software access





Synchronous storage access


Reduce computer administration cost
Personal computers vs. machine rooms
I loaded 22 new applications on my new laptop
 Instead: connect and run!
Virtual Computing Laboratory: Mladen Vouk, NCSU
Disaster recovery
Moderate bandwidth - IR with call
blocking/retries
18
Asynchronous storage

Asynchronous storage depots will lower
costs for





backups
disaster recovery
Need for increased storage grows with
multimedia files
High bandwidth, short calls
IR with delayed start
19
Larger files in web sites

Multimedia files in web sites

Imagine the use of video/audio files in all sorts of web sites
instead of ASCII


My own course PPT files: I use audio sparingly because of
bandwidth
Think assembly instructions for electric fans, furniture


Think hotel web pages




Kinesthetic learning - show me a video
Show me exactly where the beach is relative to my room; do I
have a balcony - saying it in text format is one thing; seeing it in a
video format quite another!
Content distribution network & web caching
High bandwidth, short calls
IR with delayed start
20
Are all these "high"-BW apps just a matter of
increasing BW of links in the current Internet?



No
The socialistic mode of bandwidth sharing
on the Internet discourages individual
investment in network bandwidth
Age-old question:

should we pay for bandwidth with tax dollars "free" for the whole community?


"Tragedy of the commons" (Tanenbaum)
should we create a network where individuals
can pay for bandwidth on congested links more
directly? - think higher-toll HOV lanes
21
What does all this mean?

Let's build a scalable circuit/VC network in which
bandwidth is shared in IR mode




Scalability will create "Metcalfe's value"
Provides an opportunity to finally recoup our investment
in (G)MPLS technologies
 standards creation effort
 implementation: Cisco, Juniper, Sycamore, Movaz
Assign at least a few of the optical testbeds that we
are investing in now to study whether this IR mode of
bandwidth sharing can help with our understanding of
net-neutrality, economic growth, FIND questions
IR more natural in data world unlike in airlines (BA)
22
Argument: IR is just a "now" in BA

BA and IR cannot coexist without some
form of bandwidth partitioning



BA allows for high-BW, long-duration calls
IR calls will suffer a high call blocking rate if
supported through BA scheduler (the "addnow-as-an-option-in-scheduler" solution)
Should you admit an IR call if it arrives a few
seconds before start time of a BA call and
hope it completes before the BA call start
time, or reject the call and waste bandwidth?
23
CHEETAH and TSI


The CHEETAH network solves only part of the
TSI problem
Other problems




Cray computer I/O problem
Local-area connectivity within NCSU
If the CHEETAH project was a production
solution to support TSI, we should spend money
to solve these two problems for TSI
But as an experimental short-lived networking
project, where should we focus?
24
Outline

Strategy discussion:

What's our goal for the CHEETAH network:


Bandwidth sharing:


eScience network or a scalable GP network?
Book-Ahead (BA) or Immediate-Request (IR)?
Tactical aspects:

CHEETAH network evolution

Networking software modules

Application software modules

Interconnection to HOPI/DRAGON
25
Network evolution to support IR

Current CHEETAH network only supports
10 circuits per OC192 link


remember IR mode does not work well when m,
the link capacity in channels, is small (i.e., 10)
Recoup OC1-crossconnect capability of the
SN16Ks from its current 1Gbps use

Has three advantages



supports higher m; better for IR
GMPLS standards based signaling
Call setup delay: 166ms for two-hop instead of
1.5sec!
26
Network evolution options

Four options:




VLAN-enabled NICs + VLSR for SN16K
15454 with VLSR
IP router with VLSR
Ethernet switch with VLSR
27
Example: web caching application
VT
UVa
C'ville, VA mvsut6
UTK
duke
VLANs
CHEETAH
zelda4
ORNL
wukong
UN
C
Raleigh, NC
Atlanta, GA
zelda3
NCSU
Gatech
UGa
Xiuduan Fang, [email protected]
Bob Gisiger, [email protected]
28
VLAN-enabled NICs + SN16K VLSR


SN16K has data-plane support to map a sub-Gb/s VLAN on
an Ethernet port to a corresponding number of OC1s on a
SONET port
But, it does not have control-plane support for this type
of circuit




Even GMPLS support for the GbE port mapping to a 21-OC1
VCAT signal is an experimental release just for CHEETAH
usage
Because GMPLS support for such hybrid circuits is nonstandard
Can implement our own (non-standard) solution as a VLSR
But, goal is to use off-the-shelf switches with GMPLS
support to demonstrate IR mode
29
15454/VLSR at each PoP


Make the 15454 serve as the intermediary between
Ethernet NICs in hosts and SONET based SN16Ks at
CHEETAH PoPs
A 15454 VLSR could be useful for other projects, UCLP,
Ultralight


Two problems:



Cisco has no plans to implement a GMPLS control-plane engine
for the 15454
Non-standard solution for hybrid circuits
VLAN ID continuity requirement
Cannot support partial-path circuits
30
IP router/VLSR at each PoP




Use channelized OCxx SONET interfaces
to connect IP router to SN16K
Connect web caches to router
Have routers initiate pure SONET circuit
setup
Use PBR or just ordinary routing table
update to map flows to different OCxx
circuits; support multiple circuits from one
web cache
31
CHEETAH wide-area network
UVa
Raleigh PoP (MCNC)
SN16000
via NCREN
OC192 Control GbE/
card
card
10GbE
card
CUNY
NCSU
End hosts
OC-192 (via NLR/SLR/NCREN)
ORNL PoP
SN16000
End hosts
Atlanta PoP (SOX/SLR)
SN16000
GbE/ Control
OC192
10GbE card
card
card
OC192 Control GbE/
10GbE
card
card
card
End hosts
OC-192
ORNL
GaTech
32
CHEETAH evolution
to support sub-Gb/s circuits
UVa
GbE
Raleigh PoP
SN16000
CUNY
GbE
GbE
OC192 Control OC192
card
card
card
NCSU
GbE
End hosts
OC-192 (via NLR/SLR/NCREN)
ORNL PoP
SN16000
GbE
End hosts
Atlanta PoP
SN16000
OC192 Control OC192
card
card
card
OC-192
GbE
ORNL
GbE
OC192 Control OC192
card
card
card
End hosts
GbE
GaTech
33
IP router/VLSR at each PoP

Can support end-to-end circuits






web caching
CDN servers
video apps at 10-15Mbps - map to one OC1
storage depots
Has the potential to support PPCs (partial-path
circuits)
Place router with VLSR in enterprises at edge of
GbE cheetah access link
34
Ethernet switch/VLSR at each PoP

Does not help with the problems noted in today's
Gb/s circuit use of the SN16K





long call setup delays: 1.5sec
non-standard solution
high per-circuit BW
Using an Ethernet switch/VLSR at an enterprise
(e.g. CUNY) requires all VLANs sharing 1Gbps
CHEETAH access link to be switched to the same
exit SN16K.
Even worse, m=1 if whole 1Gb/s link used for a
circuit
35
Software modules required

Networking software:



CVLSR for IP router
CTCP code to support multiple simultaneous
flows
Application software:



Add CHEETAH API to web caching squid
software
Write software for video apps
CDN and storage software
36
Upcoming year goals specified in
special report
Work item
Milestones and
deliverable
Responsible individuals
Item I
Stabilize the CHEETAH
network; increase user
base; complete doc.
MV and Xuan/Tao
Item II
Extend CHEETAH
network by adding
routers/VLAN
switches/MSPPs
MV and Tao
Item III
Interconnect to
testbeds, such as HOPI,
USN, DRAGON
MV and Tao
37
Upcoming year goals specified in
special report
Work item
Milestones and
deliverable
Responsible individuals
Item IV
Develop software,
both apps and
networking s/w, such
as CVLSRs
Router CVLSR: Mark
CTCP: Helali
Item V
Support ORNL and
NCSI in TSI apps
Mark and Tao/MV
Item VI
Enhance theoretical
understanding of
sharing modes
IR blocking/retries: XF
BA: Xiangfei
IR delayed start: Mark
Apps: Xiuduan
38
Equipment required

IP routers with channelized SONET cards with GA GMPLS
UNI implementation





need one for ORNL PoP
if we can partner with SOX in ATL, NCREN in Raleigh, MAX
in McLean, purchase channelized OC192 cards
IP routers with GbE blades for V. Tech and UVA
If NC 454 transponders are unavailable, purchase
transponders for DC-Raleigh NLR link - since HOPI
doesn't have this link
Colocation costs at NLR McLean and Raleigh
39
Interconnect CHEETAH and HOPI



Through IP routers
With our IP router/VLSR combo, setup
router-to-route SONET OC1 circuits via
cheetah and router-to-router VLAN virtual
circuits through HOPI. At routers, do PBR
mapping for flows or just update routing
tables
This means packets go back to IP layer
between two networks
40
CHEETAH-HOPI interconnection
Web
cache
VLAN
Web
cache
Web
cache
McLean, VA
MPLS
Web
cache
NC
10GbE
Web
cache
Web
cache
CHEETAH
SONET
HOPI network: courtesy of Rick Summerhill
Web
cache
TN
Web
cache
41
GA
HOPI and web caching




Seems like a good match
Rick's black cloud experiment - same as
web caching
Exercises "hybrid" goal of HOPI
Small per-circuit BW possible with VLANs
42
Connection to DRAGON



Spoke with Jerry Sobieski, Aug. 12, 2006
He said DRAGON PoPs have Ethernet
VLAN switches
Therefore, can use similar IP router
demarcation points to interconnect
CHEETAH/HOPI to DRAGON
43
Conclusions

We'd like to


enable and demonstrate general-purpose apps
using circuit/VC service with scalability as a
key goal
support IR mode of bandwidth sharing with
limited per-call bandwidth and limited call
holding times


call blocking with retries
delayed start
44