Welcome June 1998 NOW Finale David E. Culler 6/15/98 NOW Finale NOW Finale 1/98 Many PhDs 2nd PhD 6/97 NPACI NOW Sort 1/97 CS 267 CS 267 6/96 Asplos Workshop II 1/96 NOW II 6/95 1st PhD Inktomi 1/95 CS 258 6/94 NOW I 1/94 Case for.

Download Report

Transcript Welcome June 1998 NOW Finale David E. Culler 6/15/98 NOW Finale NOW Finale 1/98 Many PhDs 2nd PhD 6/97 NPACI NOW Sort 1/97 CS 267 CS 267 6/96 Asplos Workshop II 1/96 NOW II 6/95 1st PhD Inktomi 1/95 CS 258 6/94 NOW I 1/94 Case for.

Welcome
June 1998 NOW Finale
David E. Culler
6/15/98
NOW Finale
NOW Finale
1/98
Many
PhDs
2nd PhD
6/97
NPACI
NOW Sort
1/97
CS 267
CS 267
6/96
Asplos Workshop II
1/96
NOW II
6/95
1st PhD
Inktomi
1/95
CS 258
6/94
NOW I
1/94
Case for NOW
Asplos Workshop I
CS 252
NOW 0
G-Ether
VIA
SCI
Myrinet
Start of Funding
ATM, fddi
NOW Project Timeline
6/98
Metrics of Success
•
•
•
•
•
•
•
•
Project goals?
Papers published?
Technology transfer?
Adoption of approach in the real world?
Students produced?
Marriages?
Research results?
Unexpected research results?
• All of the above?
NOW Finale
Project Goals
• Fundamental change in how we design largescale computing systems
– snap together commodity components
– self-managing, self-tuning, highly available
• Make the “killer network” real
– realize the potential of emerging hardware technology
– and push its effect through the rest of the system
• Integrated system on a building-wide scale
– pool of resources (proc, disk mem)
– remote processor and memory closer than local disk
– federation of systems with local and global role
• The right way to build internet services
NOW Finale
NOW Software Components
Parallel Apps
Large Seq. Apps
Sockets, Split-C, MPI, HPF, vSM
Name Svr
Global Layer Unix
Unix
Workstation
Unix
Workstation
Unix
Workstation
VN segment
Driver
AM L.C.P.
VN segment
Driver
AM L.C.P.
VN segment
Driver
AM L.C.P.
Myrinet Scalable Interconnect
NOW Finale
Active Messages
Unix (Solaris)
Workstation
VN segment
Driver
AM L.C.P.
Adoption of the Approach
NOW Finale
NOW publications
• Over 40 papers and counting
• wide range of important venues
– IEEE Micro, ACM TOCS, ISCA, ASPLOS, SOSP,
SIGMETRICS, OSDI, SIGMOD, SPAA, SC, IPPS/SPDP,
JSPP, USENIX, Hot Interconnects, SW Prac. and Exp.,
SPDT, HPCA, …
• countless presentations
NOW Finale
NOW Students
• Moved on
– Mike Dahlin (UT), Steve Rodriguez (NetApp), Steve Luna
(HP), Lok Tin Liu (Intel), Cedric Krumbein (Microsoft)
• Moving on
– Doug Ghormley (Sandia), Randy Wang (Princeton), Amin
Vahdat (Duke), Andrea Arpaci-Dusseau (Stanford), Steve
Lumetta (UIUC), Rich Martin (Rutgers)
• Finishing
– Remzi Arpaci-Dusseau, Satoshi Asami, Alan Mainwaring,
Jeanna Neefe Mathews, Drew Roselli, Nisha Talagala
• On to other projects in CS
– Brent Chun, Kim Keeton, Chad Yoshikawa, Fred Wong
• and several undergrads
– Josh Coates, Alec Woo, Eric Schein, ...
NOW Finale
Research Results highlighted in
today’s presentations
NOW Finale
Comm. Performance => Evaluation
• Demonstrated on
LogP microbenchmarks with
GAM
• Rich Martin (9:25)
Sensitivity to
Network
Characteristics
Occams Razor: 10µs User to User
Use r Comm La yer
Ker nel Suppor t
Pro cessor
$
$
2µs
2µs
Memory
Bus
NI
Link
0.5 µs
Networ k Fa bric (S witch)
0.5 µs
16
5µs
14
12
g
L
Or
Os
10
µs
From “NOW Communication
Architecture”
Jan 1994 Retreat
8
6
4
2
NOW Finale
U
ltr
a
ar
ag
on
M
ei
ko
P
W
O
N
N
O
W
SS
10
U
ltr
a
ar
ag
on
M
ei
ko
P
W
O
N
N
O
W
SS
10
0
Novel System Design Techniques
• Andrea ArpaciDusseau (9:50)
Implicit
Coscheduling:
From Simulation
To Implementation
And Back Again
NOW is “federalism”
• Large, collective pool of resources
– Not just networked services
• Building block is complete computer
• Authority, control, responsibility divided
between local operating system and global
operating system
• How is the ensemble organized?
• Who does it?
• Based on what?
NOW 695 2
From “On Self-organizing systems,”
June 1995 Retreat
NOW Finale
Understanding Parallel Appln Perf.
Mainframe
MiniVector Supercomputer
supercomputer
Minicomputer
2003 Computer Food Chain
Portable
Computers
Networks of Desktop Computers
Case 24
From “Case for NOW”
Jan 1994 Retreat
NOW Finale
• Frederick Wong
(10:25)
Understanding
Application Scaling:
NAS Parallel
Benchmarks on the
NOW and SGI Origin
2000
Gigabytes sorted
Fast Parallel I/O
9
8
7
6
5
4
3
2
1
0
Minute Sort
SGI Orgin
SGI Power
Challenge
0
• Remzi Arpaci-Dusseau &
Eric Anderson (10:50)
Robust I/O Performance
in River
NOW Finale
10
20
30
40
50
60
Processors
70
80
90
100
Automatic Network Mapping
• Lab Tour
NOW Finale
Scalable Services
Information
appliances
Stationary
desktops
Scalable Servers
• Wingman/NOW
transcoding proxy
demo
NOW Finale
Virtual Networks
Implications (system)
• Independent scheduling
– provide concept of network process
– NI stamps NPID in message and checks against current
process
– Vector inactive messages to kernel, package messages
for current nPID conveniently
» avoid interrupt if attentive, multiple messages per int, . . .
– Context switch support (???)
• Shared Network
– destination should always be able to accept packets
Reality check: 10 ms page fault => 200 KB at 155 Mb/s
=> 750 KB at 622 Mb/s
=> End-to-end flow control needed to ensure that
resources are available at destination w/i net process.
• Virtual Memory
– address translation on dest (miss rate?)
From Jan 1994 Retreat
NOW Finale
• Alan Mainwaring
(1:00)
Communication
Retrospectives
New look at File Systems
Example: Traditional File System
Clients
Server
$
$$$
$
Local
Private
File Cache
°° °
Bottleneck
$
Fast Ch annel (HPPI)
RAID
Disk Storage
Global
Shared
File Cache
• Expensive
• Complex
• Non-Scalable
• Single point of failure
• Server resources at a premium
• Client resources poorly utilized
NOW 8
NOW Finale
• Drew Roselli
(1:25) Huge
File Traces
• Mike Dahlin
(1:50) xFS and
Beyond
• Randy Wang
(2:45)
Intelligent
Disks
Cluster Design
• Steve Lumetta
(3:10) Trends in
Cluster
Architectures
Q2: What is the Hardware Organization?
• Wide scope for innovation
nCUBE:
MEM
M/C
28 dma channels
P
Splat:
CM-5:
M
HP/Me dusa:
sbus
M
$
mbus
P
M
$
$
P
P
Paragon:
Meiko:
M
µ
$
P
mbus
P
M
$
$
P
P
Networks are all over the map as w ell!
From Jan 1994 Retreat
NOW Finale
grap hics
Vast, Cheap Storage
• Nisha Talagala and
Satoshi Asami
(3:35) Large-scale
Storage Devices
NOW Finale
Beyond Clusters
• Amin Vahdat
(3:50) WebOS:
Infrastructure
for World-Wide
Computing
NOW Finale
New Scale and New Technology
• Matt Welsh, Millennium
• Philip Buonodonna, VIA
• Eric Brewer, The Pro-active Infrastructure
Millennium Computational Community
Bus iness
SIMS
BMRC
Che mistry
C.S .
E.E.
Biology
Gigabit Ethernet
Astro
NERSC
M.E .
Physics
N.E .
IEOR
C. E.
Tra nsport
MSME
Econom y
Math
NOW 45
NOW Finale
Many Thanks
• To all of you visitors for coming
– and for guiding us through many retreats
– and for tremendous support
• To the CS division
– an environment that made it possible
• To an incredible group of students who made
NOW a successful project
– by any metric
• I think you will enjoy these final presentations
NOW Finale