Welcome June 1998 NOW Finale David E. Culler 6/15/98 NOW Finale NOW Finale 1/98 Many PhDs 2nd PhD 6/97 NPACI NOW Sort 1/97 CS 267 CS 267 6/96 Asplos Workshop II 1/96 NOW II 6/95 1st PhD Inktomi 1/95 CS 258 6/94 NOW I 1/94 Case for.
Download ReportTranscript Welcome June 1998 NOW Finale David E. Culler 6/15/98 NOW Finale NOW Finale 1/98 Many PhDs 2nd PhD 6/97 NPACI NOW Sort 1/97 CS 267 CS 267 6/96 Asplos Workshop II 1/96 NOW II 6/95 1st PhD Inktomi 1/95 CS 258 6/94 NOW I 1/94 Case for.
Welcome June 1998 NOW Finale David E. Culler 6/15/98 NOW Finale NOW Finale 1/98 Many PhDs 2nd PhD 6/97 NPACI NOW Sort 1/97 CS 267 CS 267 6/96 Asplos Workshop II 1/96 NOW II 6/95 1st PhD Inktomi 1/95 CS 258 6/94 NOW I 1/94 Case for NOW Asplos Workshop I CS 252 NOW 0 G-Ether VIA SCI Myrinet Start of Funding ATM, fddi NOW Project Timeline 6/98 Metrics of Success • • • • • • • • Project goals? Papers published? Technology transfer? Adoption of approach in the real world? Students produced? Marriages? Research results? Unexpected research results? • All of the above? NOW Finale Project Goals • Fundamental change in how we design largescale computing systems – snap together commodity components – self-managing, self-tuning, highly available • Make the “killer network” real – realize the potential of emerging hardware technology – and push its effect through the rest of the system • Integrated system on a building-wide scale – pool of resources (proc, disk mem) – remote processor and memory closer than local disk – federation of systems with local and global role • The right way to build internet services NOW Finale NOW Software Components Parallel Apps Large Seq. Apps Sockets, Split-C, MPI, HPF, vSM Name Svr Global Layer Unix Unix Workstation Unix Workstation Unix Workstation VN segment Driver AM L.C.P. VN segment Driver AM L.C.P. VN segment Driver AM L.C.P. Myrinet Scalable Interconnect NOW Finale Active Messages Unix (Solaris) Workstation VN segment Driver AM L.C.P. Adoption of the Approach NOW Finale NOW publications • Over 40 papers and counting • wide range of important venues – IEEE Micro, ACM TOCS, ISCA, ASPLOS, SOSP, SIGMETRICS, OSDI, SIGMOD, SPAA, SC, IPPS/SPDP, JSPP, USENIX, Hot Interconnects, SW Prac. and Exp., SPDT, HPCA, … • countless presentations NOW Finale NOW Students • Moved on – Mike Dahlin (UT), Steve Rodriguez (NetApp), Steve Luna (HP), Lok Tin Liu (Intel), Cedric Krumbein (Microsoft) • Moving on – Doug Ghormley (Sandia), Randy Wang (Princeton), Amin Vahdat (Duke), Andrea Arpaci-Dusseau (Stanford), Steve Lumetta (UIUC), Rich Martin (Rutgers) • Finishing – Remzi Arpaci-Dusseau, Satoshi Asami, Alan Mainwaring, Jeanna Neefe Mathews, Drew Roselli, Nisha Talagala • On to other projects in CS – Brent Chun, Kim Keeton, Chad Yoshikawa, Fred Wong • and several undergrads – Josh Coates, Alec Woo, Eric Schein, ... NOW Finale Research Results highlighted in today’s presentations NOW Finale Comm. Performance => Evaluation • Demonstrated on LogP microbenchmarks with GAM • Rich Martin (9:25) Sensitivity to Network Characteristics Occams Razor: 10µs User to User Use r Comm La yer Ker nel Suppor t Pro cessor $ $ 2µs 2µs Memory Bus NI Link 0.5 µs Networ k Fa bric (S witch) 0.5 µs 16 5µs 14 12 g L Or Os 10 µs From “NOW Communication Architecture” Jan 1994 Retreat 8 6 4 2 NOW Finale U ltr a ar ag on M ei ko P W O N N O W SS 10 U ltr a ar ag on M ei ko P W O N N O W SS 10 0 Novel System Design Techniques • Andrea ArpaciDusseau (9:50) Implicit Coscheduling: From Simulation To Implementation And Back Again NOW is “federalism” • Large, collective pool of resources – Not just networked services • Building block is complete computer • Authority, control, responsibility divided between local operating system and global operating system • How is the ensemble organized? • Who does it? • Based on what? NOW 695 2 From “On Self-organizing systems,” June 1995 Retreat NOW Finale Understanding Parallel Appln Perf. Mainframe MiniVector Supercomputer supercomputer Minicomputer 2003 Computer Food Chain Portable Computers Networks of Desktop Computers Case 24 From “Case for NOW” Jan 1994 Retreat NOW Finale • Frederick Wong (10:25) Understanding Application Scaling: NAS Parallel Benchmarks on the NOW and SGI Origin 2000 Gigabytes sorted Fast Parallel I/O 9 8 7 6 5 4 3 2 1 0 Minute Sort SGI Orgin SGI Power Challenge 0 • Remzi Arpaci-Dusseau & Eric Anderson (10:50) Robust I/O Performance in River NOW Finale 10 20 30 40 50 60 Processors 70 80 90 100 Automatic Network Mapping • Lab Tour NOW Finale Scalable Services Information appliances Stationary desktops Scalable Servers • Wingman/NOW transcoding proxy demo NOW Finale Virtual Networks Implications (system) • Independent scheduling – provide concept of network process – NI stamps NPID in message and checks against current process – Vector inactive messages to kernel, package messages for current nPID conveniently » avoid interrupt if attentive, multiple messages per int, . . . – Context switch support (???) • Shared Network – destination should always be able to accept packets Reality check: 10 ms page fault => 200 KB at 155 Mb/s => 750 KB at 622 Mb/s => End-to-end flow control needed to ensure that resources are available at destination w/i net process. • Virtual Memory – address translation on dest (miss rate?) From Jan 1994 Retreat NOW Finale • Alan Mainwaring (1:00) Communication Retrospectives New look at File Systems Example: Traditional File System Clients Server $ $$$ $ Local Private File Cache °° ° Bottleneck $ Fast Ch annel (HPPI) RAID Disk Storage Global Shared File Cache • Expensive • Complex • Non-Scalable • Single point of failure • Server resources at a premium • Client resources poorly utilized NOW 8 NOW Finale • Drew Roselli (1:25) Huge File Traces • Mike Dahlin (1:50) xFS and Beyond • Randy Wang (2:45) Intelligent Disks Cluster Design • Steve Lumetta (3:10) Trends in Cluster Architectures Q2: What is the Hardware Organization? • Wide scope for innovation nCUBE: MEM M/C 28 dma channels P Splat: CM-5: M HP/Me dusa: sbus M $ mbus P M $ $ P P Paragon: Meiko: M µ $ P mbus P M $ $ P P Networks are all over the map as w ell! From Jan 1994 Retreat NOW Finale grap hics Vast, Cheap Storage • Nisha Talagala and Satoshi Asami (3:35) Large-scale Storage Devices NOW Finale Beyond Clusters • Amin Vahdat (3:50) WebOS: Infrastructure for World-Wide Computing NOW Finale New Scale and New Technology • Matt Welsh, Millennium • Philip Buonodonna, VIA • Eric Brewer, The Pro-active Infrastructure Millennium Computational Community Bus iness SIMS BMRC Che mistry C.S . E.E. Biology Gigabit Ethernet Astro NERSC M.E . Physics N.E . IEOR C. E. Tra nsport MSME Econom y Math NOW 45 NOW Finale Many Thanks • To all of you visitors for coming – and for guiding us through many retreats – and for tremendous support • To the CS division – an environment that made it possible • To an incredible group of students who made NOW a successful project – by any metric • I think you will enjoy these final presentations NOW Finale