NUMAの構成 - Keio University

Download Report

Transcript NUMAの構成 - Keio University

NORA/Clusters

AMANO, Hideharu Textbook pp.

140-147

NORA (No Remote Access Memory Model)  No hardware shared memory  Data exchange is done by messages (or packets)  Dedicated synchronization mechanism is provided.

  High peak performance Message passing library ( MPI,PVM) is provided.

Message passing ( Blocking ) Send Receive Send Receive

Message passing ( Non-blocking ) Send Receive Send Receive

PVM (Parallel Virtual Machine)  A buffer is provided for a sender.

 Both blocking/non-blocking receive is provided.

 Barrier synchronization

MPI (Message Passing Interface)  Superset of the PVM for 1 to 1 communication.

 Group communication  Various communication is supported.

 Error check with communication tag.

 Next week, a homework based on MPI with cluster will be presented.

Shared memory model vs . Message passing model  Benefits  Distributed OS is easy to implement.

 Automatic parallelize compiler.

  Message passing like Smalltalk requires shared memory.

GHC → KL/I  Message passing  Formal verification is easy (Blocking)  No-side effect (Shared variable is side effect itself)  Small cost

Multicomputers vs. Clusters    Multicomputers    Dedicated hardware (CPU, network) High performance but expensive Hitachi’s SR8000, Cray T3E, etc.

WS/PC Clusters    Using standard CPU boards High Performance/Cost Standard bus often forms a bottleneck Beowluf Clusters    Standard CPU boards, Standard components LAN+TCP/IP Free-software  A cluster with Standard System Area Network(SAN) like Myrinet is often called Beowulf Cluster

Beowluf Clusters (RWCP, using Myrinet)

Networks for NORA machines/Clusters  Direct or Non-symmetric indirect networks  Nodes are connected with links.

 Locality of communication can be used.

 Extension to large size is easy.

Basic direct networks Linear Ring Central concentration Tree つ り x Complete connection Mesh

Metrics of Direct interconnection network ( D and d)    Diameter : D  Number of hops between most distant two nodes through the minimal path degree: d  The largest number of links per a node.

D represents performance and d cost represents Recent trends: Performance: Throughput Cost: The number of long links

Diameter 2(n-1)

Other requirements  Uniformity : Every node/link has the same configuration.

 Extendability: The size can be easily extended.

 Fault Torelance: A single fault on link or node does not cause a fatal damage on the total network.

 Embeddability: Emulating other networks  Bisection Bandwidth

bi-section bandwidth The total amount of data traffic between two halves of the network.

Hypercube 0000 0100 0001 0101 0010 0110 0011 0111 1000 1001 1011 1100 1101 1010 1110 1111

Routing on hypercube 0101→ 1100 Different bits 0000 0100 0001 0101 0010 0110 0011 0111 1000 1001 1011 1100 1101 1010 1110 1111

The diameter of hypercube 0101→1010 All bits are different → the largest distance 0000 0100 0001 0101 0010 0110 0011 0111 1000 1001 1100 1101 1010 1110 1011 1111

Characteristics of hypercube  D=d= logN  High throughput, Bisection Bandwidth  Enbeddability for various networks  Satisfies all fundamental characteristics of direct networks ( Extendability is quistionable )  Most of the first generation of NORA machines are hypercubes ( iPSC , NCUBE , FPS-T )

Problems of hypercube  Large number of links  Large number of distant links  High bandwidth links are difficult for a high performance processors.

  Small D does not contribute performance because of innovation of packet transfer.

Programming is difficult: → Hypercube’s dilemma

Is hypercube extendable?

 Yes ( Theoretical viewpoint )   The throughput increases relational to the system size.

No ( Practical viewpoint )  The system size is limited by the link of node.

Hypercube ’ s dilemma   Programming considering the topology is difficult unlike 2-D,3-D mesh/torus Programming for random communication network cannot make the use of locality of communication.

•2-D/3-D mesh/torus •Killer applications fit to the topology •Partial differential equation, Image processing,… •Simple mapping stratedies •Frequent communicating processes should be Assigned to neighboring nodes

k ary n cube  Generalized mesh/torus  K-ary n digits number is assigned into each node   For each dimension (digit), links are provided to nodes whose number are the same except the dimension in order.

Rap-around links ( n 1→ 0 ) form a torus, otherwise mesh.

k-ary n-cube 0 0 0 1 0 2 1 0 1 1 1 2 2 0 2 1 2 2 3-ary 1-cube 3-ary 2-cube

k-ary n-cube 0 0 0 0 1 0 0 2 0 1 0 0 0 0 1 1 1 0 0 1 1 1 2 0 0 2 1 2 0 0 1 0 1 0 0 0 1 1 1 0 1 2 0 1 2 1 0 2 2 2 0 1 1 0 2 1 1 1 2 1 2 2 2 2 1 1 2 0 2 2 1 2 2 2 2 3-ary 1-cube 3-ary 2-cube 3 ary 3 cube

3 -ary 4-cube 0*** 1*** 2***

k-ary n-cube 300 000 100 200 001 002 400 003 004 010 020 030 040 014 024 034 044 444

5-ary 4-cube 0*** 4*** 3*** 1*** 2***

Properties of k-ary n-cube   A class of networks which has Linear, Ring 2 D/3-D mesh/torus and Hypercube ( binary n cube ) as its member.

Small d=2n but large D ( 1/n O(k ))  Large number of neighboring links  k-ary n-cube has been a main stream of NORA networks. Recently, small-n large-k networks are trendy.

Exercise  Calculate Diameter (D) and degree (d) of the 6 ary 4-cube (mesh-type).