Power Integrity: A Nanoscale VLSI Challenge
Download
Report
Transcript Power Integrity: A Nanoscale VLSI Challenge
Power Integrity: A Nanoscale
VLSI Challenge
Raj Nair, Anasim Corporation
Oct. 2, 2008
Overview
Scaling & its less-known L*di/dt challenge
The Power Wall, breaking through
True-Electromagnetic PI analysis
Energy & Noise Management
Non-disruptive Scaling
Summary
October 2008
© 2008AnaSIM
2
Scaling Progression
October 2008
© 2008AnaSIM
3
Planar CMOS transistor Scaling
Gate
Delay 1
Freq 1
1
1
Tox
Source
L
Body
Drain
1
Gate
0.7 Tox
Source
0.7 L
Body
October 2008
Drain
0.49
0.7
Delay 0.7
1
Freq
1.43
0. 7
A lower Energy*Delay*Cost
product… but challenges led to a
0.7
© 2008AnaSIM severe Power Wall
4
Scaling Challenge: CPU Chip PI
Transistors double
every ~18
months
Power doubles
every ~36
months
Operating modes
create load
shifts
Which create
supply voltage
‘droops’
Microprocessor
Heat Spreader
Package Managed by
Substrate package
Mother Board
Capacitors
(Original
figure from C. Baldwin)
devices
Pentium™ is a trademark of Intel® Corporation
October 2008
© 2008AnaSIM
5
Package CAP Loop-L scaling
Load shift induced voltage noise
equation and derivation of package
component characteristics scaling
gives
and
10
loop-L, pH
Inversely related to
process scaling
(on-die cap) &
(freq. scaling)2
Quintuplet and Triplet loop-L scaling
1
0.1
Q-scaling
References:
0.01
Nair
P858
2001 Intel Assembly Technology Journal – Invited Paper on ‘Pathfinding’
2002 Intel Technology Journal paper “Emerging Directions for Packaging…”
October 2008
© 2008AnaSIM
T-scaling
<<0.1pH!
P860
P1262
Process
P1264
65nm
6
On-die L & L*di/dt challenge
Consider a Roots of Two Scaling [1] scenario:
Capacitance-per-unit-area, Ca, scales by 2 , operating voltage scales by
1
, frequency
2
scales by 2 , and chip area scales by
1
2
.
Following this constant-power scaling direction, and inspecting the change in Dynamic Voltage Droop
in a unit area (/ua) of integrated silicon, we get:
Since C/ua scales by
2 and voltage scales by
1
, I scales by
2* 2.
2
Assuming the effective L/ua doesn't change,
Multiplying I and
I
L
reduces by a factor of
C
1
.
2
L
in the scaled process generation, the dynamic voltage droop amplitude:
C
L
scales by
C
2* 2*
1
, or by a factor of
2
for constant power scaling.
2
References:
Nair, Nair & Bennett, 2008
EDADesignline® publications “A Power Integrity Wall follows the Power Wall” & “Dynamic Voltage Droops & Total PI”
October 2008
© 2008AnaSIM
7
The Power Wall
Processor power doubles
every ~36 months…
Pentium™
processors
Power (Watts)
100
1
486
286
10
8086
8085
8080
386
8008
4004
0.1
’71
’74
’78
’85
’92
’00
’04
’08
CPU power is now capped or reducing
in Multi-Core, SoC Architectures
References:
Nair
2001 Intel Assembly Technology Journal – Invited Paper on ‘Pathfinding’
2002 Intel Technology Journal paper “Emerging Directions for Packaging…”
October 2008
© 2008AnaSIM
Pentium™ is a trademark of Intel® Corporation
8
Scaling into the Nanoscale Era
Drain induced barrier lowering in short
channel devices makes leakage increase
with Vds (V-Supply)
Source:
Narendra & Chandrakasan
Leakage in Nanometer Technologies, Springer Publications, 2005
October 2008
Sub-threshold channel leakage dominates
V-supply dependent leakage.
Greater challenges…
High-K is only a partial solution
© 2008AnaSIM
9
Power, Performance, Leakage
Power
Active power estimated as
(P=CV2f), V P
With supply-V scaling, VT
must scale for performance
Leakage
IOFF 10е(VT/S), and S
~= 85mV/decade
IOFF rises 10X with 85mV
reduction in VT
Short channel barrier
lowering (DIBL, ) IOFF
Source: S. Narendra, Tyfone, Inc.
October 2008
© 2008AnaSIM
10
Power Trend
54%
42%
18%
4%
0%
0%
0%
Power trend
15
Present
10
Switching
Leakage
Total
5
0
1.4X
Sub-threshold
leakage
0.01
0.1
1
Channel length (um)
Leakage power now equals active power!!
Both strongly dependent upon supply voltage.
Reference: Narendra, ICCAD ‘03
October 2008
© 2008AnaSIM
11
Performance w/ Voltage Scaling
IDS (VGSVT)
15nm
Delay & Frequency
Linear dependence in
deep nanoscale CMOS
CVdd/Ids ~constant
Nanoscale CMOS delay
and performance are
roughly constant within
a ΔVdd range of Vdd
Opportunity
Use lowest possible Vdd!
500
Vg = 0.8V
Intel’s 15nm NMOS
m A/ m m)
25 nm
Drain Current (
400
0.7V
300
0.6V
200
0.5V
0.4V
100
0.3V
0
Accurate supply noise estimation
a must
© 2008AnaSIM
October 2008
0
0.2
0.4
0.6
0.8
Drain Voltage (V)
Chau et al., IEDM 2000
12
}
}
}
Voltage Minimization is Key
Total power strongly V-dependent
Active power proportional to V2
Leakage (tunneling) also related as Vx (DIBL, Efield, tunneling distance reduction with V)
Energy / task minimized similarly
ΔIDS ~linearly related to ΔV in nanoscale processes
How low can you go?
Digital: supply gating
Analog: fine grain supply voltage control
Need accurate noise, power integrity estimation
Reference: Nair, AZ Nanotechnology Symposium 2006
October 2008
© 2008AnaSIM
13
Supply Noise Analyses Examples
Package simulation
Chip == current source
Chip grid IR drop analysis
Power Grid == Resistance
Are these the high levels of approximation (for design optimization) desirable??
October 2008
© 2008AnaSIM
14
On-Chip Analysis Challenge
Atomic or Abstract?
Polygonal Analyses
Nanoscale IC’s today face
exploding computational
complexity (R, L, C, I, dI)
Energy & Efficiency
October 2008
Analyze supply surface
ripples by ‘molecular’
interactions?
© 2008AnaSIM
Must know IC’s supply
ripples for optimization
15
Meeting the Challenge
Differential Power
ECD: Continuum models
Grid is uniform; treat as a
voltage-continuum along a
single surface USPTO PUB
Include R, L, C and solve
‘true-electromagnetically’
Abstract silicon, package
October 2008
Voltage is a potential
difference; treat power
grid differentially
Partition hierarchically &
exploit symmetry
© 2008AnaSIM
Include distributed models
for silicon loads, CAP, pkg
and board components
16
Abstraction & Physics-based Sims
High levels of Abstraction
Comprehensive Modeling
All grid electromagnetic
properties, R, L, C used
Actual block load current
profiles used; di/dt, load
activity factors included
Physics based Simulation
October 2008
Power GRID as SURFACE
DISTRIBUTED circuit load
currents & capacitance
SYMMETRY in physical as
well as electrical aspects
© 2008AnaSIM
Field solver employed for
Maxwell’s equations on
‘surfaces’ / NO ‘models’
17
On-Die CAP for Noise Reduction
With Die Caps
Without Die Caps
Simple, lumped SPICE analyses indicate On-Die CAP helps in ΔVCC reduction
Area cost, Gate Oxide leakage are concerns
Reference: Narendra, ICCAD ‘03
October 2008
© 2008AnaSIM
18
SoC Power Integrity Simulation
R+L+C Dynamic Noise Simulation in
-fp
9 x 7mm
chip
Differential
noise
5nF /sq. cm
distributed
CAP
Explicit CAP
LENS
100mA peak
noise pulse
of 100ps
width
Pulse noise
source
Power grid
simulation
Do CAPACITORS (reactive devices) really absorb/expend noise energy?
Source: D. Bennett, ANASIM Corp.,
October 2008
-fp power integrity aware floor planner,
www.anasim.com
Use slide show © 2008AnaSIM Animation slide
19
Resonant effects; More / Less CAP?
Single active circuit block in a 4x4mm IC
resonance
-fp ‘what-if’ experiments showing effect of gate switching
time and on-chip de-cap on maximum voltage droop.
© 2008AnaSIM
October 2008
20
CAP Connectivity & Noise
Analysis on a CLOCK chip
Corner CAPs connected to IO Ring
Corner CAPs connected to Core Grid
CAPACITOR blocks from IO ring corners connected into Core
power grid increased noise in the core grid
Source: ANASIM Corp.,
October 2008
-fp power integrity aware floor planner,
www.anasim.com
© 2008AnaSIM
21
SoC Power Grid a Noise Conduit
Low impedance grids
conduct and sum up
supply noise
Low energy loss in
global power grids
more, sustained noise
Scaling and high perf.
high local di/dt &
loop inductance leads
to greater local noise
Reference: Bennett, EEDesign 2003 article, www.anasim.com
October 2008
© 2008AnaSIM
Animation
slide
22
Power Gating & Noise Flow
Power Gating transforms preferred pathways for noise flow in addition to transient
noise generation due to large switched capacitances…
Source: ANASIM Corp.,
October 2008
-fp power integrity aware floor planner,
www.anasim.com
© 2008AnaSIM
Animation
slide
23
Example: System-level Chip Sim
GUI or Netlist capture
Experiment-1 results
-fp simulation schematic illustration
(hyperlinked image)
October 2008
Chip NETLIST
Load current profiles are
pulse100gap100 and
pulse200gap200
SYMMETRY in physical as
well as electrical aspects
Chip grid ANIMATION &
Mirror
Notice substantial voltage
variation of top left corner
Cap 200pF added: results
© 2008AnaSIM
Chip grid ANIMATION &
Mirror
24
Chip Grid R, L + C Design
Maximum noise with grid wire width, 3nf cap
300
Noise at 10u wire width and varied on-chip cap
200
250
175
Noise (mv)
200
150
150
100
125
50
100
Noise (mv)
2
0
5
10
20
30
40
3
4
5
On-chip CAP (nf)
50
Wire width (microns)
With fixed on-chip capacitance value, increase in grid wire width
(reduction in resistance with minimal benefit in inductance)
reduces noise to a point
Increase in capacitance on-die has sub-linear benefit in noise
reduction; more CAP is not always good…
October 2008
© 2008AnaSIM
25
How does it help design?
-fp bridges the PI gap
Complements IR Drop
Energy: Optimizes supply
traditional IC analysis with
true-electromagnetic sims
voltage domain levels
Optimizes power grids;
front-end CAP planning
Reduces cost: Routing,
Chip Area, Design Effort
October 2008
Minimizes costly design
iterations
© 2008AnaSIM
System-Level analysis
26
Dynamic Energy Management
Power Performance,
Energy Battery Life
Power & Energy
Task completion at low
power with low frequency,
but same energy (PT)
Must minimize voltage!
Circuits/Design in SoC’s
Learning from CPUs’ VID bus and Adaptive
Voltage Positioning for Power Integrity
Adaptive Voltage Scaling,
Dynamic Voltage scaling
PowerWise™ compliant
example in figure
Benefits in V-domains
Figure source: Mobile Handset Designline How To, 2007
October 2008
© 2008AnaSIM
27
External Voltage & Noise Control
Distributed Local Voltage Regulation
Multiplies bandwidth: smaller the regulator, faster it can be
Local placement ensures low latency, high loop bandwidth
Decentralized, simple, hardware energy management design
Issues: Component testing
Stacked Active Passives Integration (SAPI)
Non-disruptive, practical and low-cost
References: Nair, US Patents 6084385, 6081105, 5955870, USPTO publication 20030081389, US Patent 7291896
October 2008
© 2008AnaSIM
28
Prescott-CPU ANR Inclusion
ANR Schematic & the PSC POR Model: Illustrations
30m
18n
R1
L1
5
V1
20m
6n
4m
385p
R3
L3
R4
L4
6m
R2
Trace
3.3n
L2
22u
C2
ANR Component
Q1
Reservoir CAP
Board
Package conn.
54p
vid
Vvr
0.42m
Rmb
Lmb
6m/n_osc
Rosc
504u
C1
18p
0.22m
0.26m
3.5p
Lsock Rsock
3.5m/n_mlcc
Rmlcc
Rpkg
6.5m/n_idc
Ridc
Lpkg
3.3n/n_osc
Losc
1.2n/n_mlcc
Lmlcc
60p/n_idc
Lidc
504u*n_osc
Cosc
20u*n_mlcc
Cmlcc
385p/n_2t
2.2u*n_idc
L2t
Cidc
Reference: Intel® Prescott CPU-PKG simulation model, ComLSI ANR
October 2008
© 2008AnaSIM
6.5m/n_2t
R2t
40ps/188n
Rdie
188n
Cdie
1
G1
V2
10u*n_2t
C2t
29
Prescott Pre-ANR
Nom. VDD
October 2008
© 2008AnaSIM
30
Prescott Post-ANR
Nom. VDD
No spatial info,
V as f(x,y,t)
October 2008
© 2008AnaSIM
31
Advanced SiP Simulation
Near load systems
Active Noise Regulator*
Distributed Local (POL)
Voltage Regulators
Chip power grid noise
Spatial & Temporal
Power supply variation
in x, y and t
Data can feed into
future Dynamic Timing
Analysis?
Simulation speed allows
‘what-if’ experiments
for optimization
Reference:
ANR attached to top left corner
* Nair & Bennett, ComLSI
Power Management Designline article http://www.powermanagementdesignline.com/howto/175800373
October 2008
Use slide show © 2008AnaSIM Animation slide
of grid
32
Managing Leakage Power
Classical techniques
Dual or Multi-VT processes
High-K gate oxide
SLEEP Transistors (gate-level)
Threshold voltage modulation
Adaptive Body Bias / Dynamic Body Bias
Supply voltage modulation
Most of these
complicate the
design/tool flow
substantially…
Adaptive Voltage Scaling / Dyn. Voltage Scaling
Power Gating
Scalable, fundamental solutions?
October 2008
© 2008AnaSIM
33
Addressing Scaling AND Power
Planar device Scaling
VT, VDD reaching limits
Loss of channel control
Variations (VT, Leff…)
Leakage limiting gate
dielectric scaling
FinFET
Double Gate devices
Lower leakage
Lower parasitics (C, R)
Lower fabrication cost
Lower delays higher
performance!
Source: L. Mathew, SOI 2007
October 2008
© 2008AnaSIM
34
Planar Contender from Intel®
High-K integrated, provides
low-leakage on-die CAP
30, 20 and 15nm transistors
claimed…
Reference: Marczyk & Chau, Intel®, 2005
October 2008
© 2008AnaSIM
35
Double Gate Variants
FINFET
ITFET
MIGFET
Gate 1
Gate 2
Source: L. Mathew, SOI 2007
October 2008
© 2008AnaSIM
36
Lower Total Cost of DG Devices
Substantially reduced
Improved
Processing steps
Leakage
Parasitics/Performance
Record ION/IOFF
No area penalty /
greater area utilization
Novel circuits feasible
But challenges remain
Process, Design
Source: L. Mathew, SOI 2007
October 2008
“Very promising for low power, low cost
Handheld
applications”
© 2008AnaSIM
37
Summary
Supply voltage minimization is key to IC energy
management
Accurate, true-electromagnetic, spatio-temporal
noise analysis essential
Innovative power delivery and integrity
management solutions are also needed
CMOS Scaling has some distance to go with
innovative devices
3-D (SiP, PoP, SAPI) integration will drive a more
feasible continuation of Moore’s Law
October 2008
© 2008AnaSIM
38
Anasim Info
Anasim bringing sea change into SoC
methodology with physics-based analyses and
high levels of abstraction
Benefits to chip resource usage, area, energy,
performance, and total design effort/cost
Fills the GAP in Total Power Integrity analyses
Non-disruptive, Win-Win-Win engagement
Links, tel. [email protected] +1 480-694-5984
October 2008
Anasim White Papers pifp1.pdf, pifp2.pdf, pifp3.pdf
Product - -fp brochure
ComLSI, parent co.
© 2008AnaSIM
39
Backup
October 2008
© 2008AnaSIM
40
PI-FP Tool Environment
Simulation netlist
.TRAN 200e-12
.PLOT 20
.ACC 0.0060
.PRINTNODE ALL
Ggrid1 0.2 0.2 0.0005 0.0080 0.030
10e-9 10e-9
Igrid1 0.1 0.1 0.02 0.02 pulse.txt 1
Ttline1 1 2 0.01 10e-9 100e-12 0.3
Ngrid1 1 0.11 0.11
pulse.txt : Current Source
0
22E-12
40E-12
60E-12
80E-12
100E-12
120E-12
140E-12
160E-12
180E-12
200E-12
October 2008
© 2008AnaSIM
0
0.030901699
0.058778525
0.080901699
0.095105652
0.1
0.095105652
0.080901699
0.058778525
0.030901699
0
41
PI-FP Tool Environment contd.
Multi-Grid design
Planar or 3D
© 2008AnaSIM
Include multiple
chips in stacked
or planar design
Code efficiency
October 2008
L calculation
Each GRID on its
own core (CPU)
42
Floorplanning / Optimization
GRID wire width, spacing, pitch
DECAP optimization
Area savings
Block placement tweaks for PI
Metal resource savings, routing / timing facilitation
Noise generation, propagation
Chip-Package co-simulation
Operating voltage (Energy) tuning
Resonance detection and avoidance…
October 2008
© 2008AnaSIM
43
IO Ring impact on Core Noise
Analysis on a customer CLOCK chip
The voltage regulators, connecting between the IO Ring and the Core Grid
are seen to become significant noise injection nodes with the inclusion of
loads and the IO Ring. Pictures above are snapshots of dynamic plots.
October 2008
© 2008AnaSIM
44