Virtex-4 Overview

Download Report

Transcript Virtex-4 Overview

Reconfigurable FPGAs for
Space – Present and Future
Rick Padovani
Xilinx, Inc.
MAPLD 2005
Abstract
The capability to implement reconfigurable digital systems based on FPGA technology is a reality today.
Reconfigurability is defined in a continuum ranging from rapid design development, post-deployment hardware
modifications, through to runtime reconfiguration for processing and computing. In addition, designer are increasingly
looking to FPGA-based computing as performance improvements of traditional Von Neuman processors begin to level
off. These topics are of increasing interest to designers of Space-based systems.
Two emerging technologies, Rad Hard by Design (RHBD), and runtime Partial Reconfiguration (PR) will dramatically
increase the efficiency and reduce the cost of using reconfigurable FPGAs in Space Applications. Today’s
reconfigurable FPGAs are susceptible to Single-Event Effects (SEEs) which can corrupt the configuration memory and
affect the user’s design. Reconfigurable FPGAs can be made virtually immune to SEEs through the use of TripleModule Redundancy (TMR) and configuration memory scrubbing, although these techniques bring added PCB
complexity and reduce the number of available logic cells. Efforts are underway to introduce RHBD FPGAs that will be
immune to SEEs. FPGAs employing RHBD configuration memory will not require TMR or configuration memory
scrubbing for protection against SEEs and will offer increased reconfigurable capability for field upgrades and runtime
Partial Reconfiguration (PR).
Runtime PR offers a means for changing design modules on-the-fly, while the “base” design continues to operate
uninterrupted. This allows multiple design modules to time-share the same physical silicon resources, thereby reducing
device resource utilization, device count, and power consumption.
Partial Reconfiguration is available today, and will become increasingly important for space-based systems where PCB
footprint, mass, and power consumption are of even greater concern. This paper will review the present and future
state of commercial process technology, reconfigurable FPGA architecture, FPGAs for Space, and the benefits offered
by PR and RHBD.
Padovani
Page 2
MAPLD 2005/P245
Outline
• The current state of commercial process
technology and FPGA architecture
• Computing and the Future
• Reconfiguration use models
• Partial Reconfiguration
• FPGAs for Space-based applications
• Rad Hard by Design Development
• Conclusions
Padovani
Page 3
MAPLD 2005/P245
Moore’s Law Continues
Fueling Reprogrammable FPGA Advances
Mature
FPGA Product
Technology
180 nm
Developing
FPGA Product
Technology
150 nm
130 nm
90 nm
65 nm
Future
Process Technology
• Plan continuation of 2 year
Technology node cycle
• “Traditional Scaling” is starting to be
effected by the fundamental material
limits of the planar CMOS process
• “Equivalent Scaling” or the
assimilation of new materials,
structures and functional integration
will drive continued scaling
45 nm
32 nm
22 nm
8 nm
1999
Padovani
2001
2003
2005
2007
Page 4
2009
2011
2013
2015
MAPLD 2005/P245
2017
Architectural Evolution
Device Complexity and Performance
Reconfigurable FPGAs
Domain-optimized
System Logic
Programmable
“System in a
Package”
System
Logic
Block
Logic
Glue
Logic
• FPGA Fabric
• Block RAM
• FPGA Fabric
1985
Padovani
1992
•
•
•
•
•
• FPGA Fabric
• Block RAM
• Embedded Registers
and Multipliers
Platform
• FPGA Fabric
• Clock Management
• Block RAM
Logic
• Multi-standard
• Embedded Registers Programmable IO
and Multipliers
• Embedded
FPGA Fabric
• Clock Management
Microprocessor
Block RAM
•
Multi-standard
• Multigigabit
Embedded Registers
Programmable
IO
Transceivers
and Multipliers
•
Embedded
• Embedded DSPClock Management
Microprocessor
optimized Multiplers
Multi-standard
• Multigigabit
• Embedded Ethernet
Programmable IO
Transceivers
MACs
2000
2002
Page 5
2004
2005
MAPLD 2005/P245
Are Von Neumann processors
running out of steam?
GHz
10.0
Clock Speed
1.0
0.1
1997 1998 1999 2000 2001 2002 2003 2004 2005
0.45
(MOPS/MHz/Million Transistors)
0.40
0.30
Compute Density
of Processors
0.20
0.15
– Increased cache size
– Longer pipelines
– Trying to do more per
cycle
• This approach also
nearing its limit
0.35
0.25
• Lack of increased clock
speed is being
addressed by:
0.10
0.05
Source: UC Berkeley HERC and CPUscorecard.com
0.00
Pentium MMX
(P55C) 1997
Padovani
Celeron
(Mendocino)
1998
Pentium III EB
1999
Pentium III-S
2001
Pentium 4
(Willamette) 2001
Pentium 4
(Northwood)
2002
Page 6
MAPLD 2005/P245
What’s next for Computing Platforms?
•
•
•
•
Hyperthreading?
Clusters?
Configurable instruction sets?
Configurable coprocessors?
In general, the need for parallel execution is now
recognized as a requirement, as is the desire for
customizable instruction sets
Padovani
Page 7
MAPLD 2005/P245
Reconfigurable FPGAs to the rescue
• For at least 15 years people have seen the Von Neumann limitations
and have argued that FPGAs were the ultimate supercomputer
– Better programmability – not stuck with a fixed ALU
– Parallel processing – not just hyperthreading but limitless opportunities for
parallelism
– No wasted cost on features that you don’t need
• Some traction over the years, but very limited
–
–
–
–
Numerous chess-playing machines from Deep Thought to Hydra
Craig Venter used Xilinx chips for the Human Genome project
Other people are using Xilinx chips for Bioinformatics
Cray, SGI and others have been using FPGAs as coprocessors to offload
certain operations
– Berkeley Emulation Engine is a recent example
– Numerous companies represented in the consortium have been extolling the
virtues of FPGA computing for a long time
Padovani
Page 8
MAPLD 2005/P245
Raw Processing Performance
Characteristics and Comparisons
IO Bandwidth
d
an
y
or
w
Three axes of performance
• Computational capability
• Memory Bandwidth
• IO Bandwidth
th
id
B
em
M
250
200
Virtex 4
150
100
50
Pentium
0
Computation
Computation
(GOPS)
Memory
Bandwidth
(GB/sec)
Pentium
Padovani
Page 9
IO Bandwidth
(Gbps)
Virtex-4
MAPLD 2005/P245
Why has adoption taken so long?
• Traction has been limited by programming model
– Direct C translation to gates
• Definite progress in development and productization
• Limited customer acceptance in the supercomputer market
but picture may be changes
– Direct HDL design
• Difficult to implement current applications of supercomputing
in HDL
• Need for high connectivity lowers performance
To date, the only model in widespread use for supercomputingtype applications is HDL
Padovani
Page 10
MAPLD 2005/P245
New use models enabled with
Reconfigurable FPGAs
Spectrum of Reconfiguration
Occasionally
Field Upgrades
Periodic
Rapid Design
Frequent
Data Processing
Networking
Run-time
Signal Processing
• More efficient use of hardware
– Adaptive hardware algorithms
– Design modules that time share device resources
• Reduced device count and lower power consumption
Padovani
Page 11
MAPLD 2005/P245
FPGA Partial Reconfiguration
• Think of an FPGA as Two
Layers
User Logic Layer
– Configuration Memory Layer
– User Logic Layer
• Configuration memory controls
functions on user logic layer
• Partial Reconfiguration allows a
portion of device to be changed
while the rest is still running
• Documented in XAPP 290
Padovani
Page 12
Configuration Memory Layer
What FPGA Configuration Memory Controls
• All interconnection (wiring)
• Logic Definition (Look-up Tables or “LUTs”)
• Multiply by, divide by, etc.
• Inversion
• Feature selection
• Interface to hardwired blocks, e.g. PPC
• Pipeline on/off
• ECC enable/disable
• BRAM width
• I/O Modes
>really EVERYTHING!
MAPLD 2005/P245
Partial Reconfiguration Modules
(PRMs)
XC2VP30
• One or more PR regions
can be defined
PR Region A
PRM_A0
PRM_A1
PRM_A2
• Multiple PRMs can be
defined for each region
PR Region B
PRM_B0
PRM_B1
PRM_B2
Padovani
Page 13
MAPLD 2005/P245
Reconfigurable FPGAs for Space
Product Development Strategies – Present and Future
• Exploit inherent TID hardness of advanced commercial
processes
• SEL immunity achieved with epitaxial layer on P+ substrate,
multiple substrate taps and lower core voltage
• SEE hardness will improved with RadHard by Design techniques:
Configuration
Memory Layer
User Logic
Layer
Hardwired FPGA
Control Logic
Padovani
Present Rad Tolerant Products
Future RadHard by Design Products
• Effective immunity by utilizing
Partial Reconfiguration to “scrub”
Configuration Memory
• Immunity by RadHard Circuit Design
• Eliminates need for scrubbing and
configuration manager circuit overhead
• Xilinx TMR (XTMR) confers
effective SEU and SET immunity
• TMR significant
• Immunity by RadHard Circuit Design
• Multiple Embedded Cores, e.g., PPC,
use TMR with hardened FPGA fabric
• Dramatic increase in available resources
and ease of design
• Small cross section SEFIs
requires full device reconfiguration
• FPGA power cycle is not required
• Immunity by RadHard Circuit Design
• Eliminates SEFIs
Page 14
MAPLD 2005/P245
FPGA Radiation Tolerance
TID Trends vs Product/Technology
350nm - XQ4000XL
−
60K Rads (Si)
• 220nm - XQVR (Virtex)
−
200K Rads (Si)
• 130nm - XQR2VP
−
250K Rads (Si)
• 90nm (Preliminary)
−
Padovani
200
150
100
100K Rads (Si)
• 150nm - XQR2V (Virtex-II)
−
300
250
(per 1019.6)
•
400
350
TID Krads (Si)
TID tolerance of Military-grade
FPGAs with full production
test:
300K Rad (Si)
50
0
350
300
250
200
nm
150
100
50
Process trends*:
• Gate oxide continues to thin
• Oxide tunnel currents increase
• Gate stress voltage decreases
*See “CMOS SCALING, DESIGN PRINCIPLES and HARDENING-BYDESIGN METHODOLOGIES” by Ron Lacoe, Aerospace Corp
2003 IEEE NSREC Short Course 2003
Page 15
MAPLD 2005/P245
Applied Mitigation
TMR + Scrubbing
FPGA can manage its
own configuration scrubbing!
Scrub
Control
PROM
Virtex-II
TMR
Padovani
Page 16
• Single FPGA with TMR and
Configuration Scrubbing
– Continuous, uninterrupted
operation (except SEFI)
– Can employ readback for
error detection
– Scrub controller detects and
handles SEFIs
– Critical data processing
applications
(Communications,
Navigation)
MAPLD 2005/P245
Xilinx TMR
(XTMR)
Single-String
XTMR
Padovani
Page 17
MAPLD 2005/P245
Logic Capacity of Virtex Rad Tolerant Families
Current Virtex Families with TMR Mitigation vs. Future RHBD Families
125
Available Logic Cells (K)
Current Virtex Families
100
Future RadHard
by Design Families
75
50
25
0
Padovani
XQVR1000
XQR2V6000
XQR2VP70
SIRF 4V100
(Virtex)
(Virtex-II)
(Virtex-II pro)
(Virtex-4 RHBD)
Page 18
MAPLD 2005/P245
RadHard by Design Program
SEU Immune Reconfigurable FPGA (SIRF)
• Phase-I Test Chip
– Vehicle to determine and prove
hardening strategies for key
architectural elements
– Test Chip includes a range of
design variants for each key
element
– Radiation Testing in 1Q06
• Phase-2 Product
Implementation
Phase-1: Design Feasibility, Test Chip and Trade Study
Phase-2: SIRF Product Development and Fabrication
19
– Optimal RHBD implementation
of Virtex-4 architecture
– Embedded hard core, e.g., PPC
and MGT, hardening strategies
evaluated during Phase-1 and
current V-IIpro testing
Xilinx Proprietary Presentation
Conclusions
• CMOS scaling will continue well into the next decade fueling reconfigurable
FPGA architectural advances and system-level integration
• The computing industry is trying to increase performance with parallel
execution and reconfigurability today and this is clearly the way of the future
• Performance of FPGAs as a compute platform exceed conventional
processors in all three performance vectors; implementing an effective
programming model is the main issue the industry is working hard to solve
• Partial Reconfiguration capability is here today enabling new use models and
software support tools are imminent
• Rad Tolerant Reconfigurable FPGAs available today achieve virtual SEE
immunity by applying Partial Reconfiguration and soft TMR techniques
• Rad Hard by Design Reconfigurable FPGAs are under development and will
offer a dramatic increase in available logic cells and radiation performance
while freeing up reconfiguration resources for more efficient use of hardware,
reconfigurable processing or computing applications
Padovani
Page 20
MAPLD 2005/P245