4G and H.264 - University of Michigan

Download Report

Transcript 4G and H.264 - University of Michigan

1
The Next Generation Challenge for
Software Defined Radio
Mark Woh1, Sangwon Seo1, Hyunseok Lee1, Yuan Lin1,
Scott Mahlke1, Trevor Mudge1, Chaitali Chakrabarti2, Krisztian Flautner3
1
2
Advanced Computer Architecture Lab, University of Michigan
Department of Electrical Engineering, Arizona State University
3
ARM, Ltd.
1
3G Wireless
2
Large Coverage
Outdoor - High Mobility
Up to 14Mbps
University of Michigan -SAMOS 2007
2
Expected Wireless Growth
3
The growth of wireless will require more bandwidth
University of Michigan -SAMOS 2007
3
4G Wireless
Macro Cells
4
Pico Cells
Isolated HotSpots – 1Gbps Coverage
Large Coverage – 100Mbps Coverage
Indoor – Very Low Mobility
Outdoor - High Mobility
What we need
 Adaptive high performance transmission system
 Great candidate for SDR
University of Michigan -SAMOS 2007
4
Next Generation Wireless – 4G
·
5
IFFT
MIMO
encoder
Channel
Encoder
TX
...
...
DEMOD
(OFDM)
...
...
MOD
(OFDM)
MIMO
decoder
Channel
Decoder
RX
Antenna
·
FFT
·
·
STBC
VBLAST
·
·
Turbo code
LDPC code
 3 Major Components to 4G



Modulation/Demodulation
Multiple-Input Multiple-Out (MIMO)
Channel Decoder/Encoders
University of Michigan -SAMOS 2007
5
Modulation - OFDM
….
6
Properties of OFDM
-High Spectral Efficiency
-Low Intersymbol Intereference
-Flat Fading Subcarriers
….
Can sustain high data rates with
multiple users
-Nfsc
-fsc
0
fsc
Nfsc
Can be implemented with IFFT/FFT
University of Michigan -SAMOS 2007
6
Major Component of Modulation – FFT/IFFT
x[0]
x[1]
complex
mult
complex
add
X[0]
complex
sub
X[1]
eiw
Very wide data level parallelism
Requires complex operations
University of Michigan -SAMOS 2007
7
7
MIMO (Multiple Input – Multiple Out)


8
Previously we used single antenna systems
Now we use multiple antennas to increase the channel
capacity
 Diversity - High Reliability
 Space Time Block Codes (STBC)
 Multiplexing – High Throughput
 Vertical-BLAST (V-BLAST)
University of Michigan -SAMOS 2007
8
Space Time Block Codes (STBC)
-x[2]*
9
x[1]*
Time
x[1]
Transmit Antennas
x[2]
Tx1
Channel
Tx2
h11
h22
h21
Rx1
Receive Antennas
Noise
Received
Signal
h12
Rx2
n21
n22
n11
n12
y11 = h11x[1] + h12x[2] + n11
y21 = h21x[1] + h22x[2] + n21
y12 = -h11x[2]* + h12x[1]* + n12
y22 = -h21x[2]* + h22x[1]* + n22
Channel
Estimation
h11
h21
Combiner
h12
h22
Channel
Estimation
~x[1]
~x[2]
~x[1] = h11*y11 + h12y12* + h21*y21 + h22y22*
~x[2] = h12*y11 - h11y12* + h22*y21 - h21y22*
University of Michigan -SAMOS 2007
9
STBC
10
Receiver Antenna 1 and 2
y21 y11
y22* y12*
h22 h21 h12 h11
Complex
Multiply
Accumulate
~x[1] ~x[2]
Conjugate
+Negation
Channel
Estimation
Requires complex operations
Low Data Movement
Highly parallelizable
University of Michigan -SAMOS 2007
10
Vertical-BLAST (V-BLAST)
11
Mod
Demod
V-Blast
Detector
S/P
Mod
Demod
M Transmitters
R Receivers
Channel
Estimation
1
2
3
4
1
2
3
4
1
2
3
4
5
6
7
8
5
6
7
8
5
6
7
8
Nulling
Vector
1
9
10
11
12
9
10
11
12
9
10
11
12
13
14
15
16
13
14
15
16
13
14
15
16
Data Stream of 4 Tx
2
3
4
Strongest
Signal
Linear Combination
of Data
Subtract
Strong Signal
Repeat
University of Michigan -SAMOS 2007
11
V-BLAST

12
Implementation Based on Square Root Method for V-BLAST
 Original requires repeated pseudo-inverse calculation for finding the
strongest signal
This algorithm has reduces complexity


Complexity
 Requires matrix operations on complex numbers
 Many Matrix Transformations
University of Michigan -SAMOS 2007
12
Channel Decoding


13
3G Technologies in 4G
 Viterbi
 Turbo Decoder
New to 4G
 LDPC
 Better performance characteristics compared to Turbo and Viterbi
University of Michigan -SAMOS 2007
13
LDPC
L Node
14
Original Value
L0
L1
L2
L3
L4
L5
L6
L7
1
1
0
1
0
1
0
0
1
H=
Message from Check Nodes Decision
E1 → 0
E0 → 0
E1 → 1
E0 → 0
E0 → 1
E1 → 0
E2 → 0
1
1
E0 → 1
E3 → 1
E1 → 0
E2 → 0
E3 → 1
E3 → 0
E2 → 1
E3 → 0
0
0
1
E2 → 1
1
0
1
1
1
0
0
1
0
0
0
0
1
0
0
1
1
1
1
0
0
1
1
0
1
0
1
0
0
1
0
1
0
1
E0
E1
E2
E3
0
L0
L1
L2
1
1
0
L 3 L4
1
0
L5
L6
L7
1
0
1
Message Sent = [ 1
0
0
1
0
1
0
1]
Message Recieved = [ 1
1
0
1
0
1
0
1]
University of Michigan -SAMOS 2007
14
LDPC
15
Min-Sum Decoding Used
Regular LDPC code
Can get benefit from Wide SIMD
 Can do the Bit Node and Check Node
Alignment of Check and Bit nodes is a problem
University of Michigan -SAMOS 2007
15
SODA PE Architecture
16
SODA
DSP
1. SIMD pipeline
Pred.
Regs
3. Local
memory
5. DMA
System Interconnect
Global
Memory
Local
SIMD
Memory
SIMD
ALU+
Mult
SIMD
Reg.
File
E
X
RF
S
T
V
DMA
ALU
Local
Scalar
Memory
AGU
RF
E
X
W
B
SIMD
Shuffle
Network
(SSN)
W
B
SIMD
to
Scalar
(VtoS)
V
T
S
2. Scalar pipeline
Scalar
RF
E
X
AGU
ALU
W
B
Scalar
ALU
W
B
4. AGU pipeline
SIMD – 32 Wide, 16-bit datapath, Predicate Execution
University of Michigan -SAMOS 2007
16
4G Workload on SODA


17
Key 4G
algorithms
100 Mbps
1 Gbps
MCycle/s
MCycle/s
FFT
2x360
4x360
IFFT
2x360
4x360
STBC
240
-
V-BLAST
-
1900
LDPC
7700
4x18500
100 Mbps 4G requires 8Ghz SODA PE
1 Gbps 4G requires 20Ghz SODA PE
University of Michigan -SAMOS 2007
17
18
4000
0.8
3500
0.7
3000
0.6
2500
0.5
2000
0.4
1500
0.3
1000
0.2
500
0.1
0
0
180nm 130nm
ITRS Scaled Frequency
Vdd (V)
Power (W)
Frequency (Mhz)
SODA With Technology Scaling
90nm
65nm
45nm
32nm
22nm
Fixed Scaled Frequency
Scaled Power
180nm
130nm
90nm
65nm
45nm
32nm
22nm
1.8
1.3
1.1
1.1
1
0.9
0.8
University of Michigan -SAMOS 2007
18
SDR Challenges In 4G



19
We can’t do any of 4G with technology scaling on one core
 Would 8GHz cores even be an energy efficient solution?
What about 1Gbps?
 Are we ever going to get a 20GHz core?
Cannot rely on technology scaling to give us 4G for free
 4G SDR will require algorithmic and architectural innovations
University of Michigan -SAMOS 2007
19
4G Algorithm-Architectural Co-design

Architectural improvements (SODA II)
 Specialized functional units
 CISC-like complex arithmetic operations
 Specialized data movement hardware
 Less strain on the memory system

 Wider SIMD
 How wide can we go?
 More PEs
 What does the interconnect look like?
Algorithmic optimization through parallelization
 Reduce intra-kernel communication
 Reduce memory accesses
 Arithmetic is much cheaper than data movement
University of Michigan -SAMOS 2007
20
20
Thanks

21
Questions?
University of Michigan -SAMOS 2007
21
Successive Cancelling for V-BLAST
22
V-BLAST successive interference cancelling (SIC)
The ith ZF-nulling vector wi is defined as the unique
minimum-norm vector satisfying
 Orthogonal to the subspace spanned by the contributions to yi due
to the symbols not yet estimated and cancelled and is given by the
~
ith row of H
University of Michigan -SAMOS 2007
22
Alamouti Scheme
23
Assumption: the channel remains unchanged over two consecutive symbols
 Rate = 1
 Diversity order = 2
Simple decoding
University of Michigan -SAMOS 2007
23
Advantages of Software Defined Radio
24
Multi-mode operations
Lower costs




Faster time to market
Prototyping and bug fixes
Chip volumes
UWB
innovations
 Cognitive radio
University of Michigan -SAMOS 2007
802.16a
SDR
Longevity of platforms
Enables future wireless communication
EDGE
Bluetooth
802.16a
802.11b
WCDMA
24
802.11n