signal & image processing 1 thales communications france

Download Report

Transcript signal & image processing 1 thales communications france

SIGNAL & IMAGE PROCESSING
SPEECH & IMAGE PROCESSING
(TSI/LMM - Laboratoire MultiMédia)
Contacts :
Frédéric Chartier
Tél : +33 1 46 13 31 05
Gwénaël Guilmin
Tél : +33 1 46 13 28 35
Fax : +33 1 46 13 25 55
email :
[email protected]
[email protected]
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
1
SIGNAL & IMAGE PROCESSING
2
 Propose Technical strategy, research innovation and advanced studies
 Perform advanced and feasibility studies, demonstrators and SP
modules in Thales Com. products
 Maximise Efficiency/synergy within Thales Com.for SP R&D
 Maintain close links with French administration, SMEs, University
laboratories and European research actors
 Provide expertise and support for Thales Com. units in its field
 Hire and Train young engineers in SP domain
 Disseminate new technologies and best practices within Thales Com.
 Represent Thales Com. within Thales Common Efficiency Teams
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Missions
SIGNAL & IMAGE PROCESSING
Technical and Technological Challenges
Civilian
Technologies
Software Radio
High Data Rate
Radio modem
Antenna
Processing
Wireless
Telecom
Signal & Image
Processing
Evolutions
Multimedia
and Internet
SIP
framework
DSP use
generalised
THALES COMMUNICATIONS FRANCE
Electronic Warfare
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
3
SIGNAL & IMAGE PROCESSING
4
 Multimedia
: 11 engineers (4 experts)
 Radiocommunications
: 16 engineers (5 experts)
 Sensor Processing
: 26 engineers (5 experts)
 Software Development
: 16 engineers (2 experts)

2 technicians, 1 secretary, 8 thesis students
80
person
s
Active participation to CNRS SP working group
memberships in IEEE, SEE and EURASIP
6 patents and 15 publications per year on average
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Team
SIGNAL & IMAGE PROCESSING
5
Multimedia
Low and very low rate speech compression
Watermarking
JPEG 2000 & Video Codec
Modem
VLF, HF, VUHF and satellite modems
Single and multi-carrier modulations
Spread spectrum and CDMA
Source and channel coding optimisation
Spectral efficiency optimisation
Antenna
Processing
Antenna diversity & Jammer-interference rejection
High resolution direction finding
Array optimisation on perturbing platforms
Smart antennas and SDMA
Signal
Analysis
Detection, numbering (energy, cyclic, high order stats., ...)
Recognition/identification of modulation and coding schemes
Blind demodulation and equalisation
Localisation
Software radio
Digital exciters and receivers, amplifier linearisation
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Domains of expertise
SIGNAL & IMAGE PROCESSING
6
 Compression
 Low and very low bit rate compression research and development activity
 LPC : 800 and 2400 bit/s
 HSX : 1200, 2400 and 3200 bit/s
 CELP 4.8 kbit/s and TETRA (4567 bit/s)
 VLBR : 200 to 400 bit/s (combining recognition and synthesis)
 Wide Band Low Bite rate speech Coder : 3200 bit/s
 Knowledge/Implementation of higher bit rate coders, but no research activity
 Vocal Activity Detector, echo cancellation.
 Noise reduction : passive pre-processing or processing included in vocoder
 System optimisation of channel and source coding
 Best adaptation to service and system/propagation environment
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Speech processing
SIGNAL & IMAGE PROCESSING
7
MOS
5
4
3
2
G 728
G 729
WBLBR
(92)
G 723-1 (96)
ST 4591
ST 4591
(96)
(02)
(02)
VLBR
GSM
HSX
FS 1016
ST4209
(87)
ST 4198
(90)
(83)
(87)
ST 4479
LPC 10
(93)
(83)
G711
(72)
G726
(88)
1
1k
2k
THALES COMMUNICATIONS FRANCE
4k
8k
16k
32k
64k
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Speech processing
SIGNAL & IMAGE PROCESSING
8
Indicative
Quality
5
G.711
(64 kb/s)
G.721
(32 kb/s)
G.728 G.729
G.723
(16 kb/s) (8 kb/s)
(5.3 kb/s)
Minimum qual. for high cost application
4
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Speech processing
Consumer quality
3
HSX
(2,4 kb/s)
2
LPC 10
(2,4 kb/s)
1
1970
1980
THALES COMMUNICATIONS FRANCE
1990
2000
Minimum qual. For low cost application
SIGNAL & IMAGE PROCESSING
9
 Standards
 THC coders chosen for STANAG 4479 (800 bit/s) in 1994
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
THC Major achievements
 ETSI TETRA (4567 bit/s) for PMR (licence to Motorola, Nokia, Philips/Simoco,..)
 Present participation at NATO for new low bit rate coder STANAG 4591 (1200
and 2400 bit/s, associated noise reduction)
 Products
 LPC10e implementation within Spartacus, Syracuse, HF processor
 Vocoder ASIC for the PR4G (LPC 800, LPC10e 2400, ACELP 4800)
 Vocoders (SW) for the PR4G/VS4 (LPC 800, LPC10e 2400, ACELP 4800)
 HSX in Sawari, Synthesis in a consumer pager (Info-realité) and analysis in PC,
OKI (Asic), Leo (Singapore).
 Tetra coders in base-station for ISR
 G723.1 and G726 in ATM switch
THALES COMMUNICATIONS FRANCE
SIGNAL & IMAGE PROCESSING
10
Vocoder
STANAG 4479, 800 b/s
Simulation TR
PC
C25 C54x
C50
ASIC C30 C62 sharc
C40
For/C/FixC
x
x
x(*)
x
STANAG 4198, 2400 b/s LPC For/C/FixC
x
x
x(*)
x
x
Product
PR4G, PHF, Sawari,info Tel
PR4G, PHF,Spartacus, Syr. II
HSX 2400 b/s
C/FixC
x
x
HSX 1200 b/s
C/FixC
x
xS
ACELP, 4800 b/s
For/C/FixC
x
x(*)
TETRA, 4567 b/s
FixC
x
x
ITU G723-1, 6.4/5.3 kb/s
C/FixC
x
x
ATM switch
ITU G726, 16,24,32,40 kb/s
C
x
x
ATM switch
ITU G728, LD CELP 16kb/s
C/FixC
x
x
ATM switch
ITU G729, CS ACELP 8kb/s
FixC
x
GSM
C
STANAG 4591 (2400/1200 b/s)
C/FixC
THALES COMMUNICATIONS FRANCE
x
x
x
Aztec, Sawari, OKI
InfoTelecom, OKI
x
PR4G, PHF, Spartacus
Rameau, ISR
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Existing vocoders at THC
SIGNAL & IMAGE PROCESSING
11
 Sherbrooke University (Canada)
 ACELP specialists
 University of Rennes (noise reduction)
 hand-free telephone
 ENST Paris & ESIEE
 Very Low Bit Rate Speech Coding (combining recognition and
synthesis).
 Wide Band Low Bite rate speech Coder.
 Fraunhofer institute
 MPEG II layer 3, MPEG 4 audio coders
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Cooperations
SIGNAL & IMAGE PROCESSING
12
 VLBR speech Codec
 Thanks to the developed speech encoding solution, the system will
be used on Very-Low-Bit-Rate channel, lower than 400 bits/s.
 This technology could be also used to:
 speech recognition,
 speaker/language identification,
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
VLBR speech Codec
SIGNAL & IMAGE PROCESSING
13
 Very Low Bit Rate speech coding by indexing natural
speech units of variable size
 Solution based on a new concept making use of
various speech processing technologies
 Temporal Decomposition (TD) for robust segmentation of
speech
 HMM modelling for determination of speech units
 Harmonic/Stochastic modelling for speech re-synthesis by
concatenating identified speech units
 Jan Cernocky, PhD Thesis (Orsay) 1998
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
VLBR speech Codec
SIGNAL & IMAGE PROCESSING
VLBR speech Codec
VLBR Encoder
Prosody
Analysis
…
Spectral
Analysis
Codebook
synthesis units
Prosody
Encoding
Codebook
HMM models
HMM-based
Recognition
Determination of
optimal synthesis
units (DTW)
CODER
Pitch and Energy
Profiles
THALES COMMUNICATIONS FRANCE
HMM index
Index of synthesis unit
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
Input speech
signal
14
SIGNAL & IMAGE PROCESSING
15
VLBR Decoder
Pitch and Energy
Profiles
Index of synthesis unit
HMM index
Prosody
Decoding
Extraction of
synthesis units
HNM
Synthesis
Output synthesised
speech signal
THALES COMMUNICATIONS FRANCE
Codebook
synthesis units
DECODER
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
VLBR speech Codec
SIGNAL & IMAGE PROCESSING
16
 WLBR speech Codec algorithms
 Parametric Wide Band speech coder (from 50Hz to
7000Hz).
 Bit-rate: below 4 kbits (3200 bit/s & 3600 bit/s)
 Wide Band speech pre-processing
– Noise Reduction, spectral compression, temporal
speed modification
 Voice activity detection.
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
WLBR speech Codec
SIGNAL & IMAGE PROCESSING
17
 Intérêt pour applications professionnelles:
 offrir un plus produit (il n’existe pas encore de codeur de ce
type actuellement), la cible visée étant très intéressée par
ce genre d ’amélioration.
 Le débit reste compatible des réseaux HF/VUHF
 « Simple » évolution du codeur HSX (implémentation
maîtrisée, C fixe disponible)
 Intérêt pour applications civiles:
 La seule norme civile existante en WB (AMR WB) offre un
débit supérieur à 10 kbit/s. Les utilisateurs vont demander
de plus en plus une qualité WB.
 Notre offre produit: codeur propriétaire WB à très bas débit,
marché potentiel: portail web, enregistreur Numérique,
PDA, radio numérique.
THALES COMMUNICATIONS FRANCE
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
WLBR speech Codec
SIGNAL & IMAGE PROCESSING
18
 Codage large bande (0-7kHz)
 Amélioration de la qualité perçue
 Aide à la discrimination des fricatives
 Rehaussement de l’intelligibilité
 Extension pleine bande (full band)
 Modèle paramétrique sur toute la bande (AR ordre 16)
 Choix algorithmiques
 Longueur de trame : 360 éch.
 Voisement sur 0-4kHz
Ordre 16
 4 fréquences de coupure
 Bande haute non voisée
0
fc
THALES COMMUNICATIONS FRANCE
7kHz
This document is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written approval
WLBR speech Codec