智慧型訊號處理與多媒體資訊環境 Intelligent Signal Processing and Multi-media Information Environment 主持人:李琳山
Download ReportTranscript 智慧型訊號處理與多媒體資訊環境 Intelligent Signal Processing and Multi-media Information Environment 主持人:李琳山
智慧型訊號處理與多媒體資訊環境 Intelligent Signal Processing and Multi-media Information Environment 主持人:李琳山 共同主持人:貝蘇章,吳家麟 參與研究教授:鄭士康,陳良基,陳志宏,曹建和,馮世邁 李宇旼,歐陽明,陳文進,黃肇雄 大綱 • 計劃主題 • 主要工作項目 • 重要成績舉例 • 計劃產出列表 • 下年度工作計劃 • 錄影帶及系統展示 • 總檢討及結論 卓 越 計 畫 National Taiwan University Intelligent Signal Processing • Utilizing External Knowledge – example: speech recognition using lexicon and language model • Adaptive Algorithms based on Signal Characteristics/Conditions – example: video processing considering local signal characteristics • Learning Capabilities – learning new knowledge and developing new adaptation mechanisms Signals Signal Processing Adaptive Algorithm Output External Knowledge Networks Learning Capabilities 卓 越 計 畫 National Taiwan University Information-related Activities, Applications and Services in Future Network Era Future Integrated Networks Real-time Information Private Services – personal notebook – weather, traffic – flight schedule – stock price – sports scores Knowledge Archieves – digital libraries Electronic Commerce Intelligent Working Environment – e-mail processors virtual banking – virtual museums – on-line transactions – intelligent agents – on-line investments – teleconferencing – distant learning – – business databases – home appliances – network entertainments 卓 越 計 畫 National Taiwan University Information-related Activities, Applications and Services in Future Network Era Future Integrated Networks Real-time Information Private Services Knowledge Archieves Electronic Commerce Intelligent Working Environment • Multi-media, Multi-lingual, Multi-functionalities • Cross-cultures, Cross-domains, Cross-regions • Integrating All Knowledge Systems and Information-related Activities and Services Globally • All Knowledge and Information/Services Represented in Form of Multi-media, Multi-lingual Signals • Multi-media, Multi-lingual Signals will be the Core for Future Human Knowledge and Information/Services 卓 越 計 畫 National Taiwan University Vision - Intelligent Multi-media Information Environment Future Integrate d Networks Information-related Activities and Services: – Knowledge Archieves – Real-time Information – Private Services .. . Application Tasks for Intelligent Signal Processing: – Voice Conversational Interfaces – Video/Audio Compression and Manipulation – Graphics/Virtual Reality – Multi-media Information Retrieval .. . Terminal Equipments: – Personal Computers – Telephone Sets – PDA’s – Handsets – Vehicular Electronics – Home Appliances Users .. . Basic Signal Processing Technologies : – Intelligent Signal Processing – Speech Signal Processing – Video/Audio Signal Processing – Biomedical Signal Processing – Multi-media Signal Processing .. . 卓 越 計 畫 National Taiwan University Vision - Intelligent Multi-media Information Environment Future Integrate d Networks Application Tasks for Intelligent Signal Processing: Informationrelated Activities and Services: . Terminal Equipments: . .. Users .. . Basic Signal Processing Technologies : .. . .. • Information Environment – terminal equipments, computers, software, networks, knowledge/information/services • Content Engineering – Processing of Network Knowledge/Information/Services in Form of Multi-media Signals • User Interface in Form of Multi-media Signals • Intelligent Multi-media Signal Processing 卓 越 計 畫 National Taiwan University Intelligent Signal Processing and Multi-media Information Environment • User Interface in Form of Multi-media Signals user terminals • Content Engineering - Processing of Global Knowledge/Information satellites in Form of Multi-media Signals Networks C radio cable servers Global Information 卓 越 計 畫 National Taiwan University Integration with Other Projects • Multi-media Information Environment • Microwave and Millimeter-wave Technologies satellites user terminals Networks C radio cable • Communications and Networking Technologies servers Global Information • Intelligent Signal Processing 卓 越 計 畫 National Taiwan University Vision - Voice Access of Global Multi-media Information under Broadband Wireless Environment 3G Cellular Systems EDGE/ UWC136 AP PSTN Intelligent Agent Core Network The Internet Web Server Corporate Intranet WLAN Broadband Wireless Access ATM or IP Backbone • At Any Time, from Anywhere • As Handset Size Shrinks While Required Functionalities Grows Continuously, Voice Interface will be the Key 卓 越 計 畫 National Taiwan University 主要工作項目 (1) 智慧型高等訊號處理 (2) 中文自然語言及語音處理 (3) 視訊及音訊處理 (4) 生理訊號處理 (5) 多媒體訊號處理 (6) 處理器及晶片設計 (7) 系統整合 卓 越 計 畫 National Taiwan University 89年度所完成之研究項目 一、智慧型高等訊號處理 • 著重深入的基礎研究,建立學理基礎 –二度空間離散傅立葉分數轉換之研究 –投影式離散傅立葉分數轉換之研究 –離散希爾伯特轉換設計: Analytic Design Type and Closed-form Maximally Flat Type –高速無線通訊之訊號處理研究: 調變解調模擬、無線通道模型建立、無線通道模擬 –最小雜訊之小波原理研究: 梯形小波原理、最小雜訊之架構、非正交轉換 –小波多載波原理之研究: 小波與多載波系統之關係、小波多載波系統之最佳化、最佳小波 多載波系統之效益評估 卓 越 計 畫 National Taiwan University Fast Optimization of FIR Decision Feedback Equalizer • Derived a New Algorithm for Optimizing FIR Decision Feedback Equalizer Giving Low Complexity and Better Performance 1x10 0 Number of Clock Cycles 3.5E+04 3.0E+04 2.5E+04 2.0E+04 Direct Computation New Approach 1x10 -1 Previous Approach 1x10 -2 1.5E+04 1.0E+04 Al-Dhahir-Cioffi Previous Approach 1x10 -3 5.0E+03 Fast DFE New Approach Optimization 0.0E+00 1 2 3 4 5 6 7 8 9 10 11 12 Number of Channel Taps 1x10 -4 0 5 10 15 Eb /N0 (dB) 20 25 30 卓 越 計 畫 National Taiwan University 89年度所完成之研究項目 二、中文自然語言及語音處理 • 運用語言知識、考慮中文結構 – PAT-tree為基礎之語言模型技術研究 –改良型搜尋及語言解碼技術研究 –口語對話技術之學理分析 卓 越 計 畫 National Taiwan University Syllable-based One-pass Search • Finding the Optimal Sentence from an Unknown Utterance Using 3 Knowledge Sources:Acoustic Models, Lexicon and Language Model • Based on a Lattice of Syllable Candidates t Acoustic Models Syllable Lattice Word Graph w1 w2 P(w1)P(w2 |w1)...... P(w1)P(w2 |w1)...... w1 Lexicon w2 Language Models 卓 越 計 畫 National Taiwan University Statistical Formulation and Analysis for Spoken Dialogue Systems • A Spoken Dialogue System Formulated as a Process of Transmitting a set of Semantic Slots from the User to the System • Finite State Machine Representation unknown (u,x ) known verified (k,c ) (v,c ) (k,e ) (v,e ) u/k/v: unknown, known, or verified c/e/x: correct, error, or don’t care • Channel Model for Slot Transmission – slot lost rate Rl : desired slots lost – slot misunderstanding rate Rm : wrong slots received causing misunderstanding User slot i slot lost Random slot i Tests Random Rl , Rm Select j no misunderstanding slot j (error) System 卓 越 計 畫 National Taiwan University 89年度所完成之研究項目 三、視訊及音訊處理 • 使所有的知識及資訊有聲有色、多采多姿 –轉換壓縮域視訊處理分析研究(MPEG Compressed Video Analysis): 換景自動偵測(Abrupt Scene Change detection and Gradual Scene Changes in dissolve sequence)、閃光燈特效偵測(Flash light detection)、字幕偵測(Caption detection) –彩色影像量化研究: Dependent Color Scalar Quantitzation、Quaternion moment preserving threshold technique、Self-organization Kohnuen Map Neural Network clustering – 音訊之浮水印(Audio Watermarking)技術 –音樂訊號音色辨識基礎研究: 聲音特徵值研究及萃取、類神經網路研究及程式撰寫、訓練辨識 樣本、系統測試及參數調整 –傳送樂音代碼之網路音樂會實作: 樂音產生器、網路連結、網路對話、系統整合及系統測試 卓 越 計 畫 National Taiwan University Flashlight Detection • Flashlight – homogeneous white and bright • Low variance – unusual intra-coded MB in B frame – unusual consecutive two abrupt scene changes 卓 越 計 畫 National Taiwan University 網路音樂會雛型 • 「網路音樂會」架構 – 合成音效 – 虛擬電子琴 – CSocket網路通訊(Server-Client) – 網路與音樂控制的結合 卓 越 計 畫 National Taiwan University 89年度所完成之研究項目 四、生理訊號處理 • 讓使用者接觸資訊世界更為方便自然 –眼動滑鼠之眼球運動追蹤系統建立 –眼動滑鼠之硬體系統 卓 越 計 畫 National Taiwan University Optical Mouse by Eye Movements • A New Human-computer Interface Controlled by Eye Movements • Position of Pupil can be Detected and Used for Cursor Control with an Infrared Optical Sensor Array 卓 越 計 畫 National Taiwan University Cursor Control 視動滑鼠不反應區之電腦畫面︰中間灰色框框為不反 應區,眼球可在中間注視休息而不影響電腦游標之位 置。正常操作時,此框框會消失隱藏於於幕後,以免 妨礙使用者選項。 卓 越 計 畫 National Taiwan University 89年度所完成之研究項目 五、多媒體訊號處理 • 不同訊號間之整合互動 – 即時環場實境及浮水印 – 可變速率及抗拒錯誤之視訊壓縮 – 臉部表情合成技術之開發 – 跨平台使用者共享瀏覽器系統規劃 – 建構半球型虛擬實境顯示裝置(Spherical display equipment) 卓 越 計 畫 National Taiwan University Scalable Video Codec • Scalable – decoding visual information at varying rates from a single compressed bitstream. Football: 1.5M bps, 16 fps S=1 (88x72) S=3 (352x288) S=0 (44x36) S=2 (176x144) 卓 越 計 畫 National Taiwan University Error – Resilience Video Without/With Error-Resilience Techniques 卓 越 計 畫 National Taiwan University VideoVR • Constructing Panoramic Images Automatically in Realtime from Capturing a Scene over 360 Degrees 卓 越 計 畫 National Taiwan University VideoVR with Watermark • Protecting the Intellectual Properties of Images and Associated Software • Resist Various Attacks: Compression, Pixel-shifting, Cropping, etc. 卓 越 計 畫 National Taiwan University 89年度所完成之研究項目 六、處理器及晶片設計 • 使訊號處理走出電腦,跨出視窗,進入手機等各種終端設 備,融入日常生活 –數位訊號處理器核心指令集之開發設計 – C語言模擬分析 –智慧型視訊處理系統應用模擬程式開發 卓 越 計 畫 National Taiwan University DSP Core Instruction Set Design • Computational – – – – • • • • ALU MAC SFT CMP Data Movement Program Flow Special Instructions Total 92 Instructions • CMAC/CMUL, SQS – Complex MAC/MUL – I*I+Q*Q • ACS – Dual Add-Compare-Select operation • TRCBK – Traceback • FIR2 – Two FIR operations in one cycle 卓 越 計 畫 National Taiwan University DSP Simulator Interface • • • • • • Step/Free Running Run # clocks Run to PC=# Monitor Register Monitor Memory Interrupt Generation 卓 越 計 畫 National Taiwan University An Example Layout/Specification for A 16-bit DSP for 3G Wireless Technology TSMC 0.35um CMOS SPQM 5.6 x 5.6 mm2 480K 208 CQFP 60 MHz One 1K x 28-bit (Program) On-Chip Mamory Two 2K x 16-bit (Data) Power Consumption 165mW @ 3.3V, 60MHz Full-Custom Design for DataPath Design Methodology Cell-Based Design for Others Die Size Core Transistor Count Package Max Clock Rate 卓 越 計 畫 National Taiwan University Block diagram for the Example Data Memory 0 Program Memory AG 0 Am0 Am1 Am2 Am3 Am4 Am5 Am6 Am7 Data Memory 1 Program Sequencer / Decoder AG 1 Bm0 Bm1 Bm2 Bm3 Bm4 Bm5 Bm6 Bm7 Data Path ALU CMP D0 MAC SFT D1 卓 越 計 畫 National Taiwan University 89年度所完成之研究項目 七、系統整合 •系統應用層次的整合 –第一階段整合實驗室之初步建設 卓 越 計 畫 National Taiwan University Integration Plan • A Good Environment for Integration – good results of different areas can be naturally integrated when available • An Intelligent, Multi-media, Paperless Information Environment – virtual electronic classroom, multi-media teleconferencing room, intelligent office, intelligent lab or any other working environment – conference room setting as an example • Network Architecture Campus Backbone Intranet 卓 越 計 畫 National Taiwan University 各工作項目之整合性及關連性 • (2) (3) (4)為三種不同性質形態的訊號處理,共同圍繞著(1)構成 核心技術 • (6)提供所需關鍵性零組件 • (5) (7)為整合性技術 核心技術 (2)中文自然語言 及語音處理 (4) 生理 訊號 處理 (3) 視訊 (1)智慧型高等 及 訊號處理 音訊 處理 整合性技術 (5)多媒體訊號處理 (7)系統整合 (6)處理器及晶片設計 關鍵性零組件 卓 越 計 畫 National Taiwan University 目標 • 訊號處理與網路通訊結合 • 訊號處理與知識及資訊處理結合 • 整合諸多訊號處理技術 • 前瞻性的開闊視野,集中明確的方向 • 整合台大在訊號處理各相關領域的基礎向前邁進 卓 越 計 畫 National Taiwan University Vision - Voice Access of Global Multi-media Information under Broadband Wireless Environment 3G Cellular Systems EDGE/ DWC136 AP PSTN Intelligent Agent Core Network The Internet Web Server Corporate Intranet WLAN Broadband Wireless Access ATM or IP Backbone • At Any Time, from Anywhere • As Handset Size Shrinks While Required Functionalities Grows Continuously, Voice Interface will be the Key 卓 越 計 畫 National Taiwan University Vision - Intelligent Multi-media Information Environment Future Integrated Networks Information-related Activities and Services: – – – – Application Tasks for Intelligent Signal Processing: – Voice Command and Dictation – Voice Conversational Interfaces – Video/Audio Compression and Manipulation – Graphics/Virtual Reality – Multi-media Information Retrieval Knowledge Archieves Real-time Information Electronic Commerce Private Services .. . .. . Terminal Equipments: – – – – – – Personal Computers Telephone Sets PDA’s Handsets Vehicular Electronics Home Appliances Users .. . Basic Signal Processing Technologies : – – – – – Intelligent Signal Processing Speech Signal Processing Video/Audio Signal Processing Biomedical Signal Processing Multi-media Signal Processing • Information Environment .. . – terminal equipments, computers, software, networks, knowledge/information/services • Representation/Extraction of Network Knowledge/Information/Services in Form of Multi-media Signals • User Interface in Form of Multi-media Signals 卓 越 計 • Intelligent Multi-media Signal Processing 畫 National Taiwan University Integration with Other Projects • Microwave and Millimeter-wave Technologies • Multi-media Information Environment satellites user terminals Networks C radio cable • Communications and Networking Technologies Global Information servers • Intelligent Signal Processing 卓 越 計 畫 National Taiwan University Demo’s and Video • Intelligent Signal Processing and Multi-media Information Environment – Content Engineering production, retrieval, presentation, protection – User Interface 3-dim mouse, optical mouse, speech interface – Initial Integration An intelligent, multi-media, paperless information environment 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 一、國際期刊論文已發表及接受24篇,已投稿審查中至少4篇 1. “MINLAB: Minimum noise structure for ladder-based biorthogonal filter banks”, IEEE Trans. Signal Processing, pp.465-77, Feb. 2000. 2. “Prediction based lower triangular transform, ”IEEE Trans. Signal Processing, pp. 1947-56, July 2000. 3. “Prefect discrete multitone modulation with optimal transceivers, ” IEEE Trans. Signal Processing, pp. 1702-12, June 2000. 4. C. C. Tseng, S. C. Pei, and S. C. Hsia,“Computation of fractional derivatives using Fourier transform and digital FIR differentiator,” Signal Processing, Vol.80, No.1, pp.151-159 Jan. 2000. 5. S. C. Pei, and J. J. Ding,“Closed form discrete fractional and affine Fourier transform,” IEEE Trans. on Signal Processing, Vol.48, No.5, pp.1338-1353, May 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 6. S. C. Pei, and P.H. Wang,“ Closed-form design and efficient implementation of generalized maximally flat half-band FIR filters,” IEEE Signal Processing Letters, Vol.7, No.6, pp.149-151, June 2000. 7. S. C. Pei, B.R. Chiou and P.H. Wang,“Programmable fractional sample delay filters with flatness compromise between magnitude reponse and group delay, ” IEEE Trans. On Circuits and Systems, Part II : Analog and Digital Signal Processing, Vol.47, No.8, pp.783-787, Aug 2000 8. S. C. Pei and PH. Wang,“Design of arbitrary cut-off 2-D diamond-shaped FIR filters using the Bernstein Polynomial, ”IEEE Signal Processing Letters, Vol.7, No.11, pp.310-313, Nov. 2000. 9. S. C. Pei and M.H.Yeh,“Discrete fractional Hilbert transform,”IEEE Trans. On Circuits and Sytems, Part II : Analog and Digital Signal Processing, Vol.47, No.11, Nov. 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 10. S. C. Pei and J. J. Ding,“Simplified fractional Fourior transform,”to appear in J. Opt. Soc. Am. A, Dec, 2000. 11. S. C. Pei and J. J. Ding,“The integer transforms analogous to discrete trigometric transforms,”IEEE Trans. on Signal Processing, Vol.48, No.12, Dec 2000. 12. Lin-shan Lee, Yumin Lee, “Voice Access of Global Information for Broadband Wireless: Technologies of Today and Challenges of Tomorrow” (invited paper), to appear on Proceedings of the IEEE, Feb. 2001. 13. Jeih-weih Hung, Jia-lin Shen and Lin-shan Lee, “New Approaches for Domain Transformation and Parameter Combination for Improved Accuracy in Parallel Model Combination (PMC) Techniques,” paper accepted by IEEE Transactions on Speech and Audio Processing. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 14. Bor-shen Lin, Lin-shan Lee, “Computer-aided Analysis and Design for Spoken Dialogue Systems Based on Quantitative Simulations,” paper accepted by IEEE Transactions on Speech and Audio Processing. 15. S. C. Pei and Y. Z. Chou,“ Efficient MPEG compressed video analysis using macroblock type information,” IEEE Trans. on Multimedia, Vol.1, No.4, pp.321-331, Dec. 1999. 16. S. C. Pei, C. M. Cheng and L.F. Ho,“Limited color display for compressed image and video,”IEEE Trans. on Circuits and Systems for Video Technology, Vol.10, No.6, pp.913-922, Sept. 2000. 17. Jiann-Rong Wu and Ming Ouhyoung,“On Latency Compensation and Its Effects for Head Motion Trajectories in Virtual Environments,”pp. 79-90, Vol. 16, No. 2, The Visual Computer, 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 18. Ding-Yun Chen, Ming Ouhyoung, and Ja-Ling Wu,“A Shift-Resisting Public Watermark System for Protecting Image Processing Software,” IEEE Trans. on Comsumer Electronics, Vol. 46, No. 3, pp. 404-414, August 2000. 19. Chien-Feng Huang, I-Chen Lin, Ming Ouhyoung,“High Resolution Calibration of Motion Capture Data for Realistic Facial animation,”Vol. 11, No. 9, pp.1141-1150, Journal of Software, ISSN 1000-9825, September 2000, China Computer Federation. 20. Yuh-Ming Huang and Ja-Ling Wu, “Polynomial Transform Based Algorithms for Computing 2-D Generalized DFT, Generalized DHT and skew circular convolution ”to appear in Signal Processing. 21. Yuh-Ming Huang, Ja-Ling Wu, and Chi-Lun Chang, “A Generalized Output Pruning Algorithm for Matrix-Vector Multiplication and Its Application to Compute Discrete Cosine Transform,” vol.48, No.2, pp.561-563, IEEE Trans.on Signal Processing Feb.2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 22. H. C. Chang, J. Y. Jiu, L. L. Chen, and, L. G. Chen, “Design and Implementation of Low Power DCT Chip for Portable Multimedia Terminal”, Journal of VLSI Signal Processing, Vol. 26, pp. 319-332, November 2000 23. R. X. Chen, L. G. Chen, and L. Chen, “System Design Consideration for Digital Wheelchair Controller”, IEEE Trans. on Industrial Electronics, Vol. 47, No. 4, pp. 898-907, August 2000. 24. T. H. Tsai and L. G. Chen, “A Novel Architecture of Inverse Quantization and Multichannel Processing for MPEG-2 Audio Decoding”, IEEE Trans. on Circuits and Systems II: Analog and Digital Signal Processing, Vol. 47, No. 1, pp. 75-78, January 2000. 25. “Minimum redundancy for ISI free FIR DMT transceiver,” submitted to IEEE Trans. Signal Processing. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 26. “Discrete multitone modulation with principle component filter banks,” submitted to IEEE Trans. Information Theory. 27. “Optimality of orthogonal DMT Transceivers for distorted channels with colored noise”, paper submitted to IEEE Trans. Signal Processing. 28. “ISI free FIR filterbank Transceivers for frequency selective channels”, paper submitted to IEEE Trans. Signal Processing. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 二、國際會議論文已發表及接受44篇 1. “Optimality of principle component filter banks for discrete multitone communication systems,” in Proc. IEEE Int. Symp. Circ. Syst., Geneva, 2000. 2. “Minimal factorization of lapped unimodular transforms,” in Proc. IEEE Int. Conf. Acoust. Speech, Signal Processing, Turkey, 2000. 3. “On the Duality of Optimal DMT Systems and Biorthogonal Subband Coders,” in Proc. IEEE Int. Conf. Acoust. Speech, Signal Processing, Turkey, June 2000. 4. “Design of Causal Stable IIR Filter Bank with Powers-of-two Coefficients,” in Proc. Eusipco, Finland, sep. 2000. 5. “Design of FIR filter bank transceivers with effective band separation,” in Proc. Eusipco, Finland, Sep. 2000. 6. “Minimum redundancy ISI free FIR filter bank transceiver,” in SPIE, San Diego, CA, July 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 7. Yumin Lee, Vip Desai, “Fast Optimization of FIR DFE for Wireless Data Communications,” GLOBECOM 2000. 8. S. C. Pei and J. J. Ding,”Integer discrete Fourier transform and its extension to integer trigometric transforms,”Proc. of IEEE Int'l Symp. on Circuits and Systems, Geneva, Switzerland, May 2000. 9. S. C. Pei and P. H. Wang,”Closed-form design of maximally flat R-regular Mth-band FIR filter,” Proc. of IEEE Int'l Symp. on Circuits and Systems, Geneva, Switzerland, May 2000. 10. C. C. Tseng and S. C. Pei,”Discrete-time Hilbert transformer,”Proc. of IEEE Int'l Symp. on Circuits and Systems, Geneva, Switzerland, May 2000. 11. S. C. Pei and J. J. Ding,”Eigenfunctions of the canonical transform and the self-imaging problems in optical system,”Proc. of IEEE Int'l Conf. on Acoustics, Speech and Signal Processing, Istanbul, Turkey, June 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 12. S. C. Pei and P.H. Wang,”Closed-form design of generalized maximally flat low-pass FIR filters using generating functions,”Proc. of IEEE Int'l Conf. On. 13. Lin-shan Lee, Lee-Feng Chien, “Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era”, 2nd International Conference on Language Resources and Evaluation, Athens, Greece, MayJune 2000, pp. 931-936. 14. Berlin Chen, Hsin-min Wang, Lin-shan Lee, “Retrieval of Broadcast News Speech in Mandarin Chinese Collected in Taiwan Using Syllable-level Statistical Characteristics”, IEEE International Conference on Acoustics, Speech and Signal Processing, Istanbul, Turkey, June 2000, SPP9.14, pp. III-1771-1774 15. Bor-shen Lin, Lin-shan Lee, “Fundamental Performance Analysis for Spoken Dialogue Systems Based on A Quantitative Simulation Approach”, IEEE International Conference on Acoustics, Speech and Signal Processing, Istanbul, Turkey, June 2000, SPL9.2, pp. II-1221-1224. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 16. Lin-shan Lee, Lee-Feng Chien, Yumin Lee, “Global Information Access by Chinese Spoken Language in A Wireless Era Overview with Some Recent Results”, International Symposium on Chinese Spoken Language Processing, Oct. 2000, Beijing, China. 17. Bor-shen Lin, Lin-shan Lee, “Computer-Aided Design/Analysis for Chinese Spoken Dialogue System”, International Symposium on Chinese Spoken Language Processing, Oct. 2000, Beijing, China. 18. Berlin Chen, Hsin-ming Wang, Lin-shan Lee, “Retrieval of Mandarin Broadcast News Using Spoken Queries”, International Conference on Spoken Language Processing, Oct. 2000, Beijing, China. 19. Kuan-ting Chen, Wen-wei Liau, Hsin-ming Wang, Lin-shan Lee, “Fast Speaker Adaptation Using Eigenspace-based Maximum Likelihood Linear Regression”, International Conference on Spoken Language Processing, Oct. 2000, Beijing, China. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 20. Jeih-wei Hung, Hsin-ming Wang, Lin-shan Lee, “Automatic Metric-based Speech Segmentation for Broadcast News via Principal Component Analysis”, International Conference on Spoken Language Processing, Oct. 2000, Beijing, China. 21. Hsiao-Chuan Wang, Frank Seide, Chiu-yu Tseng, Lin-Shan Lee, “MAT2000: Design, Collection, and Validation of a Mandarin 2000-Speaker Telephone Speech Database”, 6th International Conference on Spoken Language Processing, Oct. 2000, Beijing, China, Vol.IV, 460-463. 22. S. C. Pei , C. L. Tseng, and C. C. Wu,”The illuminant invariant matching of images using color histogram normalization,”Proc. of 4th Asian Conference on Computer Vision, Taipei, Taiwan, R. O. C., Jan. 2000. 23. S. C. Pei and C. M. Cheng,”Limited Color display for compressed video, ”Proc. of IEEE Int'l Symp. on Circuits and Systems, Geneva, Switzerland, May 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 24. S. C. Pei and 1. K. Tam,”Effective color interpolation in CCD color filter array using signal correlation,”Proc. of IEEE Int'l Conf. On Image Processing, Vancouver, B.C. Canada, Sept. 2000. 25. S. C. Pei and Y. Z. Chou,”Wipe detection in MPEG compressed video based on the macroblock information,”Proc. of IEEE Int'l Conf. On Image Processing, Vancouver, B.C. Canada, Sept. 2000. 26. Borching Su and Shyh-Kang Jeng, "Multi-timbre chord classification using wavelet transform and self-organized map neural networks," accepted by IEEE Int. Conf. Acoust. Speech, Signal Processing 2001 27. Deng-Rung. Liu, Meng-Jyi. Shieh, Yu-Chung. Lee and Wen-Chin. Chen, "On the Design and Implementation of an MPEG-4 Scene Editor", Proceedings of IEEE International Conference on Consumer Electronics, June 2000, pp. 120-121. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 28. I-Chen Lin, Chien-Feng Huang, Jia-Chi Wu, Ming Ouhyoung, "A Low Bitrate Web-enabled Synthetic Head with Speech-driven Facial Animation", to appear in Workshop on Computer Animation and Sumulation'2000, Switzerland, Aug. 2000. 29. Ding-Yun Chen, Chun-Hsiang Huang, Ja-Ling Wu, and Ming Ouhyoung, "A Shift-Resisting Watermark System for Panoramic Images", pp. 8-9, Proc. IEEE International Conference on Consumer Electronics, June, LA. , 2000. 30. S-W. Liu, Ja-Ling Wu, and C-H Huang, “A Fully Scalable Video Codec,” Accepted by SPIE-Electronic Imaging’2000 . 31. C-H Huang and Ja-Ling Wu, “A Blind Watermarking Algorithm with Semantic Meaningful Watermarks, ”Accepted by the Asilomar International Conference Oct.2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 32. C-T Hsu and Ja-Ling Wu, “Image Watermarking by wavelet Decomposition,” Accepted by the Allied Academies Internation Conference, Hawaii, Oct,2000. 33. Y-S Tung, D-R Liu, B-H Liou, Ja-Ling Wu, W-C Chen, O-Y Ming, and J-H Huang, “A study and Implementation of MPEG-4 Hybrid-Media Virtual Environment,” CVGIP’2000, PP.1-38~1-42, Aug.2000. 34. Y-S Tung, Ja-Ling Wu, C-C Ho, “Architecture design of an MPEG-4 system, ” IEEE Inte’l conf. On Consumer Electronics (ICCE), June,2000. 35. D-Y Chen, C-H Huang, Ja-Ling Wu, O-Y Ming, “A shift-resisting blind watermarking system for panoramic images,” Proc.ICCE’2000, June 2000. 36. C-H Huang and Ja-Ling Wu, “A Watermark Optimization Technique based on Genetic Algorithms,” Proc. SPIE-Visual Communications and Image Processing, Feb.2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 37. C-W Tsai and Ja-Ling Wu, “A Modified Symmetrical Reversible Variable Length Code And Its Theoretical Boards,” Proc. SPIE-Visual Communications and Image Processing, Feb.2000. 38. C. Y. Chen, T. C. Wang, and L. G. Chen, “A Programmable VLSI Architecture for 2D Discrete Wavelet Transform,” 2000 IEEE International Symposium on Circuits and Systems (ISCAS'2000), Geneva, Swiss, May 2000. 39. H. C. Chang, L. G. Chen, M. Y. Hsu and Y. C. Chang, “Performance Analysis and Architecture Evaluation of MPEG-4 Video Codec System,” 2000 IEEE International Symposium on Circuits and Systems (ISCAS'2000), Geneva, Swiss, May 2000. 40. H. C. Chang, Y. C. Chang and L. G. Chen, “MPEG-4 Video Bitstream Structure Analysis and Its Parsing Architecture Design,” 2000 IEEE International Symposium on Circuits and Systems (ISCAS'2000), Geneva, Swiss, May 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 41. S. Y. Chien, S. Y. Ma, and L. G. Chen, “An Efficient Video Segmentation Algorithm for Real-time MPEG-4 Camera System,” in Proceedings of Visual Communication and Image Processing (VCIP2000), 2000. 42. H. C. Chang, Y. C. Wang, M. Y. Hsu and L. G. Chen,”Efficient Algorithms and Architectures for MPEG-4 Object-based Video Coding,”2000 IEEE Workshop on Signal Processing System (SiPS 2000), Lafayette, Louisiana, October 2000. 43. S. Y. Ma, S. Y. Chien, and L. G. Chen, “An Efficient Moving Object Segmentation for MPEG-4 Encoding Systems,” 2000 IEEE International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS 2000), Honolulu, Hawaii, U.S.A., Nov. 2000. 44. Y. W. Huang, S. Y. Chien, S. Y. Ma, and L. G. Chen, “Analysis of Global Motion Effects on Video Segmentation,” 2000 Asia Pacific Conference on Multimedia Technology and Applications (APCMTA’2000), Kaohsiung, Taiwan, R.O.C., Dec. 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 三、專利12件(美國專利5件、中華民國專利6件、大陸專利1件) 1. “Array architecture with data-rings for 3-step hierarchical search block matching algorithm”, U.S. Patent No. 6,118,901, 2000.9.12~2017.10.31 2. “Novel architecture for inverse quantization and multichannel processing in MPEG-II audio decoding”, U.S. Patent 09/354,797(announcing), 2000.08.28. 3. “High-frequency CMOS dual/multi modules prescaler”, U.S. Patent 6,094,466, 2000.07.25~2017.01.10. 4. “Methods for compressing and re-constructing a color image in a computer system”, U.S. Patent 09/042,061(announcing), 2000.06.09. 5. “System and Method of Recognizing Continuous Mandarin Speech Utilizing Chinese Hidden Markov Models”, U.S. Patent No.6,067,520 , issue on May 23 2000. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 6. “一種以眼球控制電腦游標系統”,中華民國專利編號︰87114679 7. “於一電腦系統中壓縮彩色圖像及重建被壓縮彩色圖像的方法”,No. 410301(announcing), 中華民國專利, 2000.11.01 8. “CMOS主動像素感應器”, No. 117230,中華民國專利, 2000.07.01~2018.12.13 9. “利用直接式的高輸出量與高度規則的二維8乘8離散餘弦轉換/反 離散餘弦轉換之架構”,No. 118022,中華民國專利, 2000.06.21~2018.03.02 10. “高頻互補式金氧半雙模\多模前置分頻器”, No. 114359,中華民國專利, 2000.04.11~2016.09.22 11. “於MPEG–II音頻訊號解碼中合成次頻帶濾波器的方法”, No. 110857,中華民國專利, 2000.01.01~2018.07.31 12. “中文電腦之國語語音輸入系統及方法”,中國大陸專利申請案號 94,102,358.3,民國89年9月16日通知核准 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 四、畢業博士生3人 黃聖傑、陳瑞熙、李仁貴 五、畢業碩士生33人 張恆,楊志清,廖達昌,許家禎,譚耀光,何適宇,曾易聰, 林韋成,劉書維,呂立偉,洪紹華,劉尹婷,黃獻毅,方俊忠, 江雨潔,曾俊豪,林昂賢,李郁中,黃建峰,洪振盛,賴怡瑩, 高笙庭,王宏瑾,王崇憲,江振國,胡君怡,陳紀光,楊宗嵐, 許美雲,張永基,廖文偉,郭建良,徐志文 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 六、出席國際會議35人次 1. Int. Conf. Acoust. Speech, Signal Processing(ICASSP), Istanbul, Turkey, June 2000, 6人次. 2. 2000 IEEE International Symposium on Circuits and Systems (ISCAS'2000), Geneva, Switzerland, May 2000, 2人次. 3. International symposium on Chinese Spoken Language Processing, Beijing, Oct 2000, 3人次. 4. 2000 IEEE International Conference on Communications(ICC 2000)June 2000, 2人次. 5. 2000 IEEE Global Telecommunications Conference (Globecom 2000), Nov. ~ Dec., 2000, 2人次. 6. IEEE International Conference on Consumer Electronics, June, LA. , 2000, 3人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 7. Int’l Symposium on Optical Science, Engineering, and Instrumentation(SPIE), San Diego, CA, USA, July 2000, 1人次. 8. IEEE Int’l Conf. On Image Processing, Vancouver, Canada Sept. 2000, 1 人次. 9. European Signal Processing Conference(Eusipco),Finland, Sep. 2000, 1人次. 10. 4th Asian Conference on Computer Vision, Taipei, Jan. 2000, 1人次. 11. Workshop on Computer Animation and Sumulation'2000, Switzerland, Aug. 2000, 1人次. 12. The Allied Academies International Conference, Hawaii, Oct, 2000, 1人次. 13. Visual Communication and Image Processing (VCIP2000), 2000, 1人次. 14. 2000 IEEE Workshop on Signal Processing Systems (SiPS 2000), Lafayette, Louisiana, October 2000, 1人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 15. 2000 IEEE International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS 2000), Honolulu, Hawaii, U.S.A., Nov. 2000, 1人次. 16. 2000 Asia Pacific Conference on Multimedia Technology and Applications (APCMTA’2000), Kaohsiung, Taiwan, R.O.C., Dec. 2000, 1人次. 17. ICAT 2000, 3人次. 18. International symposium on Chinese Spoken Language Processing, Beijing, Oct 2000, 3人次. 19. 2nd International Conference on Language Resources and Evaluation, Athens, Greece, May-June 2000, 1人次 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 七、在國際會議擔任特殊角色17人次 1. General Co-Chair: The Tenth International Conference on Artificial Reality and Tele-existence October 25-27, 2000, National Taiwan University, Taipei, Taiwan, 1人次. 2. Member of Permanent Council, International Conference on Spoken Language Processing, Beijing, China, Oct 2000, 1人次. 3. Vice Chair, Technical Program Committee, International Conference on Spoken Language Processing, Beijing, China, Oct 2000, 1人次. 4. Member of Scientific Committee, 2nd International Conference on Language Resources and Evaluation, Athens, Greece, May-June 2000, 1人 次. 5. Member of International Steering Committee, IEEE International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), Oct 2000, 1人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 6. Member of International Steering Committee, International Symposium on Chinese Spoken Language Processing, Beijing, China, Oct 2000, 1人次. 7. General Convener, COCOSDA Workshop, Beijing, China, Oct 2000, 1人 次. 8. Panelist, International Symposium on Chinese Spoken Language Processing, Beijing, China, 1人次. 9. Member of International Advisory Board, Asian Pacific Conference on Communications(APCC), Aug 2000, 1人次. 10. Session Chair, 2000 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'2000), Istanbul, Turkey, June 2000, 1人次. 11. Session Chair, 2000 IEEE International Symposium on Circuits and Systems (ISCAS'2000), Geneva, Switzerland, May 2000, 2人次. 12. Session Chair, International Conference on Spoken Language Processing, Beijing, China, Oct 2000, 2人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 13. Session Chair, International Symposium on Chinese Spoken Language Processing, Beijing, China, 1人次. 14. Session Chair, 2nd International Conference on Language Resources and Evaluation, Athens, Greece, May-June 2000, 1人次. 15. Session Chair, 2000 IEEE Workshop on Signal Processing Systems (SiPS 2000), Lafayette, Louisiana, October 2000, 1人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 八、在國際期刊擔任特殊角色10人次 1. Associate Editor, IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, 1人次. 2. Associate Editor, IEEE Transactions on Circuits and Systems for Video Technology, 1人次. 3. Associate Editor, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 1人次. 4. Guest Editor, The Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology, 1人次. 5. Associate Editor, Journal of Circuits, Systems, and Signal Processing, 1人次. 6. Member of International Advisory Board, National Language Processing(by Oxford University Press), 1人次. 7. Member of International Advisory Board, IEICE Transactions on Communications, 1人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 8. Member of International Advisory Board, Journal of Communications and Networks, 1人次. 9. Member of Editorial Board, Wireless Personal Communications(by Kluwer Academic Publications), 1人次. 10. Member of Editorial Board, International Journal of Pattern Recognition and Artificial Intelligence(by World Scientific Publishing Co.), 1人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 九、在國際學會擔任特殊角色14人次 1. 2. 3. 4. 5. 6. 7. 8. 9. Fellow of IEEE, 3人. IEEE Distinguished Lecturer, 1人次 Member (appointed by president), Strategic Planning Committee, IEEE Communications Society, 1人次. Chair, Multimedia Systems and Applications Technical Committee, IEEE Circuits and Systems Society, 1人次. Convener, the International Coordinating Committee of Speech Database and Assessment(COCOSDA), 1人次. Advisor, Asia Pacific Board, IEEE Communications Society, 1人次. Member, Design and Implementation of Signal Processing Systems Technical Committee, IEEE Signal Processing Society, 1人次. Member, Signal Processing and Communication Electronics Technical Committee, IEEE Communications Society, 2人次. Member, VLSI Systems and Applications Technical Committee, IEEE Circuits and Systems Society, 1人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 10. Member, Visual Signal Processing and Communications Technical Committee, IEEE Circuits and Systems Society, 1人次. 11. Vice Chair, Asia Pacific Board Chapter Coordination Committee, IEEE Communications Society, 1人次. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 十、技術移轉3項 1. 2. 3. 「可程式化之二維離散小波轉換」,傑霖科技,89.5.17. 「可程式化之二維離散小波轉換」,凌陽科技,89.5.26. 「JPEG 解碼器之硬體架構及技術」,圓剛科技,89.10.23. 卓 越 計 畫 National Taiwan University 89年度計畫產出列表 十一、整體列表 國際期刊論文(已發表及接受) 國際會議論文(已發表及接受) 美國 中華民國 專刊(已核定及公告) 大陸 合計 博士 畢業研究生 碩士 出席國際會議 在國際會議擔任特殊角色 在國際期刊擔任特殊角色 在國際學會擔任特殊角色 技術移轉產業界 24篇 44篇 5件 6件 1件 12件 3人 33人 35人次 17人次 10人次 14人次 3項 卓 越 計 畫 National Taiwan University 90年度預定進行之研究項目 工作項目 內容 1.智慧型 高等訊 號處理 預定完成年月 1)分數式傅氏轉換技術及其應用(貝蘇章) ‧離散傅立葉分數轉換的延伸及改良 Closed-Form 離散傅立葉分數轉換 快速轉換法的研發 90.06 90.12 2)希爾伯特轉換及微分器技術及其應用研究 (貝蘇章) ‧數位微分器設計 分數微分器之設計 Power Series Method Fractional Delay Design 90.06 90.12 90.12 卓 越 計 畫 National Taiwan University 90年度預定進行之研究項目 工作項目 內容 1.智慧型 高等訊 號處理 預定完成年月 3)高速無線通訊中之可適性訊號處理研究(李宇 旼) ‧高速無線通訊可適性等化器基礎研究 資料蒐集 演算法理論推導 等化器模擬 ‧高速無線通訊可適性等化器與干擾消除之配合 資料蒐集 干擾模式之建立 演算法理論推導 90.03 90.09 90.12 90.03 90.06 90.12 4)最小雜訊小波原理及訊號壓縮研究(馮世邁) ‧最小雜訊小波的影像壓縮之研究 資料蒐集 影像之特性研究 最小雜訊小波之影像壓縮 彩色影像之壓縮 90.03 90.06 90.09 90.12 卓 越 計 畫 National Taiwan University 90年度預定進行之研究項目 工作項目 內容 預定完成年月 1.智慧型 5)小波多載波技術研究(馮世邁) ‧通道特性於小波多載波之影響 高等訊 Twisted pair之通道特性 號處理 FEXT and NEXT crosstalks Effect of change of channel parameters 效益評估 2.中文自 1)聲學處理技術基礎研究(李琳山) 雜訊及電話通道處理技術 然語言 語者調適技術 及語音 處理 2)語言處理技術基礎研究(李琳山) 中文詞分群技術 90.03 90.06 90.09 90.12 90.06 90.12 90.09 卓 越 計 畫 National Taiwan University 90年度預定進行之研究項目 工作項目 內容 預定完成年月 3.視訊及 1)彩色影像量化與分割技術研究(貝蘇章) ‧彩色影像分割及其應用 音訊處 自然彩色影像的區域分割 理 文書自動化處理的背景分離 色盲彩色盤的數字抽取 90.06 90.09 90.12 2)以內容為主的影像資料庫查詢(貝蘇章) (Content-Based Image Data-Base Retrieval) ‧以彩色特徵做影像查詢 彩色座標系統及影像量化研究 彩色特徵抽取 彩色區域介定及分割 影像查詢系統實驗摸擬 90.03 90.06 90.09 90.12 卓 越 計 畫 National Taiwan University 90年度預定進行之研究項目 工作項目 內容 預定完成年月 3.視訊及 3)音樂訊號處理基礎研究(鄭士康) • 節奏辨識研究 音訊處 原始音樂訊號讀寫程式撰寫及濾波後音樂訊號節 理 奏特性分析 濾波器群組設計及實作及微分整流器設計及實作 共振濾波器群組設計及實作 系統整合、系統測試及參數調整 • 音高辨識研究 Constant-Q濾波器研討及試裝 小波轉換應用於音高辨識之研究 音高辨識類神經網路系統研究 系統整合、系統測試及參數調整 90.03 90.03 90.09 90.12 90.03 90.06 90.09 90.12 卓 越 計 畫 National Taiwan University 90年度預定進行之研究項目 工作項目 內容 預定完成年月 4.生理訊 1)眼動滑鼠系統設計研發(陳志宏) 眼動滑鼠軟體系統之完成 號處理 90.03 2)頭動滑鼠系統設計研發(陳志宏) 肢體運動、頭部運動感應器系統建立 90.09 5.多媒體 1)視訊、音訊、語言、文件、虛擬實境之編輯、 吳家麟、陳文進、黃肇雄、歐陽 訊號處 標記及互動研究( 明) 理 物品編輯展示工具之開發 虛擬實境展示工具之開發 90.03 90.12 2)跨平台使用者共享瀏覽器之研究(陳文進、黃 肇雄) 共享技術研發及瀏覽器輔助工具撰寫 90.06 卓 越 計 畫 National Taiwan University 90年度預定進行之研究項目 工作項目 內容 預定完成年月 5.多媒體 3)呈現立體場景之多媒體虛擬電子教室(歐陽明、 吳家麟) 訊號處 建構半球形之虛擬實境顯示裝置及軟體開發 理 90.03 6. 處理器 1)數位訊號處理器核心指令集設計(陳良基) 硬體描述語言(HDL)設計 及晶片 設計 2)智慧型視訊處理器之模擬分析及設計(陳良基) 90.09 (CAVE) 硬體C層次模擬分析 7. 系統整 1)整合實驗室之建立(李琳山) 第二階段整合實驗室 合 90.12 90.12 卓 越 計 畫 National Taiwan University 總檢討與結論 • 主題之重要性及創新性 – 未來網路資訊世界之前瞻性目標,整合各種訊號處理技術並 與網路通訊及資訊科技融合之遠景,及諸多創新性技術均為 不爭之事實 • 提升我國學術水準之效益及達到學術卓越及國際一流水 準之程度 – 論文發表,在國際會議、國際期刊、國際學會中之角色及參 與、居全球領先地位之研究項目等事實,均可證明本計畫的 研究在國際學界主流中有一定地位 卓 越 計 畫 National Taiwan University 總檢討與結論 • 熟習關鍵技術之人才培育 –所培養之研究生,他們的諸多研究成果(論文發表、系統實作、 專利等)、參與競賽成績等均為有關效益之證明 • 對產業發展及實際應用之效益 – 智慧型網路資訊科技居「知識經濟」之核心地位,與產業界 之互動、專利、技術移轉、實際操作之系統、人才培育等均 為有關效益之證明 卓 越 計 畫 National Taiwan University 總檢討與結論 • 主持人及參與教授勝任計畫之程度 – 由論文發表,在國際會議、期刊、學會中之角色及參與,居 全球領先地位之研究項目,專利、技術移轉,與產業界的互 動,實際操作之系統,人才培育等均為具體明確指標 • 人力、經費配置、進度控制、計畫書內容 – 相當理想 • 第一年只是開始,未來進展仍有賴各方鼎力支持 卓 越 計 畫 National Taiwan University