研究生: |
李建鋒 Chian-Feng Le |
---|---|
論文名稱: |
實現H.264可變區塊移動估測單元之高效能超大型積體電路架構 An Efficient VLSI Architecture for H.264 Variable Block Size Motion Estimation |
指導教授: |
黃文吉
Hwang, Wen-Jyi |
學位類別: |
碩士 Master |
系所名稱: |
資訊工程學系 Department of Computer Science and Information Engineering |
論文出版年: | 2005 |
畢業學年度: | 93 |
語文別: | 中文 |
論文頁數: | 52 |
中文關鍵詞: | 視訊編碼 、VLSI架構 、可變區塊移動估測單元 、H.264視訊壓縮標準 |
英文關鍵詞: | Video Coding, VLSI Architecture, Variable Block Size Motion Estimation, H.264 Standard |
論文種類: | 學術論文 |
相關次數: | 點閱:402 下載:1 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文針對H.264可變區塊移動估測單元提出了一個有效率和彈性化的VLSI架構,可以針對4×4區塊大小及其整數倍數區塊大小的區塊,執行全區搜尋區塊比對演算法。本架構將會把在原始畫面中的每一個16×16大小的巨區塊(Macroblock)切割成16個沒有互相重疊4×4大小的子區塊,稱為基本子區塊(primitive subblocks),本架構中包含16個模組和一個可變區塊移動估測處理器(VBSME processor)。每一個模組中,我們利用串接一維心脈陣列(cascading 1D systolic array)來針對不同基本子區塊的執行區塊比對的動作,這樣的串接一維心脈陣列可以讓本架構有高度的計算吞吐量、高度的彈性化和百分之百的處理單元利用率,每一個基本子區塊皆會同時執行移動估測的動作,並利用這16個基本子區塊組合出41個不同大小的子區塊,在本架構中,我們利用可變區塊移動估測處理器(VBSME processor)由基本子區塊計算所得之絕對誤差總合(SAD)同時計算出所有41個子區塊的絕對誤差總合(SAD)。本論文所出的新架構和已發表的H.264可變區塊移動估測架構相比有著較低的計算延遲和高度的計算吞吐量。
This paper proposes a novel flexible VLSI architecture for the implementation of variable block size motion estimation(VBSME).The architecture is able to perform a full motion search on integral multiples of 4×4 block sizes. To use the architecture, each 16×16 marcoblock of the source frames should be partitioned into sixteen modules and one VBSME processor. Each module, realized by cascading 1D systolic arrays, is responsible for the block-matching operations of a different primitive subblock. The realization has the advantages of high throughput, high flexibility and 100﹪ processing element (PE) utilization. The motion estimation of all the primitive subblocks are performed in parallel. These primitive subblocks are used to form 41 subblocks with different sizes. We use the VBSME processor to concurrently compute the sums of absolute differences (SADs) of all the 41 subblocks from the SADs of the primitive subblocks. This new architecture has lowest latency and highest throughput over other existing VBSME architectures for the hardware implementation of H.264 encoders.
[1] I.E.G. Richardson, H.264 and MPEG-4 Video Compression, John Wiley & Sons, 2003.
[2] J.F. Shen, T.C. Wang and L.G. Chen, “A novel low-power full-search block-matching motion-estimation design for H.263+,” IEEE Trans. Circuits and Systems for Video Technology, pp.890-897, 2001.
[3] L. de Vos and M. Schobinger, “VLSI architecture for a flexible block matching processor,” IEEE Trans. Circuits and Systems for Video Technology, Vol.5, pp.417-428, 1995.
[4]Z. He, M. L. Liou, Philip.C.H. Chan, and R. Li,“An Efficient VLSI Architecture for New Three-step Search Algorithm,” Proceeding of the 38th IEEE Midwest symposium on Circuits and Systems, vol.2, pp.1228-1231, 1996.
[5]P. Pirsch, “VLSI Architectures for Video Compression-A Survey,” Proc. IEEE, Vol.83, pp.220-246, 1995.
[6] A.P. Chandrakasan and R.W. Brodersen, “Minimizing Power Consumption in Digital CMOS Circuits,” Proceedings of the IEEE, Vol. 83, pp.498-523, 1995.
[7] L. Thuyen, M. Glesner, “Configurable VLSI-architectures for both standard DCT and shape-adaptive DCT in future MPEG-4 circuit implementations,” IEEE Trans. Circuits and Systems, vol. 2, pp.461-464, 2000.
[8] J. M. Shapiro, “Embedded Image Coding Using Zerotrees of Wavelet
Coefficients,” IEEE Trans. Signal Processing, vol. 41, pp.3445-3462, 1993.
[9] A. Said and W. A. Pearlman, “A New, Fast and Efficient Image Coder,Based on Set Partitioning in Hierarchical Trees,” IEEE Trans.
Circuits and Systems for Video Technology, vol. 6, pp. 243-250,
1996.
[10] K. R. Rao and J. J. Hwang, Techniques and Standards for Image,
Video and Audio Coding, Prentice Hall, 1996.
[11]M. J. Chen, L. G. Chen and T. D. Chiuech, “One-Dimensinal Full Search Motion Estimation Algorithm for Video Coding,” IEEE Trans. Circuits and Systems for Video Technology, pp.504-509, 1994.
[12] H. M. Jong, L. G. Chen and T. D. Chiueh, “Accuracy Improvement
and Cost Reduction of 3-Step Search Block-Matching Algorithm for
Video Coding,” IEEE Trans. Circuits and Systems for Video
Technology, vol. 4, pp. 88-90, 1994.
[13] R. Li, B. Zeng and M. L. Liou, “A New Three-Step Search Algorithm for Block Motion Estimation,” IEEE Trans. Circuits and Systems for Video Technology, vol. 4, pp. 433-442, Aug. 1994.
[14] S. Zhu and K. K. Ma, “A New Diamond Search Algorithm for Fast
Block Matching Motion Estimation,” IEEE Trans. Image Processing,
vol. 9, pp. 287-290, 2000.
[15]Y. K. Lai, Y. L. Lai, Y. C. Liu, P. C. Wu and L. G. Chen, “VLSI Implementation of the Motion Estimator with Two-Dimensional Data-Reuse,” IEEE Transactions. Consumer Electronics , vol. 44, pp.623-629, 1998.
[16]S. Saponara and L. Fanucci, “Data-adaptive Motion Estimation Algorithm and VLSI Architecture Design fot Low Power Video Systems,” IEE Proceeding. Computers and digital techniques, vol. 151, pp.51-59, 2004.
[17]S. Y. Yap and J. V. McCanny, “A VLSI Architecture for Variable Block Size Video Motion Estimation,” IEEE Trans. Circuits and Systems, pp.384-389, Vol. 51,2004.