簡易檢索 / 詳目顯示

研究生: 周文瑞
Chou, Wen-Jui
論文名稱: 以多核心圖形處理器加速影像處理之研究
A Study on Acceleration of Image Processing Using Multicore Graphical Processing Units
指導教授: 林政宏
Lin, Cheng-Hung
學位類別: 碩士
Master
系所名稱: 電機工程學系
Department of Electrical Engineering
論文出版年: 2016
畢業學年度: 104
語文別: 中文
論文頁數: 49
中文關鍵詞: 多核心圖形處理器影像處理全景圖轉換車牌定位車牌辨識
英文關鍵詞: multicore graphic processing units, image processing, panoramic pictures transformation, vehicle license plate localization, vehicle license plate recognition
DOI URL: https://doi.org/10.6345/NTNU202204468
論文種類: 學術論文
相關次數: 點閱:222下載:36
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文研究以多核心圖形處理器(Multicore Graphic Processing Units)加速影像處理演算法,我們以全向圖(omnidirectional pictures)轉換成全景圖(panoramic pictures)及車牌辨識(vehicle license plate recognition)系統為例,提出平行演算法並以多核心圖形處理器進行相關演算法加速。
    論文首先針對橢圓拋物曲面全向圖轉換成全景圖的演算法進行平行化研究,本論文提出了一個階層式的平行架構包含資料平行(data parallelism)與任務平行(task parallelism)兩個階層,其中資料平行階層是透過執行圖形處理器的大量執行緒平行轉換每個像素從全向圖移轉至全景圖,而任務平行階層是透過圖形處理器多串流技術(multiple stream),以管線化(pipelining)的方式平行執行多個影像的轉換。任務平行可以藉由重疊影像處理器的核心運算與資料傳輸的執行時間來改善整體的效能。實驗結果顯示相較於CPU,透過圖形處理器,我們可以得到6.33倍的改善。
    論文第二部分,我們針對車牌辨識系統進行平行化研究,一個車牌辨識系統主要包含車牌定位、車牌校正、文字切割與文字辨識等四大步驟。首先在車牌定位部分,我們透過灰階轉換、直方圖等化、二值化、輪廓萃取與剛性物體偵測之核心演算法取得車牌的位置,然後在車牌校正方面,我們使用仿射轉換中的單映性以校正歪斜的車牌。在文字分割方面,我們利用輪廓萃取及邊緣偵測將文字與車牌面積進行計算,並將車牌中的文字分割取出。最後在文字辨識部份,我們利用樣板比對法(template matching)作為文字辨識的方法,為了縮短辨識系統計算的時間,我們透過圖形處理器加速車牌文字辨識的計算速度相較於CPU,我們可以得到100倍的改善。

    關鍵字:多核心圖形處理器、影像處理、全景圖轉換、車牌定位、車牌辨識

    This thesis proposes to accelerate image processing algorithms using multicore Graphic Processing Units (GPUs). Taking the transformation of omnidirectional pictures to panoramic pictures and vehicle license plate recognition system as cases, we propose parallel approaches to accelerate relative image processing algorithms using GPUs.
    First, we study to parallelize the transformation of elliptical omnidirectional pictures to panoramic pictures. We propose a hierarchical parallelism architecture which includes data parallelism and task parallelism. The data parallelism issues large amount of threads to simultaneously map each pixel of an elliptical omnidirectional pictures to the corresponding position in a panoramic pictures. On the other hand, the task parallelism adopts multiple stream technique to pipeline the transformation of multiple images. The task parallelism improves the overall throughput by overlapping the latency of kernel computation and data transmission time. Experimental results demonstrate that the proposed algorithm achieves 6.33 times of performance improvement as compared to CPU counterpart.
    Furthermore, we study on the parallelization of vehicle license plate recognition system. A vehicle license plate recognition system composes of four stages including plate localization, plate calibration, text segmentation, and text recognition. First, in the step of plate localization, we obtain the position of a plate via the steps of gray transformation, histogram equalization, image binarization, contour extraction, and rigid object detection. Then, in the step of plate calibration, we adopt single affine transformation to calibrate skew license plates. Furthermore, in the step of text segmentation, we segment texts by extracting the edges and contours of texts and compare their area with that of a license plate. Finally, we perform text recognition using template matching algorithm. In order to reduce the elapsed time of text recognition, we propose to accelerate template matching algorithm using GPUs, compared to the CPU, we can get 100 times improvement.

    Keywords: multicore graphic processing units, image processing, panoramic pictures transformation, vehicle license plate localization, vehicle license plate recognition

    中 文 摘 要 i 英 文 摘 要 iii 致 謝 v 目 錄 vi 圖 目 錄 viii 表 目 錄 x 符號說明 xi 第一章 緒論 1 1.1 研究背景與目的 1 1.2 研究動機 1 1.3 論文架構 2 第二章 文獻探討 3 2.1 影像處理加速技術 3 2.1.1 FPGA 加速技術 3 2.1.2 異質硬體加速方法 3 2.1.3 CUDA加速技術 4 2.2 欲解決效能瓶頸之目標演算法 4 2.2.1 全向圖轉全景圖 5 2.2.2 車牌辨識系統 5 2.3 車牌辨識系統所需演算法 6 2.3.1 灰階轉換 6 2.3.2 直方圖等化 7 2.3.3 二值化 10 2.3.4 去雜訊膨脹侵蝕 13 2.3.5 輪廓找尋法 15 2.3.6 單應性(homography) 15 2.3.7 機器學習演算法 15 2.3.8 字元辨識 16 第三章 影像處理演算法系統架構 18 3.1 全向圖轉全景圖之核心演算法 18 3.2 車牌辨識系統 20 3.2.1 車牌辨識的方法與流程 20 3.2.2 取輪廓 21 3.2.3 挑選符合條件的輪廓 22 3.3 車牌校正 23 3.4 SVM分類器 25 3.4.1 訓練流程與測試流程 28 3.5 文字切割 28 3.5.1 方法流程 28 3.5.2 樣板匹配 29 第四章 圖形處理器用於加速影像處理演算法之系統架構 32 4.1 管線化 (Pipeline) 處理 32 4.2 CUDA 與全景影像轉換 33 4.3 CUDA 與車牌樣板比對 37 第五章 模擬分析與實驗 39 5.1 實驗環境 39 5.2 平行化以全向圖轉全景圖之結果 39 5.3 平行化做樣板匹配之結果 41 5.4 歪斜車牌轉正執行結果 42 5.5 行化的車牌偵測與字元辨識系統實行結果 42 第六章 結論與未來展望 45 參考文獻 46 自  傳 49

    [1] K. Steffen, E. Markus, H. Sorin A, and S. Integrierte Schaltungen, " Real time image processing based on reconfigurable hardware acceleration," in Proceedings of Workshop Heterogeneous Reconfigurable Systems on Chip, 2002.
    [2] P. Cuong , A. Zaid and B. Koen "Heterogeneous hardware accelerator architecture for streaming image processing," in Proceedings of International Conference on Advanced Technologies for Communications, pp. 374-379, 2013.
    [3] C. Edwards, "Graphical future," in Engineering & Technology, vol. 4, no. 10, pp. 36-39, 6-19 June 6 2009.
    [4] S. K. Nayar, "Catadioptric omnidirectional camera," in Computer Vision and Pattern Recognition, 1997. in Proceedings of Conference on IEEE Computer Society, 1997, pp. 482-488.
    [5] V. Nalwa, "A true omnidirectional viewer," technical report, Bell Laboratories1996.
    [6] S. Baker and S. K. Nayar, "A theory of catadioptric image formation," in Proceedings of Sixth International Conference on Computer Vision, pp. 35-42, 1998.
    [7] A. Krishnan and N. Ahuja, "Panoramic image acquisition," in Computer Vision and Pattern Recognition, 1996. Proceedings CVPR'96, 1996 IEEE Computer Society Conference on, 1996, pp. 379-384.
    [8] A. Goshtasby and W. A. Gruver, "Design of a single-lens stereo camera system," Pattern Recognition, vol. 26, pp. 923-937, 1993.
    [9] H. F. Ho and W. J. Chou, "Method to enhance panoramic photography resolution and vertical field of view," in Proceedings of the 27th Conference on Computer Vision, Graphics, and Image Processing, Pingtung, Taiwan , 2014.
    [10] Y. Yagi and S. Kawato, "Panorama scene analysis with conic projection," in Intelligent Robots and Systems' 90.'Towards a New Frontier of Applications', Proceedings. IROS'90. IEEE International Workshop on, 1990, pp. 181-187.
    [11] P.-H. Huang, "The Research and Development of Panoramic Total Internal Reflection Lens and Omnidirectional Imager System," 2011.
    [12] G. L. Mariottini, E. Alunno, and D. Prattichizzo, "The epipolar geometry toolbox (EGT) for Matlab," Citeseer2004.
    [13] C. Saravanan, "Color Image to Grayscale Image Conversion," in Proceedings of Second International Conference on Computer Engineering and Applications, Bali Island, pp. 196-199, 2010.
    [14] Johndcook.com. (2009). Converting color to grayscale. [online] Available at: http://www.johndcook.com/blog/2009/08/24/algorithms-convert-color-grayscale/ [Accessed 2 Apr. 2016].
    [15] H. Yeganeh, A. Ziaei and A. Rezaie, "A novel approach for contrast enhancement based on Histogram Equalization," Computer and Communication Engineering, 2008. ICCCE 2008. International Conference on, Kuala Lumpur, 2008, pp. 256-260.
    [16] (finalfrank), f. (2016). [OpenCV] 直方圖等化 @ Frank's 資訊科技潮流站 :: 痞客邦 PIXNET ::. [online] Frank's 資訊科技潮流站. Available at: http://finalfrank.pixnet.net/blog/post/30668256-%5Bopencv%5D-%E7%9B%B4%E6%96%B9%E5%9C%96%E7%AD%89%E5%8C%96 [Accessed 15 Jun. 2016].
    [17] (finalfrank), f. (2016). [影像處理] OTSU演算法程式 @ Frank's 資訊科技潮流站 :: 痞客邦 PIXNET ::. [online] Frank's 資訊科技潮流站. Available at: http://finalfrank.pixnet.net/blog/post/29574285-%5B%E5%BD%B1%E5%83%8F%E8%99%95%E7%90%86%5D-otsu%E6%BC%94%E7%AE%97%E6%B3%95%E7%A8%8B%E5%BC%8F [Accessed 22 Jun. 2016].
    [18] N. Otsu"A Threshold Selection Method from Gray-Level Histograms," in IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62-66, Jan. 1979.
    [19] Fahn, Chin-Shyurng, and Meng-Luen Wu. "An Autonomous Aesthetics-driven Photographing Instructor with Personality Prediction." The 2013 International Conference on Computer Graphics, Visualization, Computer Vision, and Game Technology. Atlantis Press, 2013.
    [20] S. Suzuki, K. Abe, "Topological structural analysis of digitized binary images by border following," Computer Vision, Graphics, and Image Processing, vol. 30, no. 1. pp. 32-46, 1985.
    [21] Fernando, S. (2016). Shape Detection & Tracking using Contours. [online] Opencv-srf.blogspot.tw. Available at: http://opencv-srf.blogspot.tw/2011/09/object-detection-tracking-using-contours.html [Accessed 25 Jun. 2016].
    [22] P. Viola, M. Jones, “Robust Real-Time Face Detection. International Journal of Computer Vision”, 57(2), 2004, 137-154.
    [23] J. Gao, J. Yang, “An Adaptive Algorithm for Text Detection from Natural Scenes”, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001.
    [24] V. Shapiro, G. Gluhchev, D. Dimov, “Towards a Multinational Car License Plate Recognition System”, Machine Vision and Application, 17, 2006, 173-183.
    [25] K. Nadeem AM and R. J and H. Hans A, " License plate recognition system," in Proceedings of SPIE's International Symposium on Optical Science, Engineering, and Instrumentation pp. 14-24, 1998.
    [26] E. Patrick and J. Carlos and B. Stefan and M. Donal, " A Hotelling transformation approach for rapid inversion of atmospheric spectra," Journal of Quantitative Spectroscopy and Radiative Transfer, vol. 73, no. 6, pp. 529-543, 2002.
    [27] C. Paolo, F. Paolo, G. Michael N and S. Flavio, " Optical recognition of motor vehicle license plates," in IEEE Transactions on Vehicular Technology, vol. 44, no. 4, pp. 790-799, 1995.
    [28] H. Yo-Ping, L. Shi-Yong and C. Wei-Po, "A template-based model for license plate recognition," in Proceedings of International Conference on Networking, Sensing and Control, vol. 2, pp. 737-742, 2004.
    [29] Upload.wikimedia.org. (2016). [online] Available at: https://upload.wikimedia.org/wikipedia/commons/2/2a/Svm_max_sep_hyperplane_with_margin.png [Accessed 3 Mar. 2016].
    [30] Upload.wikimedia.org. (2016). [online] Available at: https://upload.wikimedia.org/wikipedia/commons/thumb/6/67/5_Stage_Pipeline.svg/300px-5_Stage_Pipeline.svg.png [Accessed 5 Jul. 2016].

    下載圖示
    QR CODE