研究生: |
廖婉雅 Wan-Ya Liao |
---|---|
論文名稱: |
使用馬可夫鏈蒙地卡羅方法之多方位行人偵測 Multi-View Pedestrian Detection Using Markov Chain Monte Carlo Approach |
指導教授: |
陳世旺
Chen, Sei-Wang |
學位類別: |
碩士 Master |
系所名稱: |
資訊工程學系 Department of Computer Science and Information Engineering |
論文出版年: | 2010 |
畢業學年度: | 98 |
語文別: | 中文 |
論文頁數: | 84 |
中文關鍵詞: | 多方位 、行人偵測 、馬可夫鏈蒙地卡羅 、單位球體 、3D行人模型 、遮蔽 |
英文關鍵詞: | Multi-View, pedestrian detection, Markov Chain Monte Carlo, viewsphere, 3D human model, occlusion |
論文種類: | 學術論文 |
相關次數: | 點閱:150 下載:38 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文研究多方位行人偵測的技術,攝影機可以不受架設位置與觀測角度的限制偵測行人。為了要掌握行人於影像中呈現各種不同的型態,我們提出一個多視角(multiple-view)的單位球(unit sphere)來描述行人,稱此單位球為viewsphere,它是由多個巢狀球面所組成,每一層球面均勻分佈許多視點(viewpoints)。我們將一3D行人模型置於球體中心,然後將行人模型投影至每一視點所屬的影像平面,因此可以取得各種不同觀測角度的行人外觀,稱為model views。
本研究首先建立一3D行人模型,此行人模型是由行人頭部和肩膀所組成的上半身,因為上半身的輪廓形成”Ω”形狀,為行人獨有的特徵,即使在擁擠的人群中,這個輪廓也不易消失。利用此輪廓資訊再搭配頭髮的髮色、臉部的膚色和所占區域的面積比資訊,可以於影像中找出行人的位置,即使行人發生遮蔽情況也可成功標示出行人的位置和計算行人的數目。由於行人狀態的解空間很龐大,我們利用馬可夫鏈蒙地卡羅(Markov Chain Monte Carlo)的方法,在解空間中連續取樣,計算行人於影像中的後驗機率 (posterior probability) 分佈,再根據後驗機率分佈,決定出行人最佳狀態。由於馬可夫鏈蒙地卡羅收斂速度慢,因此我們設計三種不同的取樣策略,提升建立後驗機率的效率。
實驗時,在不同場景架設不同高度和不同攝影角度的攝影機,測試本研究所提出的技術。結果證明,可適應各種角度的監視影像,且行人發生遮蔽的情況下,也能正確找出行人位置。
This paper presents a technique for multi-view pedestrian detection. The camera can be mounted anywhere with any viewing direction. We use a multiple-view unit sphere, called the viewsphere, to represent pedestrian. The viewsphere forms a 2D manifold of viewing directions (i.e., viewpoints), which are unidistributed over the spherical surface of the viewsphere. The pedestrian detection problem is formulated as maximum posterior estimation. Due to the high complexity of the solution space, we explore the solution space using a Markov Chain Monte Carlo (MCMC) sampling method. Three kinds of proposal distribution are proposed to further improve the efficiency of MCMC.
[Che 95] Y. Cheng,“Mean Shift, Mode Seeking, and Clustering,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 17, NO. 8, pp. 790-799, 1995.
[Dal 05] N. Dalal, and B. Triggs,“Histograms of Oriented Gradients for Human Detection,”IEEE Conference on Computer Vision and Pattern Recognition, Vol. 1, pp. 886-893, 2005.
[Gav 99] D. M. Gavrila, and V. Philomin,“Real-Time Object Detection for SMART Vehicles,”IEEE International Conference on Computer Vision, Vol. 1, pp. 87-94, 1999.
[Gre 01] H. Greenspan, J. Goldberger, and I. Eshet,“Mixture Model for Face Color Modeling and Segmentation,”Pattern Recognition Letters, vol. 22, pp. 1525-1536, 2001.
[Har 00] S. Haritaoglu, D. Harwood, and L. S. Davis,“W4: Real-Time Surveillance of People and Their Activities,”IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 8, pp. 809-830, 2000.
[Kim 08] Z. Kim,“Real Time Object Tracking based on Dynamic Feature Grouping with Background Subtraction,”IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[Lei 05] B. Leibe, E. Seemann, and B. Schiele,“Pedestrian Detection in Crowded Scenes,”IEEE Conference on Computer Vision and Pattern Recognition, Vol. 1, pp. 878-885, 2005.
[Lin 07] Z. Lin, L.S. Davis, D. Doermann, and D. DeMenthon,“Hierarchical Part-Template Matching for Human Detection and Segmentation,”IEEE International Conference on Computer Vision, pp. 1-8, 2007.
[Lin 09] S. Lin, J. Tang, X. Zhang, and Y. Lv,“Research on traffic moving object detection, trackingand track-generating,” IEEE International Conference on Automation and Logistics, pp. 783-788, 2009.
[Mik 04] K. Mikolajczyk, C. Schmid, and A. Zisserman,“Human Detection based on A Probabilistic Assembly of Robust Part Detectors,”Proc. European Conference on Computer Vision, pp. 69-82, 2004.
[Phu 05] S. L. Phung, A. Bouzerdoum, and D. Chai,“Skin Segmentation Using Color Pixel Classification: Analysis and Comparison,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, NO.1, pp.148-154, 2005.
[Rit 05] J. Rittscher, P. H. Tu, and N. Krahnstoever,“Simultaneous Estimation of Segmentation and Shape,” IEEE Conference on Computer Vision and Pattern Recognition, vol.2, pp. 486-493, 2005.
[Ram 07]D. Ramanan, D. A. Forsyth, A. Zisserman,“Tracking People by Learning Their Appearance, ” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, pp.65 - 81, 2007.
[See 06] E. Seemann, B. Leibe, and B. Schiele, “Multi-Aspect Detection of Articulated Objects, ”IEEE Conference on Computer Vision and Pattern Recognition, pp. 1582-1588, 2006.
[Smi 05] K. Smith, D. Gatica-Perez, and J.-M. Odobez, “Using Particles to Track Varying Numbers of Interacting People,”IEEE Conference on Computer Vision and Pattern Recognition, Vol. 1, pp. 962-969, 2005.
[Sta 99] C. Stauffer, and W.E.L. Grimson,“Adaptive background mixture models for real-time tracking,” IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 246-252, 1999.
[Tie 96] L.Tierney,“Markov Chain Concepts Related to Sampling Algorithm, ” Markov Chain Monte Carlo in Practice, pp. 59-74, 1996.
[Wu 05] B. Wu, and R. Nevatia, “Detection of Multiple, Partially Occluded Humans in a Single Image by Bayesian Combination of Edgelet Part Detectors,”IEEE International Conference on Computer Vision, Vol. 1, pp. 90-97, 2005.
[Wu 08] B. Wu, R. Nevatia, and Y. Li,“Segmentation of Multiple, Partially Occluded Objects by Grouping, Merging, Assigning Part Detection Responses,”IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2008.
[Zha 03] T. Zhao, and R. Nevatia,“Bayesian Human Segmentation in Crowded Situations,”IEEE Conference on Computer Vision and Pattern Recognition, Vol. 2, pp. II - 459-66, 2003.
[Zha 05] L. Zhao, and L. S. Davis,“Closely Coupled Object Detection and Segmentation,”IEEE International Conference on Computer Vision, Vol. 1, pp. 454-461, 2005.
[Zha 08] T. Zhao, R. Nevatia, and B. Wu,“Segmentation and Tracking of Multiple Humans in Crowded Environments,”IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 30, No. 7, pp. 1198-1211, 2008.
[dataset 1] The Caltech Faces Dataset, http://www.vision.caltech.edu/html-files/archive.html, 2010.
[dataset 2] The Georgia Tech Face Database, http://www.anefian.com/research/face_reco.htm, 2010.
[dataset 3] The CAVIAR data set, http://homepages.inf.ed.ac.uk/rbf/CAVIAR, 2010.