研究生: |
白恒瑞 |
---|---|
論文名稱: |
投影片換頁特效分類之研究:以教學影片為例 Detection and Classification of Special Effects within Lecture Slides |
指導教授: |
李忠謀
Lee, Chung-Mou |
學位類別: |
碩士 Master |
系所名稱: |
資訊教育研究所 Graduate Institute of Information and Computer Education |
論文出版年: | 2008 |
畢業學年度: | 96 |
語文別: | 中文 |
論文頁數: | 41 |
中文關鍵詞: | 換頁特效 、特效分類 |
論文種類: | 學術論文 |
相關次數: | 點閱:95 下載:9 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
由於錄影工具的大眾化,使得人人皆能夠自行錄製影片,而線上教學系統即
可利用老師授課時所錄製的影片,置於線上供學生隨時學習甚至複習。但由於一節課影片相當冗長,而學生在複習時也許只對某些上課片段不了解,為了避免觀看特定片段卻需要下載完整影片而浪費許多時間,於是視訊切割的重要性相對地大大增加,但由於教師在製作投影片時,會為了授課時更加地生動活潑,因此投影片中除了靜態的文字和圖片之外,仍有著動態文字特效、動態圖片、動態換頁特效等等效果加入,因此本研究希望提出一個方法,可以判斷出動態換頁特效的存在,並正確地將其分類。
本研究可分為兩個主要階段:第一階段為取出有變化的畫面,採用較簡易快
速的pixel difference 演算法,並由灰階及二元化兩方資訊相輔相成,從大量的連續畫面中偵測出含有變化的許多動態變化片段資訊出來。第二階段則是偵測換頁特效,此動態效果在此研究中定義成三種類型,分別是掃瞄線變化類別(SCT)、分散式變化類別(DCT)、位移變化類別(MCT)。SCT 的判斷方式是先求出水平及垂直投影圖,再利用連續畫面中各畫面與首張及結尾畫面之間進行DPAO 演算法比較求出最小距離,再依照分別對於首張及結尾畫面的距離數據計算其相關性,達到一定程度的負相關者則判斷為SCT 類別;DCT 則是使用與首張和結尾畫面的灰階圖相減,並計算含有變動的pixel 數量,再依照相同的相關性計算方式判斷是否為DCT 類別;MCT 則為不符合前兩項類別則為此類別。最後再經由區塊比對將被判定為SCT 及DCT 的資料中再次進行分類,以求更高的分類正確率。
本研究以avi 格式影片作為實驗,將自動偵測出投影片換頁特效的起始點,
並將換頁特效分類,實驗結果其偵測準確率可達96%以上,而分類準確率可達
91%。
[1] Open Source Computer Vision Library (OpenCV)
http://www.opencv.org.
[2] Y. Abdeljaoued, T. Ebrahimi, C. Christopoulos and I.M. Ivars, "A New Algorithm For Shot Boundary Detection", Proceedings of the 10th European Signal Processing Conference, Tampere, Finland, September 2000.
[3] J. C. Bezdek, and N.R. Pal, "Some New Indexes of Cluster Validity", IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, Vol. 28, No. 3, pp.301-315, Jun 1998.
[4] P. Browne, A. F. Smeaton, N. Murphy, N. O'Connor, S. Marlow and C. Berrut," Evaluating and Combining Digital Video Shot Boundary Detection Algorithms",Proceedings of Irish Machine Vision and Image Processing Conference, 2000.
[5] H.S. Chang, S. Sull and S.U. Lee, "Efficient Video Indexing Scheme for Content-Based Retrieval", IEEE Transactions On Circuits And Systems For Video Technology, Vol. 9, No. 8, pp. 1269-1279, December 1999.
[6] A.M. Ferman, A.M. Tekalp and R. Mehrotra, "Robust color histogram descriptors for video segment retrieval and identification", IEEE Transactions On Image Processing, Vol. 11, No. 5, pp. 497-508, May 2002.
[7] U. Gargi, R. Kasturi and S.H. Strayer, "Performance Characterization of Video-shot-change Detection Methods", IEEE Transactions on Circuits and Systems for Video Technology, Vol. 10, No. 1, pp. 1-13, February 2000.
[8] B. Gatos and N. Papamarkos, "Applying Fast Segmentation Techniques at a Binary Image Represented by a Set of Non-Overlapping Blocks", Proceedings of the Sixth International Conference on Document Analysis and Recognition, pp. 1147-1151, 2001.
[9] Y. Gong, "An Accurate and Robust Method for Detecting Video Shot Boundaries", Proceedings of IEEE International Conference on Multimedia Computing and Systems, Vol. 1, pp. 850-854, July 1999.
[10] B. Gunsel, A. M. Ferman and A. Murat Tekalp, "Temporal Video Segmentation Using Unsupervised Clustering and Semantic Object Tracking", Journal of Electronic Imaging, Vol. 7, No. 3, pp. 592-604, July 1998.
[11] S. Ji and H.W. Park, "Region-based Video Segmentation using DCT Coefficients", Proceedings of IEEE International Conference on Image Processing, Vol. 2, pp.150-154, 1999.
[12] S. Khedekar, V. Ramanaprasad, S. Setlur and V. Govindaraju, "Text - Image Separation in Devanagari Documents", Proceedings of the 6th International Conference on Document Analysis and Recognition, Washington, DC, USA, Vol. 2, pp. 1265-1269, 2003.
[13] C Lin, M Sheu, H Chiang, C Liaw and C Tsai, "An Efficient Video De-interlacing with Scene Change Detection", Proceedings of the 5th International Conference on Information, Communications and Signal Processing, pp. 36-40, December 2005.
[14] Z. Lei, W. Chou, J. Zhong and C.H. Lee, "Video Segmentation Using Spatial and Temporal Statistical Analysis Method", Proceedings of IEEE International Conference on Multimedia and Expo, New York, Vol. 3, pp. 1527-1530, July 2000.
[15] M. Lin, J.F. Nunamaker, M. Chau and H. Chen, "Segmentation of Lecture Videos Based on Text: A Method Combining Multiple Linguistic Features", Proceedings of the 37th Hawaii International Conference on System Sciences, 2004, Big Island, Hawaii, pp. 3-11, 2004.
[16] H. Lu, Y.P. Tan, X. Xue and L. Wu, "Shot Boundary Detection using Unsupervised Clustering and Hypothesis Testing", Proceedings of IEEE International Conference on Communications, Circuits and Systems, Vol. 2, pp.932-936, June 2004.
[17] J. Nang, S. Hong and Y. Ihm, "An Efficient Video Segmentation Scheme for MPEG Video Stream using Macroblock Information", Proceedings of the 7th ACM international conference on Multimedia, Orlando, Florida, United States, pp. 23-26, 1999.
[18] M.R. Naphade, R. Mehrotra, A.M. Ferman, J. Warnick, T.S. Huang and A.M. Tekalp, "A High-performance Shot Boundary Detection Algorithm using Multiple Cues", Proceedings of IEEE International Conference on Image Processing, Vol. 1, pp. 884-887, October 1998.
[19] C. O’Toole1, A. Smeaton, N. Murphy and S. Marlow, "Evaluation of Automatic Shot Boundary Detection on A Large Video Test Suite", Proceedings of The Challenge of Image Retrieval, Newcastle, UK, pp. 25-26 February, 1999.
[20] N. Otsu, "A threshold selection method from gray-level histograms", IEEE Transactions on Systems, Man and Cybernetics, pp. 62–66, 1979.
[21] S.C. Pei and Y.Z. Chou, "Efficient MPEG Compressed Video Analysis using Macroblock Type Information", IEEE Transactions On Multimedia, Vol. 1, No. 4, pp. 321-333, December 1999.
[22] T.M. Rath and R. Manmatha, "Word Image Matching Using Dynamic Time Warping", Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2, pp. 521-527, June 2003.
[23] S. Repp and C. Meinel, "Semantic Indexing for Recorded Educational Lecture Videos", Proceedings of the 4th Annual IEEE Int. Conference on Pervasive Computing and Communications Workshops, pp. 240-245, March 2006.
[24] R. J. Rodrigues and A. C. G. Thomé, "Cursive Character Recognition - A Character Segmentation Method using Projection Profile-based Technique", Proceedings of The 4th World Multi-conference on Systemics, Cybernetics and
Informatics SCI 2000 and The 6th International Conference on Information Systems, Analysis and Synthesis ISAS 2000 - Orlando, USA - August 2000.
[25] E. Saez, J.I. Benavides and N. Guil, "Reliable Real Time Scene Change Detection in MPEG Compressed Video", Proceedings of IEEE International Conference on Multimedia & Expo, Vol. 1, pp. 567-570, June 2004.
[26] M. Sawaki and N. Hagita, "Text-Line Extraction and Character Recognition of Document Headlines With Graphical Designs Using Complementary Similarity Measure", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No. 10, pp. 1103-1109, October 1998.
[27] B. Shahraray, "Scene Change Detection and Content-Based Sampling of Video Sequences", in Digital Video Compression: Algorithms and Technologies, Arturo Rodriguez, Robert Safranek, Edward Delp, Editors, Proc. SPIE 2419, pp.
2-13, February 1995.
[28] T. Shin, J.G. Kim, H. Lee and J. Kim, "Hierarchical Scene Change Detection in An MPEG-2 Compressed Video Sequence", Proceedings of IEEE International Symposium on Circuits and Systems, Vol. 4, pp. 253-256, June 1998.
[29] A. Stegner and R. Klette, "Evaluation of mpeg motion compensation algorithms", Tech. Rep., The University of Auckland, October 1997.
[30] K.W. Sze, K.M. Lam and G. Qiu, "A New Key Frame Representation for Video Segment Retrieval", IEEE Transactions on Circuits and Systems for Video Technology, Vol. 15, No. 9, pp. 1148-1155, September 2005.
[31] K.W. Sze, K.M. Lam and G. Qiu, "An Optimal Key Frame Representation for Video Shot Retrieval", Proceedings of IEEE International Symposium on Intelligent Multimedia, Video and Speech Processing, pp. 270-273, October 2004.
[32] W. Tian and Y. Qiao, "Off-line Chinese Signature Verification based on Optimal Matching of Projection Profiles", Proceedings of the 6th World Congress on Intelligent Control and Automation, Dalian, China, Vol. 2, pp. 10240-10244, June 2006.
[33] V.V. Vinod and H. Murase, "Object Location Using Complementary Color Features: Histogram and DCT", Proceedings of the 13th International Conference on Pattern Recognition, Vol. 1, pp. 554-559, August 1996.
[34] Z. Wang, G. Liu and L. Liu, "A Fast And Accurate Video Object Detection And Segmentation Nethod In The Compressed Domain", Proceedings of IEEE International Conference Neural Networks & Signal Processing, Nanjing, China, Vol. 2, pp. 1209-1212, December 2003.
[35] T. Yokoi and H. Fujiyoshi, "Generating A Time Shrunk Lecture Video by Event Detection", Proceedings of IEEE International Conference on Multimedia & Expo, Toronto, Ontario, Canada, pp. 873-876, July 2001.
[36] H.J. Zhang, A. Kankanhalli and S.W. Smoliar, "Automatic Partitioning of Full-motion Video", Multimedia Systems, Vol. 1, No. 1, pp. 10-28, 1993.
[37] D. Zhang, W. Qi and H.J. Zhang, "A New Shot Boundary Detection Algorithm", Proceedings of the Second Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing, pp. 63-70, 2001.
[38] L. Zhao, W. Qi, S. Z. Li, S.Q. Yang and H.J. Zhang, "Content-based Retrieval of Video Shot Using The-improved Nearest Feature Line Method", Proceedings of IEEE International Conference on Acoustics, Speech, and Signal
Processing, Vol. 3, pp. 1625-1628, 2001.
[39] J. Zhou and X.P. Zhang, "A Web-Enabled Video Indexing System", Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval, New York, USA, pp. 307-314, 2004.
[40] Y. Zhu, T. Tan and Y. Wang, "Font Recognition Based on Global Texture Analysis", IEEE Transactions Pattern Analysis and Machine Intelligence, Vol. 23 No.10, pp.1192-1200, October 2001.
[41] Y. Zhuangt, Y. Rui, T. S. Huang and S. Mehrotra, "Adaptive Key Frame Extraction using Unsupervised Clustering", Proceedings of IEEE International Conference on Image Processing, Vol.1, pp. 866-870, October 1998.