研究生: |
許浩禎 Hsu Hao Chen |
---|---|
論文名稱: |
使用結構性輸出之機器學習方法於電影精彩度預測 Movie Highlight Detection using Structured Output Learning |
指導教授: |
葉梅珍
Yeh, Mei-Chen |
學位類別: |
碩士 Master |
系所名稱: |
資訊工程學系 Department of Computer Science and Information Engineering |
論文出版年: | 2013 |
畢業學年度: | 101 |
語文別: | 中文 |
論文頁數: | 35 |
中文關鍵詞: | 多媒體內容分析 、時間事件偵測 、機器學習 |
英文關鍵詞: | Multimedia content analysis, Early detection, Structured Output SVM |
論文種類: | 學術論文 |
相關次數: | 點閱:173 下載:22 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在多媒體內容分析領域的近期研究中,針對時間事件的偵測成為了廣泛探討的議題。一個可靠的時間事件偵測技術存在著許多的應用,例如個人視角的生活觀察及預測大範圍災害持續時間等。從這些應用中,我們可以發現早期事件偵測(Early event detection)也漸漸受到重視。對於人們在觀賞電影時,可以很自然地了解到接下來的劇情將要進入精彩,如果可以讓機器也能如人類感知般,了解到電影的精采度變化,將是一件有趣的工作。在本論文中,我們提出一個藉由機器學習模型 結構性輸出支持向量器 (Structured Output SVM) 的方法實現電影上的精采度偵測器,有別於傳統 SVM的輸出侷限於一個數字或一個標籤,Structured Output SVM的輸出格式可以是一個複雜的結構物件,例如是一張圖片、一個框架或是一段時間區間等等。在預測精彩片段的學習上,Structured Output SVM提供了更有彈性的輸出,使我們能夠更直接的解決問題。在本篇論文中,我們利用電影動作片進行實驗,並透過此模型所計算的信心度自動的辨識出精彩場景。
In multimedia content analysis research, early detection of temporal events from sequential data has been a popular topic. For example, there exist many potential applications from highlight review in first person perspective to video security. This thesis presents an early highlight detection system for movies. The proposed method is based on Structured Output SVM(SOSVM). Unlike conventional SVMs,SOSVM provides structured output, which is more suitable for a highlight detection task. Among many genres of movies, we conducted experiments on action movies. The experimental results show the developed system can be used to automatically identify highlight scene segments with high confident scores.
[1]Wen-Hsing Hsu,Hui-Yu Huang. Constructing a Movie Genre Classifier Based on Low Level Visual Features,MS Thesis, National Tsing Hua University, 2006.
[2]Fang-Yi Wu. Characteristic Color Usage in Different Film Categories,MS Thesis, National Chino Tung University, 2007
[3]Yisong Yue, Thorsten Joachims. Predicting diverse subsets using structural SVMs, ICML 2008: 1224-1231, 2007.
[4]Hoai, Minh & De la Torre. Max-Margin Early Event Detection, CVPR 2012.
[5]Jin-jun Wang, “Sport highlight detection from key word sequences using HMM ”,ICME, 2004
[6]Keng-Sheng Lin and Homer H.Chen, “Automatic highlights extraction for drama video using music emotion and human face features”, MMSP, 2011.
[7]A. Mehrabian, “Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in temperament,”Current Psychology, col. 14, no. 4, pp. 261–292, 1996.
[8]D. Ellis, “Beat Tracking by Dynamic Programming”, J. New Music Research, Special Issue on Beat and Tempo Extraction, vol. 36 no. 1, pp. 51-60, March 2007.
[9]http://windows.microsoft.com/zh-tw/windows-live/movie-maker#t1=overview
[10]Ioannis Tsochantaridis, Thorsten Joachims, Thomas Hofmann and Yasemin Altun , “Large Margin Methods for Structured and Interdependent OutputVariables,” JMLR, Vol. 6, pages 1453-1484,2005