研究生: 朱惠銘
Huei-Ming Chu
論文名稱: 研究使用詞彙與語意資訊於
Investigating the Use of Lexical and Semantic Information for Automatic Spoken Document Segmentation and Organization
指導教授: 陳柏琳
Chen, Berlin
學位類別: 碩士
系所名稱: 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2005
畢業學年度: 93
語文別: 中文
論文頁數: 101
中文關鍵詞: 語音文件切割語音文件組織自我組織圖主題混合模型圖示
英文關鍵詞: Spoken Document Segmentation, Spoken Document Organization, Self-Organization Map, Topic Mixture Model Map
論文種類: 學術論文
  • 語音文件切割是指在長時間的聲音訊號上自動地標定不同主題之間的邊界,因此可將語音文件分隔成具有主題凝聚力的段落。另外,語音文件組織是指對於已切割過的段落分析其應隸屬的主題,使這些段落群聚在主題群集中,並標示群集標記後以階層式視覺化呈現便於使用者瀏覽。兩者在近幾年都逐漸受到重視。

    Spoken document segmentation is to automatically set the boundaries between different small topics begin mentioned in long steams of audio signals, and divide the spoken documents into a set of cohesive paragraphs of sentences sharing some common central topic. While spoken document organization aims at automatically analyzing the subject topics of the segmented shot paragraphs of the spoken documents, clustering them into groups with topic labels and organizing them into some hierarchical visual presentation easier for users to browse. Both of them have gained growing attention in the past few years.
    In the thesis, we explored the use of the Hidden Markov Model (HMM) approach, which has been proven effective for speech recognition and information retrieval, in the context of spoken document segmentation. We not only exploited the lexical information inherent in the spoken document, such as the statistical features or the language model probabilities, but also considered the acoustic information, such as the pause distribution and the confidence measure, in identifying segment boundaries. Moreover, the semantic information conveyed in the spoken document was also integrated into the HMM segmenter for accurately modeling the state observation distributions. On the other hand, we investigated two unsupervised and data-driven organization approaches as well for spoken document analysis, i.e., the Self-Organizing Map (SOM) and Probabilistic Latent Semantic Analysis Map (ProbMap). While for the ProbMap approach, a topical mixture model approach (TMMmap), which came from an alternative perspective, was also studied. A series of experiments was conducted on the Topic Detection and Tracking (TDT) spoken document collections in order to analyze the performance levels of these approaches and compare the differences between them. Finally, we further attempted to incorporate the topic distributions as well as the topological constraints achieved from spoken document organization into the HMM segmenter. Very Promising results were initially demonstrated.

    誌謝 i 中文摘要 iii Abstract v 圖目錄 ix 表目錄 xi 第一章 緒論 1 1.1前言 1 1.2研究動機 2 1.3論文內容 2 1.4研究貢獻 3 1.5論文大綱 4 第二章 文獻回顧 5 2.1語音文件切割 5 2.1.1 隱藏式馬可夫模型 (Hidden Markov Model, HMM) 6 2.1.2 局部文意分析 (Local Context Analysis, LCA) 8 2.1.3 統計式模型 (Statistical Models) 9 2.1.4 概念式隱藏馬可夫模型 (Aspect Hidden Markov Model, AHMM) 10 2.1.5 其他相關方法 13 2.2 語音文件組織與視覺化呈現 13 2.2.1語音文件組織系統 14 2.2.2自我組織圖 16 第三章 實驗語料與評估方式 17 3.1 實驗語料說明 17 3.2 自動文件切割效能評估方式 19 3.2.1直接在語音訊號上的評估方式 20 3.2.2.語音文件轉成文字後評估方式 21 3.3 文件視覺化組織效能評估方式 23 3.3.1距離比值評估方式 24 第四章 語音新聞文件切割 29 4.1隱藏式馬可夫模型 29 4.1.1分群數與轉移罰值對於效能的影響 29 4.1.2語言模型平滑化對於效能的影響 34 4.2考量語音文件其他資訊 36 4.2.1考量語音辨識可信度資訊 36 4.2.2考量停頓長度資訊 37 4.2.4綜合上述資訊 41 4.3調整衡量觀測機率方式 42 4.3.1 融合潛藏語意分析 42 4.3.2 隱藏式馬可夫模型融合潛藏語意分析資訊 45 第五章 資料主題組織 49 5.1自我組織圖視覺化呈現 50 5.1.1自我組織圖用於語音文件視覺化組織 53 5.1.2自我組織圖鄰接函式設定 54 5.1.3自我組織圖圖示評估結果 56 5.2 機率式主題組織視覺化呈現 57 5.2.1機率式潛藏語意分析(Probabilistic Latent Semantic Analysis, PLSA) 58 5.2.2機率圖示 (ProbMap) 61 5.2.3主題混合模型簡介 64 5.2.4主題混合模型圖示(Topic Mixture Model map, TMMmap) 68 5.2.5圖示評估結果 70 5.3語音文件視覺化呈現雛型系統 71 5.4應用資料主題組織資訊於語音文件切割 74 5.4.1自我組織圖示資訊應用於HMM語音文件切割 74 5.4.2機率式主題組織資訊應用於HMM語音文件切割 75 第六章 結論與展望 77 6.1結論 77 6.2未來展望 79 參考文獻 81

    [方國安, 2002]方國安, “應用基因演算法於中文廣播新聞中情境切割及分類”, 國立成功大學資訊工程學系碩士班碩士論文, pp. 20~36, 2002
    [陳佳甫, 2003] 陳佳甫, “考慮特徵、語言模型及額外資訊之中文語音文件切割-以廣播新聞為例” 國立台灣大學電信工程學研究所碩士論文, 2003