簡易檢索 / 詳目顯示

研究生: 施佩君
論文名稱: 新聞論壇多面向分析之研究
指導教授: 柯佳伶
學位類別: 碩士
Master
系所名稱: 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2009
畢業學年度: 97
語文別: 中文
論文頁數: 61
中文關鍵詞: 分類論壇新聞面向探勘摘要
論文種類: 學術論文
相關次數: 點閱:181下載:3
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在網路新聞論壇中,由於文章內容是由一般使用者自由發佈,即使論壇中有以主題區分討論內容,但討論文章內容中仍可能呈現出多種討論觀點面向,使用者不容易從數量龐大的討論文章中有效瀏覽自己感興趣的觀點文章。本論文研究所提出的方法可對一組新聞論壇文章自動分析萃取出重要的討論觀點面向(以關鍵字表示),且建立出面向的階層架構關係,並對各文章自動判斷其所包含的各個面向,提供使用者可依討論觀點面向進行文章瀏覽。在探勘分析過程中,我們會先由標題字詞選出重要且出現頻率高的字詞作為面向,並對這些面向關鍵字探勘出其相關擴展字詞,接著我們利用向量空間模型分別計算整篇文章所包含的字詞與面向擴展字詞的相關程度,以及文章中各個句子所包含的字詞與面向擴展字詞的相似度,再將這兩個結果合併判斷一篇文章所包含之相關面向。實驗結果顯示:本論文系統對各文章所選定的面向與受試者挑選的面向結果一致性很高;且將多個主題的文章混合在一起時,本論文方法也可以將不同主題的文章所涵蓋的面向正確地萃取出來。

    第一章 緒論 1 1-1 研究動機 1 1-2 相關文獻探討 2 1-3 論文方法 7 1-4 論文架構 8 第二章 問題描述與定義 9 2-1 問題描述 9 2-2 基本名詞定義 10 第三章 論文方法 15 3-1 系統簡介 15 3-2 資料蒐集及資料前處理 16 3-3 建立各主題之面向 21 3-4 選定面向 29 第四章 實作系統簡介與實驗評估 37 4-1 新聞論壇多面向分析系統介紹 37 4-2 實驗評估 41 4-3 分析與討論 50 第五章 結論與未來研究方向 52 參考文獻 54

    [1] W. Dakka, P. G. Ipeirotis, and K. R. Wood, “Automatic construction of multifaceted browsing interfaces,” In Proceedings of the 14th ACM international conference on Information and knowledge management (CIKM), 2005.
    [2] W. Dakka, R. Dayal, and P. G. Ipeirotis, “Automatic discovery of useful facet terms,” In Proceedings of the 29th ACM SIGIR conference on Faceted Search, 2006.
    [3] W. Dakka and P. G. Ipeirotis, “Automatic Extraction of Useful Facet Hierarchies from Text Databases,” in Proceedings of the 24th International Conference on Data Engineering (ICDE), 2008.
    [4] D. Dash, J. Rao, N. Megiddo, A. Ailamaki1, and G. Lohman, “Dynamic Faceted Search for Discovery-driven Analysis,” In Proceedings of the 17th ACM international conference on Information and knowledge management (CIKM), 2008.
    [5] G. Erkan and D. R. Radev, “LexPageRank: Prestige in Multi-Document Text Summarization,” In Proceeding of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2004.
    [6] M. Gamon, S. Basu, D. Belenko, D. Fisher, M. Hurst, and A. C. König, “BLEWS: Using Blogs to Provide Context for News Articles,” In National Conference on Artificial Intelligence (AAAI), 2008.
    [7] B. He, C. Macdonald, J. He, and I. Ounis, “An Effective Statistical Approach to Blog Post Opinion Retrieval,” In Proceedings of the 17th ACM international conference on Information and knowledge management (CIKM), 2008.
    [8] M. Hu, A. Sun, and E. Lim, “Comments-Oriented Blog Summarization by Sentence Extraction,” In Proceedings of the 16th ACM international conference on Information and knowledge management (CIKM), 2007.
    [9] M. Hu, A. Sun, and E. Lim, “Comments-Oriented Document Summarization: Understanding Documents with Readers’ Feedback,” In Proceeding of the 31st ACM SIGIR conference on Research and Development in Information Retrieval, 2008.
    [10] L. Ku, Y. Liang, and H. Chen, “Opinion Extraction, Summarization and Tracking in News and Blog Corpora,” In National Conference on Artificial Intelligence (AAAI), 2006.
    [11] X. Ling, Q. Mei, C. Zhai, and B. Schatz, “Mining Multi-Faceted Overviews of Arbitrary Topics in a Text Collection,” In Proceeding of the 11th ACM SIGKDD international conference on Knowledge discovery in data mining, 2008.
    [12] G. Mishne, “Multiple Ranking Strategies for Opinion Retrieval in Blogs,” in Proceedings of the 15th of Text REtrieval Conference (TREC 2006), 2006.
    [13] G. Mishne, “Using Blog Properties to Improve Retrieval,” In proceedings of International Conference on Weblogs and Social Media (ICWSM), 2007.
    [14] G. Salton, “Automatic Information Organization and Retrieval,” McGraw-Hill, New York, 1968.
    [15] E. Stoica, M. A. Hearst, and M. Richardson, “Automating creation of hierarchical faceted metadata structures,” In Proceedings of NAACL/HLT 2007, 2007.
    [16] W. Zhang, C. Yu, and W. Meng, “Opinion Retrieval from Blogs,” In Proceedings of the 16th ACM international conference on Information and knowledge management (CIKM), 2007.

    下載圖示
    QR CODE