簡易檢索 / 詳目顯示

研究生: 洪范文
Hung Fan Wen
論文名稱: 以網站日誌探勘建立網站架構
Using Web Log Mining to Create a Website Structure
指導教授: 謝建成
Shieh, Jiann-Cherng
學位類別: 碩士
Master
系所名稱: 圖書資訊學研究所
Graduate Institute of Library and Information Studies
論文出版年: 2010
畢業學年度: 98
語文別: 中文
論文頁數: 62
中文關鍵詞: 尋獲度網站日誌探勘資訊架構
英文關鍵詞: Findability, Web log mining, Information architecture
論文種類: 學術論文
相關次數: 點閱:150下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在各種資訊被大量數位化的網路環境中,網站已變成各機構所必須建立的資訊據點。使用者是否能快速並且直覺地找到需要的資訊,是網站提供給使用者搜尋所需資訊的能力,亦即為一網站內部的尋獲度。
    由於網站結構的日趨複雜,使用者往往需花費較多的時間才能找到所需的資訊,如能根據使用者的瀏覽行為修正網站的結構,可使網站具有適應性,進而提昇使用者的瀏覽效率。但目前有關網站架構的研究、或是局部調整網頁結構等,大多未能從整體的觀點最佳化網站的結構。因此本研究提出以目標網頁的因素加入傳統的方法中,根據網頁間的瀏覽次數,以權重的大小決定某一網頁的父網頁進而重整網站的結構,以提高網站內部尋獲度。
    本研究提出一演算法計算瀏覽序列之時間門檻與觀察使用者瀏覽行為找出目標網頁,以決定該網頁的所有父候選網頁之權重,藉此重建網站架構,並以任務導向之尋獲度測試成效。測試結果顯示目標網頁次數比重為100%的架構,其架構之尋獲度成效較高且優於現行架構,亦優於傳統只考慮網頁瀏覽次數之架構,進而建構符合使用者行為之網站架構。且目標網頁次數比重的不同所組織的架構也明顯不同,在組織結構上,本研究所建構以目標網頁次數比重為100%的架構是屬於廣而淺的形式,在尋獲度成效上也確實能符合前人研究的結果。相較於研究個案中的現行架構,其廣度稍微減少,更可減少使用者在搜尋項目時的迷失感,較符合一個良好資訊架構的設計方式。該架構雖然在部分任務上表現不盡理想,但整體上有著比原先架構與傳統方法之架構有著較佳的表現。

    In the internet environment where all types of information are digitized, a website has become the information site which all institutes and organizations must establish. Whether users can rapidly and intuitively find out information they need is the ability that a website provide for them to do it; i.e. it is the internal findability of a website.
    Because website structures become increasingly complicated, users often need more time to find out needed information; if website structures can be amended according to users’ browsing behaviors, then they can be adaptive and it can improve users’ browsing efficiency. However, most of the current studies related to website structure or partly-adjusted website structure cannot optimize website structure from a holistic viewpoint. Therefore, this study proposed adding the factor of target page into the traditional method, determining the parent page of a certain page by weighing the browsing times of these pages and further reorganized website structure based on it to improve the internal findability.
    This study proposed an algorithm to calculate the time threshold of user session, observe users’ browsing behaviors and find out the target page to determine the weight of all parent page candidates which was the basis to rebuild the website structure, and tested the outcome by the task-oriented findability. The results showed that the findability of the structure whose target page frequency weight was 100% was higher than the current one and than the traditional structure which only considered page browsing times, and this study further constructed a website structure conforming to user behavior. Moreover, the organization structure was significantly different as the frequency weight of target page was different; as for the organization structure, the structure that the frequency weight of target page was 100% constructed in this study was a broad and shallow form, and the result of findability did answer to the results of former studies. Compared with the current structure in the cases, its breadth was lightly lowered and this could reduce users’ lost feeling during information seeking and was a better design of information architecture. Although this structure did not perform well in some parts of task, it did have a better performance than the former and traditional ones as a whole.

    摘要 i Abstract ii 目次 iii 表目次 v 圖目次 vi 第一章 緒論 1 第一節 研究動機與問題陳述 1 第二節 研究目的 3 第三節 研究範圍與限制 3 第四節 名詞解釋 4 第二章 文獻分析 6 第一節 資訊架構 6 第二節 網際探勘(Web Mining) 11 第三節 網站使用探勘的相關研究 16 第三章 研究設計與實施 24 第一節 研究對象 24 第二節 研究方法 26 第三節 研究架構與步驟 28 第四章 研究結果與分析 31 第一節 網站日誌探勘 31 第二節 網站架構圖 40 第三節 網站尋獲度測試結果 47 第四節 分析結果討論 51 第五章 結論與建議 56 第一節 結論 56 第二節 未來研究建議 57 參考書目 59

    中文文獻
    丁依玲(民96)。以卡片分類法分析大學圖書館網站架構-以國立臺灣師範大學圖書館網站為例。未出版之碩士論文,國立台灣師範大學,台北市。
    邱涵芬(民95)。應用資料探勘與文件分類建構個人化知識查詢網站─以全民健康保險研究資料庫與醫藥新聞為例。未出版之碩士倫文,私立大同大學,台北市。
    徐維坤(民92)。使用資料探勘技術於網頁可用性評估並應用於適應化網站之研究。未出版之碩士論文,國立雲林科技大學,雲林縣。
    張忠琦(民89)。資料探勘於網站使用與網頁內容探索之研究。未出版之碩士論文,私立天主教輔仁大學,台北縣。
    張碧顯(民97)。從網站日誌中發現網站結構及網頁關係。未出版之碩士論文,國立暨南國際大學,南投縣。
    陳世訓(民91)。網站日誌探勘的研究與實作,未出版之碩士論文,國立台灣大學,台北市。
    陳韻如(民91)。超文本資訊架構與閱聽人溝通脈絡、時間序列對於手持小螢幕瀏覽之影響。未出版之碩士論文,國立交通大學,新竹市。
    曾憲雄、蔡秀滿、蘇東興、曾秋蓉與王慶堯(民96)。資料探勘。台北市:旗標出版股份有限公司。
    黃雅慧(民91)。應用網站探勘於網站瀏覽之個人化-以健康檢查業為例。未出版之碩士論文,私立東吳大學,台北市。
    賴玲玲、鍾閔衛(民96)。建構資訊服務Ⅱ:以網頁設計人員為例。教育資料與圖書館學,45(1),33-53。
    戴玉美、劉元芳(民94)。我國IA(信息構建)研究綜述。情報資料工作,143,13-16。
    蘇俊斌(民94)。應用網站探勘技術於網友瀏灠行為分析-以內容服務網站為例。未出版之碩士論文,國立台灣大學,台北市。

    英文文獻
    Bettina Berendt, Bamshad Mobasher, Miki Nakagawa & Myra Spiliopoulou (2002). The impact of site structure and user environment on session reconstruction in web usage mining. WEBKDD 2002, 2703(2003), 159-179.
    Capra, M. G. (2005). Factor analysis of card sort data: an alternative to hierarchical cluster analysis. In Proceedings of the Human Factors and Ergonomics Society 49th Annual Meeting, 651-655.
    Chen, Z., Fu, A.W.C. & Tong F.C.H (2003). Optimal Algorithms for Finding User Access Sessions from Very Large Web Logs. World Wide Web: Internet and Web Information Systems, 6(3).
    Chu-Hui Lee, Yu-Hsiang Fu. (2008) Web usage Mining based on clustering of browsing features. The Eighth International Conference on Intelligent Systems Design and Applications.
    Dixon, K., Lowcay, B. (2007). Creating a great website. Retrieved on January 7, 2009, from http://www.phcris.org.au/phplib/filedownload.php?file=/elib/lib/downloaded_files/publications/pdfs/phcris_pub_1214.pdf
    Etzioni, O. (1996). The World Wide Web: quagmire or gold mine. Communications of The ACM, 39(11), 65-68.
    Hiroshi Ishikawa, Manabu Ohta, Shohei Yokoyama, Junya Nakayama, & Kaoru Katayama. (2002). Web usage mining approaches to page recommendation and restructuring. Intelligent Systems in Accounting, Finance & Management, 11(3), 137-148.
    Hudson, W. (2005). Playing your cards right: getting the most from card sorting for navigation design. Interactions, 12, 56-58.
    Information Architecture Institute (2009). Retrieved march 15, 2009, from http://www.iainstitute.org/en/about/our_mission.php
    Jianhan Zhu, Jun Hong, & John G. Hughes. (2004). PageCluster: Mining Conceptual Link Hierarchies from Web Log Files for Adaptive Web Site Navigation. ACM Transactions on Internet Technology, 4(2), 185-208.
    Kosala, R., Blockeel, H. (2000). Web mining research: a survey. ACM SIGKDD, 2(1), 1-15.
    Kwok-Wai Cheung, Yuxiang Sun. (2003). Mining web site's clusters from link topology and site hierarchy. Proceedings of the 2003 IEEE/WIC International Conference on Web Intelligence (WI'03).
    Masseglia, P. Poncelet, & M. Teisseire. (1999). Using data mining techniques on web access logs to dynamically improve hypertext structure. ACM SIGWEB Newsletter, 8 (3), 13-19.
    Michael Hawley. (2008). Extending Card-Sorting techniques to inform the design of web site hierarchies. Retrieved January 2, 2009, from http://www.uxmatters.com/MT/archives/000332.php
    Mobasher, R. Cooley, & J. Srivastava. (2000). Automatic Personalization Based on Web Usage Mining. Communication of the ACM, 43, 142-151.
    Morville, P. (2005a). Ambient findability: libraries at the crossroads of ubiquitous computing and the Internet. Online, 29(6), 16-21.
    Morville, P. (2005b). Ambient findability: findability hacks. Retrieved January 5, 2009, from http://www.alistapart.com/articles/ambientfindability
    Myra Spiliopoulou. (2000). Web usage mining for web site evaluation. Communications of the ACM, 43(8), 127-134.
    Nielsen, J. (2000). Why You Only Need to Test With 5 Users, Alter Box. Retrieved May 30, 2009, from http://www.useit.com/alertbox/20000319.html
    Nielsen, J. (2004). Card Sorting: How Many Users to Test. Retrieved May 30, 2009, from http://www.useit.com/alertbox/20040719.html
    Nielsen, J. (2006). Quantitative Studies: How Many Users to Test?. Retrieved May 30, 2009, from http://www.useit.com/alertbox/quantitative_testing.html
    R. Cooley, B.Mobasher, & J. Srivastava. (1997). Web Mining: Information and Pattern Discovery on the World Wide Web. In Proceeding of ICTAI'97.
    Ramakrishnan Srikant, Yinghui Yang Mining. (2001). Web logs to improve website organization. Proceedings of the tenth international World Wide Web conference on World Wide Web, 430-437.
    Ramakrishnan Srikant, Yinghui Yang. (2001). Mining web logs to improve website organization. Proceedings of the tenth international World Wide Web conference on World Wide Web, 430-437.
    Robert Cooley, Bamshad Mobasher, & Jaideep Srivastava. (1999). Data Preparation for Mining World Wide Web Browsing Patterns. Knowledge and Information Systems, 1(1).
    Rosenfeld, L., & Morville, P. (2006). Information architecture for the World Wide Web (3rd ed.). Sebastopol: O’Reilly Media.
    Toub, S. (2000). Evaluating information architecture. Retrieved on January 2, 2009, from http://argus-acia.com/white_papers/evaluating_ia.pdf
    Usama Fayyad, Gregory Piatetsky-Shapiro, & Padhraic Smyth. (1996). The KDD Process for Extracting Useful Knowledge from Volumes of Data. Communications of the ACM, 39(11), 27-34.
    W.J.Frawley, G.Paitetsky-Shapiro, & C.J.Matheus. (1991). Knowledge Discovery in Databases : An Overview Knowledge Discovery in Databases. edited by G.Piatetsky-Shapiro and W.J.Frawley, California, AAAI/MIT Press, 1-30.
    Wan Abdul Rahim Wan Mohd Isa, Nor Laila Md Noor, Shafie Mehad. (2008). Website information architecture influences on user performance: An experimental approach. Information Technology(ITSim 2008), 1, 1-9.
    White, B. (2003). Web accessibility, mobility, and findability. In First Latin American Web Congress: proceedings: LA-WEB 2003, Santiago 2003 (pp. 239-240). Los Alamitos, Calif.: IEEE Computer Society.
    Zaphiris, P. (2000). Depth vs. breadth in the arrangement of web links. Proceedings of the IEA 2000/HFES 2000 Congress, 453-456.

    下載圖示
    QR CODE