簡易檢索 / 詳目顯示

研究生: 陳育銘
Chen,Yu-Ming
論文名稱: RAG技術的應用與效能評估-以圖書資訊學領域為例
Application and Performance Evaluation of RAG Technology -A Case Study of Library and Information Science
指導教授: 曾元顯
Tseng, Yuen-Hsien
口試委員: 曾元顯
Tseng, Yuen-Hsien
林頌堅
Lin, Sung-Chien
陳舜德
Chen, Shun-Der
口試日期: 2025/01/06
學位類別: 碩士
Master
系所名稱: 圖書資訊學研究所
Graduate Institute of Library and Information Studies
論文出版年: 2025
畢業學年度: 113
語文別: 中文
論文頁數: 70
中文關鍵詞: 檢索增強生成大型語言模型
英文關鍵詞: Retrieval-Augmented Generation, Large Language Models
DOI URL: http://doi.org/10.6345/NTNU202500473
論文種類: 學術論文
相關次數: 點閱:86下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本研究針對圖書資訊學領域,探討檢索增強生成(Retrieval-Augmented Generation, RAG)技術的應用與效能評估。現有的大型語言模型(如GPT-3)雖展現卓越的文本生成能力,但在面對專業問題時,易受人工智慧幻覺影響,導致生成內容的準確性和相關性不足。RAG技術結合檢索與生成兩個階段,通過檢索外部資料輔助文本生成,提升了內容的專業性與上下文的連貫性,特別適合應用於資訊需求高且專業性強的領域。本研究採用AI生成問題並使用RAG進行回答,結合ChatGPT與人工的評分數據,透過多指標(如F1分數、準確率)對RAG效能進行量化分析。結果顯示,RAG能有效克服傳統LLM在專業領域中的不足,在準確性、相關性和上下文匹配上表現卓越。同時,採用Ragas生成測試集以另一種客觀方式進行評估,進一步驗證RAG技術的效能。然而,研究也發現部分生成回答在忠實度上存在改進空間,特別是在資料支持不足或背景資訊偏差的情境中。本研究證實,RAG技術能顯著提升大型語言模型在圖書資訊學領域文本生成的質量,為專業問題解決提供了更準確與可靠的工具,並為相關領域的研究與應用提供了重要的參考依據。

    This study focuses on the application and performance evaluation of Retrieval-Augmented Generation (RAG) in the field of Library and Information Science. While existing LLMs such as GPT-3 and BERT demonstrate remarkable capabilities in text generation, they are prone to inaccuracies and lack relevance when addressing domain-specific issues, often due to the phenomenon of artificial intelligence hallucination. By integrating retrieval and generation processes, RAG leverages external data to enhance the accuracy and contextual relevance of generated content, making it particularly suitable for fields that demand precise and professional information.In this study, AI-generated questions were answered using RAG, and the results were evaluated through combined scores from ChatGPT and human reviewers using metrics such as F1-score and accuracy. The findings indicate that RAG effectively addresses the limitations of traditional LLMs in professional domains, demonstrating outstanding performance in accuracy, relevance, and contextual alignment. Additionally, testset generated using Ragas were employed to provide an objective evaluation of RAG’s performance, further validating its effectiveness. However, the study also revealed areas for improvement in the fidelity of generated responses, particularly in cases of insufficient data support or contextual misalignment.This research confirms that RAG technology significantly improves the quality of text generation by large language models in the field of Library and Information Science. It offers a more accurate and reliable tool for solving professional problems and provides valuable insights for future research and applications in related domains.

    第一章 緒論 1 第一節 研究背景與動機 1 第二節 研究目的與問題 2 第二章 文獻探討 3 第一節 大型語言模型(Large Language Models, LLMs) 3 第二節 檢索增強生成(Retrieval-Augmented Generation, RAG) 5 第三章 研究方法 8 第一節 研究範圍與限制 8 第二節 研究工具 9 第三節 研究方法與實施 10 第四章 研究結果與分析 19 第一節 ChatGPT與人工結合的評估分析 19 第二節 Ragas評估分析 21 第五章 結論與建議 56 參考文獻 59 附錄一 ChatPDF生成之問題 61

    Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.https://doi.org/10.48550/arXiv.1810.04805
    Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., ... & Amodei, D. (2020). Language models are few-shot learners. arXiv preprint arXiv:2005.14165.https://doi.org/10.48550/arXiv.2005.14165
    Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., ... & Riedel, S. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. arXiv preprint arXiv:2005.11401.https://doi.org/10.48550/arXiv.2005.11401
    Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.
    Ragas. (n.d.). Ragas documentation (Version stable). Retrieved December 24, 2024, from https://docs.ragas.io/en/stable/
    莊晏詞(2024)。生成式AI於醫療領域應用之管制議題。生物產業科技管理叢刊,12(),43-61。https://0-doi-org.opac.lib.ntnu.edu.tw/10.6170/BTMR.202404_12.0004
    涂家章(2024)。生成式AI於文字客服機器人應用。電腦與通訊,(197),11-15。https://0-www-airitilibrary-com.opac.lib.ntnu.edu.tw/Article/Detail?DocID=1019391x-N202404160003-00003
    蔡銘修、林進益(2024)。生成式AI在大專院校工程教育中的挑戰與潛力。臺灣教育評論月刊,13(5),58-63。https://0-www-airitilibrary-com.opac.lib.ntnu.edu.tw/Article/Detail?DocID=P20130114001-N202405030017-00011
    黃冠綸(2023)。中文期刊論文資訊擷取之研究 — 以圖書資訊學領域為例https://hdl.handle.net/11296/puev95

    下載圖示
    QR CODE