簡易檢索 / 詳目顯示

研究生: 黃宜程
Huang, Yi-Cheng
論文名稱: 中學生小論文重複性比對與引文行為分析
Analysis of Repetitive Comparisons and Citation Practices in Short Essays by Secondary School Students
指導教授: 曾元顯
Tseng, Yuen-Hsien
口試委員: 林頌堅
Lin, Sung-Chien
李龍豪
Lee, Lung-Hao
曾元顯
Tseng, Yuen-Hsien
口試日期: 2023/07/20
學位類別: 碩士
Master
系所名稱: 圖書資訊學研究所
Graduate Institute of Library and Information Studies
論文出版年: 2023
畢業學年度: 111
語文別: 中文
論文頁數: 50
中文關鍵詞: 高中生小論文寫作引用行為文內引註重複性比對學術倫理
英文關鍵詞: High school student, Essay writing, Citation Practices, In-text citations, Repetitive Comparisons, Academic ethics
研究方法: 內容分析法對比分析
DOI URL: http://doi.org/10.6345/NTNU202301003
論文種類: 學術論文
相關次數: 點閱:234下載:17
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本研究將蒐集歷屆小論文寫作得獎作品進行重複性比對與引文行為觀察,擬探討現況小論文得獎作品的重複性行為類型以及臺灣高中生引文行為。期望以協助後續小論文相關研究參考,並提供未來學術倫理教育相關方向以及更加精確的小論文競賽重複性比對方向。
    運用科展比對工具,獲取前500對小論文的重複性比對結果,以進行高重複/具有抄襲特徵小論文的類型分析;利用深度學習bert模型進行小論文引註資料來源類型的分類預測;透過人工檢視1,000篇小論文,取得引用行為的詳盡分析結果。
    重複性比對與引文行為觀察結果概覽:
    一、 排除網址編碼的參考資料和無法剖析的資料,共有255對獨立比對結果。其中高重複/具有抄襲特徵的小論文行為類型依序為:一稿多投/修改後再次投稿(32.3%)、團隊多人分次交稿(28.2%)、疑似抄襲他人作品(25.4%)。
    二、 中學生小論文引註資料的分類預測結果,最常見的使用類別是網路48%,其次是書籍39%。
    三、 透過人工檢視1,000篇小論文整體的文內總引註率為67%,不同變因相關數據如下:
    (一)地區變因:不同地區之間的文內引註率存在顯著差異,海外區的文內引註率最高100%,其次是新竹區84%;相反,花蓮區的文內引註率最低53%,次低為宜蘭區56%。
    (二)年代變因:不同年代間的文內引註率變化呈現一些波動,年代較早的學生比年代較晚的學生有更好的引文行為。2008的文內引註率最高87%,2019年下降到最低點47%。
    (三)年級變因:根據年級層面對於文內引註率的變化進行分析,高一的學生將比高三的學生有更好的引文行為。高中一年級的文內引註率為77%,二年級為68%,三年級則降至61%。
    (四)城鄉變因:非偏鄉區學生比偏鄉區學生有更好的引文行為。偏鄉高級中學文內引註率56%;非偏鄉高級中學文內引註率66%;海外高級中學文內引註率達100%。
    本研究主要貢獻如下:(一)將高重複的小論文行為類型統計分析,對未來的小論文重複性審核方式提供更精確的方向。(二)透過中學生小論文引註資料分類預測,提供關於小論文得獎作品引註資料類別使用情況的洞察。(三)了解影響中學生小論文引文行為的變因,找出可能造成引文行為差異的因素,對未來的學生學術倫理教育提供切入點。

    This research collected award-winning short essays by secondary school students from previous years and conduct a comparative analysis of their repetitive comparisons and citation practices. The goal is to investigate the types of repetitive behaviors found in current award-winning short essays and the citation practices of high school students in Taiwan. The findings of this research are expected to contribute to future research on short essays, provide references for subsequent studies, and offer insights into academic ethics education and more precise directions for comparing the originality of short essays in competitions.
    By utilizing the science fair comparison system , the research obtained the repetitiveness comparison results for the top 500 pairs of short essays to conduct an analysis of high-repetitiveness/plagiarized essay types. Our research employed the deep learning BERT model to classify and predict the source types of citation data in the essays. Additionally, through manual examination of 1,000 short essays, detailed analysis results of citation behaviors were obtained.
    Overview of repetitive comparison and citation practices observation results:
    1. Excluding reference materials with URL encoding and unparsable data, a total of 255 pairs of independent comparison results were obtained. Among them, the types of high-repetitiveness/plagiarized essay behaviors are as follows, in sequential order: (1)Submitting the same essay to multiple submissions/modifying and resubmitting (32.3%); (2)Multiple team members submitting in succession (28.2%); (3) Suspected plagiarism of other people's work (25.4%).
    2. For the classification and prediction results of citation materials in small papers of middle school students, the most common use category is the Internet (48%), followed by books (39%).
    3. Through manual inspection of 1,000 short essays, the total in-text citation rate is 67%. The data related to different variables are as follows:
    (1) Regional variation: There are significant differences in the in-text citation rate among different regions. The in-text citation rate in the overseas region is the highest at 100%, followed by the Hsinchu District at 84%. On the contrary, the in-text citation rate in the Hualien District The lowest is 53%, and the second lowest is 56% in Yilan District.
    (2) Age variation: the variation of in-text citation rates between different ages shows some fluctuations, and students with earlier ages have better citation behaviors than students with later ages. The in-text citation rate was the highest at 87% in 2008, and dropped to the lowest point of 47% in 2019.
    (3) Grade-level variation: According to the analysis of the variation of the citation rate in the text according to the grade level, the students in the first year of high school will have better citation behavior than the students in the third year of high school. The in-text citation rate was 77 percent for first graders, 68 percent for second graders, and 61 percent for third graders.
    (4) Urban-rural variation: Students in non-rural areas have better citation behavior than students in rural areas. The in-text citation rate of rural senior high schools is 56%; the in-text citation rate of non-rural high schools is 66%; the in-text citation rate of overseas senior high schools is 100%.

    The main contributions of this research are as follows: (1) Statistical analysis of the behavior types of highly repetitive small papers provides a more accurate direction for the repetitive review methods of small papers in the future. (2) Through the prediction of the classification of citation data in small papers of middle school students, it provides insight into the use of citation data categories of award-winning works in small papers. (3) Understand the variables that affect the citation behavior of middle school students' essays, find out the factors that may cause differences in citation behavior, and provide an entry point for future students' academic ethics education.

    第一章 緒論 1 第一節 研究動機 1 第二節 名詞解釋 3 第三節 研究目的與問題 4 第二章 文獻探討 5 第一節 小論文現況 5 第二節 文本重複性檢測技術 6 第三節 高中生剽竊行為 8 第四節 引文情況評級 11 第三章 研究方法與步驟 13 第一節 研究範圍與限制 13 第二節 研究工具 14 第三節 研究實施與步驟 15 第四章 研究結果與分析 31 第一節 高重複/具有抄襲特徵的小論文行為類型 31 第二節 中學生小論文引註資料分類預測 36 第三節 中學生小論文引文行為相關數據 38 第五章 結論 46 參考文獻 48

    American Psychological Association(2023). Parenthetical Versus Narrative In-Text Citations. Retrieved 7/05 from https://apastyle.apa.org/style-grammar-guidelines/citations/basic-principles/parenthetical-versus-narrative
    Anu Saini, M. R. S., Mansi Thakur. (2021). Intrinsic Plagiarism Detection System Using Stylometric Features and DBSCAN. 2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS).13-18.
    Bela Gipp, J. B. (2010). Citation based plagiarism detection a new approach to identify plagiarized work language independently. Proceedings of the 21st ACM Conference on Hypertext and Hypermedia. 273-274.
    Dant, D. R. (1986). Plagiarism in High School A Survey. The English Journal, 75(2), 81-84.
    Elastic. (2022). The heart of the free and open Elastic Stack. Retrieved 11/20 from https://www.elastic.co/elasticsearch/
    Juan C. Torres diaz, J. M. d., Pablo V. Torres Carrión, Isidro Marín gutierrez. (2021). Plagiarism and use of technology by high school students. Campus Virtuales, 10(2), 175-184.
    Kessler, M. M. (1963). Bibliographic coupling between scientific papers. American Documentation, 14(1), 10-25. https://doi.org/10.1002/asi.5090140103
    Knight-Davis, S., and Jan S. Sung. (2008). Analysis of Citations in Undergraduate Papers. College U& Research Libraries, 447.
    OpenAI. (2023). Introducing ChatGPT. Retrieved 11/10 from https://openai.com/blog/chatgpt.
    Peytcheva-Forsyth, R., Aleksieva, L., & Yovkova, B. (2018). The impact of technology on cheating and plagiarism in the assessment – The teachers’ and students’ perspectives. AIP conference proceedings, 2048(1), 020037
    Sisti, D. A. (2007). How do high school students justify Internet plagiarism. Ethics & Behavior, 17(3), 215-231.
    Sven Meyer zu Eissen, B. S. (2006). Intrinsic Plagiarism Detection. In European conference on information retrieval, 565-569. Springer, Berlin, Heidelberg.
    Timothy C. Hoad, J. Z. (2003). Methods for Identifying Versioned and Plagiarised Documents. Journal of the American Society for Information Science and Technology, 54(3), 203-215.
    Vieyra, M., and Kari Weaver. (2016). The prevalence and quality of source attribution in middle and high school science papers. Issues in Science and Technology Librarianship, 83, 13.
    中學生網站(2020)。109學年度全國高級中等學校小論文寫作比賽實施計畫。
    中學生網站(2021)。110學年度全國高級中等學校小論文寫作比賽實施計畫。
    中學生網站(2023)。全國高級中等學校小論文寫作比賽引註及參考文獻格式範例。
    江瑞顏(2021)。高中小論文比賽介紹。
    周倩(2022)。你也可以成為很棒的研究者-寫給高中生研究誠信小冊(學生版)。臺灣學術倫理教育學會。
    吳清山(2005)。中華民國教育年報(民國九十三年版)。中華民國教育年報,93, 137-186。
    吳清山(2007)。中華民國教育年報(民國九十五年版)。中華民國教育年報, 95,103-152。
    秦夢群(2004)。大學多元入學制度實施與改革之研究。教育政策論壇,7(2), 59-84。
    國立臺灣大學社會學系(2021)。作伙學:2021審議會議閱讀手冊。 Retrieved 10/5 from https://www.108epo.com/about.php
    國家教育研究院(2021)。課程綱要 - 108課綱重點。Retrieved 11/10 from https://12basic.pro.k12ea.gov.tw/k12eagt/theme/themeAction!themeCourseTitle.so?themeCourseCode=LB2VNY。
    教育部(2014)。十二年國教課程綱要總綱。
    教育部(2017)。專科以上學校學術倫理案件處理原則。(教育部臺教高(五)字第 1060059470 號函)。
    教育部統計處(2021)。110學年各級學校名錄及異動一覽表〈偏遠地區學校-高級中等學校〉。
    許維寧(2022)。小論文、報告涉襲抄時有所聞 高中生誤以為改寫不算抄襲。 聯合報。Retrieved 12/27 from https://udn.com/news/story/6885/6509026
    通識再現(2019)。高中「小論文」,教育大問題。點教育,1(1),25-26。
    楊平、黃冠綸、楊雪子、鄭淑文、周茜芸、曾元顯(2022)。科展作品比對系統建置與初步成果。科教館學刊,3,254-261。
    陽平、曾元顯(2022)。全國科展作品比對系統。
    葉韻蘋(2010)。高中生參加小論文比賽資訊素養差異之研究。國立臺灣師範大學圖書資訊學系。臺北市。

    下載圖示
    QR CODE