研究生: 簡培修
Pei Hsiu Chien
論文名稱: 以支持向量機為基礎之問卷填答識別研究
Support Vector Machine Based Questionnaire Marking Recognition Research and Applications
指導教授: 李忠謀
Lee, Chung-Mou
學位類別: 博士
系所名稱: 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2012
畢業學年度: 100
語文別: 中文
論文頁數: 74
中文關鍵詞: 問卷填答識別表單處理系統支持向量機試卷評分系統
英文關鍵詞: questionnaire marking recognition, form processing system, support vector machine, exam grading system
論文種類: 學術論文
  • 在現今電腦網路蓬勃發展的世代,部分的紙本問卷已轉成線上問卷,方便快速統計結果,然而仍然有許多電腦與網路不便使用的場合,例如:餐廳用餐、商店購物、銀行存提款、參加產品發表會或研討會、或是到政府機關洽公等,在這些場景中,通常不方便提供電腦及網路供問卷填寫,若要在第一時間取得意見回饋,紙本型式的問卷還是最直接且最便利的管道。而一般問卷設計,為了讓填答者方便填寫,以及快速統計填答結果,大部分會以選擇題方式呈現,不論是學術研究領域或是商業軟體,對這一類型問題的處理方式仍以計算填答區域中的可視點數量,作為是否有被標記之主要依據,然而雜訊問題以及填答者填答方式的多樣性(勾選、畫叉、塗滿等),經常讓這些計算可視點數的方法無法正確辨識選項是否被標記。

    Even in this electronic age, paper-based forms are still very much part of daily life. Filling out the service quality questionnaire during a flight, completing survey after attending a seminar, and filling out a passport application form are all common tasks that still require some paper and pen-based form input. If a large number of forms are to be collected, a form processing system that can automatically extract and tally inputs of the forms would be needed to save time and to prevent errors. Most systems recognize marks in regions of interest by counting the visible pixels in them. However, the accuracy of mark recognition is strongly affected by noises because the respondent may use various types of input as marks.
    The proposed system divides the automatically marking recognition process into two stages. The first stage is to recognize regions of interest and group them by each problem automatically. The second stage is to recognize marks made by respondents. The system applies the SVM method as major technology to avoid the noise problem. The respondent’s intent is also considered for eliminating the cross-out marks. The proposed system was put to use at two different instances. First, the system was used to automatically tally and report results of a quality of (University) service questionnaire and end-of-semester course survey. Second, the system was used to automatically grade the Intellectual Property Rights Exam taken by the incoming freshmen. The accuracy of the SVM classifier for checked/unchecked mark detection is higher than 99%, and the accuracy is above 98% about recognizing the choice for each question. Finally, we propose a blend SVM for new different types of symbols used as options which usually need to retrain a new SVM. The same questionnaires and test were used for evaluating the performance of the blend SVM. The accuracy is a little lower, but holds above 95%. That means the blend SVM is suitable for those new questionnaires which may allow lightly lower accuracy.

    第一章 緒論 1 第一節 研究背景與動機 1 第二節 問題與挑戰 4 第三節 論文架構 7 第四節 名詞解釋 8 第二章 文獻探討 10 第一節 表單文件的處理 10 第二節 選票自動判別處理 14 第三節 支持向量機理論回顧 18 第三章 系統架構及研究方法 21 第一節 系統架構 21 第二節 填答區域的自動辨識 24 第三節 填答區域的自動群組 28 第四節 已填寫之問卷與空白問卷的疊合對齊 30 第五節 標記的辨識 32 第四章 實驗結果與討論 35 第一節 各類型問卷填答區域的辨識 36 第二節 SVM方法適切性評估 40 第三節 問卷的填答處理 44 第四節 延伸應用:試卷自動評分 50 第五節 通用型支持向量機的建立與測試 53 第五章 結論 55 參考著作 58 附 錄 61 A. 標記辨識結果檔案範例 61 B. 填答區的規則樣板 64 C. 選票資料辨識結果 66

