簡易檢索 / 詳目顯示

研究生: 高國瑋
Gao, Guo-Wei
論文名稱: 基於長短期記憶網路的疲勞檢測
Base on Long Short-term Memory Network for Fatigue Detection
指導教授: 陳美勇
Chen, Mei-Yung
口試委員: 蘇順豐
Su, Shun-Feng
林顯易
Lin, Sian-Yi
郭重顯
Guo, Chong-Sian
陳美勇
Chen, Mei-Yung
口試日期: 2022/07/26
學位類別: 碩士
Master
系所名稱: 機電工程學系
Department of Mechatronic Engineering
論文出版年: 2022
畢業學年度: 110
語文別: 中文
論文頁數: 73
中文關鍵詞: 疲勞檢測特徵提取長短期記憶網路可視化機器學習
英文關鍵詞: Fatigue Detection, Feature Extraction, Long Short-term Memory Networks, Visualization, Machine Learning
研究方法: 實驗設計法主題分析
DOI URL: http://doi.org/10.6345/NTNU202201337
論文種類: 學術論文
相關次數: 點閱:78下載:13
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文重點介紹即時疲勞檢測流程。該系統將在 Python 內部完成這一切,並逐步構建它,以便能夠檢測到不同的姿勢,特別是困倦的跡象。 為了做到這一點,我們使用一些關鍵模型並使用 MediaPipe Holistic 來提取關鍵點。 這將使我們能夠從臉部提取關鍵點。 該系統使用 Tensorflow 和 Keras,並建立了一個長短期記憶模型 long short-term memory(LSTM),能夠預測螢幕上顯示的動作。我們需要做的是收集關於我們所有不同關鍵點的一些數據,所以我們收集我們臉上的數據並將它們保存為 Numpy 數據,以便處理多維的陣列或矩陣。人臉檢測方法基於一個深度神經網絡,使用 Sklearn 進行評估和測試,並使用 Matplotlib 幫助進行圖像可視化。能夠從臉部檢測到 468個地標,提取臉部的重要特徵並對數據進行變換,以便將數據導入 LSTM 模型。使用 LSTM 層繼續並預測時間分量,它能夠從多個幀預測動作,而不僅僅是單個幀。使用 Opencv 進行集成,然後使用網路攝影機進行即時預測。本研究成功使用 MediaPipe 與 LSTM 模型相結合,提出一套疲勞檢測的系統。實驗結果顯示,經機器學習後其檢測平均準確率能達到 90%。

    This paper focuses on the instant fatigue detection process. The system will do all this inside python and build it incrementally to be able to detect different poses, especially signs of drowsiness. To do this, we use some key models and use MediaPipe Holistic to extract keypoints. This will allow us to extract keypoints from the face. The system uses Tensorflow and Keras and builds a long short-term memory (LSTM) model that is able to predict actions displayed on the screen. What we need to do is collect some data about all our different keypoints, so we collect data on our faces and save them as Numpy data in order to work with multidimensional arrays or matrices. The face detection method is based on a deep neural network, evaluated and tested using Sklearn and aided in image visualization using Matplotlib. Able to detect 468 landmarks from faces, extract important features of faces and transform the data so that it can be imported into an LSTM model. Continuing and predicting the temporal component using an LSTM layer, it is able to predict action from multiple frames, not just a single frame. Integrate with Opencv, then use a webcam for instant prediction. This study successfully uses MediaPipe combined with LSTM model to propose a fatigue detection system. The experimental results show that the detection accuracy can reach an average of 90% after machine learning.

    目錄 摘要 I Abstract II 誌謝 III 目錄 IV 表目錄 VI 圖目錄 VII 第一章 緒論 1 1.1前言 1 1.2文獻回顧 4 1.3研究動機與目的 10 1.4論文架構 11 第二章 理論基礎 12 2.1遞迴神經網路 12 2.1.1 RNN模型架構 14 2.1.2梯度爆炸 16 2.1.3解決梯度爆炸問題 17 2.1.4長短期記憶網路 18 2.2 MediaPipe 25 第三章 疲勞檢測系統設計 27 3.1系統架構 27 3.1.1色彩轉換 28 3.1.2臉部偵測 29 3.1.3收集特徵數據 30 3.1.4數據預處理 33 3.1.5 LSTM模型訓練 34 3.1.6疲勞預測&儲存權重 36 3.1.7混淆矩陣 37 3.2軟體配置 39 3.3硬體設備 41 第四章 實驗結果與討論 44 4.1實驗方法 44 4.1.1人臉偵測 46 4.1.2 MediaPipe模型距離偵測 47 4.1.3 LSTM網路訓練結果 50 4.1.4系統執行結果 55 4.2光線亮度測試 57 4.2.1光線亮度測試分析 60 4.3臉部部分遮擋之分析 62 4.4不同對象的疲勞檢測 64 4.5 Intel NUC模擬結果 65 4.6不同方法的疲勞偵測比較 67 第五章 結論與未來展望 70 參考文獻 71

    [1] Triyanti V. Iridiastadi H, "Challenges in detecting drowsiness based on driver's behavior, " Mater. Sci. Eng. Conf. Series, vol. 277, issue 1, p. 12-42, 2017.
    [2] Yang Z. Ren H, "Feature Extraction and Simulation of EEG Signals During Exercise-Induced Fatigue, " in IEEE Access, vol. 7, pp. 46389-46398, 2019.
    [3] Gasser T. Sroka L. Möcks J, "The transfer of EOG activity into the EEG for eyes open and closed, " Electroencephalography and clinical neurophysiology, vol. 61, issue 2, pp. 181–193, 1985.
    [4] Mardi Z. Ashtiani S.N.M. Mikaili M, "EEG-based drowsiness detection for safe driving using chaotic features and statistical tests, " J. Med. Signals Sens, vol. 1, issue 2, pp. 130–137, 2011.
    [5] 半導體投資聯盟, "浙江大華股份有限公司官方網站", 15 Apr.2022, www.dahuatech.com/about/company.html.
    [6] Alioua N. Amine A. Rziza M, "Driver's fatigue detection based on yawning extraction, " Int. J. Vehicular Technology, vol. 1, pp. 47–75, 2014.
    [7] Wang T. Shi P, "Yawning detection for determining driver drowsiness," in Proceedings of the IEEE International Workshop on VLSI Design and Video Technology, pp. 373-376, Suzhou, China, May 2005.
    [8] Choi I.H. Kim Y.G, "Head pose and gaze direction tracking for detecting a drowsy driver," IEEE Int. Conf. on Big Data and Smart Computing, Bangkok, Thailand, vol. 9, no. 2, pp. 241–244, 2014.
    [9] CLOUDMATRIX, "GFace面部表情檢測系統", 15 Apr.2022, www.cloudmatrix.com.tw/gfac e-facial-recognition-system/.
    [10] Ray Lin, "駕駛員監控系統DMS介紹", 15 Apr.2022, medium.com/學以廣才/駕駛者監控系統-driver-monitoring-system-dms-2830f3008c81.
    [11] Byoung Chul Ko, "A Brief Review of Facial Emotion Recognition Based on Visual Information," Sensors (Basel), January 2018.
    [12] Kim D.H. Baddar W. Jang J. Ro Y.M, "Multi-objective based Spatio-temporal feature representation learning robust to expression intensity variations for facial expression recognition," IEEE Transactions on Affective Computing, pp.99, April 2017.
    [13] Graves A. Mayer C. Wimmer M. Schmidhuber J. Radig B, "Facial expression recognition with recurrent neural networks," Proceedings of the International Workshop on Cognition for Technical Systems, Santorini, Greece, pp.1–6, October 2008.
    [14] Jain D.K. Zhang Z. Huang K, "Multi angle optimal pattern-based deep learning for automatic facial expression recognition," Pattern Recognition Letters. Volume 139, pp.157-165, November 2020.
    [15] Arqam M. Al-Nuimi, Ghassan J. Mohammed, "Face Direction Estimation based on Mediapipe Landmarks", pp.25-26, August 2021.
    [16] Huang K. Liu X. Fu S. Guo D. & Xu M, "A Lightweight Privacy Preserving CNN Feature Extraction Framework for Mobile Sensing," IEEE Transactions on Dependable and Secure Computing, pp.1441–1455, 2019.
    [17] Jogin M. Mohana M. Madhulika M. Divya G. Meghana R. & Apoorva S, "Feature Extraction using Convolution Neural Networks(CNN) and Deep Learning," 2018 3rd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT), 2018.
    [18] JY, "時間序列分析與預測方法",15 Dec.2021, ithelp.ithome.co m.tw/article s/10272289.
    [19] "梯度消失問題", 維基百科, 15 Apr.2022, zh.wikipedia.org/zh-tw/梯度消失問题。
    [20] "Intel RealSense D435i", Intel® RealSense™ Depth Camera官方網站, 15 Apr.2022, www.intelrealsense.com/depth-camera-d435i/.
    [21] "Intel NUC", Intel官方網站,15 Jun.2022, www.intel.com.tw/content/ww
    w/tw/zh/products/details/nuc/boards.html.
    [22] 計算機視覺, "雙目立體視覺",12 Jul.2022, www.getit01.com/p20180201.
    [23] Omidyeganeh M.et al, "Yawning Detection Using Embedded Smart Cameras," in IEEE Transactions on Instrumentation and Measurement, vol. 65, no. 3, pp. 570-582, March 2016.
    [24] Li X. Hong L. Wang J. Liu X, "Fatigue driving detection model based on multi-feature fusion and semi-supervised active learning," in IET Intelligent Transport Systems, vol. 13, no. 9, pp. 1401-1409, 2019.
    [25] Kazemi V. Sullivan J, "One millisecond face alignment with an ensemble of regression trees," 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, pp. 1867-1874, 2014.
    [26] Ji Y. Wang S. Zhao Y. Wei J. Lu Y, "Fatigue State Detection Based on Multi-Index Fusion and State Recognition Network," in IEEE Access, vol. 7, pp. 64136-64147, 2019.
    [27] Mandal B. Li L. Wang G. S. Lin J, "Towards Detection of Bus Driver Fatigue Based on Robust Visual Analysis of Eye State," in IEEE Transactions on Intelligent Transportation Systems, vol. 18, no. 3, pp. 545-557, March 2017.

    下載圖示
    QR CODE