研究生: |
宋奕泓 Sung, Yi-Hung |
---|---|
論文名稱: |
基於非監督式生成對抗網路及對比學習之水下影像品質回復 Unsupervised Generative Adversarial Network and Contrastive Learning for Underwater Image Restoration |
指導教授: |
康立威
Kang, Li-Wei |
口試委員: |
蔣欣翰
JIANG, SIN-HAN 李曉祺 LI, XIAO-QI 康立威 Kang, Li-Wei |
口試日期: | 2023/01/10 |
學位類別: |
碩士 Master |
系所名稱: |
電機工程學系 Department of Electrical Engineering |
論文出版年: | 2023 |
畢業學年度: | 111 |
語文別: | 中文 |
論文頁數: | 44 |
中文關鍵詞: | 水下影像回復 、生成對抗網路 、非監督式學習 、深度學習 、卷積神經網路 |
英文關鍵詞: | Underwater image restoration, generative adversarial networks, unsupervised learning, deep learning, convolutional neural networks |
研究方法: | 實驗設計法 |
DOI URL: | http://doi.org/10.6345/NTNU202300255 |
論文種類: | 學術論文 |
相關次數: | 點閱:184 下載:43 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
近年來水下環境之相關應用的重要性與日俱增,比如:水下資源探勘及水下環境監控。這些應用往往需要由水下無人載具來擷取水下數位影像資料以供後續之資料分析及其相關應用 (例如:水下物件偵測及水下影像分類等相關應用)。然而水下影像品質受到許多環境因素影響而造成影像退化,包括光線折射、反射等等,如此可能使得基於水下影像之相關應用無法得到良好的效果。近年來,隨著深度學習技術蓬勃發展,研究者提出許多基於深度學習的模型來改善水下影像的品質。目前現有方法中,以具備成對影像資料之監督式深度學習模型為主。成對影像學習雖然能以較輕量模型得到好的影像品質回復效果,但礙於現實難以取得成對的原始水下影像及其還原之陸上影像,因此模型訓練上受到許多限制。為了解決這個限制,許多研究以人工合成之影像來建立成對之訓練影像資料集。然而,人工合成之訓練影像資料集未必能反映真實的水下影像特性。為了解決此問題,最近已有研究提出使用生成對抗網路及非成對影像資料來進行深度學習網路訓練。本論文提出一基於非成對影像資料及生成對抗網路之深度學習模型,來處理水下影像回復的問題。
本論文提出基於非成對訓練影像資料集及利用生成對抗網路架構訓練一影像領域轉換生成器將輸入之水下影像轉換為對應之陸上影像 (回復之水下影像),其中我們利用對比學習及多樣損失函數來進行網路訓練。實驗結果已證實我們的方法可得到較佳的回復影像品質且優於 (或近似) 現有基於成對/非成對訓練資料之基於深度學習之水下影像回復網路。
Nowadays, underwater applications play an important role in many fields, which increase the requirements of high-quality underwater images. However, underwater images may usually suffer from degeneration due to attenuation, color distortion, and noise from artificial lighting sources of imaging devices. To cope with the problem, single underwater image restoration has been popular recently. On the other hand, with the recently rapid development of deep learning techniques, several deep learning-based single underwater image restoration frameworks have been presented. Most of them rely on end-to-end supervised deep networks trained by synthesized paired image datasets, which may not fit real underwater image characteristics.
To solve the problem that real paired underwater training image data are hard to collect, without needing paired training images, a novel unsupervised GAN (generative adversarial network)-based deep learning framework for single underwater image restoration is proposed in this thesis. To train the presented deep model, we introduce contrastive learning with perceptual, style loss, and other types of loss functions in our GAN-based structure to learn an image generator for translating underwater images to the corresponding in-air images. Extensive experiments have shown that the proposed method outperforms (or is comparable with) the state-of-the-art deep learning-based methods relying on paired/unpaired training data quantitatively and qualitatively.
Chen. X, Zhang. P, Quan. L, et al, “Underwater Image Enhancement based on Deep Learning and Image Formation Model,” arXiv:2101.00991v2, 2021.
C. Ancuti, C. O. Ancuti, T. Haber, and P. Bekaert, “Enhancing underwater images and videos by fusion,” in 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012, pp. 81–88.
C. J. Prabhakar and P. U. P. Kumar, “Underwater image denoising using adaptive wavelet subband thresholding,” in 2010 International Conference on Signal and Image Processing, 2010, pp. 322–327.
H. Y. Yang, P. Y. Chen, and C. C. Huang, ‘‘Low complexity underwater image enhancement based on dark channel prior,’’ in Proc. 2nd Int. Conf. Innov. Bio-Inspired Comput. Appl., pp. 17–20, Dec. 2011.
Chongyi Li, Saeed Anwar, Fatih Porikli, "Underwater Scene Prior Inspired Deep Underwater Image and Video Enhancement", arXiv, 2019.
Naik, Ankita Rajaram et al. “Shallow-UWnet : Compressed Model for Underwater Image Enhancement.” AAAI Conference on Artificial Intelligence ,Vol. 35. pp. 15853–15854, 2021.
Y. Guo, H. Li and P. Zhuang, "Underwater Image Enhancement Using a Multiscale Dense Generative Adversarial Network," in IEEE Journal of Oceanic Engineering,vol.45,no.3, pp. 862-870, July 2020.
Lin, J.-C.; Hsu, C.-B.; Lee, J.-C.; Chen, C.-H.; Tu, T.-M. Dilated Generative Adversarial Networks for Underwater Image Restoration. J. Mar. Sci. Eng. 2022.
Islam, Md. Jahidul et al. “Fast Underwater Image Enhancement for Improved Visual Perception.” IEEE Robotics and Automation Letters, pp. 3227-3234, 2020.
Du, R.; Li, W.; Chen, S.; Li, C.; Zhang, Y. Unpaired Underwater Image Enhancement Based on CycleGAN. Information 2022, 13, 1.
Taesung Park, Alexei A Efros, Richard Zhang, and Jun-Yan Zhu. Contrastive learning for unpaired image-to-image translation. In European Conference on Computer Vision, 2020.
K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition,"IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, pp. 770-778,2016.
Karras, Tero & Laine, Samuli & Aittala, Miika & Hellsten, Janne & Lehtinen, Jaakko & Aila, Timo. "Analyzing and Improving the Image Quality of StyleGAN", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.8107-8116, 2020.
Isola, P., Zhu, J. Y., Zhou, T., & Efros, A. A. Image-to-image translation with conditional adversarial networks. IEEE Conference on Computer Vision and
Pattern Recognition (CVPR), pp. 5967-5976, 2017.
Mao, X., Li, Q., Xie, H., Lau, R. Y., Wang, Z., & Paul Smolley, S. "Least squares generative adversarial networks," IEEE international conference on computer vision, pp. 2794-2802, 2017.
J. Han et al., "Single Underwater Image Restoration by Contrastive Learning," IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, pp. 2385-2388, 2021.
Johnson, J., Alahi, A., Fei-Fei, L. (2016). Perceptual Losses for Real-Time Style Transfer and Super-Resolution. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds) Computer Vision – ECCV, 2016.
Kingma, Diederik & Ba, Jimmy. Adam: A Method for Stochastic Optimization. International Conference on Learning Representations, 2014.
Islam, Md Jahidul & Luo, Peigen & Sattar, Junaed."Simultaneous Enhancement and Super-Resolution of Underwater Imagery for Improved Visual Perception", RSS, 2020.
C. Li et al., "An Underwater Image Enhancement Benchmark Dataset and Beyond," in IEEE Transactions on Image Processing, vol. 29, pp. 4376-4389, 2020
Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. "Image quality assessment: from error visibility to structural similarity", IEEE transactions on image processing, pp. 600-612, 2004.
J. -Y. Zhu, T. Park, P. Isola and A. A. Efros, "Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks," 2017 IEEE International Conference on Computer Vision (ICCV), 2017