國立臺灣師範大學博碩士論文全文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	陳均泓 Chen, Jiun-Hong
論文名稱：	利用強化特定牌型得分方法改良麻將程式 Enhancing the Scores of Specific Hand Types to Improve the Mahjong Program
指導教授：	林順喜 Lin, Shun-Shii
口試委員：	吳毅成 Wu, I-Chen 顏士淨 Yen, Shi-Jim 陳志昌 Chen, Jr-Chang 周信宏 Chou, Hsin-Hung 張紘睿 Chang, Hung-Jui 林順喜 Lin, Shun-Shii
口試日期：	2022/08/03
學位類別：	碩士 Master
系所名稱：	資訊工程學系 Department of Computer Science and Information Engineering
論文出版年：	2022
畢業學年度：	110
語文別：	中文
論文頁數：	47
中文關鍵詞：	麻將、不完全資訊賽局、規則導向、台分
英文關鍵詞：	Mahjong, Imperfect Information Game, Rule-Based, Faan
研究方法:	實驗設計法
DOI URL：	http://doi.org/10.6345/NTNU202201333
論文種類：	學術論文
相關次數：	點閱：354 下載：22
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

近年來隨著人工智慧在電腦對局領域蓬勃發展，電腦對局領域的程式強度已經有了明顯成長。麻將為一個多玩家、機率性且不完全資訊的遊戲，由於麻將遊戲隨機且有限資訊的特性，也增加了遊戲的複雜度及困難度。
本篇論文將會針對臺灣麻將遊戲的規則，並且參考先前的相關研究，針對麻將程式進行改良。本篇論文將延續「利用棄牌資訊強化策略改良麻將程式」論文，使用規則導向與進胡數計算為主要的程式架構，並且針對原程式中的缺點，提出相對應的演算法進行改良。
本論文將會依照進攻與防守兩個方面進行改良。進攻方面以原先的進胡數計算為主要架構達到快速胡牌的目標，並且進一步注重在台分的獲取，能夠獲取更多的分數。防守方面進一步降低程式放槍率，降低分數的損失。
實驗數據顯示，改良後的程式 Seofon_v2，其不同版本與原版程式 Seofon 進行對戰，皆能夠獲得超過56%的勝率。

In recent years, with the vigorous development of the field of computer games in artificial intelligence, the strength of computer games programs has been promoted dramatically. Mahjong is a multiplayer, probabilistic, imperfect information game. These characteristics increase the complexity and difficulty of Mahjong game.
This thesis will focus on the rules of Taiwan Mahjong game, refer to the related research, and make improvements on the Mahjong program. We follow up on the previous thesis titled “Using the Enhancement Strategy from Discarded Tiles Information to Improve Mahjong Program”that used the rule-based approach and the computation of “deficiency number”to be the main framework. Aiming at the shortcomings of the original program, some corresponding algorithms are proposed for improvement.
The efforts are made in two aspects, offensive and defensive. For the offensive objective, it not only keeps the original framework of computing the“deficiency number”in order to achieve the goal of a quick win, but also wants to get more Tai (equivalent to Faan) scoring units if it is possible. For the defensive objective, it will furthermore focus on avoiding discarding a tile that will make an opponent win the game. It means that the program will try to decrease the loss of scores.
The experimental results show that our proposed algorithms implemented on the program Seofon_v2 have more than 56% win rate against the original program Seofon.

致謝 i
摘要 ii
ABSTRACT iii
目錄 iv
圖目錄 vi
表目錄 viii
一、 緒論 1
1.1 研究背景 1
1.2 研究目的 3
二、 文獻探討 5
2.1 名詞解釋與比賽規則 5
2.2 文獻回顧與相關程式 12
三、 演算法設計與程式改良 15
3.1 字牌棄牌優先度 17
3.2 強化獲取特殊牌型台分 21
3.3 槓牌機制 26
3.4 跟打機制 27
四、 實驗結果與比賽結果	31
4.1 實驗環境設置 32
4.2 字牌棄牌優先度版本 33
4.3 進攻模組版本 34
4.4 跟打機制版本 36
4.5 比賽結果 38
五、 結論與未來展望 43
參考文獻 46
                                

[1] D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. V. D. Driessche,
J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe,
J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel, and D. Hassabis, Mastering the Game of Go with Deep Neural Networks and Tree Search, Nature, vol. 529, no. 7587, pp. 484–489, 2016.
[2] D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Y. Chen, T. Lillicrap, F. Hui, L. Sifre, G. V. D. Driessche,
T. Graepel, and D. Hassabis, Mastering the Game of Go without Human Knowledge, Nature, vol. 550, no. 7676, pp. 354–359, 2017.
[3] D. Silver, T. Hubert, J. Schrittwieser, I. Antonoglou, M. Lai, A. Guez, M. Lanctot,
L. Sifre, D. Kumaran, T. Graepel, T. Lillicrap, K. Simonyan and D. Hassabis, Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm, arXiv:1712.01815v1.
[4] S. Nair, Alpha Zero General, https://github.com/suragnair/alpha-zero-general.
[5] J. Li, S. Koyamada, Suphx: Mastering Mahjong with Deep Reinforcement Learning, arXiv:2003.13590, 2020.
[6] Cycgame麻將平台， https://cycgame.com/cyc/。
[7] S. Li and X. Yan, Let's Play Mahjong!, Computing Research Repository
(CoRR abs/1903.03294), 2019.
[8] 林猷琛，利用他家資訊模組來改良麻將程式，2020，國立台灣師範大學資工所碩士論文。
[9] 林宗翰，利用棄牌資訊強化策略改良麻將程式，2021，國立台灣師範大學資工所碩士論文。
[10] 陳新颺，電腦麻將程式 ThousandWind 的設計與實作，2013，國立台灣師範大學資工所碩士論文。
[11] 吳俊緯，電腦麻將程式 MahJongDaXia 的設計與實作，2015，國立台灣師範大學資工所碩士論文。
[12] 莊立楷，麻將人工智慧之研究，2015，國立交通大學資訊科學與工程研究所碩士論文。

簡易檢索 / 詳目顯示

相關論文