TD Methods Applied to Mixture of Experts for Learning 9 X 9 Go Evaluation Function

Raonak Zaman
Donald C. Wunsch, Missouri University of Science and TechnologyFollow

Abstract

The temporal difference (TD) method is applied on a committee of neural network experts to learn the board evaluation function for the Oriental board game Go. The game has simple rules but requires complex strategies to play well, and, the conventional tree search algorithm for computer games make poor Go program. Thus, the game Go is an ideal problem domain for exploring machine learning algorithms. Here, the neural networks learned a board evaluation function for Go played on 9 x 9 board sizes. Two learning algorithms, e.g., hybrid mixture of experts (HME) and Meta-Pi, are used to train the neural network experts. Both algorithms learned good Go evaluation functions and the neural network based Go engines were able to defeat a public domain rule-based program more than 50% of the times. The performances of the mixture networks are compared with that of a single feedforward network trained similarly.

Recommended Citation

R. Zaman and D. C. Wunsch, "TD Methods Applied to Mixture of Experts for Learning 9 X 9 Go Evaluation Function," Proceedings of the International Joint Conference on Neural Networks, vol. 6, pp. 3734 - 3739, Institute of Electrical and Electronics Engineers (IEEE), Jan 1999.

The definitive version is available at https://doi.org/10.1109/IJCNN.1999.830746

Meeting Name

International Joint Conference on Neural Networks (IJCNN'99) (1999: Jul. 10-16, Washington, DC)

Department(s)

Electrical and Computer Engineering

International Standard Serial Number (ISSN)

1098-7576

Document Type

Article - Conference proceedings

Document Version

Citation

File Type

text

Language(s)

English

Rights

Publication Date

01 Jan 1999

Electrical and Computer Engineering Faculty Research & Creative Works

TD Methods Applied to Mixture of Experts for Learning 9 X 9 Go Evaluation Function

Abstract

Recommended Citation

Meeting Name

Department(s)

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations

Electrical and Computer Engineering Faculty Research & Creative Works

TD Methods Applied to Mixture of Experts for Learning 9 X 9 Go Evaluation Function

Author

Abstract

Recommended Citation

Meeting Name

Department(s)

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Share

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations