Knightcap: a chess program that learns by combining td (lambda) with game-tree search

J Baxter, A Tridgell, L Weaver - arXiv preprint cs/9901002, 1999 - arxiv.org
In this paper we present TDLeaf (lambda), a variation on the TD (lambda) algorithm that
enables it to be used in conjunction with game-tree search. We present some experiments in
which our chess program``KnightCap''used TDLeaf (lambda) to learn its evaluation function
while playing on the Free Internet Chess Server (FICS, fics. onenet. net). The main success
we report is that KnightCap improved from a 1650 rating to a 2150 rating in just 308 games
and 3 days of play. As a reference, a rating of 1650 corresponds to about level B human play …

[引用][C] KnightCap: A chess program that learns by combining TD (lambda) with game-tree search. arXiv 1999

J Baxter, A Tridgell, L Weaver - arXiv preprint cs/9901002
以上显示的是最相近的搜索结果。 查看全部搜索结果