Introduction of TD-Gammon System
In 1992, the TD-Gammon system was developed by Gerald Tesauro at IBM, marking a pioneering effort in the application of reinforcement learning to the game of backgammon. This system utilized temporal difference learning to improve its gameplay through self-play and experience.