Rapid on-line temporal sequence prediction by an adaptive agent

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Scopus citations

Abstract

Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning agent operating in a non-stationary environment. The challenge for a learning agent under these conditions is to exploit the relevant experience from a limited environmental event history while preserving flexibility. We propose a novel time/space efficient method for learning temporal sequences and making short-term predictions. Our method operates on-line, requires few exemplars, and adapts easily and quickly to changes in the underlying stochastic world model. Using a short-term memory of recent observations, the method maintains a dynamic space of candidate hypotheses in which the growth of the space is systematically and dynamically pruned using an entropy measure over the observed predictive quality of each candidate hypothesis. The method compares well against Markov-chain predictions, and adapts faster than learned Markov-chain models to changes in the underlying distribution of events. We demonstrate the method using both synthetic data and empirical experience from a game-playing scenario with human opponents.

Original languageEnglish (US)
Title of host publicationProceedings of the 4th International Conference on Autonomous Agents and Multi agent Systems, AAMAS 05
EditorsF. Dignum, V. Dignum, S. Koenig, S. Kraus, M. Pechoucek, M. Singh, D. Steiner, S. Thompson, M. Wooldridge
Pages217-223
Number of pages7
StatePublished - Dec 1 2005
Event4th International Conference on Autonomous Agents and Multi agent Systems, AAMAS 05 - Utrecht, Netherlands
Duration: Jul 25 2005Jul 29 2005

Other

Other4th International Conference on Autonomous Agents and Multi agent Systems, AAMAS 05
Country/TerritoryNetherlands
CityUtrecht
Period7/25/057/29/05

Keywords

  • Markov Decision Process
  • N-gram
  • Rapid Learning
  • Sequence prediction

Fingerprint

Dive into the research topics of 'Rapid on-line temporal sequence prediction by an adaptive agent'. Together they form a unique fingerprint.

Cite this