A Causal Model Approach to Dynamic Control

Zachary J. Davis, Neil R. Bramley, Bob Rehder, Todd Gureckis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Acting effectively in the world requires learning and controlling dynamic systems, that is, systems involving feedback relations among continuous variables that vary in real time. We introduce a novel class of dynamic control environments using Ornstein-Uhlenbeck processes connected in causal Markov graphs that allow us to systematically test people's ability to learn and control various dynamic systems. We find that performance varied across a range of test environments, roughly matching with complexity defined by a set of models trained on the task (an optimal model, a deep Reinforcement Learning agent, and a PID controller). The testbed of dynamic environments and class of models introduced in this paper lay the groundwork for the systematic study of people's ability to control complex dynamic systems.

Original languageEnglish (US)
Title of host publicationProceedings of the 40th Annual Meeting of the Cognitive Science Society, CogSci 2018
PublisherThe Cognitive Science Society
Pages281-286
Number of pages6
ISBN (Electronic)9780991196784
StatePublished - 2018
Event40th Annual Meeting of the Cognitive Science Society: Changing Minds, CogSci 2018 - Madison, United States
Duration: Jul 25 2018Jul 28 2018

Publication series

NameProceedings of the 40th Annual Meeting of the Cognitive Science Society, CogSci 2018

Conference

Conference40th Annual Meeting of the Cognitive Science Society: Changing Minds, CogSci 2018
Country/TerritoryUnited States
CityMadison
Period7/25/187/28/18

Keywords

  • causal learning
  • control theory
  • dynamic control
  • dynamic decision making
  • reinforcement learning

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Human-Computer Interaction
  • Cognitive Neuroscience

Fingerprint

Dive into the research topics of 'A Causal Model Approach to Dynamic Control'. Together they form a unique fingerprint.

Cite this