Extending unified-theory-of-reinforcement neural networks to steady-state operant behavior
Authored by J J McDowell, Olivia L Calvin
Date Published: 2016
DOI: 10.1016/j.beproc.2016.03.016
Sponsors:
No sponsors listed
Platforms:
Microsoft Visual Basic
Model Documentation:
Other Narrative
Flow charts
Mathematical description
Model Code URLs:
Model code not found
Abstract
The unified theory of reinforcement has been used to develop models of
behavior over the last 20 years (Donahoe et al., 1993). Previous
research has focused on the theory's concordance with the respondent
behavior of humans and animals. In this experiment, neural networks were
developed from the theory to extend the unified theory of reinforcement
to operant behavior on single-alternative variable-interval schedules.
This area of operant research was selected because previously developed
neural networks could be applied to it without significant alteration.
Previous research with humans and animals indicates that the pattern of
their steady-state behavior is hyperbolic when plotted against the
obtained rate of reinforcement (Herrnstein, 1970). A genetic algorithm
was used in the first part of the experiment to determine parameter
values for the neural networks, because values that were used in
previous research did not result in a hyperbolic pattern of behavior.
After finding these parameters, hyperbolic and other similar functions
were fitted to the behavior produced by the neural networks. The form of
the neural network's behavior was best described by an exponentiated
hyperbola (McDowell, 1986; McLean and White, 1983; Wearden, 1981), which
was derived from the generalized matching law (Baum, 1974). In post-hoc
analyses the addition of a baseline rate of behavior significantly
improved the fit of the exponentiated hyperbola and removed systematic
residuals. The form of this function was consistent with human and
animal behavior, but the estimated parameter values were not. (C) 2016
Elsevier B.V. All rights reserved.
Tags
selection
law
Choice
Theoretical note
Bias
Variable-interval schedules
System theory prediction
Concurrent
schedules
Herrnstein equation
Matching theory