


Proofs of the equivalence between the mathematics of RescorlaWagner learning and the mathematics of perceptron learning have a long history (Gluck & Bower, 1988; Sutton & Barto, 1981). One would imagine that the existence of proofs of the computational equivalence between RescorlaWagner learning and perceptron learning would mean that perceptrons would not be able to provide any new insights into classical conditioning. However, this is not correct. Dawson (2008) has shown that if one puts aside the formal comparison of the two types of learning, and uses perceptrons to simulate a wide variety of different classical conditioning paradigms, then some puzzling results occur. One the one hand, perceptrons produce the same results as the RescorlaWagner model for many different paradigms. Given the formal equivalence between the two types of learning, this is not surprising. On the other hand, for some paradigms, perceptrons generate different results than those predicted from the RescorlaWagner model (Dawson, 2008, Chapter 7). If the two types of learning are formally equivalent, then how is it possible for such differences to occur? This is what Dawson (2008) called the perceptron paradox.
Dawson (2008) used the perceptron paradox to motivate a more detailed comparison between RescorlaWagner and perceptron learning. He found that while these two models of learning were equivalent at the computational level of investigation, there were crucial differences between them at the algorithmic level. In order to train a perceptron, the network must first behave (respond to an input pattern) in order for error to be computed to determine weight changes. In contrast, Dawson showed that the RescorlaWagner model defines learning in such a way that behavior is not required!
References:
 Dawson, M. R. W. (2008). Connectionism and classical conditioning. Comparative Cognition and Behavior Reviews, 3 (Monograph), 1115.
 Gluck, M. A., & Bower, G. H. (1988). From conditioning to category learning  an adaptive network model. Journal of Experimental PsychologyGeneral, 117(3), 227247.
 Sutton, R. S., & Barto, A. G. (1981). Toward a modern theory of adaptive networks: Expectation and prediction. Psychological Review, 88(2), 135170.
(Added April 2011)



