Jay McClelland

Lucie Stern Professor in the Social Sciences, Professor of Psychology and, by courtesy, of Linguistics and of Computer Science

Academic Appointments

Professor, Psychology
Professor (By courtesy), Linguistics
Professor (By courtesy), Computer Science
Member, Bio-X
Faculty Affiliate, Institute for Human-Centered Artificial Intelligence (HAI)
Member, Wu Tsai Human Performance Alliance
Member, Wu Tsai Neurosciences Institute

Administrative Appointments

Professor, Department of Psychology (2006 - Present)
Director, Center for Mind, Brain, Computation and Technology (2006 - Present)

Honors & Awards

Distinguished Scientific Contribution Award, American Psychological Association (1996)
Member, National Academy of Sciences (2001-)

Program Affiliations

Symbolic Systems Program

Professional Education

Ph. D., University of Pennsylvania, Cognitive Psychology (1975)

Contact

Alternate Contact Reneé Rittler Administrative Services Manager rittler@stanford.edu
- 6507237431 (office)

Additional Info

ORCID:
https://orcid.org/0000-0002-8217-405X

Current Research and Scholarly Interests

My research addresses topics in perception and decision making; learning and memory; language and reading; semantic cognition; and cognitive development. I view cognition as emerging from distributed processing activity of neural populations, with learning occurring through the adaptation of connections among neurons. A new focus of research in the laboratory is mathematical cognition and reasoning in humans and contemporary AI systems based on neural networks.

Please visit my web page for more information.

2025-26 Courses

Neural Network Models of Cognition
PSYCH 209 (Win)
Independent Studies (18)
- Advanced Reading and Research
  CS 499 (Aut, Win, Spr, Sum)
- Advanced Reading and Research
  CS 499P (Aut, Win, Spr, Sum)
- Curricular Practical Training
  CS 390B (Sum)
- Directed Reading in Neurosciences
  NEPR 299 (Aut, Win, Spr, Sum)
- Graduate Research
  NEPR 399 (Aut, Win, Spr, Sum)
- Graduate Research
  PSYCH 275 (Aut, Win, Spr, Sum)
- Independent Study
  SYMSYS 196 (Aut, Win, Spr, Sum)
- Independent Study
  SYMSYS 296 (Aut, Win, Spr, Sum)
- Individually Supervised Practicum
  PSYCH 299 (Win, Spr, Sum)
- Master's Degree Project
  SYMSYS 290 (Aut, Win, Spr, Sum)
- Ph.D. Research
  CME 400 (Sum)
- Practicum in Teaching
  PSYCH 281 (Aut, Win, Spr, Sum)
- Reading and Special Work
  PSYCH 194 (Aut, Win, Spr, Sum)
- Senior Honors Tutorial
  SYMSYS 190 (Aut, Win, Spr, Sum)
- Senior Project
  CS 191 (Aut, Win, Spr)
- Special Laboratory Projects
  PSYCH 195 (Aut, Win, Spr, Sum)
- Supervised Undergraduate Research
  CS 195 (Aut, Win, Spr, Sum)
- Writing Intensive Senior Research Project
  CS 191W (Aut, Win, Spr)
Prior Year Courses
2024-25 Courses
- Foundations of Cognition
  PSYCH 205 (Spr)
2023-24 Courses
- Neural Network Models of Cognition
  PSYCH 209 (Win)
2022-23 Courses
- Foundations of Cognition
  PSYCH 205 (Spr)
- Neural Network Models of Cognition
  PSYCH 209 (Win)

Stanford Advisees

Doctoral Dissertation Reader (AC)
Joshua Ryu
Doctoral Dissertation Advisor (AC)
Satchel Grant, Sabrina Jones
Master's Program Advisor
Amrita Malhotra, Nicolo di Borgoricco
Doctoral (Program)
Satchel Grant, Jerome Han, Violet Xiang

Graduate and Fellowship Programs

Neurosciences (Phd Program)

All Publications

Reflections on David E. Rumelhart and the Rumelhart Prize TOPICS IN COGNITIVE SCIENCE McClelland, J. L. 2025

Abstract

The 25th anniversary of the Rumelhart Prize in Cognitive Science and a special issue of Topics in Cognitive Science celebrating the achievements of two recent Rumelhart Prize recipients provides an opportunity to reflect on the prize, the scientists that it honors, and the scientific values it seeks to promote. I offer my perspective on these topics as a long-time member of the Cognitive Science Society, a collaborator and friend of David Rumelhart, and as the first chair of the Rumelhart Prize selection committee. I see the prize as celebrating several aspects of what I believe many cognitive scientists aspire to achieve. We seek to make contributions to our understanding of our unique human ability to make sense of the world and of each other. We seek to employ a wide range of tools and methods, as well as insights from a wide range of perspectives. We seek to engage with our colleagues and our students, to create community, and even to have fun while we pursue our scientific goals. The careers of Dave Rumelhart and of the two Rumelhart Prize Winners celebrated in this special issue all richly exemplify these traits.

View details for DOI 10.1111/tops.70016

View details for Web of Science ID 001528865800001

View details for PubMedID 40663709
Grounding mathematics in an integrated conceptual structure, part I: experimental evidence that grounded rules support transfer that formal rules do not. Frontiers in psychology Mickey, K. W., McClelland, J. L. 2025; 16: 1507670

Abstract

Mathematics relies on formal systems of rules that can be treated in isolation or grounded in a conceptual system that provides meaning for the relationships the rules express. Here, we show how the conceptual system provided by the unit circle, a visuospatial structure that provides a meaning for formal expressions in the domain of trigonometry, supports a generalizable understanding of trigonometric relationships, allowing for transfer beyond relationships explicitly taught. We examined the utility of the unit circle in our first study, in which we presented trigonometric identity problems to undergraduates (N = 50) who had prior coursework in pre-calculus trigonometry. Students reported using the unit circle to solve these problems more often than other approaches, and those who reported using the circle solved more problems correctly. Using other students from the same population, we then manipulated the systems they used by presenting a refresher lesson, using either formal rules or rules grounded in relationships on the unit circle (N = 35 in each group). Students in both conditions improved on taught problems, but only students in the grounded condition showed improvement on held-out transfer problems. Using findings from a third study further exploring the grounded condition (N = 64 participants), we found evidence that the circle supported transfer in two ways: by providing a procedure that could be used to solve both taught and transfer problems without rules and by allowing students to appreciate rules as capturing relationships between meaningful quantities, facilitating their application and extension. This project served as the starting place for the development of a curriculum that supports reliance on the unit circle and led to robust learning and retention of trigonometric relationships for most students with sufficient relevant prior knowledge, as described in Part II of this article.

View details for DOI 10.3389/fpsyg.2025.1507670

View details for PubMedID 40528852
Language models, like humans, show content effects on reasoning tasks. PNAS nexus Lampinen, A. K., Dasgupta, I., Chan, S. C., Sheahan, H. R., Creswell, A., Kumaran, D., McClelland, J. L., Hill, F. 2024; 3 (7): pgae233

Abstract

reasoning is a key ability for an intelligent system. Large language models (LMs) achieve above-chance performance on abstract reasoning tasks but exhibit many imperfections. However, human abstract reasoning is also imperfect. Human reasoning is affected by our real-world knowledge and beliefs, and shows notable "content effects"; humans reason more reliably when the semantic content of a problem supports the correct logical inferences. These content-entangled reasoning patterns are central to debates about the fundamental nature of human intelligence. Here, we investigate whether language models-whose prior expectations capture some aspects of human knowledge-similarly mix content into their answers to logic problems. We explored this question across three logical reasoning tasks: natural language inference, judging the logical validity of syllogisms, and the Wason selection task. We evaluate state of the art LMs, as well as humans, and find that the LMs reflect many of the same qualitative human patterns on these tasks-like humans, models answer more accurately when the semantic content of a task supports the logical inferences. These parallels are reflected in accuracy patterns, and in some lower-level features like the relationship between LM confidence over possible answers and human response times. However, in some cases the humans and models behave differently-particularly on the Wason task, where humans perform much worse than large models, and exhibit a distinct error pattern. Our findings have implications for understanding possible contributors to these human cognitive effects, as well as the factors that influence language model performance.

View details for DOI 10.1093/pnasnexus/pgae233

View details for PubMedID 39015546

View details for PubMedCentralID PMC11250216
Systematic Human Learning and Generalization From a Brief Tutorial With Explanatory Feedback. Open mind : discoveries in cognitive science Nam, A. J., McClelland, J. L. 2024; 8: 148-176

Abstract

We investigate human adults' ability to learn an abstract reasoning task quickly and to generalize outside of the range of training examples. Using a task based on a solution strategy in Sudoku, we provide Sudoku-naive participants with a brief instructional tutorial with explanatory feedback using a narrow range of training examples. We find that most participants who master the task do so within 10 practice trials and generalize well to puzzles outside of the training range. We also find that most of those who master the task can describe a valid solution strategy, and such participants perform better on transfer puzzles than those whose strategy descriptions are vague or incomplete. Interestingly, fewer than half of our human participants were successful in acquiring a valid solution strategy, and this ability was associated with completion of high school algebra and geometry. We consider the implications of these findings for understanding human systematic reasoning, as well as the challenges these findings pose for building computational models that capture all aspects of our findings, and we point toward a role for learning from instructions and explanations to support rapid learning and generalization.

View details for DOI 10.1162/opmi_a_00123

View details for PubMedID 38435707

View details for PubMedCentralID PMC10898786
Causal interventions expose implicit situation models for commonsense language understanding Yamakoshi, T., McClelland, J. L., Goldberg, A. E., Hawkins, R. D. edited by Boyd-Graber, J., Okazaki, N., Rogers, A. ASSOC COMPUTATIONAL LINGUISTICS-ACL. 2023: 13265-13293

View details for Web of Science ID 001379548606014
Capturing advanced human cognitive abilities with deep neural networks TRENDS IN COGNITIVE SCIENCES McClelland, J. L. 2022; 26 (12): 1047-1050

Abstract

How can artificial neural networks capture the advanced cognitive abilities of pioneering scientists? I suggest they must learn to exploit human-invented tools of thought and human-like ways of using them, and must engage in explicit goal-directed problem solving as exemplified in the activities of scientists and mathematicians and taught in advanced educational settings.

View details for DOI 10.1016/j.tics.2022.09.018

View details for Web of Science ID 000897495100013

View details for PubMedID 36335015
A weighted constraint satisfaction approach to human goal-directed decision making PLOS COMPUTATIONAL BIOLOGY Li, Y., McClelland, J. L. 2022; 18 (6): e1009553

Abstract

When we plan for long-range goals, proximal information cannot be exploited in a blindly myopic way, as relevant future information must also be considered. But when a subgoal must be resolved first, irrelevant future information should not interfere with the processing of more proximal, subgoal-relevant information. We explore the idea that decision making in both situations relies on the flexible modulation of the degree to which different pieces of information under consideration are weighted, rather than explicitly decomposing a problem into smaller parts and solving each part independently. We asked participants to find the shortest goal-reaching paths in mazes and modeled their initial path choices as a noisy, weighted information integration process. In a base task where choosing the optimal initial path required weighting starting-point and goal-proximal factors equally, participants did take both constraints into account, with participants who made more accurate choices tending to exhibit more balanced weighting. The base task was then embedded as an initial subtask in a larger maze, where the same two factors constrained the optimal path to a subgoal, and the final goal position was irrelevant to the initial path choice. In this more complex task, participants' choices reflected predominant consideration of the subgoal-relevant constraints, but also some influence of the initially-irrelevant final goal. More accurate participants placed much less weight on the optimality-irrelevant goal and again tended to weight the two initially-relevant constraints more equally. These findings suggest that humans may rely on a graded, task-sensitive weighting of multiple constraints to generate approximately optimal decision outcomes in both hierarchical and non-hierarchical goal-directed tasks.

View details for DOI 10.1371/journal.pcbi.1009553

View details for Web of Science ID 000829645400007

View details for PubMedID 35709299

View details for PubMedCentralID PMC9255770
Data Distributional Properties Drive Emergent In-Context Learning in Transformers Chan, S. C. Y., Santoro, A., Lampinen, A. K., Wang, J. X., Singh, A. K., Richemond, P. H., McClelland, J. L. edited by Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K., Oh, A. NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). 2022

View details for Web of Science ID 001213927502022
Placing language in an integrated understanding system: Next steps toward human-level performance in neural language models PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA McClelland, J. L., Hill, F., Rudolph, M., Baldridge, J., Schutze, H. 2020; 117 (42): 25966-25974

Abstract

Language is crucial for human intelligence, but what exactly is its role? We take language to be a part of a system for understanding and communicating about situations. In humans, these abilities emerge gradually from experience and depend on domain-general principles of biological neural networks: connection-based learning, distributed representation, and context-sensitive, mutual constraint satisfaction-based processing. Current artificial language processing systems rely on the same domain general principles, embodied in artificial neural networks. Indeed, recent progress in this field depends on query-based attention, which extends the ability of these systems to exploit context and has contributed to remarkable breakthroughs. Nevertheless, most current models focus exclusively on language-internal tasks, limiting their ability to perform tasks that depend on understanding situations. These systems also lack memory for the contents of prior situations outside of a fixed contextual span. We describe the organization of the brain's distributed understanding system, which includes a fast learning system that addresses the memory problem. We sketch a framework for future models of understanding drawing equally on cognitive neuroscience and artificial intelligence and exploiting query-based attention. We highlight relevant current directions and consider further developments needed to fully capture human-level language understanding in a computational system.

View details for DOI 10.1073/pnas.1910416117

View details for Web of Science ID 000580597300005

View details for PubMedID 32989131

View details for PubMedCentralID PMC7585006
Do estimates of numerosity really adhere to Weber's law? A reexamination of two case studies PSYCHONOMIC BULLETIN & REVIEW Testolin, A., McClelland, J. L. 2021; 28 (1): 158-168

Abstract

Both humans and nonhuman animals can exhibit sensitivity to the approximate number of items in a visual array or events in a sequence, and across various paradigms, uncertainty in numerosity judgments increases with the number estimated or produced. The pattern of increase is usually described as exhibiting approximate adherence to Weber's law, such that uncertainty increases proportionally to the mean estimate, resulting in a constant coefficient of variation. Such a pattern has been proposed to be a signature characteristic of an innate "number sense." We reexamine published behavioral data from two studies that have been cited as prototypical evidence of adherence to Weber's law and observe that in both cases variability increases less than this account would predict, as indicated by a decreasing coefficient of variation with an increase in number. We also consider evidence from numerosity discrimination studies that show deviations from the constant coefficient of variation pattern. Though behavioral data can sometimes exhibit approximate adherence to Weber's law, our findings suggest that such adherence is not a fixed characteristic of the mechanisms whereby humans and animals estimate numerosity. We suggest instead that the observed pattern of increase in variability with number depends on the circumstances of the task and stimuli, and reflects an adaptive ensemble of mechanisms composed to optimize performance under these circumstances.

View details for DOI 10.3758/s13423-020-01801-z

View details for Web of Science ID 000570839700001

View details for PubMedID 32949010

View details for PubMedCentralID PMC7870758
Integration of new information in memory: new insights from a complementary learning systems perspective. Philosophical transactions of the Royal Society of London. Series B, Biological sciences McClelland, J. L., McNaughton, B. L., Lampinen, A. K. 2020; 375 (1799): 20190637

Abstract

According to complementary learning systems theory, integrating new memories into the neocortex of the brain without interfering with what is already known depends on a gradual learning process, interleaving new items with previously learned items. However, empirical studies show that information consistent with prior knowledge can sometimes be integrated very quickly. We use artificial neural networks with properties like those we attribute to the neocortex to develop an understanding of the role of consistency with prior knowledge in putatively neocortex-like learning systems, providing new insights into when integration will be fast or slow and how integration might be made more efficient when the items to be learned are hierarchically structured. The work relies on deep linear networks that capture the qualitative aspects of the learning dynamics of the more complex nonlinear networks used in previous work. The time course of learning in these networks can be linked to the hierarchical structure in the training data, captured mathematically as a set of dimensions that correspond to the branches in the hierarchy. In this context, a new item to be learned can be characterized as having aspects that project onto previously known dimensions, and others that require adding a new branch/dimension. The projection onto the known dimensions can be learned rapidly without interleaving, but learning the new dimension requires gradual interleaved learning. When a new item only overlaps with items within one branch of a hierarchy, interleaving can focus on the previously known items within this branch, resulting in faster integration with less interleaving overall. The discussion considers how the brain might exploit these facts to make learning more efficient and highlights predictions about what aspects of new information might be hard or easy to learn. This article is part of the Theo Murphy meeting issue 'Memory reactivation: replaying events past, present and future'.

View details for DOI 10.1098/rstb.2019.0637

View details for PubMedID 32248773
Intrusions into the shadow of attention: A new take on illusory conjunctions ATTENTION PERCEPTION & PSYCHOPHYSICS Henderson, C. M., McClelland, J. L. 2020; 82 (2): 564-584

Abstract

We present new evidence about illusory conjunctions (ICs) suggesting that their current explanation requires revision. According to Feature Integration Theory (FIT; Treisman & Gelade Cognitive Psychology, 12, 97-136, 1980), focal attention to a single stimulus is required to bind its features into an integrated percept. FIT predicts that if attention is spread over multiple stimuli, features of these different stimuli can be combined into a single percept and produce ICs. Treisman and Schmidt (Cognitive Psychology, 14, 107-141, 1982) and Cohen & Ivry (Journal of Experimental Psychology: Human Perception and Performance, 15(4), 650-663, 1989) supported this prediction. In the latter study, participants viewed brief displays containing two digits and two colored letters. Digit locations were pre-cued, and participants were instructed to prioritize the digits and to spread their attention across the region encompassed by the digits. Cohen & Ivry found that reports of one letter (the 'target') produced ICs when both letters appeared between the digits. Expanding on Cohen & Ivry's paradigm, we find that both letters do not need to appear between the digits to produce ICs. While the target letter was highly susceptible to ICs if the target appeared inside the position of a nearby digit, the position of the other letter was largely irrelevant. Our experimental results also argue that these ICs were not due to mnemonic errors occurring while the digits are being reported. Based on our findings, we propose that attention to the digits casts an attentional 'shadow' projecting towards fixation, interfering with processing of target letters in that shadow and allowing color information from elsewhere in the display to be included in the resulting percept.

View details for DOI 10.3758/s13414-019-01893-3

View details for Web of Science ID 000520803200001

View details for PubMedID 32189233
Exemplar models are useful and deep neural networks overcome their limitations: A commentary on Ambridge (2020) FIRST LANGUAGE McClelland, J. L. 2020; 40 (5-6): 612-615

View details for DOI 10.1177/0142723720905765

View details for Web of Science ID 000515031100001
Numerosity discrimination in deep neural networks: Initial competence, developmental refinement and experience statistics DEVELOPMENTAL SCIENCE Testolin, A., Zou, W. Y., McClelland, J. L. 2020; 23 (5): e12940

Abstract

Both humans and non-human animals exhibit sensitivity to the approximate number of items in a visual array, as indexed by their performance in numerosity discrimination tasks, and even neonates can detect changes in numerosity. These findings are often interpreted as evidence for an innate 'number sense'. However, recent simulation work has challenged this view by showing that human-like sensitivity to numerosity can emerge in deep neural networks that build an internal model of the sensory data. This emergentist perspective posits a central role for experience in shaping our number sense and might explain why numerical acuity progressively increases over the course of development. Here we substantiate this hypothesis by introducing a progressive unsupervised deep learning algorithm, which allows us to model the development of numerical acuity through experience. We also investigate how the statistical distribution of numerical and non-numerical features in natural environments affects the emergence of numerosity representations in the computational model. Our simulations show that deep networks can exhibit numerosity sensitivity prior to any training, as well as a progressive developmental refinement that is modulated by the statistical structure of the learning environment. To validate our simulations, we offer a refinement to the quantitative characterization of the developmental patterns observed in human children. Overall, our findings suggest that it may not be necessary to assume that animals are endowed with a dedicated system for processing numerosity, since domain-general learning mechanisms can capture key characteristics others have attributed to an evolutionarily specialized number system.

View details for DOI 10.1111/desc.12940

View details for Web of Science ID 000514118800001

View details for PubMedID 31977137
Quasi-compositional mapping from form to meaning: a neural network-based approach to capturing neural responses during human language comprehension PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES Rabovsky, M., McClelland, J. L. 2020; 375 (1791): 20190313

Abstract

We argue that natural language can be usefully described as quasi-compositional and we suggest that deep learning-based neural language models bear long-term promise to capture how language conveys meaning. We also note that a successful account of human language processing should explain both the outcome of the comprehension process and the continuous internal processes underlying this performance. These points motivate our discussion of a neural network model of sentence comprehension, the Sentence Gestalt model, which we have used to account for the N400 component of the event-related brain potential (ERP), which tracks meaning processing as it happens in real time. The model, which shares features with recent deep learning-based language models, simulates N400 amplitude as the automatic update of a probabilistic representation of the situation or event described by the sentence, corresponding to a temporal difference learning signal at the level of meaning. We suggest that this process happens relatively automatically, and that sometimes a more-controlled attention-dependent process is necessary for successful comprehension, which may be reflected in the subsequent P600 ERP component. We relate this account to current deep learning models as well as classic linguistic theory, and use it to illustrate a domain general perspective on some specific linguistic operations postulated based on compositional analyses of natural language. This article is part of the theme issue 'Towards mechanistic models of meaning composition'.

View details for DOI 10.1098/rstb.2019.0313

View details for Web of Science ID 000502785400007

View details for PubMedID 31840583

View details for PubMedCentralID PMC6939354
Transforming task representations to perform novel tasks. Proceedings of the National Academy of Sciences of the United States of America Lampinen, A. K., McClelland, J. L. 2020

Abstract

An important aspect of intelligence is the ability to adapt to a novel task without any direct experience (zero shot), based on its relationship to previous tasks. Humans can exhibit this cognitive flexibility. By contrast, models that achieve superhuman performance in specific tasks often fail to adapt to even slight task alterations. To address this, we propose a general computational framework for adapting to novel tasks based on their relationship to prior tasks. We begin by learning vector representations of tasks. To adapt to new tasks, we propose metamappings, higher-order tasks that transform basic task representations. We demonstrate the effectiveness of this framework across a wide variety of tasks and computational paradigms, ranging from regression to image classification and reinforcement learning. We compare to both human adaptability and language-based approaches to zero-shot learning. Across these domains, metamapping is successful, often achieving 80 to 90% performance, without any data, on a novel task, even when the new task directly contradicts prior experience. We further show that metamapping can not only generalize to new tasks via learned relationships, but can also generalize using novel relationships unseen during training. Finally, using metamapping as a starting point can dramatically accelerate later learning on a new task and reduce learning time and cumulative error substantially. Our results provide insight into a possible computational basis of intelligent adaptability and offer a possible framework for modeling cognitive flexibility and building more flexible artificial intelligence systems.

View details for DOI 10.1073/pnas.2008852117

View details for PubMedID 33303652
Generative Continual Concept Learning Rostami, M., Kolouri, S., McClelland, J., Pilly, P., Assoc Advancement Artificial Intelligence ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. 2020: 5545-5552

View details for Web of Science ID 000667722805076
Developing the knowledge of number digits in a child-like robot NATURE MACHINE INTELLIGENCE Di Nuovo, A., McClelland, J. L. 2019; 1 (12): 594-605

View details for DOI 10.1038/s42256-019-0123-3

View details for Web of Science ID 000571267000011
A mathematical theory of semantic development in deep neural networks. Proceedings of the National Academy of Sciences of the United States of America Saxe, A. M., McClelland, J. L., Ganguli, S. 2019

Abstract

An extensive body of empirical research has revealed remarkable regularities in the acquisition, organization, deployment, and neural representation of human semantic knowledge, thereby raising a fundamental conceptual question: What are the theoretical principles governing the ability of neural networks to acquire, organize, and deploy abstract knowledge by integrating across many individual experiences? We address this question by mathematically analyzing the nonlinear dynamics of learning in deep linear networks. We find exact solutions to this learning dynamics that yield a conceptual explanation for the prevalence of many disparate phenomena in semantic cognition, including the hierarchical differentiation of concepts through rapid developmental transitions, the ubiquity of semantic illusions between such transitions, the emergence of item typicality and category coherence as factors controlling the speed of semantic processing, changing patterns of inductive projection over development, and the conservation of semantic similarity in neural representations across species. Thus, surprisingly, our simple neural model qualitatively recapitulates many diverse regularities underlying semantic development, while providing analytic insight into how the statistical structure of an environment can interact with nonlinear deep-learning dynamics to give rise to these regularities.

View details for DOI 10.1073/pnas.1820226116

View details for PubMedID 31101713
Value-based decision making: An interactive activation perspective. Psychological review Suri, G. n., Gross, J. J., McClelland, J. L. 2019

Abstract

Prominent theories of value-based decision making have assumed that choices are made via the maximization of some objective function (e.g., expected value) and that the process of decision making is serial and unfolds across modular subprocesses (e.g., perception, valuation, and action selection). However, the influence of a large number of contextual variables that are not related to expected value in any direct way and the ubiquitous reciprocity among variables thought to belong to different subprocesses suggest that these assumptions may not always hold. Here, we propose an interactive activation framework for value-based decision making that does not assume that objective function maximization is the only consideration affecting choice or that processing is modular or serial. Our framework holds that processing takes place via the interactive propagation of activation in a set of simple, interconnected processing elements. We use our framework to simulate a broad range of well-known empirical phenomena-primarily focusing on decision contexts that feature nonoptimal decision making and/or interactive (i.e., not serial or modular) processing. Our approach is constrained at Marr's (1982) algorithmic and implementational levels rather than focusing strictly on considerations of optimality at the computational theory level. It invites consideration of the possibility that choice is emergent and that its computation is distributed. (PsycINFO Database Record (c) 2019 APA, all rights reserved).

View details for DOI 10.1037/rev0000164

View details for PubMedID 31524426
Modelling the N400 brain potential as change in a probabilistic representation of meaning NATURE HUMAN BEHAVIOUR Rabovsky, M., Hansen, S. S., McClelland, J. L. 2018; 2 (9): 693–705

View details for DOI 10.1038/s41562-018-0406-4

View details for Web of Science ID 000446615600026
Modelling the N400 brain potential as change in a probabilistic representation of meaning. Nature human behaviour Rabovsky, M., Hansen, S. S., McClelland, J. L. 2018; 2 (9): 693-705

Abstract

The N400 component of the event-related brain potential has aroused much interest because it is thought to provide an online measure of meaning processing in the brain. However, the underlying process remains incompletely understood and actively debated. Here we present a computationally explicit account of this process and the emerging representation of sentence meaning. We simulate N400 amplitudes as the change induced by an incoming stimulus in an implicit and probabilistic representation of meaning captured by the hidden unit activation pattern in a neural network model of sentence comprehension, and we propose that the process underlying the N400 also drives implicit learning in the network. The model provides a unified account of 16 distinct findings from the N400 literature and connects human language comprehension with recent deep learning approaches to language processing.

View details for DOI 10.1038/s41562-018-0406-4

View details for PubMedID 31346278
Different Presentations of a Mathematical Concept Can Support Learning in Complementary Ways JOURNAL OF EDUCATIONAL PSYCHOLOGY Lampinen, A. K., McClelland, J. L. 2018; 110 (5): 664–82

View details for DOI 10.1037/edu0000235

View details for Web of Science ID 000437721500004
Concepts, Control, and Context: A Connectionist Account of Normal and Disordered Semantic Cognition PSYCHOLOGICAL REVIEW Hoffman, P., McClelland, J. L., Ralph, M. 2018; 125 (3): 293-328

Abstract

Semantic cognition requires conceptual representations shaped by verbal and nonverbal experience and executive control processes that regulate activation of knowledge to meet current situational demands. A complete model must also account for the representation of concrete and abstract words, of taxonomic and associative relationships, and for the role of context in shaping meaning. We present the first major attempt to assimilate all of these elements within a unified, implemented computational framework. Our model combines a hub-and-spoke architecture with a buffer that allows its state to be influenced by prior context. This hybrid structure integrates the view, from cognitive neuroscience, that concepts are grounded in sensory-motor representation with the view, from computational linguistics, that knowledge is shaped by patterns of lexical co-occurrence. The model successfully codes knowledge for abstract and concrete words, associative and taxonomic relationships, and the multiple meanings of homonyms, within a single representational space. Knowledge of abstract words is acquired through (a) their patterns of co-occurrence with other words and (b) acquired embodiment, whereby they become indirectly associated with the perceptual features of co-occurring concrete words. The model accounts for executive influences on semantics by including a controlled retrieval mechanism that provides top-down input to amplify weak semantic relationships. The representational and control elements of the model can be damaged independently, and the consequences of such damage closely replicate effects seen in neuropsychological patients with loss of semantic representation versus control processes. Thus, the model provides a wide-ranging and neurally plausible account of normal and impaired semantic cognition. (PsycINFO Database Record

View details for DOI 10.1037/rev0000094

View details for Web of Science ID 000431493400001

View details for PubMedID 29733663

View details for PubMedCentralID PMC5937916
Distinct Representations of Magnitude and Spatial Position within Parietal Cortex during Number-Space Mapping JOURNAL OF COGNITIVE NEUROSCIENCE Kanayet, F. J., Mattarella-Micke, A., Kohler, P. J., Norcia, A. M., McCandliss, B. D., McClelland, J. L. 2018; 30 (2): 200–218

Abstract

Mapping numbers onto space is foundational to mathematical cognition. These cognitive operations are often conceptualized in the context of a "mental number line" and involve multiple brain regions in or near the intraparietal sulcus (IPS) that have been implicated both in numeral and spatial cognition. Here we examine possible differentiation of function within these brain areas in relating numbers to spatial positions. By isolating the planning phase of a number line task and introducing spatiotopic mapping tools from fMRI into mental number line task research, we are able to focus our analysis on the neural activity of areas in anterior IPS (aIPS) previously associated with number processing and on spatiotopically organized areas in and around posterior IPS (pIPS), while participants prepare to place a number on a number line. Our results support the view that the nonpositional magnitude of a numerical symbol is coded in aIPS, whereas the position of a number in space is coded in posterior areas of IPS. By focusing on the planning phase, we are able to isolate activation related to the cognitive, rather than the sensory-motor, aspects of the task. Also, to allow the separation of spatial position from magnitude, we tested both a standard positive number line (0 to 100) and a zero-centered mixed number line (-100 to 100). We found evidence of a functional dissociation between aIPS and pIPS: Activity in aIPS was associated with a landmark distance effect not modulated by spatial position, whereas activity in pIPS revealed a contralateral preference effect.

View details for PubMedID 29040015
Bayesian statistics to test Bayes optimality. The Behavioral and brain sciences Turner, B. M., McClelland, J. L., Busemeyer, J. 2018; 41: e246

Abstract

We agree with the authors that putting forward specific models and examining their agreement with experimental data are the best approach for understanding the nature of decision making. Although the authors only consider the likelihood function, prior, cost function, and decision rule (LPCD) framework, other choices are available. Bayesian statistics can be used to estimate essential parameters and assess the degree of optimality.

View details for PubMedID 30767805
The dynamics of multimodal integration: The averaging diffusion model. Psychonomic bulletin & review Turner, B. M., Gao, J., Koenig, S., Palfy, D., L McClelland, J. 2017

Abstract

We combine extant theories of evidence accumulation and multi-modal integration to develop an integrated framework for modeling multimodal integration as a process that unfolds in real time. Many studies have formulated sensory processing as a dynamic process where noisy samples of evidence are accumulated until a decision is made. However, these studies are often limited to a single sensory modality. Studies of multimodal stimulus integration have focused on how best to combine different sources of information to elicit a judgment. These studies are often limited to a single time point, typically after the integration process has occurred. We address these limitations by combining the two approaches. Experimentally, we present data that allow us to study the time course of evidence accumulation within each of the visual and auditory domains as well as in a bimodal condition. Theoretically, we develop a new Averaging Diffusion Model in which the decision variable is the mean rather than the sum of evidence samples and use it as a base for comparing three alternative models of multimodal integration, allowing us to assess the optimality of this integration. The outcome reveals rich individual differences in multimodal integration: while some subjects' data are consistent with adaptive optimal integration, reweighting sources of evidence as their relative reliability changes during evidence integration, others exhibit patterns inconsistent with optimality.

View details for DOI 10.3758/s13423-017-1255-2

View details for PubMedID 28275990
Building on prior knowledge without building it in BEHAVIORAL AND BRAIN SCIENCES Hansen, S. S., Lampinen, A. K., Suri, G., McClelland, J. L. 2017; 40: e268

Abstract

Lake et al. propose that people rely on "start-up software," "causal models," and "intuitive theories" built using compositional representations to learn new tasks more efficiently than some deep neural network models. We highlight the many drawbacks of a commitment to compositional representations and describe our continuing effort to explore how the ability to build on prior knowledge and to learn new tasks efficiently could arise through learning in deep neural networks.

View details for PubMedID 29342701
What Learning Systems do Intelligent Agents Need? Complementary Learning Systems Theory Updated TRENDS IN COGNITIVE SCIENCES Kumaran, D., Hassabis, D., McClelland, J. L. 2016; 20 (7): 512-534

Abstract

We update complementary learning systems (CLS) theory, which holds that intelligent agents must possess two learning systems, instantiated in mammalians in neocortex and hippocampus. The first gradually acquires structured knowledge representations while the second quickly learns the specifics of individual experiences. We broaden the role of replay of hippocampal memories in the theory, noting that replay allows goal-dependent weighting of experience statistics. We also address recent challenges to the theory and extend it by showing that recurrent activation of hippocampal traces can support some forms of generalization and that neocortical learning can be rapid for information that is consistent with known structure. Finally, we note the relevance of the theory to the design of artificial intelligent agents, highlighting connections between neuroscience and machine learning.

View details for DOI 10.1016/j.tics.2016.05.004

View details for Web of Science ID 000379106100007

View details for PubMedID 27315762
Bayesian analysis of simulation-based models JOURNAL OF MATHEMATICAL PSYCHOLOGY Turner, B. M., Sederberg, P. B., McClelland, J. L. 2016; 72: 191-199

View details for DOI 10.1016/j.jmp.2014.10.001

View details for Web of Science ID 000377641300017
You shall know an object by the company it keeps: An investigation of semantic representations derived from object co-occurrence in visual scenes. Neuropsychologia Sadeghi, Z., McClelland, J. L., Hoffman, P. 2015; 76: 52-61

Abstract

An influential position in lexical semantics holds that semantic representations for words can be derived through analysis of patterns of lexical co-occurrence in large language corpora. Firth (1957) famously summarised this principle as "you shall know a word by the company it keeps". We explored whether the same principle could be applied to non-verbal patterns of object co-occurrence in natural scenes. We performed latent semantic analysis (LSA) on a set of photographed scenes in which all of the objects present had been manually labelled. This resulted in a representation of objects in a high-dimensional space in which similarity between two objects indicated the degree to which they appeared in similar scenes. These representations revealed similarities among objects belonging to the same taxonomic category (e.g., items of clothing) as well as cross-category associations (e.g., between fruits and kitchen utensils). We also compared representations generated from this scene dataset with two established methods for elucidating semantic representations: (a) a published database of semantic features generated verbally by participants and (b) LSA applied to a linguistic corpus in the usual fashion. Statistical comparisons of the three methods indicated significant association between the structures revealed by each method, with the scene dataset displaying greater convergence with feature-based representations than did LSA applied to linguistic data. The results indicate that information about the conceptual significance of objects can be extracted from their patterns of co-occurrence in natural environments, opening the possibility for such data to be incorporated into existing models of conceptual representation.

View details for DOI 10.1016/j.neuropsychologia.2014.08.031

View details for PubMedID 25196838

View details for PubMedCentralID PMC4589736
Payoff Information Biases a Fast Guess Process in Perceptual Decision Making under Deadline Pressure: Evidence from Behavior, Evoked Potentials, and Quantitative Model Comparison. journal of neuroscience Noorbaloochi, S., Sharon, D., McClelland, J. L. 2015; 35 (31): 10989-11011

Abstract

We used electroencephalography (EEG) and behavior to examine the role of payoff bias in a difficult two-alternative perceptual decision under deadline pressure in humans. The findings suggest that a fast guess process, biased by payoff and triggered by stimulus onset, occurred on a subset of trials and raced with an evidence accumulation process informed by stimulus information. On each trial, the participant judged whether a rectangle was shifted to the right or left and responded by squeezing a right- or left-hand dynamometer. The payoff for each alternative (which could be biased or unbiased) was signaled 1.5 s before stimulus onset. The choice response was assigned to the first hand reaching a squeeze force criterion and reaction time was defined as time to criterion. Consistent with a fast guess account, fast responses were strongly biased toward the higher-paying alternative and the EEG exhibited an abrupt rise in the lateralized readiness potential (LRP) on a subset of biased payoff trials contralateral to the higher-paying alternative ∼150 ms after stimulus onset and 50 ms before stimulus information influenced the LRP. This rise was associated with poststimulus dynamometer activity favoring the higher-paying alternative and predicted choice and response time. Quantitative modeling supported the fast guess account over accounts of payoff effects supported in other studies. Our findings, taken with previous studies, support the idea that payoff and prior probability manipulations produce flexible adaptations to task structure and do not reflect a fixed policy for the integration of payoff and stimulus information.Humans and other animals often face situations in which they must make choices based on uncertain sensory information together with information about expected outcomes (gains or losses) about each choice. We investigated how differences in payoffs between available alternatives affect neural activity, overt choice, and the timing of choice responses. In our experiment, in which participants were under strong time pressure, neural and behavioral findings together with model fitting suggested that our human participants often made a fast guess toward the higher reward rather than integrating stimulus and payoff information. Our findings, taken with findings from other studies, support the idea that payoff and prior probability manipulations produce flexible adaptations to task structure and do not reflect a fixed policy.

View details for DOI 10.1523/JNEUROSCI.0017-15.2015

View details for PubMedID 26245962
Payoff Information Biases a Fast Guess Process in Perceptual Decision Making under Deadline Pressure: Evidence from Behavior, Evoked Potentials, and Quantitative Model Comparison JOURNAL OF NEUROSCIENCE Noorbaloochi, S., Sharon, D., McClelland, J. L. 2015; 35 (31): 10989-11011

Abstract

We used electroencephalography (EEG) and behavior to examine the role of payoff bias in a difficult two-alternative perceptual decision under deadline pressure in humans. The findings suggest that a fast guess process, biased by payoff and triggered by stimulus onset, occurred on a subset of trials and raced with an evidence accumulation process informed by stimulus information. On each trial, the participant judged whether a rectangle was shifted to the right or left and responded by squeezing a right- or left-hand dynamometer. The payoff for each alternative (which could be biased or unbiased) was signaled 1.5 s before stimulus onset. The choice response was assigned to the first hand reaching a squeeze force criterion and reaction time was defined as time to criterion. Consistent with a fast guess account, fast responses were strongly biased toward the higher-paying alternative and the EEG exhibited an abrupt rise in the lateralized readiness potential (LRP) on a subset of biased payoff trials contralateral to the higher-paying alternative ∼150 ms after stimulus onset and 50 ms before stimulus information influenced the LRP. This rise was associated with poststimulus dynamometer activity favoring the higher-paying alternative and predicted choice and response time. Quantitative modeling supported the fast guess account over accounts of payoff effects supported in other studies. Our findings, taken with previous studies, support the idea that payoff and prior probability manipulations produce flexible adaptations to task structure and do not reflect a fixed policy for the integration of payoff and stimulus information.Humans and other animals often face situations in which they must make choices based on uncertain sensory information together with information about expected outcomes (gains or losses) about each choice. We investigated how differences in payoffs between available alternatives affect neural activity, overt choice, and the timing of choice responses. In our experiment, in which participants were under strong time pressure, neural and behavioral findings together with model fitting suggested that our human participants often made a fast guess toward the higher reward rather than integrating stimulus and payoff information. Our findings, taken with findings from other studies, support the idea that payoff and prior probability manipulations produce flexible adaptations to task structure and do not reflect a fixed policy.

View details for DOI 10.1523/JNEUROSCI.0017-15.2015

View details for Web of Science ID 000361131800008

View details for PubMedID 26245962
Connectionist perspectives on language learning, representation and processing WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE Joanisse, M. F., McClelland, J. L. 2015; 6 (3): 235-247

Abstract

The field of formal linguistics was founded on the premise that language is mentally represented as a deterministic symbolic grammar. While this approach has captured many important characteristics of the world's languages, it has also led to a tendency to focus theoretical questions on the correct formalization of grammatical rules while also de-emphasizing the role of learning and statistics in language development and processing. In this review we present a different approach to language research that has emerged from the parallel distributed processing or 'connectionist' enterprise. In the connectionist framework, mental operations are studied by simulating learning and processing within networks of artificial neurons. With that in mind, we discuss recent progress in connectionist models of auditory word recognition, reading, morphology, and syntactic processing. We argue that connectionist models can capture many important characteristics of how language is learned, represented, and processed, as well as providing new insights about the source of these behavioral patterns. Just as importantly, the networks naturally capture irregular (non-rule-like) patterns that are common within languages, something that has been difficult to reconcile with rule-based accounts of language without positing separate mechanisms for rules and exceptions. WIREs Cogn Sci 2015, 6:235-247. doi: 10.1002/wcs.1340 For further resources related to this article, please visit the WIREs website.The authors have declared no conflicts of interest for this article.

View details for DOI 10.1002/wcs.1340

View details for Web of Science ID 000353886000004
Connectionist perspectives on language learning, representation and processing. Wiley interdisciplinary reviews. Cognitive science Joanisse, M. F., McClelland, J. L. 2015; 6 (3): 235-247

Abstract

The field of formal linguistics was founded on the premise that language is mentally represented as a deterministic symbolic grammar. While this approach has captured many important characteristics of the world's languages, it has also led to a tendency to focus theoretical questions on the correct formalization of grammatical rules while also de-emphasizing the role of learning and statistics in language development and processing. In this review we present a different approach to language research that has emerged from the parallel distributed processing or 'connectionist' enterprise. In the connectionist framework, mental operations are studied by simulating learning and processing within networks of artificial neurons. With that in mind, we discuss recent progress in connectionist models of auditory word recognition, reading, morphology, and syntactic processing. We argue that connectionist models can capture many important characteristics of how language is learned, represented, and processed, as well as providing new insights about the source of these behavioral patterns. Just as importantly, the networks naturally capture irregular (non-rule-like) patterns that are common within languages, something that has been difficult to reconcile with rule-based accounts of language without positing separate mechanisms for rules and exceptions. WIREs Cogn Sci 2015, 6:235-247. doi: 10.1002/wcs.1340 For further resources related to this article, please visit the WIREs website.The authors have declared no conflicts of interest for this article.

View details for DOI 10.1002/wcs.1340

View details for PubMedID 26263227
Capturing Gradience, Continuous Change, and Quasi-Regularity in Sound, Word, Phrase, and Meaning HANDBOOK OF LANGUAGE EMERGENCE McClelland, J. L. edited by MacWhinney, B., OGrady, W. 2015: 53-80

View details for Web of Science ID 000684479900003
Resilient properties of thought and experience LANGUAGE COGNITION AND NEUROSCIENCE McClelland, J. L. 2015; 30 (8): 917-918

View details for DOI 10.1080/23273798.2015.1053816

View details for Web of Science ID 000369060000004
Parallel Distributed Processing at 25: Further Explorations in the Microstructure of Cognition COGNITIVE SCIENCE Rogers, T. T., McClelland, J. L. 2014; 38 (6): 1024-1077

Abstract

This paper introduces a special issue of Cognitive Science initiated on the 25th anniversary of the publication of Parallel Distributed Processing (PDP), a two-volume work that introduced the use of neural network models as vehicles for understanding cognition. The collection surveys the core commitments of the PDP framework, the key issues the framework has addressed, and the debates the framework has spawned, and presents viewpoints on the current status of these issues. The articles focus on both historical roots and contemporary developments in learning, optimality theory, perception, memory, language, conceptual knowledge, cognitive control, and consciousness. Here we consider the approach more generally, reviewing the original motivations, the resulting framework, and the central tenets of the underlying theory. We then evaluate the impact of PDP both on the field at large and within specific subdomains of cognitive science and consider the current role of PDP models within the broader landscape of contemporary theoretical frameworks in cognitive science. Looking to the future, we consider the implications for cognitive science of the recent success of machine learning systems called "deep networks"-systems that build on key ideas presented in the PDP volumes.

View details for DOI 10.1111/cogs.12148

View details for Web of Science ID 000340557000002

View details for PubMedID 25087578
Interactive activation and mutual constraint satisfaction in perception and cognition. Cognitive science McClelland, J. L., Mirman, D., Bolger, D. J., Khaitan, P. 2014; 38 (6): 1139-1189

Abstract

In a seminal 1977 article, Rumelhart argued that perception required the simultaneous use of multiple sources of information, allowing perceivers to optimally interpret sensory information at many levels of representation in real time as information arrives. Building on Rumelhart's arguments, we present the Interactive Activation hypothesis-the idea that the mechanism used in perception and comprehension to achieve these feats exploits an interactive activation process implemented through the bidirectional propagation of activation among simple processing units. We then examine the interactive activation model of letter and word perception and the TRACE model of speech perception, as early attempts to explore this hypothesis, and review the experimental evidence relevant to their assumptions and predictions. We consider how well these models address the computational challenge posed by the problem of perception, and we consider how consistent they are with evidence from behavioral experiments. We examine empirical and theoretical controversies surrounding the idea of interactive processing, including a controversy that swirls around the relationship between interactive computation and optimal Bayesian inference. Some of the implementation details of early versions of interactive activation models caused deviation from optimality and from aspects of human performance data. More recent versions of these models, however, overcome these deficiencies. Among these is a model called the multinomial interactive activation model, which explicitly links interactive activation and Bayesian computations. We also review evidence from neurophysiological and neuroimaging studies supporting the view that interactive processing is a characteristic of the perceptual processing machinery in the brain. In sum, we argue that a computational analysis, as well as behavioral and neuroscience evidence, all support the Interactive Activation hypothesis. The evidence suggests that contemporary versions of models based on the idea of interactive activation continue to provide a basis for efforts to achieve a fuller understanding of the process of perception.

View details for DOI 10.1111/cogs.12146

View details for PubMedID 25098813
Why bilateral damage is worse than unilateral damage to the brain. Journal of cognitive neuroscience Schapiro, A. C., McClelland, J. L., Welbourne, S. R., Rogers, T. T., Lambon Ralph, M. A. 2013; 25 (12): 2107-2123

Abstract

Human and animal lesion studies have shown that behavior can be catastrophically impaired after bilateral lesions but that unilateral damage often produces little or no effect, even controlling for lesion extent. This pattern is found across many different sensory, motor, and memory domains. Despite these findings, there has been no systematic, computational explanation. We found that the same striking difference between unilateral and bilateral damage emerged in a distributed, recurrent attractor neural network. The difference persists in simple feedforward networks, where it can be understood in explicit quantitative terms. In essence, damage both distorts and reduces the magnitude of relevant activity in each hemisphere. Unilateral damage reduces the relative magnitude of the contribution to performance of the damaged side, allowing the intact side to dominate performance. In contrast, balanced bilateral damage distorts representations on both sides, which contribute equally, resulting in degraded performance. The model's ability to account for relevant patient data suggests that mechanisms similar to those in the model may operate in the brain.

View details for DOI 10.1162/jocn_a_00441

View details for PubMedID 23806177
Context, cortex, and associations: a connectionist developmental approach to verbal analogies FRONTIERS IN PSYCHOLOGY Kollias, P., McClelland, J. L. 2013; 4

Abstract

We present a PDP model of binary choice verbal analogy problems (A:B as C:[D1|D2], where D1 and D2 represent choice alternatives). We train a recurrent neural network in item-relation-item triples and use this network to test performance on analogy questions. Without training on analogy problems per se, the model explains the developmental shift from associative to relational responding as an emergent consequence of learning upon the environment's statistics. Such learning allows gradual, item-specific acquisition of relational knowledge to overcome the influence of unbalanced association frequency, accounting for association effects of analogical reasoning seen in cognitive development. The network also captures the overall degradation in performance after anterior temporal damage by deleting a fraction of learned connections, while capturing the return of associative dominance after frontal damage by treating frontal structures as necessary for maintaining activation of A and B while seeking a relation between C and D. While our theory is still far from being complete it provides a unified explanation of findings that need to be considered together in any integrated account of analogical reasoning.

View details for DOI 10.3389/fpsyg.2013.00857

View details for Web of Science ID 000331583200001

View details for PubMedID 24312068

View details for PubMedCentralID PMC3834521
Incorporating rapid neocortical learning of new schema-consistent information into complementary learning systems theory. Journal of experimental psychology. General McClelland, J. L. 2013; 142 (4): 1190-1210

Abstract

The complementary learning systems theory of the roles of hippocampus and neocortex (McClelland, McNaughton, & O'Reilly, 1995) holds that the rapid integration of arbitrary new information into neocortical structures is avoided to prevent catastrophic interference with structured knowledge representations stored in synaptic connections among neocortical neurons. Recent studies (Tse et al., 2007, 2011) showed that neocortical circuits can rapidly acquire new associations that are consistent with prior knowledge. The findings challenge the complementary learning systems theory as previously presented. However, new simulations extending those reported in McClelland et al. (1995) show that new information that is consistent with knowledge previously acquired by a putatively cortexlike artificial neural network can be learned rapidly and without interfering with existing knowledge; it is when inconsistent new knowledge is acquired quickly that catastrophic interference ensues. Several important features of the findings of Tse et al. (2007, 2011) are captured in these simulations, indicating that the neural network model used in McClelland et al. has characteristics in common with neocortical learning mechanisms. An additional simulation generalizes beyond the network model previously used, showing how the rate of change of cortical connections can depend on prior knowledge in an arguably more biologically plausible network architecture. In sum, the findings of Tse et al. are fully consistent with the idea that hippocampus and neocortex are complementary learning systems. Taken together, these findings and the simulations reported here advance our knowledge by bringing out the role of consistency of new experience with existing knowledge and demonstrating that the rate of change of connections in real and artificial neural networks can be strongly prior-knowledge dependent.

View details for DOI 10.1037/a0033812

View details for PubMedID 23978185
A Differentiation Account of Recognition Memory: Evidence from fMRI JOURNAL OF COGNITIVE NEUROSCIENCE Criss, A. H., Wheeler, M. E., McClelland, J. L. 2013; 25 (3): 421-435

Abstract

Differentiation models of recognition memory predict a strength-based mirror effect in the distributions of subjective memory strength. Subjective memory strength should increase for targets and simultaneously decrease for foils following a strongly encoded list compared with a weakly encoded list. An alternative explanation for the strength-based mirror effect is that participants adopt a stricter criterion following a strong list than a weak list. Behavioral experiments support the differentiation account. The purpose of this study was to identify the neural bases for these differences. Encoding strength was manipulated (strong, weak) in a rapid event-related fMRI paradigm. To investigate the effect of retrieval context on foils, foils were presented in test blocks containing strong or weak targets. Imaging analyses identified regions in which activity increased faster for foils tested after a strong list than a weak list. The results are interpreted in support of a differentiation account of memory and are suggestive that the angular gyrus plays a role in evaluating evidence related to the memory decision, even for new items.

View details for Web of Science ID 000314363200008

View details for PubMedID 23092213
Integrating probabilistic models of perception and interactive neural networks: a historical and tutorial review. Frontiers in psychology McClelland, J. L. 2013; 4: 503-?

Abstract

This article seeks to establish a rapprochement between explicitly Bayesian models of contextual effects in perception and neural network models of such effects, particularly the connectionist interactive activation (IA) model of perception. The article is in part an historical review and in part a tutorial, reviewing the probabilistic Bayesian approach to understanding perception and how it may be shaped by context, and also reviewing ideas about how such probabilistic computations may be carried out in neural networks, focusing on the role of context in interactive neural networks, in which both bottom-up and top-down signals affect the interpretation of sensory inputs. It is pointed out that connectionist units that use the logistic or softmax activation functions can exactly compute Bayesian posterior probabilities when the bias terms and connection weights affecting such units are set to the logarithms of appropriate probabilistic quantities. Bayesian concepts such the prior, likelihood, (joint and marginal) posterior, probability matching and maximizing, and calculating vs. sampling from the posterior are all reviewed and linked to neural network computations. Probabilistic and neural network models are explicitly linked to the concept of a probabilistic generative model that describes the relationship between the underlying target of perception (e.g., the word intended by a speaker or other source of sensory stimuli) and the sensory input that reaches the perceiver for use in inferring the underlying target. It is shown how a new version of the IA model called the multinomial interactive activation (MIA) model can sample correctly from the joint posterior of a proposed generative model for perception of letters in words, indicating that interactive processing is fully consistent with principled probabilistic computation. Ways in which these computations might be realized in real neural systems are also considered.

View details for DOI 10.3389/fpsyg.2013.00503

View details for PubMedID 23970868

View details for PubMedCentralID PMC3747375
Retrospective. R. Duncan Luce (1925-2012). Science McClelland, J. L. 2012; 337 (6102): 1619-?

View details for PubMedID 23019641
R. Duncan Luce (1925-2012) SCIENCE McClelland, J. L. 2012; 337 (6102): 1619

View details for DOI 10.1126/science.1229851

View details for Web of Science ID 000309215400034

View details for PubMedID 23019641
Generalization Through the Recurrent Interaction of Episodic Memories: A Model of the Hippocampal System PSYCHOLOGICAL REVIEW Kumaran, D., McClelland, J. L. 2012; 119 (3): 573-616

Abstract

In this article, we present a perspective on the role of the hippocampal system in generalization, instantiated in a computational model called REMERGE (recurrency and episodic memory results in generalization). We expose a fundamental, but neglected, tension between prevailing computational theories that emphasize the function of the hippocampus in pattern separation (Marr, 1971; McClelland, McNaughton, & O'Reilly, 1995), and empirical support for its role in generalization and flexible relational memory (Cohen & Eichenbaum, 1993; Eichenbaum, 1999). Our account provides a means by which to resolve this conflict, by demonstrating that the basic representational scheme envisioned by complementary learning systems theory (McClelland et al., 1995), which relies upon orthogonalized codes in the hippocampus, is compatible with efficient generalization-as long as there is recurrence rather than unidirectional flow within the hippocampal circuit or, more widely, between the hippocampus and neocortex. We propose that recurrent similarity computation, a process that facilitates the discovery of higher-order relationships between a set of related experiences, expands the scope of classical exemplar-based models of memory (e.g., Nosofsky, 1984) and allows the hippocampus to support generalization through interactions that unfold within a dynamically created memory space.

View details for DOI 10.1037/a0028681

View details for Web of Science ID 000306029300007

View details for PubMedID 22775499

View details for PubMedCentralID PMC3444305
Can native Japanese listeners learn to differentiate /r-l/ on the basis of F3 onset frequency? BILINGUALISM-LANGUAGE AND COGNITION Ingvalson, E. M., Holt, L. L., McClelland, J. L. 2012; 15 (2): 255-274

Abstract

Many attempts have been made to teach native Japanese listeners to perceptually differentiate English/r-l/(e.g. rock-lock). Though improvement is evident, in no case is final performance native English-like. We focused our training on the third formant onset frequency, shown to be the most reliable indicator of/r-l/category membership. We first presented listeners with instances of synthetic/r-l/stimuli varying only in F3 onset frequency, in a forced-choice identification training task with feedback. Evidence of learning was limited. The second experiment utilized an adaptive paradigm beginning with non-speech stimuli consisting only of/r/and/l/F3 frequency trajectories progressing to synthetic speech instances of/ra-la/; half of the trainees received feedback. Improvement was shown by some listeners, suggesting some enhancement of/r-l/identification is possible following training with only F3 onset frequency. However, only a subset of these listeners showed signs of generalization of the training effect beyond the trained synthetic context.

View details for DOI 10.1017/S1366728911000447

View details for Web of Science ID 000302083100009

View details for PubMedCentralID PMC3538861
Two Mechanisms of Human Contingency Learning PSYCHOLOGICAL SCIENCE Sternberg, D. A., McClelland, J. L. 2012; 23 (1): 59-68

Abstract

How do humans learn contingencies between events? Both pathway-strengthening and inference-based process models have been proposed to explain contingency learning. We propose that each of these processes is used in different conditions. Participants viewed displays that contained single or paired objects and learned which displays were usually followed by the appearance of a dot. Some participants predicted whether the dot would appear before seeing the outcome, whereas other participants were required to respond quickly if the dot appeared shortly after the display. In the prediction task, instructions guiding participants to infer which objects caused the dot to appear were necessary in order for contingencies associated with one object to influence participants' predictions about the object with which it had been paired. In the response task, contingencies associated with one object affected responses to its pair mate irrespective of whether or not participants were given causal instructions. Our results challenge single-mechanism accounts of contingency learning and suggest that the mechanisms underlying performance in the two tasks are distinct.

View details for DOI 10.1177/0956797611429577

View details for Web of Science ID 000300955100012

View details for PubMedID 22198929
Using time-varying evidence to test models of decision dynamics: bounded diffusion vs. the leaky competing accumulator model FRONTIERS IN NEUROSCIENCE Tsetsos, K., Gao, J., McClelland, J. L., Usher, M. 2012; 6

Abstract

When people make decisions, do they give equal weight to evidence arriving at different times? A recent study (Kiani et al., 2008) using brief motion pulses (superimposed on a random moving dot display) reported a primacy effect: pulses presented early in a motion observation period had a stronger impact than pulses presented later. This observation was interpreted as supporting the bounded diffusion (BD) model and ruling out models in which evidence accumulation is subject to leakage or decay of early-arriving information. We use motion pulses and other manipulations of the timing of the perceptual evidence in new experiments and simulations that support the leaky competing accumulator (LCA) model as an alternative to the BD model. While the LCA does include leakage, we show that it can exhibit primacy as a result of competition between alternatives (implemented via mutual inhibition), when the inhibition is strong relative to the leak. Our experiments replicate the primacy effect when participants must be prepared to respond quickly at the end of a motion observation period. With less time pressure, however, the primacy effect is much weaker. For 2 (out of 10) participants, a primacy bias observed in trials where the motion observation period is short becomes weaker or reverses (becoming a recency effect) as the observation period lengthens. Our simulation studies show that primacy is equally consistent with the LCA or with BD. The transition from primacy-to-recency can also be captured by the LCA but not by BD. Individual differences and relations between the LCA and other models are discussed.

View details for DOI 10.3389/fnins.2012.00079

View details for Web of Science ID 000209165300088

View details for PubMedCentralID PMC3372959
Using time-varying evidence to test models of decision dynamics: bounded diffusion vs. the leaky competing accumulator model FRONTIERS IN NEUROSCIENCE Tsetsos, K., Gao, J., McClelland, J. L., Usher, M. 2012; 6

Abstract

When people make decisions, do they give equal weight to evidence arriving at different times? A recent study (Kiani et al., 2008) using brief motion pulses (superimposed on a random moving dot display) reported a primacy effect: pulses presented early in a motion observation period had a stronger impact than pulses presented later. This observation was interpreted as supporting the bounded diffusion (BD) model and ruling out models in which evidence accumulation is subject to leakage or decay of early-arriving information. We use motion pulses and other manipulations of the timing of the perceptual evidence in new experiments and simulations that support the leaky competing accumulator (LCA) model as an alternative to the BD model. While the LCA does include leakage, we show that it can exhibit primacy as a result of competition between alternatives (implemented via mutual inhibition), when the inhibition is strong relative to the leak. Our experiments replicate the primacy effect when participants must be prepared to respond quickly at the end of a motion observation period. With less time pressure, however, the primacy effect is much weaker. For 2 (out of 10) participants, a primacy bias observed in trials where the motion observation period is short becomes weaker or reverses (becoming a recency effect) as the observation period lengthens. Our simulation studies show that primacy is equally consistent with the LCA or with BD. The transition from primacy-to-recency can also be captured by the LCA but not by BD. Individual differences and relations between the LCA and other models are discussed.

View details for DOI 10.3389/fnins.2012.00079

View details for Web of Science ID 000209165300088

View details for PubMedCentralID PMC3372959
Predicting native English-like performance by native Japanese speakers JOURNAL OF PHONETICS Ingvalson, E. M., McClelland, J. L., Holt, L. L. 2011; 39 (4): 571-584

Abstract

This study tested the predictions of the Speech Learning Model (SLM, Flege, 1988) on the case of native Japanese (NJ) speakers' perception and production of English /ɹ / and /l/. NJ speakers' degree of foreign accent, intelligibility of /ɹ -l/ productions, and ability to perceive natural speech /ɹ -l/ were assessed as a function of length of residency in North America, age of arrival in North America, years of student status in an English environment, and percentage of Japanese usage. Additionally, the extent to which NJ speakers' utilized the F3 onset cue when differentiating /ɹ -l/ in perception and production was assessed, this cue having previously been shown to be the most reliable indicator of category membership. As predicted, longer residencies predicted more native English-like accents, more intelligible productions, and more accurate natural speech identifications; however, no changes were observed in F3 reliance, indicating that though performance improves it does so through reliance on other cues.

View details for DOI 10.1016/j.wocn.2011.03.003

View details for Web of Science ID 000296402000011

View details for PubMedCentralID PMC3196605
Dynamic Integration of Reward and Stimulus Information in Perceptual Decision-Making PLOS ONE Gao, J., Tortell, R., McClelland, J. L. 2011; 6 (3)

Abstract

In perceptual decision-making, ideal decision-makers should bias their choices toward alternatives associated with larger rewards, and the extent of the bias should decrease as stimulus sensitivity increases. When responses must be made at different times after stimulus onset, stimulus sensitivity grows with time from zero to a final asymptotic level. Are decision makers able to produce responses that are more biased if they are made soon after stimulus onset, but less biased if they are made after more evidence has been accumulated? If so, how close to optimal can they come in doing this, and how might their performance be achieved mechanistically? We report an experiment in which the payoff for each alternative is indicated before stimulus onset. Processing time is controlled by a "go" cue occurring at different times post stimulus onset, requiring a response within msec. Reward bias does start high when processing time is short and decreases as sensitivity increases, leveling off at a non-zero value. However, the degree of bias is sub-optimal for shorter processing times. We present a mechanistic account of participants' performance within the framework of the leaky competing accumulator model [1], in which accumulators for each alternative accumulate noisy information subject to leakage and mutual inhibition. The leveling off of accuracy is attributed to mutual inhibition between the accumulators, allowing the accumulator that gathers the most evidence early in a trial to suppress the alternative. Three ways reward might affect decision making in this framework are considered. One of the three, in which reward affects the starting point of the evidence accumulation process, is consistent with the qualitative pattern of the observed reward bias effect, while the other two are not. Incorporating this assumption into the leaky competing accumulator model, we are able to provide close quantitative fits to individual participant data.

View details for DOI 10.1371/journal.pone.0016749

View details for Web of Science ID 000287965200005

View details for PubMedID 21390225

View details for PubMedCentralID PMC3048391
A PDP model of the simultaneous perception of multiple objects CONNECTION SCIENCE Henderson, C. M., McClelland, J. L. 2011; 23 (2): 161-172

View details for DOI 10.1080/09540091.2011.575931

View details for Web of Science ID 000291093700009
Testing multi-alternative decision models with non-stationary evidence FRONTIERS IN NEUROSCIENCE Tsetsos, K., Usher, M., McClelland, J. L. 2011; 5

Abstract

Recent research has investigated the process of integrating perceptual evidence toward a decision, converging on a number of sequential sampling choice models, such as variants of race and diffusion models and the non-linear leaky competing accumulator (LCA) model. Here we study extensions of these models to multi-alternative choice, considering how well they can account for data from a psychophysical experiment in which the evidence supporting each of the alternatives changes dynamically during the trial, in a way that creates temporal correlations. We find that participants exhibit a tendency to choose an alternative whose evidence profile is temporally anti-correlated with (or dissimilar from) that of other alternatives. This advantage of the anti-correlated alternative is well accounted for in the LCA, and provides constraints that challenge several other models of multi-alternative choice.

View details for DOI 10.3389/fnins.2011.00063

View details for Web of Science ID 000209200600059

View details for PubMedCentralID PMC3093747
Are there mental lexicons? The role of semantics in lexical decision BRAIN RESEARCH Dilkina, K., McClelland, J. L., Plaut, D. C. 2010; 1365: 66-81

Abstract

What is the underlying representation of lexical knowledge? How do we know whether a given string of letters is a word, whereas another string of letters is not? There are two competing models of lexical processing in the literature. The first proposes that we rely on mental lexicons. The second claims there are no mental lexicons; we identify certain items as words based on semantic knowledge. Thus, the former approach - the multiple-systems view - posits that lexical and semantic processing are subserved by separate systems, whereas the latter approach - the single-system view - holds that the two are interdependent. Semantic dementia patients, who have a cross-modal semantic impairment, show an accompanying and related lexical deficit. These findings support the single-system approach. However, a report of an SD patient whose impairment on lexical decision was not related to his semantic deficits in item-specific ways has presented a challenge to this view. If the two types of processing rely on a common system, then shouldn't damage impair the same items on all tasks? We present a single-system model of lexical and semantic processing, where there are no lexicons, and performance on lexical decision involves the activation of semantic representations. We show how, when these representations are damaged, accuracy on semantic and lexical tasks falls off together, but not necessarily on the same set of items. These findings are congruent with the patient data. We provide an explicit explanation of this pattern of results in our model, by defining and measuring the effects of two orthogonal factors - spelling consistency and concept consistency.

View details for DOI 10.1016/j.brainres.2010.09.057

View details for Web of Science ID 000285816900006

View details for PubMedID 20869349

View details for PubMedCentralID PMC2993824
Emergence in Cognitive Science TOPICS IN COGNITIVE SCIENCE McClelland, J. L. 2010; 2 (4): 751-770

Abstract

The study of human intelligence was once dominated by symbolic approaches, but over the last 30 years an alternative approach has arisen. Symbols and processes that operate on them are often seen today as approximate characterizations of the emergent consequences of sub- or nonsymbolic processes, and a wide range of constructs in cognitive science can be understood as emergents. These include representational constructs (units, structures, rules), architectural constructs (central executive, declarative memory), and developmental processes and outcomes (stages, sensitive periods, neurocognitive modules, developmental disorders). The greatest achievements of human cognition may be largely emergent phenomena. It remains a challenge for the future to learn more about how these greatest achievements arise and to emulate them in artificial systems.

View details for DOI 10.1111/j.1756-8765.2010.01116.x

View details for Web of Science ID 000283870400012
Letting structure emerge: connectionist and dynamical systems approaches to cognition TRENDS IN COGNITIVE SCIENCES McClelland, J. L., Botvinick, M. M., Noelle, D. C., Plaut, D. C., Rogers, T. T., Seidenberg, M. S., Smith, L. B. 2010; 14 (8): 348-356

Abstract

Connectionist and dynamical systems approaches explain human thought, language and behavior in terms of the emergent consequences of a large number of simple noncognitive processes. We view the entities that serve as the basis for structured probabilistic approaches as abstractions that are occasionally useful but often misleading: they have no real basis in the actual processes that give rise to linguistic and cognitive abilities or to the development of these abilities. Although structured probabilistic approaches can be useful in determining what would be optimal under certain assumptions, we propose that connectionist, dynamical systems, and related approaches, which focus on explaining the mechanisms that give rise to cognition, will be essential in achieving a full understanding of cognition and development.

View details for DOI 10.1016/j.tics.2010.06.002

View details for Web of Science ID 000281099600009

View details for PubMedID 20598626

View details for PubMedCentralID PMC3056446
Integration of Sensory and Reward Information during Perceptual Decision-Making in Lateral Intraparietal Cortex (LIP) of the Macaque Monkey PLOS ONE Rorie, A. E., Gao, J., McClelland, J. L., Newsome, W. T. 2010; 5 (2)

Abstract

Single neurons in cortical area LIP are known to carry information relevant to both sensory and value-based decisions that are reported by eye movements. It is not known, however, how sensory and value information are combined in LIP when individual decisions must be based on a combination of these variables. To investigate this issue, we conducted behavioral and electrophysiological experiments in rhesus monkeys during performance of a two-alternative, forced-choice discrimination of motion direction (sensory component). Monkeys reported each decision by making an eye movement to one of two visual targets associated with the two possible directions of motion. We introduced choice biases to the monkeys' decision process (value component) by randomly interleaving balanced reward conditions (equal reward value for the two choices) with unbalanced conditions (one alternative worth twice as much as the other). The monkeys' behavior, as well as that of most LIP neurons, reflected the influence of all relevant variables: the strength of the sensory information, the value of the target in the neuron's response field, and the value of the target outside the response field. Overall, detailed analysis and computer simulation reveal that our data are consistent with a two-stage drift diffusion model proposed by Diederich and Bussmeyer for the effect of payoffs in the context of sensory discrimination tasks. Initial processing of payoff information strongly influences the starting point for the accumulation of sensory evidence, while exerting little if any effect on the rate of accumulation of sensory evidence.

View details for DOI 10.1371/journal.pone.0009308

View details for Web of Science ID 000274923700012

View details for PubMedID 20174574

View details for PubMedCentralID PMC2824817
Integration of Sensory and Reward Information during Perceptual Decision-Making in Lateral Intraparietal Cortex (LIP) of the Macaque Monkey PLOS ONE Rorie, A. E., Gao, J., McClelland, J. L., Newsome, W. T. 2010; 5 (2): e9308

Abstract

Single neurons in cortical area LIP are known to carry information relevant to both sensory and value-based decisions that are reported by eye movements. It is not known, however, how sensory and value information are combined in LIP when individual decisions must be based on a combination of these variables. To investigate this issue, we conducted behavioral and electrophysiological experiments in rhesus monkeys during performance of a two-alternative, forced-choice discrimination of motion direction (sensory component). Monkeys reported each decision by making an eye movement to one of two visual targets associated with the two possible directions of motion. We introduced choice biases to the monkeys' decision process (value component) by randomly interleaving balanced reward conditions (equal reward value for the two choices) with unbalanced conditions (one alternative worth twice as much as the other). The monkeys' behavior, as well as that of most LIP neurons, reflected the influence of all relevant variables: the strength of the sensory information, the value of the target in the neuron's response field, and the value of the target outside the response field. Overall, detailed analysis and computer simulation reveal that our data are consistent with a two-stage drift diffusion model proposed by Diederich and Bussmeyer for the effect of payoffs in the context of sensory discrimination tasks. Initial processing of payoff information strongly influences the starting point for the accumulation of sensory evidence, while exerting little if any effect on the rate of accumulation of sensory evidence.

View details for DOI 10.1371/journal.pone.0009308

View details for Web of Science ID 000274923700012

View details for PubMedID 20174574

View details for PubMedCentralID PMC2824817
Matching Exact Posterior Probabilities in the Multinomial Interactive Activation Model Khaitan, P., McClelland, J. L. edited by Ohlsson, S., Catrambone, R. COGNITIVE SCIENCE SOCIETY, INC. 2010: 623

View details for Web of Science ID 000392421700200
Semantics in the wild: Context-sensitive inferences about mammals Glick, J., McClelland, J. edited by Ohlsson, S., Catrambone, R. COGNITIVE SCIENCE SOCIETY, INC. 2010: 668

View details for Web of Science ID 000392421700245
Complementary processing systems: A PDP model of the simultaneous perception of multiple objects Henderson, C., McClelland, J. edited by Ohlsson, S., Catrambone, R. COGNITIVE SCIENCE SOCIETY, INC. 2010: 2395

View details for Web of Science ID 000392421700556
Locating Object Knowledge in the Brain: Comment on Bowers's (2009) Attempt to Revive the Grandmother Cell Hypothesis PSYCHOLOGICAL REVIEW Plaut, D. C., McClelland, J. L. 2010; 117 (1): 284-290

Abstract

According to Bowers, the finding that there are neurons with highly selective responses to familiar stimuli supports theories positing localist representations over approaches positing the type of distributed representations typically found in parallel distributed processing (PDP) models. However, his conclusions derive from an overly narrow view of the range of possible distributed representations and of the role that PDP models can play in exploring their properties. Although it is true that current distributed theories face challenges in accounting for both neural and behavioral data, the proposed localist account--to the extent that it is articulated at all--runs into more fundamental difficulties. Central to these difficulties is the problem of specifying the set of entities a localist unit represents.

View details for DOI 10.1037/a0017101

View details for Web of Science ID 000273408500015

View details for PubMedID 20063976
Modeling Unsupervised Perceptual Category Learning IEEE TRANSACTIONS ON AUTONOMOUS MENTAL DEVELOPMENT Lake, B. M., Vallabha, G. K., McClelland, J. L. 2009; 1 (1): 35-43

View details for DOI 10.1109/TAMD.2009.2021703

View details for Web of Science ID 000208067800003
How do we get from propositions to behavior? BEHAVIORAL AND BRAIN SCIENCES Sternberg, D. A., McClelland, J. L. 2009; 32 (2): 226-+

View details for DOI 10.1017/S0140525X09001150

View details for Web of Science ID 000269684500061
A connectionist model of a continuous developmental transition in the balance scale task COGNITION Schapiro, A. C., McClelland, J. L. 2009; 110 (3): 395-411

Abstract

A connectionist model of the balance scale task is presented which exhibits developmental transitions between 'Rule I' and 'Rule II' behavior [Siegler, R. S. (1976). Three aspects of cognitive development. Cognitive Psychology,8, 481-520.] as well as the 'catastrophe flags' seen in data from Jansen and van der Maas [Jansen, B. R. J., & van der Maas, H. L. J. (2001). Evidence for the phase transition from Rule I to Rule II on the balance scale task. Developmental Review, 21, 450-494]. The model extends a connectionist model of this task [McClelland, J. L. (1989). Parallel distributed processing: Implications for cognition and development. In R. G. M. Morris (Ed.), Parallel distributed processing: Implications for psychology and neurobiology (pp. 8-45). Oxford: Clarendon Press] by introducing intrinsic variability into processing and by allowing the network to adapt during testing in response to its own outputs. The simulations direct attention to several aspects of the experimental data indicating that children generally show gradual change in sensitivity to the distance dimension on the balance scale. While a few children show larger changes than are characteristic of the model, its ability to account for nearly all of the data using continuous processes is consistent with the view that the transition from Rule I to Rule II behavior is typically continuous rather than discrete in nature.

View details for DOI 10.1016/j.cognition.2008.11.017

View details for Web of Science ID 000264039900006

View details for PubMedID 19171326
Is a Machine Realization of Truly Human-Like Intelligence Achievable? COGNITIVE COMPUTATION McClelland, J. L. 2009; 1 (1): 17-21

View details for DOI 10.1007/s12559-009-9015-x

View details for Web of Science ID 000207987000003
The Place of Modeling in Cognitive Science TOPICS IN COGNITIVE SCIENCE McClelland, J. L. 2009; 1 (1): 11-38

Abstract

I consider the role of cognitive modeling in cognitive science. Modeling, and the computers that enable it, are central to the field, but the role of modeling is often misunderstood. Models are not intended to capture fully the processes they attempt to elucidate. Rather, they are explorations of ideas about the nature of cognitive processes. In these explorations, simplification is essential-through simplification, the implications of the central ideas become more transparent. This is not to say that simplification has no downsides; it does, and these are discussed. I then consider several contemporary frameworks for cognitive modeling, stressing the idea that each framework is useful in its own particular ways. Increases in computer power (by a factor of about 4 million) since 1958 have enabled new modeling paradigms to emerge, but these also depend on new ways of thinking. Will new paradigms emerge again with the next 1,000-fold increase?

View details for DOI 10.1111/j.1756-8765.2008.01003.x

View details for Web of Science ID 000283862000002
Precis of Semantic Cognition: A Parallel Distributed Processing Approach BEHAVIORAL AND BRAIN SCIENCES Rogers, T. T., McClelland, J. L. 2008; 31 (6): 689-?

View details for DOI 10.1017/S0140525X0800589X

View details for Web of Science ID 000262085200034
A simple model from a powerful framework that spans levels of analysis BEHAVIORAL AND BRAIN SCIENCES Rogers, T. T., McClelland, J. L. 2008; 31 (6): 729-749

View details for DOI 10.1017/S0140525X08006067

View details for Web of Science ID 000262085200051
Objective assessment of deformable image registration in radiotherapy: A multi-institution study MEDICAL PHYSICS Kashani, R., Hub, M., Balter, J. M., Kessler, M. L., Dong, L., Zhang, L., Xing, L., Xie, Y., Hawkes, D., Schnabel, J. A., McClelland, J., Joshi, S., Chen, Q., Lu, W. 2008; 35 (12): 5944-5953

Abstract

The looming potential of deformable alignment tools to play an integral role in adaptive radiotherapy suggests a need for objective assessment of these complex algorithms. Previous studies in this area are based on the ability of alignment to reproduce analytically generated deformations applied to sample image data, or use of contours or bifurcations as ground truth for evaluation of alignment accuracy. In this study, a deformable phantom was embedded with 48 small plastic markers, placed in regions varying from high contrast to roughly uniform regional intensity, and small to large regional discontinuities in movement. CT volumes of this phantom were acquired at different deformation states. After manual localization of marker coordinates, images were edited to remove the markers. The resulting image volumes were sent to five collaborating institutions, each of which has developed previously published deformable alignment tools routinely in use. Alignments were done, and applied to the list of reference coordinates at the inhale state. The transformed coordinates were compared to the actual marker locations at exhale. A total of eight alignment techniques were tested from the six institutions. All algorithms performed generally well, as compared to previous publications. Average errors in predicted location ranged from 1.5 to 3.9 mm, depending on technique. No algorithm was uniformly accurate across all regions of the phantom, with maximum errors ranging from 5.1 to 15.4 mm. Larger errors were seen in regions near significant shape changes, as well as areas with uniform contrast but large local motion discontinuity. Although reasonable accuracy was achieved overall, the variation of error in different regions suggests caution in globally accepting the results from deformable alignment.

View details for DOI 10.1118/1.3013563

View details for Web of Science ID 000261210000071

View details for PubMedID 19175149

View details for PubMedCentralID PMC2673610
Cooperation of complementary learning systems in memory McClelland, J. L. PSYCHOLOGY PRESS. 2008: 2

View details for Web of Science ID 000259264300014
Effects of attention on the strength of lexical influences on speech perception: Behavioral experiments and computational mechanisms COGNITIVE SCIENCE Mirman, D., McClelland, J. L., Holt, L. L., Magnuson, J. S. 2008; 32 (2): 398-417

Abstract

The effects of lexical context on phonological processing are pervasive and there have been indications that such effects may be modulated by attention. However, attentional modulation in speech processing is neither well-documented nor well-understood. Experiment 1 demonstrated attentional modulation of lexical facilitation of speech sound recognition when task and critical stimuli were identical across attention conditions. We propose modulation of lexical activation as a neurophysiologically-plausible computational mechanism that can account for this type of modulation. Contrary to the claims of critics, this mechanism can account for attentional modulation without violating the principle of interactive processing. Simulations of the interactive TRACE model extended to include two different ways of modulating lexical activation showed that each can account for attentional modulation of lexical feedback effects. Experiment 2 tested conflicting predictions from the two implementations and provided evidence that is consistent with bias input as the mechanism of attentional control of lexical activation.

View details for DOI 10.1080/03640210701864063

View details for Web of Science ID 000255341900005

View details for PubMedCentralID PMC2396758
Modeling Unsupervised Perceptual Category Learning 7th IEEE International Conference on Development and Learning Lake, B. M., Vallabha, G. K., McClelland, J. L. IEEE. 2008: 25–30

View details for Web of Science ID 000265407300005
Modeling Unsupervised Perceptual Category Learning Lake, B. M., Vallabha, G. K., McClelland, J. L., IEEE IEEE. 2008: 25-30

View details for DOI 10.1109/DEVLRN.2008.4640800

View details for Web of Science ID 000265407300005
Connectionist Models of Cognition CAMBRIDGE HANDBOOK OF COMPUTATIONAL PSYCHOLOGY Thomas, M. S. C., McClelland, J. L. edited by Sun, R. 2008: 23-59

View details for Web of Science ID 000304675100003
A single-system account of semantic and lexical deficits in five semantic dementia patients COGNITIVE NEUROPSYCHOLOGY Dilkina, K., McClelland, J. L., Plaut, D. C. 2008; 25 (2): 136-164

Abstract

In semantic dementia (SD), there is a correlation between performance on semantic tasks such as picture naming and lexical tasks such as reading aloud. However, there have been a few case reports of patients with spared reading despite profound semantic impairment. These reports have sparked an ongoing debate about how the brain processes conceptual versus lexical knowledge. One possibility is that there are two functionally distinct systems in the brain-one for semantic and one for lexical processing. Alternatively, there may be a single system involved in both. We present a computational investigation of the role of individual differences in explaining the relationship between naming and reading performance in five SD patients, among whom there are cases of both association and dissociation of deficits. We used a connectionist model where information from different modalities feeds into a single integrative layer. Our simulations successfully produced the overall relationship between reading and naming seen in SD and provided multiple fits for both association and dissociation data, suggesting that a single, cross-modal, integrative system is sufficient for both semantic and lexical tasks and that individual differences among patients are essential in accounting for variability in performance.

View details for DOI 10.1080/02643290701723948

View details for Web of Science ID 000257087600002

View details for PubMedID 18568816
Language is not just for talking - Redundant labels facilitate learning of novel categories PSYCHOLOGICAL SCIENCE Lupyan, G., Rakison, D. H., McClelland, J. L. 2007; 18 (12): 1077-1083

Abstract

In addition to having communicative functions, verbal labels may play a role in shaping concepts. Two experiments assessed whether the presence of labels affected category formation. Subjects learned to categorize "aliens" as those to be approached or those to be avoided. After accuracy feedback on each response was provided, a nonsense label was either presented or not. Providing nonsense category labels facilitated category learning even though the labels were redundant and all subjects had equivalent experience with supervised categorization of the stimuli. A follow-up study investigated differences between learning verbal and nonverbal associations and showed that learning a nonverbal association did not facilitate categorization. The findings show that labels make category distinctions more concrete and bear directly on the language-and-thought debate.

View details for Web of Science ID 000251206100011

View details for PubMedID 18031415
Language is not just for talking - Redundant labels facilitate learning of novel categories PSYCHOLOGICAL SCIENCE Lupyan, G., Rakison, D. H., McClelland, J. L. 2007; 18 (12): 1077-1083

Abstract

In addition to having communicative functions, verbal labels may play a role in shaping concepts. Two experiments assessed whether the presence of labels affected category formation. Subjects learned to categorize "aliens" as those to be approached or those to be avoided. After accuracy feedback on each response was provided, a nonsense label was either presented or not. Providing nonsense category labels facilitated category learning even though the labels were redundant and all subjects had equivalent experience with supervised categorization of the stimuli. A follow-up study investigated differences between learning verbal and nonverbal associations and showed that learning a nonverbal association did not facilitate categorization. The findings show that labels make category distinctions more concrete and bear directly on the language-and-thought debate.

View details for DOI 10.1111/j.1467-9280.2007.02028.x

View details for Web of Science ID 000251206100011

View details for PubMedID 18031415
Unsupervised learning of vowel categories from infant-directed speech PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA Vallabha, G. K., McClelland, J. L., Pons, F., Werker, J. F., Amano, S. 2007; 104 (33): 13273-13278

Abstract

Infants rapidly learn the sound categories of their native language, even though they do not receive explicit or focused training. Recent research suggests that this learning is due to infants' sensitivity to the distribution of speech sounds and that infant-directed speech contains the distributional information needed to form native-language vowel categories. An algorithm, based on Expectation-Maximization, is presented here for learning the categories from a sequence of vowel tokens without (i) receiving any category information with each vowel token, (ii) knowing in advance the number of categories to learn, or (iii) having access to the entire data ensemble. When exposed to vowel tokens drawn from either English or Japanese infant-directed speech, the algorithm successfully discovered the language-specific vowel categories (/I, i, epsilon, e/ for English, /I, i, e, e/ for Japanese). A nonparametric version of the algorithm, closely related to neural network models based on topographic representation and competitive Hebbian learning, also was able to discover the vowel categories, albeit somewhat less reliably. These results reinforce the proposal that native-language speech categories are acquired through distributional learning and that such learning may be instantiated in a biologically plausible manner.

View details for DOI 10.1073/pnas.0705369104

View details for Web of Science ID 000248899600013

View details for PubMedID 17664424

View details for PubMedCentralID PMC1934922
Using domain-general principles to explain children's causal reasoning abilities DEVELOPMENTAL SCIENCE McClelland, J. L., Thompson, R. M. 2007; 10 (3): 333-356

Abstract

A connectionist model of causal attribution is presented, emphasizing the use of domain-general principles of processing and learning previously employed in models of semantic cognition. The model categorizes objects dependent upon their observed 'causal properties' and is capable of making several types of inferences that 4-year-old children have been shown to be capable of. The model gives rise to approximate conformity to normative models of causal inference and gives approximate estimates of the probability that an object presented in an ambiguous situation actually possesses a particular causal power, based on background knowledge and recent observations. It accounts for data from three sets of experimental studies of the causal inferencing abilities of young children. The model provides a base for further efforts to delineate the intuitive mechanisms of causal inference employed by children and adults, without appealing to inherent principles or mechanisms specialized for causal as opposed to other forms of reasoning.

View details for DOI 10.1111/j.1467-7687.2007.00586.x

View details for Web of Science ID 000245812200006

View details for PubMedID 17444974
Convergent approaches to the understanding of autonomous mental development IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION McClelland, J. L., Plunkett, K., Weng, J. 2007; 11 (2): 133-136

View details for DOI 10.1109/TEVC.2006.890280

View details for Web of Science ID 000245518500001
Success and failure of new speech category learning in adulthood: Consequences of learned Hebbian attractors in topographic maps COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE Vallabha, G. K., McClelland, J. L. 2007; 7 (1): 53-73

Abstract

The influence of a native language on learning new speech sounds in adulthood is addressed using a network model in which speech categories are attractors implemented through interactive activation and Hebbian learning. The network has a representation layer that receives topographic projections from an input layer and has reciprocal excitatory connections with deeper layers. When applied to an experiment in which Japanese adults were trained to distinguish the English /r/-/l/ contrast (McCandliss, Fiez, Protopapas, Conway, & McClelland, 2002), the model can account for many aspects of the experimental results, such as the time course and outcome of the learning, how it varies as a function of feedback, the relative efficacy of adaptive and initially easy training stimuli versus nonadaptive and difficult stimuli, and the development of a discrimination peak at the acquired category boundary. The model is also able to capture some aspects of the individual differences in learning.

View details for Web of Science ID 000247048500006

View details for PubMedID 17598735
Success and failure of new speech category learning in adulthood: Consequences of learned Hebbian attractors in topographic maps COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE Vallabha, G. K., McClelland, J. L. 2007; 7 (1): 53-73

Abstract

The influence of a native language on learning new speech sounds in adulthood is addressed using a network model in which speech categories are attractors implemented through interactive activation and Hebbian learning. The network has a representation layer that receives topographic projections from an input layer and has reciprocal excitatory connections with deeper layers. When applied to an experiment in which Japanese adults were trained to distinguish the English /r/-/l/ contrast (McCandliss, Fiez, Protopapas, Conway, & McClelland, 2002), the model can account for many aspects of the experimental results, such as the time course and outcome of the learning, how it varies as a function of feedback, the relative efficacy of adaptive and initially easy training stimuli versus nonadaptive and difficult stimuli, and the development of a discrimination peak at the acquired category boundary. The model is also able to capture some aspects of the individual differences in learning.

View details for DOI 10.3758/CABN.7.1.53

View details for Web of Science ID 000247048500006

View details for PubMedID 17598735
Gradience of gradience: A reply to Jackendoff (Ray Jackendoff) LINGUISTIC REVIEW McClelland, J. L., Bybee, J. 2007; 24 (4): 437-455

View details for DOI 10.1515/TLR.2007.019

View details for Web of Science ID 000253293500007
Response to McQueen et <i>al</i>.:: Theoretical and empirical arguments support interactive processing TRENDS IN COGNITIVE SCIENCES Mirman, D., McClelland, J. L., Holt, L. L. 2006; 10 (12): 534

View details for DOI 10.1016/j.tics.2006.10.003

View details for Web of Science ID 000242932200004
A homeostatic rule for inhibitory synapses promotes temporal sharpening and cortical reorganization PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA Moldakarimov, S. B., McClelland, J. L., Ermentrout, G. B. 2006; 103 (44): 16526-16531

Abstract

Experience with transient stimuli leads to stronger neural responses that also rise and fall more sharply in time. This sharpening enhances the processing of transients and may be especially relevant for speech perception. We consider a learning rule for inhibitory connections that promotes this sharpening effect by adjusting these connections to maintain a target homeostatic level of activity in excitatory neurons. We analyze this rule in a recurrent network model of excitatory and inhibitory units. Strengthening inhibitory-->excitatory connections along with excitatory-->excitatory connections is required to obtain a sharpening effect. Using the homeostatic rule, we show that repeated presentations of a transient signal will "teach" the network to respond to the signal with both higher amplitude and shorter duration. The model also captures reorganization of receptive fields in the sensory hand area after amputation or peripheral nerve resection.

View details for DOI 10.1073/pnas.0607589103

View details for Web of Science ID 000241879500082

View details for PubMedID 17050684

View details for PubMedCentralID PMC1637615
Connectionist models of development DEVELOPMENTAL SCIENCE Munakata, Y., McClelland, J. L. 2003; 6 (4): 413-429

View details for DOI 10.1111/1467-7687.00296

View details for Web of Science ID 000184909000006
Differentiation and integration in human language - Reply to Marslen-Wilson and Tyler TRENDS IN COGNITIVE SCIENCES McClelland, J. L., Patterson, K. 2003; 7 (2): 63-64

View details for DOI 10.1016/S1364-6613(02)00048-7

View details for Web of Science ID 000181370200009

View details for PubMedID 12584023
Stipulating versus discovering representations BEHAVIORAL AND BRAIN SCIENCES Plaut, D. C., McClelland, J. L. 2000; 23 (4): 489-+

View details for DOI 10.1017/S0140525X00473358

View details for Web of Science ID 000166610300027
Neural models of memory CURRENT OPINION IN NEUROBIOLOGY Hasselmo, M. E., McClelland, J. L. 1999; 9 (2): 184-188

Abstract

Neural models assist in characterizing the processes carried out by cortical and hippocampal memory circuits. Recent models of memory have addressed issues including recognition and recall dynamics, sequences of activity as the unit of storage, and consolidation of intermediate-term episodic memory into long-term memory.

View details for DOI 10.1016/S0959-4388(99)80025-7

View details for Web of Science ID 000080043500006

View details for PubMedID 10322183
Considerations arising from a complementary learning systems perspective on hippocampus and neocortex HIPPOCAMPUS McClelland, J. L., Goddard, N. H. 1996; 6 (6): 654-665

Abstract

We discuss a framework for the organization of learning systems in the mammalian brain, in which the hippocampus and related areas form a memory system complementary to learning mechanisms in neocortex and other areas. The hippocampal system stores new episodes and "replays" them to the neocortical system, interleaved with ongoing experience, allowing generalization as cortical memories form. The data to account for include: 1) neurophysiological findings concerning representations in hippocampal areas, 2) behavioral evidence demonstrating a spatial role for hippocampus, 3) and effects of surgical and pharmacological manipulations on neuronal firing in hippocampal regions in behaving animals. We hypothesize that the hippocampal memory system consists of three major modules: 1) an invertible encoder subsystem supported by the pathways between neocortex and entorhinal cortex, which provides a stable, compressed, invertible encoding in entorhinal cortex (EC) of cortical activity patterns, 2) a memory separation, storage, and retrieval subsystem, supported by pathways between EC, dentate gyrus and area CA3, including the CA3 recurrent collaterals, which facilitates encoding and storage in CA3 of individual EC patterns, and retrieval of those CA3 encodings, in a manner that minimizes interference, and 3) a memory decoding subsystem, supported by the Shaffer collaterals from area CA1 to area CA3 and the bi-directional pathways between EC and CA3, which provides the means by which a retrieved CA3 coding of an EC pattern can reinstate that pattern on EC. This model has shown that 1) there is a trade-off between the need for information-preserving, structure-extracting encoding of cortical traces and the need for effective storage and recall of arbitrary traces, 2) long-term depression of synaptic strength in the pathways subject to long-term potentiation is crucial in preserving information, 3) area CA1 must be able to exploit correlations in EC patterns in the direct perforant path synapses.

View details for Web of Science ID A1996WF54400008

View details for PubMedID 9034852
HIPPOCAMPAL CONJUNCTIVE ENCODING, STORAGE, AND RECALL - AVOIDING A TRADE-OFF HIPPOCAMPUS OREILLY, R. C., MCCLELLAND, J. L. 1994; 4 (6): 661-682

Abstract

The hippocampus and related structures are thought to be capable of 1) representing cortical activity in a way that minimizes overlap of the representations assigned to different cortical patterns (pattern separation); and 2) modifying synaptic connections so that these representations can later be reinstated from partial or noisy versions of the cortical activity pattern that was present at the time of storage (pattern completion). We point out that there is a trade-off between pattern separation and completion and propose that the unique anatomical and physiological properties of the hippocampus might serve to minimize this trade-off. We use analytical methods to determine quantitative estimates of both separation and completion for specified parameterized models of the hippocampus. These estimates are then used to evaluate the role of various properties and of the hippocampus, such as the activity levels seen in different hippocampal regions, synaptic potentiation and depression, the multi-layer connectivity of the system, and the relatively focused and strong mossy fiber projections. This analysis is focused on the feedforward pathways from the entorhinal cortex (EC) to the dentate gyrus (DG) and region CA3. Among our results are the following: 1) Hebbian synaptic modification (LTP) facilitates completion but reduces separation, unless the strengths of synapses from inactive presynaptic units to active postsynaptic units are reduced (LTD). 2) Multiple layers, as in EC to DG to CA3, allow the compounding of pattern separation, but not pattern completion. 3) The variance of the input signal carried by the mossy fibers is important for separation, not the raw strength, which may explain why the mossy fiber inputs are few and relatively strong, rather than many and relatively weak like the other hippocampal pathways. 4) The EC projects to CA3 both directly and indirectly via the DG, which suggests that the two-stage pathway may dominate during pattern separation and the one-stage pathway may dominate during completion; methods the hippocampus may use to enhance this effect are discussed.

View details for DOI 10.1002/hipo.450040605

View details for Web of Science ID A1994PZ98800004

View details for PubMedID 7704110
LEARNING THE GENERAL BUT NOT THE SPECIFIC CURRENT BIOLOGY MCCLELLAND, J. L. 1994; 4 (4): 357-358

Abstract

Amnesia patients have a normal ability to learn categories from examples, even though they fail to learn the examples themselves; computational models of brain function suggest how and why.

View details for DOI 10.1016/S0960-9822(00)00079-8

View details for Web of Science ID A1994NF78300012

View details for PubMedID 7857400
LEARNING CONTINUOUS PROBABILITY-DISTRIBUTIONS WITH SYMMETRICAL DIFFUSION NETWORKS COGNITIVE SCIENCE MOVELLAN, MCCLELLAND, J. L. 1993; 17 (4): 463-496

View details for DOI 10.1016/0364-0213(93)90001-O

View details for Web of Science ID A1993MR45600001
NEURAL NETWORK MODELS AND COGNITIVE NEUROPSYCHOLOGY PSYCHIATRIC ANNALS FARAH, M. J., MCCLELLAND, J. L. 1992; 22 (3): 148-153

View details for DOI 10.3928/0048-5713-19920301-12

View details for Web of Science ID A1992HJ56600008
A COMPUTATIONAL MODEL OF SEMANTIC MEMORY IMPAIRMENT - MODALITY SPECIFICITY AND EMERGENT CATEGORY SPECIFICITY JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL FARAH, M. J., MCCLELLAND, J. L. 1991; 120 (4): 339-357

Abstract

It is demonstrated how a modality-specific semantic memory system can account for category-specific impairments after brain damage. In Experiment 1, the hypothesis that visual and functional knowledge play different roles in the representation of living things and nonliving things is tested and confirmed. A parallel distributed processing model of semantic memory in which knowledge is subdivided by modality into visual and functional components is described. In Experiment 2, the model is lesioned, and it is confirmed that damage to visual semantics primarily impairs knowledge of living things, and damage to functional semantics primarily impairs knowledge of nonliving things. In Experiment 3, it is demonstrated that the model accounts naturally for a finding that had appeared problematic for a modality-specific architecture, namely, impaired retrieval of functional knowledge about living things. Finally, in Experiment 4, it is shown how the model can account for a recent observation of impaired knowledge of living things only when knowledge is probed verbally.

View details for DOI 10.1037/0096-3445.120.4.339

View details for Web of Science ID A1991GQ85400001

View details for PubMedID 1837294
ON THE CONTROL OF AUTOMATIC PROCESSES - A PARALLEL DISTRIBUTED-PROCESSING ACCOUNT OF THE STROOP EFFECT PSYCHOLOGICAL REVIEW Cohen, J. D., Dunbar, K., McClelland, J. L. 1990; 97 (3): 332-361

Abstract

Traditional views of automaticity are in need of revision. For example, automaticity often has been treated as an all-or-none phenomenon, and traditional theories have held that automatic processes are independent of attention. Yet recent empirical data suggest that automatic processes are continuous, and furthermore are subject to attentional control. A model of attention is presented to address these issues. Within a parallel distributed processing framework, it is proposed that the attributes of automaticity depend on the strength of a processing pathway and that strength increases with training. With the Stroop effect as an example, automatic processes are shown to be continuous and to emerge gradually with practice. Specifically, a computational model of the Stroop task simulates the time course of processing as well as the effects of learning. This was accomplished by combining the cascade mechanism described by McClelland (1979) with the backpropagation learning algorithm (Rumelhart, Hinton, & Williams, 1986). The model can simulate performance in the standard Stroop task, as well as aspects of performance in variants of this task that manipulate stimulus-onset asynchrony, response set, and degree of practice. The model presented is contrasted against other models, and its relation to many of the central issues in the literature on attention, automaticity, and interference is discussed.

View details for Web of Science ID A1990DN33800002

View details for PubMedID 2200075
A SIMULATION-BASED TUTORIAL SYSTEM FOR EXPLORING PARALLEL DISTRIBUTED-PROCESSING BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS McClelland, J. L., Rumelhart, D. E. 1988; 20 (2): 263-275

View details for Web of Science ID A1988M935500040
THE CASE FOR INTERACTIONISM IN LANGUAGE PROCESSING ATTENTION AND PERFORMANCE MCCLELLAND, J. L. 1987: 3-36

View details for Web of Science ID A1987K817400001
THE TRACE MODEL OF SPEECH-PERCEPTION COGNITIVE PSYCHOLOGY MCCLELLAND, J. L., ELMAN, J. L. 1986; 18 (1): 1-86

View details for DOI 10.1016/0010-0285(86)90015-0

View details for Web of Science ID A1986AXS9000001

View details for PubMedID 3753912
STRUCTURAL FACTORS IN FIGURE PERCEPTION PERCEPTION & PSYCHOPHYSICS MCCLELLAND, J. L., MILLER, J. 1979; 26 (3): 221-229

View details for DOI 10.3758/BF03199872

View details for Web of Science ID A1979HN33900008
VISUAL FACTORS IN WORD PERCEPTION PERCEPTION & PSYCHOPHYSICS JOHNSTON, J. C., MCCLELLA.JL,, 1973; 14 (2): 365-370

View details for DOI 10.3758/BF03212406

View details for Web of Science ID A1973R353300029

Jay McClelland

Lucie Stern Professor in the Social Sciences, Professor of Psychology and, by courtesy, of Linguistics and of Computer Science

Academic Appointments

Administrative Appointments

Honors & Awards

Program Affiliations

Professional Education

Contact

Additional Info

Links

Current Research and Scholarly Interests

2025-26 Courses

2024-25 Courses

2023-24 Courses

2022-23 Courses

Stanford Advisees

Graduate and Fellowship Programs

All Publications

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract