Common coding theory

Last updated

Common coding theory is a cognitive psychology theory describing how perceptual representations (e.g. of things we can see and hear) and motor representations (e.g. of hand actions) are linked. The theory claims that there is a shared representation (a common code) for both perception and action. More important, seeing an event activates the action associated with that event, and performing an action activates the associated perceptual event. [1]

Contents

The idea of direct perception-action links originates in the work of the American psychologist William James and more recently, American neurophysiologist and Nobel prize winner Roger Sperry. Sperry argued that the perception–action cycle is the fundamental logic of the nervous system. [2] Perception and action processes are functionally intertwined: perception is a means to action and action is a means to perception. Indeed, the vertebrate brain has evolved for governing motor activity with the basic function to transform sensory patterns into patterns of motor coordination.

Background

The classical approach to cognition is a 'sandwich' model which assumes three stages of information processing: perception, cognition and then action. In this model, perception and action do not interact directly, instead cognitive processing is needed to convert perceptual representations into action. For example, this might require creating arbitrary linkages (mapping between sensory and motor codes). [3]

In contrast, the common coding account claims that perception and action are directly linked by a common computational code. [4]

This theory, put forward by Wolfgang Prinz and his colleagues from the Max Planck Institute for Human Cognitive and Brain Sciences, claims parity between perception and action. Its core assumption is that actions are coded in terms of the perceivable effects (i.e., the distal perceptual events) they should generate. [5] This theory also states that perception of an action should activate action representations to the degree that the perceived and the represented action are similar. [6] Such a claim suggests that we represent observed, executed and imagined actions in a commensurate manner and makes specific predictions regarding the nature of action and perceptual representations. First, representations for observed and executed actions should rely on a shared neural substrate. Second, a common cognitive system predicts facilitation of action based on directly prior perception and vice versa. Third, such a system predicts interference effects when action and perception attempt to access shared representations simultaneously.

Evidence for common coding

From the year 2000 onwards, a growing number of results have been interpreted in favor of the common coding theory.

For instance, one functional MRI study demonstrated that the brain's response to the 2/3 power law of motion (i.e., which dictates a strong coupling between movement curvature and velocity) is much stronger and more widespread than to other types of motion. Compliance with this law was reflected in the activation of a large network of brain areas subserving motor production, visual motion processing, and action observation functions. These results support the common coding and the notion of similar neural coding for motion perception and production. [7]

One of the most direct evidence for common coding in the brain now stems from the fact that pattern classifiers that can differentiate based on brain activity whether someone has performed action A or B can also classify, above chance, whether that person heard the sound of action A or B, thereby demonstrating that action execution and perception are represented using a common code. [8]

In the early 21st century, the common coding theory received increased interest from researchers in developmental psychology, [9] cognitive neuroscience, [10] robotics, [11] and social psychology. [12]

Commensurate representation

Common coding posits, on top of separate coding, further domains of representation in which afferent and efferent information share the same format and dimensionality of representation. Common coding refers to 'late' afferent representations (referring to events in the environment) and 'early' efferent representations (referring to intended events). Such representations are commensurate since they both exhibit distal reference. [13] [14] They permit creating linkages between perception and action that do not rely on arbitrary mappings. Common coding conceives action planning in terms of operations that determine intended future events from given current events (matching between event codes and action codes). In particular perception and action may modulate each other by virtue of similarity. Unlike rule-based mapping of incommensurate codes which requires preceding acquisition of mapping rules, similarity-based matching of commensurate codes requires no such preceding rule acquisition.

Ideomotor principle

In line with the ideomotor theory of William James (1890) and Hermann Lotze (1852), the common coding theory posits that actions are represented in terms of their perceptual consequences. Actions are represented like any other events, the sole distinctive feature being that they are (or can be) generated through bodily movements. Perceivable action consequences may vary on two major dimensions: resident vs. remote effects, and 'cool' versus 'hot' outcomes (i.e., reward values associated with action outcomes). [15]

When individuals perform actions they learn what their movements lead to (Ideomotor learning). The ideomotor theory claims that these associations can also be used in the reverse order (cf. William James, 1890 II, p. 526): When individuals perceive events of which they know (from previous learning) that they may result from certain movements, perception of these events may evoke the movements leading to them (Ideomotor control). The distinction between learning and control is equivalent to the distinction between forward and inverse computation in motor learning and control. [16] Ideomotor learning supports prediction and anticipation of action outcomes, given current action. Ideomotor control supports selection and control of action, given intended outcomes.

While most traditional approaches tend to stress the relative independence of perception and action, some theories have argued for closer links. Motor theories of speech and action perception have made a case for motor contributions to perception. [17] [18] Close non-representational connections between perception and action have also been claimed by ecological approaches. [19] [20] Today common coding theory is closely related to research and theory in two intersecting fields of study: Mirror neurons systems and embodied cognition. As concerns mirror systems, common coding seems to reflect the functional logic of mirror neurons and mechanisms in the brain. [21] As concerns embodied cognition, common coding is compatible with the claim that meaning is embodied, i.e. grounded in perception and action. [22] [23] Common coding theory has further sparked refined theoretical frameworks that build on its notion of a shared representational format for action and perception. A recent example for these refinements is the Binding and retrieval in action control (BRAC) framework. [24]

See also

Related Research Articles

<span class="mw-page-title-main">Cognitive science</span> Interdisciplinary scientific study of cognitive processes

Cognitive science is the interdisciplinary, scientific study of the mind and its processes with input from linguistics, psychology, neuroscience, philosophy, computer science/artificial intelligence, and anthropology. It examines the nature, the tasks, and the functions of cognition. Cognitive scientists study intelligence and behavior, with a focus on how nervous systems represent, process, and transform information. Mental faculties of concern to cognitive scientists include language, perception, memory, attention, reasoning, and emotion; to understand these faculties, cognitive scientists borrow from fields such as linguistics, psychology, artificial intelligence, philosophy, neuroscience, and anthropology. The typical analysis of cognitive science spans many levels of organization, from learning and decision to logic and planning; from neural circuitry to modular brain organization. One of the fundamental concepts of cognitive science is that "thinking can best be understood in terms of representational structures in the mind and computational procedures that operate on those structures."

<span class="mw-page-title-main">Perception</span> Interpretation of sensory information

Perception is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous system, which in turn result from physical or chemical stimulation of the sensory system. Vision involves light striking the retina of the eye; smell is mediated by odor molecules; and hearing involves pressure waves.

In the philosophy of mind, neuroscience, and cognitive science, a mental image is an experience that, on most occasions, significantly resembles the experience of "perceiving" some object, event, or scene but occurs when the relevant object, event, or scene is not actually present to the senses. There are sometimes episodes, particularly on falling asleep and waking up, when the mental imagery may be dynamic, phantasmagoric, and involuntary in character, repeatedly presenting identifiable objects or actions, spilling over from waking events, or defying perception, presenting a kaleidoscopic field, in which no distinct object can be discerned. Mental imagery can sometimes produce the same effects as would be produced by the behavior or experience imagined.

The consciousness and binding problem is the problem of how objects, background and abstract or emotional features are combined into a single experience.

<span class="mw-page-title-main">Dual-coding theory</span> Theory of cognition

Dual-coding theory is a theory of cognition that suggests that the mind processes information along two different channels; verbal and nonverbal. It was hypothesized by Allan Paivio of the University of Western Ontario in 1971. In developing this theory, Paivio used the idea that the formation of mental imagery aids learning through the picture superiority effect.

The cognitive revolution was an intellectual movement that began in the 1950s as an interdisciplinary study of the mind and its processes, from which emerged a new field known as cognitive science. The preexisting relevant fields were psychology, linguistics, computer science, anthropology, neuroscience, and philosophy. The approaches used were developed within the then-nascent fields of artificial intelligence, computer science, and neuroscience. In the 1960s, the Harvard Center for Cognitive Studies and the Center for Human Information Processing at the University of California, San Diego were influential in developing the academic study of cognitive science. By the early 1970s, the cognitive movement had surpassed behaviorism as a psychological paradigm. Furthermore, by the early 1980s the cognitive approach had become the dominant line of research inquiry across most branches in the field of psychology.

Music psychology, or the psychology of music, may be regarded as a branch of both psychology and musicology. It aims to explain and understand musical behaviour and experience, including the processes through which music is perceived, created, responded to, and incorporated into everyday life. Modern music psychology is primarily empirical; its knowledge tends to advance on the basis of interpretations of data collected by systematic observation of and interaction with human participants. Music psychology is a field of research with practical relevance for many areas, including music performance, composition, education, criticism, and therapy, as well as investigations of human attitude, skill, performance, intelligence, creativity, and social behavior.

Lawrence W. Barsalou is an American psychologist and a cognitive scientist, currently working at the University of Glasgow.

The concept of motor cognition grasps the notion that cognition is embodied in action, and that the motor system participates in what is usually considered as mental processing, including those involved in social interaction. The fundamental unit of the motor cognition paradigm is action, defined as the movements produced to satisfy an intention towards a specific motor goal, or in reaction to a meaningful event in the physical and social environments. Motor cognition takes into account the preparation and production of actions, as well as the processes involved in recognizing, predicting, mimicking, and understanding the behavior of other people. This paradigm has received a great deal of attention and empirical support in recent years from a variety of research domains including embodied cognition, developmental psychology, cognitive neuroscience, and social psychology.

<span class="mw-page-title-main">Wolfgang Prinz</span> German cognitive psychologist

Wolfgang Prinz is a German cognitive psychologist. He is the director of the Max Planck Institute for Human Cognitive and Brain Sciences in Leipzig, Germany, and an internationally recognized expert in experimental psychology, cognitive psychology and philosophy of mind. He is the founder of the common coding theory between perception and action that has a significant impact in cognitive neuroscience and social cognition.

<span class="mw-page-title-main">Vittorio Gallese</span> Italian physiologist (1959–)

Vittorio Gallese is professor of Psychobiology at the University of Parma, Italy, and was professor in Experimental Aesthetics at the University of London, UK (2016–2018). He is an expert in neurophysiology, cognitive neuroscience, social neuroscience, and philosophy of mind. Gallese is one of the discoverers of mirror neurons. His research attempts to elucidate the functional organization of brain mechanisms underlying social cognition, including action understanding, empathy, language, mindreading and aesthetic experience.

Motor imagery is a mental process by which an individual rehearses or simulates a given action. It is widely used in sport training as mental practice of action, neurological rehabilitation, and has also been employed as a research paradigm in cognitive neuroscience and cognitive psychology to investigate the content and the structure of covert processes that precede the execution of action. In some medical, musical, and athletic contexts, when paired with physical rehearsal, mental rehearsal can be as effective as pure physical rehearsal (practice) of an action.

Bayesian approaches to brain function investigate the capacity of the nervous system to operate in situations of uncertainty in a fashion that is close to the optimal prescribed by Bayesian statistics. This term is used in behavioural sciences and neuroscience and studies associated with this term often strive to explain the brain's cognitive abilities based on statistical principles. It is frequently assumed that the nervous system maintains internal probabilistic models that are updated by neural processing of sensory information using methods approximating those of Bayesian probability.

The motor theory of speech perception is the hypothesis that people perceive spoken words by identifying the vocal tract gestures with which they are pronounced rather than by identifying the sound patterns that speech generates. It originally claimed that speech perception is done through a specialized module that is innate and human-specific. Though the idea of a module has been qualified in more recent versions of the theory, the idea remains that the role of the speech motor system is not only to produce speech articulations but also to detect them.

Cognitive musicology is a branch of cognitive science concerned with computationally modeling musical knowledge with the goal of understanding both music and cognition.

<span class="mw-page-title-main">Embodied cognition</span> Interdisciplinary theory

Embodied cognition is the concept suggesting that many features of cognition are shaped by the state and capacities of the organism. The cognitive features include a wide spectrum of cognitive functions, such as perception biases, memory recall, comprehension and high-level mental constructs and performance on various cognitive tasks. The bodily aspects involve the motor system, the perceptual system, the bodily interactions with the environment (situatedness), and the assumptions about the world built the functional structure of organism's brain and body.

In neuroscience, predictive coding is a theory of brain function which postulates that the brain is constantly generating and updating a "mental model" of the environment. According to the theory, such a mental model is used to predict input signals from the senses that are then compared with the actual input signals from those senses. With the rising popularity of representation learning, the theory is being actively pursued and applied in machine learning and related fields.

Social cognitive neuroscience is the scientific study of the biological processes underpinning social cognition. Specifically, it uses the tools of neuroscience to study "the mental mechanisms that create, frame, regulate, and respond to our experience of the social world". Social cognitive neuroscience uses the epistemological foundations of cognitive neuroscience, and is closely related to social neuroscience. Social cognitive neuroscience employs human neuroimaging, typically using functional magnetic resonance imaging (fMRI). Human brain stimulation techniques such as transcranial magnetic stimulation and transcranial direct-current stimulation are also used. In nonhuman animals, direct electrophysiological recordings and electrical stimulation of single cells and neuronal populations are utilized for investigating lower-level social cognitive processes.

Ensemble coding, also known as ensemble perception or summary representation, is a theory in cognitive neuroscience about the internal representation of groups of objects in the human mind. Ensemble coding proposes that such information is recorded via summary statistics, particularly the average or variance. Experimental evidence tends to support the theory for low-level visual information, such as shapes and sizes, as well as some high-level features such as face gender. Nonetheless, it remains unclear the extent to which ensemble coding applies to high-level or non-visual stimuli, and the theory remains the subject of active research.

Binding and Retrieval in Action Control (BRAC) is a theoretical framework to explain basic psychological functions at the intersection of perception and motor control. It takes a cognitive approach by capturing how events are represented in the cognitive system. Its two core mechanisms – binding and retrieval of feature codes – explain a variety of observations in basic psychological experiments within a compact and parsimonious framework.

References

  1. Prinz, W. (1984). "Modes of linkage between perception and action". In Prinz, W.; Sanders, A.-F. (eds.). Cognition and motor processes. New York: Springer. pp. 185–193. ISBN   0-387-12855-7.
  2. Sperry, R.W. (1952). "Neurology and the mind-body problem". American Scientist. 40: 291–312.
  3. Massaro, D. W. (1990). "An information-processing analysis of perception and action". In Neumann, O.; Prinz, W. (eds.). Relationships between perception and action: Current approaches. New York: Springer. pp. 133–166. ISBN   0-387-52069-4.
  4. Prinz, W. (2005). "Experimental approaches to action". In Roessler, J.; Eilan, N. (eds.). Agency and self-awareness. New York: Oxford University Press. pp. 165–187. ISBN   0-19-924561-4.
  5. Prinz, W. (1997). "Perception and action planning". European Journal of Cognitive Psychology. 9 (2): 129–154. doi:10.1080/713752551.
  6. Knoblich, G.; Flach, R. (2001). "Predicting the effects of actions: interactions of perception and action". Psychological Science. 12 (6): 467–472. doi:10.1111/1467-9280.00387. PMID   11760133. S2CID   9061659.
  7. Eran Dayan, E.; Casile, A.; Levit-Binnun, N.; Giese, M.A.; Hendler, T.; Flash, T. (2007). "Neural representations of kinematic laws of motion: Evidence for action-perception coupling". PNAS. 104 (51): 20582–20587. Bibcode:2007PNAS..10420582D. doi: 10.1073/pnas.0710033104 . PMC   2154474 . PMID   18079289.
  8. Etzel, J. A.; Gazzola, V.; Keysers, C. (2008). "Testing Simulation Theory with Cross-Modal Multivariate Classification of fMRI Data". PLOS ONE. 3 (11): e3690. Bibcode:2008PLoSO...3.3690E. doi: 10.1371/journal.pone.0003690 . PMC   2577733 . PMID   18997869.
  9. Sommerville, J. A.; Decety, J. (2006). "Weaving the fabric of social interaction: Articulating developmental psychology and cognitive neuroscience in the domain of motor cognition". Psychonomic Bulletin & Review. 13 (2): 179–200. doi:10.3758/bf03193831. PMID   16892982. S2CID   14689479.
  10. Jackson, P.L.; Decety, J. (2004). "Motor cognition: A new paradigm to investigate social interactions". Current Opinion in Neurobiology. 14 (2): 1–5. doi:10.1016/j.conb.2004.01.020. PMID   15082334. S2CID   36205586.
  11. Proctor & Vu (2006). Stimulus-response compatibility: Data, theory and application. Taylor & Francis. ISBN   0-415-31536-0.
  12. Dijksterhuis, A.; Bargh, J.A. (2001). "The perception-behavior expressway: automatic effects of social perception on social behavior". Advances in Experimental Social Psychology. 33: 1–40. doi:10.1016/S0065-2601(01)80003-4. ISBN   9780120152339.
  13. Prinz, W (1992). "Why don't we perceive our brain states?". European Journal of Cognitive Psychology. 4: 1–20. doi:10.1080/09541449208406240.
  14. Hommel, B.; Müsseler, J.; Aschersleben, G.; Prinz, W. (2001). "The theory of event coding (TEC): A framework for perception and action planning". Behavioral and Brain Sciences. 24 (5): 849–878. doi:10.1017/s0140525x01000103. PMID   12239891.
  15. Dickinson, A.; Balleine, B. W. (2002). "The role of learning in the operation of motivational systems". In Pashler, H.; Gallistel, R. (eds.). Stevens' handbook of experimental psychology. Vol. 3. New York: John Wiley. pp. 497–533. ISBN   0-471-38047-4.
  16. Wolpert, D.; Ghahramani, Z. (2004). "Computational motor control". In Gazzaniga, M. S. (ed.). The cognitive neurosciences (3rd ed.). Cambridge, MA: MIT Press. pp. 485–494. ISBN   0-262-07254-8.
  17. Viviani, P. (2002). "Motor competence in the perception of dynamic events: A tutorial". In Prinz, W.; Hommel, B. (eds.). Common mechanisms in perception and action: Attention and Performance. Vol. XIX. Oxford: Oxford University Press. pp. 406–442. ISBN   0-19-851069-1.
  18. Liberman, A. M. (1982). "On finding that speech is special". American Psychologist. 37 (2): 148–167. doi:10.1037/0003-066x.37.2.148. S2CID   14469676.
  19. Fowler, C. A.; Turvey, M. T. (1982). "Observational perspective and descriptive level in perceiving and acting". In Weimer, W. B.; Palermo, D. S. (eds.). Cognition and the symbolic processes. Vol. 2. Hillsdale, NJ: Lawrence Erlbaum. pp. 1–19. ISBN   0-89859-066-3.
  20. Gibson, J. J. (1979). The ecological approach to visual perception. Boston: Houghton Mifflin. ISBN   0-395-27049-9.
  21. Rizzolatti, G.; Craighero, L. (2004). "The mirror-neuron system". Annual Review of Neuroscience. 27: 169–192. doi:10.1146/annurev.neuro.27.070203.144230. PMID   15217330. S2CID   1729870.
  22. Noë, A. (2004). Action in Perception. MIT Press. ISBN   0-262-14088-8.
  23. Barsalou, L. W. (2008). "Grounded cognition". Annual Review of Psychology. 59: 617–645. doi:10.1146/annurev.psych.59.103006.093639. PMID   17705682. S2CID   22345373.
  24. Frings, C., Hommel, B., Koch, I., Rothermund, K., Dignath, D., Giesen, C., Kiesel, A., Kunde, W., Mayr, S., Moeller, B., Möller, M., Pfister, R., & Philipp, A.: Binding and retrieval in action control (BRAC). In: Trends in Cognitive Sciences, Nr. 24, 2020, p. 375–387.

Further reading