Object Action Complex

Last updated

Object-Action Complexes (OACs) are proposed as a universal representation enabling efficient planning and execution of purposeful action at all levels of a cognitive architecture (Kruger 2009, Worgotter 2008, Geib 2006, Piater 2009). OACs combine the representational and computational efficiency for purposes of search (the frame problem) of STRIPS rules and the object- and situation-oriented concept of affordance with the logical clarity of the event calculus. Affordance is the relation between a situation, usually including an object of a defined type, and the actions that it allows. While affordances have mostly been analyzed in their purely perceptual aspect, the OAC concept defines them more generally as state transition functions suited to prediction. Such functions can be used for efficient forward chaining planning, learning, and execution of actions represented simultaneously at multiple levels in an embodied agent architecture.

Contents

The PACO+ project, an Integrated Project funded by the European Commission through its Cognition Unit under the Information Society Technologies of the sixth Framework Programme (FP6), and launched on 1 February 2006, brings together an interdisciplinary research team to design and build cognitive robots capable of developing perceptual, behavioural and cognitive categories that can be used, communicated and shared with other humans and artificial agents. In the project they hypothesize that such understanding can only be attained by embodied agents and requires the simultaneous consideration of perception and action resting on three foundational assumptions:

Natural language grounding

The domain of Cognitive robotics tries to recognize manipulation tasks which are demonstrated by humans and other robots. In the simplest form, it's equal to a video parsing system [1] but it can be extended with learning capabilities. [2]

Before a robot can execute tasks, the environment has to perceived with Robotic sensors. The raw data are converted into machine readable information which are enriched with semantic information. Natural language grounding (Symbol grounding problem) is equal to convert the actions of the robot's environment into textual information. Semantic Event Chains [3] and Object-Action complex are used to store the information in a database.

Further reading

Related Research Articles

<span class="mw-page-title-main">Cognitive science</span> Interdisciplinary scientific study of cognitive processes

Cognitive science is the interdisciplinary, scientific study of the mind and its processes with input from linguistics, psychology, neuroscience, philosophy, computer science/artificial intelligence, and anthropology. It examines the nature, the tasks, and the functions of cognition. Cognitive scientists study intelligence and behavior, with a focus on how nervous systems represent, process, and transform information. Mental faculties of concern to cognitive scientists include language, perception, memory, attention, reasoning, and emotion; to understand these faculties, cognitive scientists borrow from fields such as linguistics, psychology, artificial intelligence, philosophy, neuroscience, and anthropology. The typical analysis of cognitive science spans many levels of organization, from learning and decision to logic and planning; from neural circuitry to modular brain organization. One of the fundamental concepts of cognitive science is that "thinking can best be understood in terms of representational structures in the mind and computational procedures that operate on those structures."

Artificial consciousness (AC), also known as machine consciousness (MC), synthetic consciousness or digital consciousness, is the consciousness hypothesized to be possible in artificial intelligence. It is also the corresponding field of study, which draws insights from philosophy of mind, philosophy of artificial intelligence, cognitive science and neuroscience. The same terminology can be used with the term "sentience" instead of "consciousness" when specifically designating phenomenal consciousness.

<span class="mw-page-title-main">Affordance</span> Possibility of an action on an object or environment

In psychology, affordance is what the environment offers the individual. In design, affordance has a narrower meaning, it refers to possible actions that an actor can readily perceive.

Agency is the capacity of an actor to act in a given environment. It is independent of the moral dimension, which is called moral agency.

Soar is a cognitive architecture, originally created by John Laird, Allen Newell, and Paul Rosenbloom at Carnegie Mellon University. It is now maintained and developed by John Laird's research group at the University of Michigan.

<span class="mw-page-title-main">Multi-agent system</span> Built of multiple interacting agents

A multi-agent system is a computerized system composed of multiple interacting intelligent agents. Multi-agent systems can solve problems that are difficult or impossible for an individual agent or a monolithic system to solve. Intelligence may include methodic, functional, procedural approaches, algorithmic search or reinforcement learning.

Situated cognition is a theory that posits that knowing is inseparable from doing by arguing that all knowledge is situated in activity bound to social, cultural and physical contexts.

Developmental robotics (DevRob), sometimes called epigenetic robotics, is a scientific field which aims at studying the developmental mechanisms, architectures and constraints that allow lifelong and open-ended learning of new skills and new knowledge in embodied machines. As in human children, learning is expected to be cumulative and of progressively increasing complexity, and to result from self-exploration of the world in combination with social interaction. The typical methodological approach consists in starting from theories of human and animal development elaborated in fields such as developmental psychology, neuroscience, developmental and evolutionary biology, and linguistics, then to formalize and implement them in robots, sometimes exploring extensions or variants of them. The experimentation of those models in robots allows researchers to confront them with reality, and as a consequence, developmental robotics also provides feedback and novel hypotheses on theories of human and animal development.

Cognitive Robotics or Cognitive Technology is a subfield of robotics concerned with endowing a robot with intelligent behavior by providing it with a processing architecture that will allow it to learn and reason about how to behave in response to complex goals in a complex world. Cognitive robotics may be considered the engineering branch of embodied cognitive science and embodied embedded cognition, consisting of Robotic Process Automation, Artificial Intelligence, Machine Learning, Deep Learning, Optical Character Recognition, Image Processing, Process Mining, Analytics, Software Development and System Integration.

Embodied cognitive science is an interdisciplinary field of research, the aim of which is to explain the mechanisms underlying intelligent behavior. It comprises three main methodologies: the modeling of psychological and biological systems in a holistic manner that considers the mind and body as a single entity; the formation of a common set of general principles of intelligent behavior; and the experimental use of robotic agents in controlled environments.

Enactivism is a position in cognitive science that argues that cognition arises through a dynamic interaction between an acting organism and its environment. It claims that the environment of an organism is brought about, or enacted, by the active exercise of that organism's sensorimotor processes. "The key point, then, is that the species brings forth and specifies its own domain of problems ...this domain does not exist "out there" in an environment that acts as a landing pad for organisms that somehow drop or parachute into the world. Instead, living beings and their environments stand in relation to each other through mutual specification or codetermination" (p. 198). "Organisms do not passively receive information from their environments, which they then translate into internal representations. Natural cognitive systems...participate in the generation of meaning ...engaging in transformational and not merely informational interactions: they enact a world." These authors suggest that the increasing emphasis upon enactive terminology presages a new era in thinking about cognitive science. How the actions involved in enactivism relate to age-old questions about free will remains a topic of active debate.

Robotics is the branch of technology that deals with the design, construction, operation, structural disposition, manufacture and application of robots. Robotics is related to the sciences of electronics, engineering, mechanics, and software. The word "robot" was introduced to the public by Czech writer Karel Čapek in his play R.U.R., published in 1920. The term "robotics" was coined by Isaac Asimov in his 1941 science fiction short-story "Liar!"

Neurorobotics is the combined study of neuroscience, robotics, and artificial intelligence. It is the science and technology of embodied autonomous neural systems. Neural systems include brain-inspired algorithms, computational models of biological neural networks and actual biological systems. Such neural systems can be embodied in machines with mechanic or any other forms of physical actuation. This includes robots, prosthetic or wearable systems but also, at smaller scale, micro-machines and, at the larger scales, furniture and infrastructures.

Psi-theory, developed by Dietrich Dörner at the University of Bamberg, is a systemic psychological theory covering human action regulation, intention selection and emotion. It models the human mind as an information processing agent, controlled by a set of basic physiological, social and cognitive drives. Perceptual and cognitive processing are directed and modulated by these drives, which allow the autonomous establishment and pursuit of goals in an open environment.

The LIDA cognitive architecture is an integrated artificial cognitive system that attempts to model a broad spectrum of cognition in biological systems, from low-level perception/action to high-level reasoning. Developed primarily by Stan Franklin and colleagues at the University of Memphis, the LIDA architecture is empirically grounded in cognitive science and cognitive neuroscience. In addition to providing hypotheses to guide further research, the architecture can support control structures for software agents and robots. Providing plausible explanations for many cognitive processes, the LIDA conceptual model is also intended as a tool with which to think about how minds work.

In artificial intelligence research, the situated approach builds agents that are designed to behave effectively successfully in their environment. This requires designing AI "from the bottom-up" by focussing on the basic perceptual and motor skills required to survive. The situated approach gives a much lower priority to abstract reasoning or problem-solving skills.

<span class="mw-page-title-main">Embodied cognition</span> Interdisciplinary theory

Embodied cognition is the concept suggesting that many features of cognition are shaped by the state and capacities of the organism. The cognitive features include a wide spectrum of cognitive functions, such as perception biases, memory recall, comprehension and high-level mental constructs and performance on various cognitive tasks. The bodily aspects involve the motor system, the perceptual system, the bodily interactions with the environment (situatedness), and the assumptions about the world built the functional structure of organism's brain and body.

The ambient optic array is the structured arrangement of light with respect to a point of observation. American psychologist James J. Gibson posited the existence of the ambient optic array as a central part of his ecological approach to optics. For Gibson, perception is a bottom-up process, whereby the agent accesses information about the environment directly from invariant structures in the ambient optic array, rather than recovering it by means of complex cognitive processes. More controversially, Gibson claimed that agents can also directly pick-up the various affordances of the environment, or opportunities for the observer to act in the environment, from the ambient optic array.

The first half of the topic of agency deals with the behavioral sense, or outward expressive evidence thereof. In behavioral psychology, agents are goal-directed entities that are able to monitor their environment to select and perform efficient means-ends actions that are available in a given situation to achieve an intended goal. Behavioral agency, therefore, implies the ability to perceive and to change the environment of the agent. Crucially, it also entails intentionality to represent the goal-state in the future, equifinal variability to be able to achieve the intended goal-state with different actions in different contexts, and rationality of actions in relation to their goal to produce the most efficient action available. Cognitive scientists and Behavioral psychologists have thoroughly investigated agency attribution in humans and non-human animals, since social cognitive mechanisms such as communication, social learning, imitation, or theory of mind presuppose the ability to identify agents and differentiate them from inanimate, non-agentive objects. This ability has also been assumed to have a major effect on inferential and predictive processes of the observers of agents, because agentive entities are expected to perform autonomous behavior based on their current and previous knowledge and intentions. On the other hand, inanimate objects are supposed to react to external physical forces.

Intrinsic motivation in the study of artificial intelligence and robotics is a mechanism for enabling artificial agents to exhibit inherently rewarding behaviours such as exploration and curiosity, grouped under the same term in the study of psychology. Psychologists consider intrinsic motivation in humans to be the drive to perform an activity for inherent satisfaction – just for the fun or challenge of it.

References

  1. Norbert Krüger and Christopher Geib and Justus Piater and Ronald Petrick and Mark Steedman and Florentin Wörgötter (2011). "Object-Action Complexes: Grounded abstractions of sensory-motor processes". Robotics and Autonomous Systems. Elsevier BV. 59 (10): 740–757. doi:10.1016/j.robot.2011.05.009. hdl: 2117/15586 .
  2. Norbert Krüger and Christopher Geib and Justus Piater and Ronald Petrick and Mark Steedman and Florentin Wörgötter (2011). "Object-Action Complexes: Grounded abstractions of sensory-motor processes". Robotics and Autonomous Systems. Elsevier BV. 59 (10): 740–757. doi:10.1016/j.robot.2011.05.009. hdl: 2117/15586 .
  3. Eren Erdal Aksoy and Alexey Abramov and Johannes Dörr and Kejun Ning and Babette Dellen and Florentin Wörgötter (2011). "Learning the semantics of object action relations by observation". The International Journal of Robotics Research. SAGE Publications. 30 (10): 1229–1249. doi:10.1177/0278364911410459. hdl: 2117/14016 .