Neuro-symbolic AI

Last updated May 19, 2024

Neuro-symbolic AI is a type of artificial intelligence that integrates neural and symbolic AI architectures to address the weaknesses of each, providing a robust AI capable of reasoning, learning, and cognitive modeling. As argued by Leslie Valiant ^[1] and others,^[2]^[3] the effective construction of rich computational cognitive models demands the combination of symbolic reasoning and efficient machine learning. Gary Marcus, argued, "We cannot construct rich cognitive models in an adequate, automated way without the triumvirate of hybrid architecture, rich prior knowledge, and sophisticated techniques for reasoning."^[4] Further, "To build a robust, knowledge-driven approach to AI we must have the machinery of symbol manipulation in our toolkit. Too much useful knowledge is abstract to proceed without tools that represent and manipulate abstraction, and to date, the only known machinery that can manipulate such abstract knowledge reliably is the apparatus of symbol manipulation."^[5]

Henry Kautz,^[6] Francesca Rossi,^[7] and Bart Selman ^[8] also argued for a synthesis. Their arguments attempt to address the two kinds of thinking, as discussed in Daniel Kahneman's book Thinking Fast and Slow. It describes cognition as encompassing two components: System 1 is fast, reflexive, intuitive, and unconscious. System 2 is slower, step-by-step, and explicit. System 1 is used for pattern recognition. System 2 handles planning, deduction, and deliberative thinking. In this view, deep learning best handles the first kind of cognition while symbolic reasoning best handles the second kind. Both are needed for a robust, reliable AI that can learn, reason, and interact with humans to accept advice and answer questions. Such dual-process models with explicit references to the two contrasting systems have been worked on since the 1990s, both in AI and in Cognitive Science, by multiple researchers.^[9]

Approaches

Approaches for integration are diverse. Henry Kautz's taxonomy of neuro-symbolic architectures,^[10] along with some examples, follows:

Symbolic Neural symbolic is the current approach of many neural models in natural language processing, where words or subword tokens are the ultimate input and output of large language models. Examples include BERT, RoBERTa, and GPT-3.
Symbolic[Neural] is exemplified by AlphaGo, where symbolic techniques are used to invoke neural techniques. In this case, the symbolic approach is Monte Carlo tree search and the neural techniques learn how to evaluate game positions.
Neural | Symbolic uses a neural architecture to interpret perceptual data as symbols and relationships that are reasoned about symbolically. Neural-Concept Learner^[11] is an example.
Neural: Symbolic → Neural relies on symbolic reasoning to generate or label training data that is subsequently learned by a deep learning model, e.g., to train a neural model for symbolic computation by using a Macsyma-like symbolic mathematics system to create or label examples.
Neural_{Symbolic} uses a neural net that is generated from symbolic rules. An example is the Neural Theorem Prover,^[12] which constructs a neural network from an AND-OR proof tree generated from knowledge base rules and terms. Logic Tensor Networks^[13] also fall into this category.
Neural[Symbolic] allows a neural model to directly call a symbolic reasoning engine, e.g., to perform an action or evaluate a state. An example would be ChatGPT using a plugin to query Wolfram Alpha.

These categories are not exhaustive, as they do not consider multi-agent systems. In 2005, Bader and Hitzler presented a more fine-grained categorization that considered, e.g., whether the use of symbols included logic and if it did, whether the logic was propositional or first-order logic.^[14] The 2005 categorization and Kautz's taxonomy above are compared and contrasted in a 2021 article.^[10] Recently, Sepp Hochreiter argued that Graph Neural Networks "...are the predominant models of neural-symbolic computing"^[15] since "[t]hey describe the properties of molecules, simulate social networks, or predict future states in physical and engineering applications with particle-particle interactions."^[16]

Artificial general intelligence

Gary Marcus argues that "...hybrid architectures that combine learning and symbol manipulation are necessary for robust intelligence, but not sufficient",^[17] and that there are

...four cognitive prerequisites for building robust artificial intelligence:
hybrid architectures that combine large-scale learning with the representational and computational powers of symbol manipulation,
large-scale knowledge bases—likely leveraging innate frameworks—that incorporate symbolic knowledge along with other forms of knowledge,
reasoning mechanisms capable of leveraging those knowledge bases in tractable ways, and
rich cognitive models that work together with those mechanisms and knowledge bases.^[18]

This echoes earlier calls for hybrid models as early as the 1990s.^[19]^[20]

History

Garcez and Lamb described research in this area as ongoing at least since the 1990s.^[21]^[22] At that time, the terms symbolic and sub-symbolic AI were popular.

A series of workshops on neuro-symbolic AI has been held annually since 2005 Neuro-Symbolic Artificial Intelligence.^[23] In the early 1990s, an initial set of workshops on this topic were organized.^[19]

Research

Key research questions remain,^[24] such as:

What is the best way to integrate neural and symbolic architectures?
How should symbolic structures be represented within neural networks and extracted from them?
How should common-sense knowledge be learned and reasoned about?
How can abstract knowledge that is hard to encode logically be handled?

Implementations

Implementations of neuro-symbolic approaches include:

Scallop: a language based on Datalog that supports differentiable logical and relational reasoning. Scallop can be integrated in Python and with a PyTorch learning module.^[25]
Logic Tensor Networks: encode logical formulas as neural networks and simultaneously learn term encodings, term weights, and formula weights.
DeepProbLog: combines neural networks with the probabilistic reasoning of ProbLog.
SymbolicAI: a compositional differentiable programming library.
Explainable Neural Networks (XNNs): combine neural networks with symbolic hypergraphs and trained using a mixture of backpropagation and symbolic learning called induction.^[26]

Citations

↑ Valiant 2008.
↑ Garcez et al. 2015.
↑ D'Avila Garcez, Artur S.; Lamb, Luis C.; Gabbay, Dov M. (2009). Neural-symbolic cognitive reasoning. Cognitive technologies. Springer. ISBN 978-3-540-73245-7.
↑ Marcus 2020, p. 44.
↑ Marcus & Davis 2019, p. 17.
↑ Kautz 2020.
↑ Rossi 2022.
↑ Selman 2022.
↑ Sun 1995.
1 2 Sarker, Md Kamruzzaman; Zhou, Lu; Eberhart, Aaron; Hitzler, Pascal (2021). "Neuro-symbolic artificial intelligence: Current trends". AI Communications. 34 (3): 197–209. doi:10.3233/AIC-210084. S2CID 239199144.
↑ Mao et al. 2019.
↑ Rocktäschel, Tim; Riedel, Sebastian (2016). "Learning Knowledge Base Inference with Neural Theorem Provers". Proceedings of the 5th Workshop on Automated Knowledge Base Construction. San Diego, CA: Association for Computational Linguistics. pp. 45–50. doi: 10.18653/v1/W16-1309 . Retrieved 2022-08-06.
↑ Serafini, Luciano; Garcez, Artur d'Avila (2016). "Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge". arXiv: 1606.04422 [cs.AI].
↑ Bader & Hitzler 2005.
↑ L.C. Lamb, A.S. d'Avila Garcez, M.Gori, M.O.R. Prates, P.H.C. Avelar, M.Y. Vardi (2020). "Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective." CoRR abs/2003.00330 (2020)
↑ Hochreiter, Sepp (April 2022). "Toward a broad AI". Communications of the ACM. 65 (4): 56–57. doi:10.1145/3512715. ISSN 0001-0782.
↑ Marcus 2020, p. 50.
↑ Marcus 2020, p. 48.
1 2 Sun & Bookman 1994.
↑ Honavar 1995.
↑ Garcez & Lamb 2020, p. 2.
↑ Garcez et al. 2002.
↑ "Neuro-Symbolic Artificial Intelligence". people.cs.ksu.edu. Retrieved 2023-09-11.
↑ Sun 2001.
↑ Li, Ziyang; Huang, Jiani; Naik, Mayur (2023). "Scallop: A Language for Neurosymbolic Programming". arXiv: 2304.04812 [cs.PL].
↑ "Model Induction Method for Explainable AI". USPTO. 2021-05-06.

Related Research Articles

Artificial intelligence (AI), in its broadest sense, is intelligence exhibited by machines, particularly computer systems. It is a field of research in computer science that develops and studies methods and software that enable machines to perceive their environment and uses learning and intelligence to take actions that maximize their chances of achieving defined goals. Such machines may be called AIs.

Cognitive science is the interdisciplinary, scientific study of the mind and its processes. It examines the nature, the tasks, and the functions of cognition. Mental faculties of concern to cognitive scientists include language, perception, memory, attention, reasoning, and emotion; to understand these faculties, cognitive scientists borrow from fields such as linguistics, psychology, artificial intelligence, philosophy, neuroscience, and anthropology. The typical analysis of cognitive science spans many levels of organization, from learning and decision to logic and planning; from neural circuitry to modular brain organization. One of the fundamental concepts of cognitive science is that "thinking can best be understood in terms of representational structures in the mind and computational procedures that operate on those structures."

Connectionism is the name of an approach to the study of human mental processes and cognition that utilizes mathematical models known as connectionist networks or artificial neural networks. Connectionism has had many 'waves' since its beginnings.

<span class="mw-page-title-main">Symbolic artificial intelligence</span> Methods in artificial intelligence research

In artificial intelligence, symbolic artificial intelligence is the term for the collection of all methods in artificial intelligence research that are based on high-level symbolic (human-readable) representations of problems, logic and search. Symbolic AI used tools such as logic programming, production rules, semantic nets and frames, and it developed applications such as knowledge-based systems, symbolic mathematics, automated theorem provers, ontologies, the semantic web, and automated planning and scheduling systems. The Symbolic AI paradigm led to seminal ideas in search, symbolic programming languages, agents, multi-agent systems, the semantic web, and the strengths and limitations of formal knowledge and reasoning systems.

Neuromorphic computing is an approach to computing that is inspired by the structure and function of the human brain. A neuromorphic computer/chip is any device that uses physical artificial neurons to do computations. In recent times, the term neuromorphic has been used to describe analog, digital, mixed-mode analog/digital VLSI, and software systems that implement models of neural systems. The implementation of neuromorphic computing on the hardware level can be realized by oxide-based memristors, spintronic memories, threshold switches, transistors, among others. Training software-based neuromorphic systems of spiking neural networks can be achieved using error backpropagation, e.g., using Python based frameworks such as snnTorch, or using canonical learning rules from the biological learning literature, e.g., using BindsNet.

In artificial intelligence (AI), commonsense reasoning is a human-like ability to make presumptions about the type and essence of ordinary situations humans encounter every day. These assumptions include judgments about the nature of physical objects, taxonomic properties, and peoples' intentions. A device that exhibits commonsense reasoning might be capable of drawing conclusions that are similar to humans' folk psychology and naive physics.

The expression computational intelligence (CI) usually refers to the ability of a computer to learn a specific task from data or experimental observation. Even though it is commonly considered a synonym of soft computing, there is still no commonly accepted definition of computational intelligence.

A cognitive architecture refers to both a theory about the structure of the human mind and to a computational instantiation of such a theory used in the fields of artificial intelligence (AI) and computational cognitive science. The formalized models can be used to further refine a comprehensive theory of cognition and as a useful artificial intelligence program. Successful cognitive architectures include ACT-R and SOAR. The research on cognitive architectures as software instantiation of cognitive theories was initiated by Allen Newell in 1990.

The Sally–Anne test is a psychological test, used in developmental psychology to measure a person's social cognitive ability to attribute false beliefs to others. The flagship implementation of the Sally–Anne test was by Simon Baron-Cohen, Alan M. Leslie, and Uta Frith (1985); in 1988, Leslie and Frith repeated the experiment with human actors and found similar results.

Hybrid intelligent system denotes a software system which employs, in parallel, a combination of methods and techniques from artificial intelligence subfields, such as:

Dov M. Gabbay is an Israeli logician. He is Augustus De Morgan Professor Emeritus of Logic at the Group of Logic, Language and Computation, Department of Computer Science, King's College London.

The following outline is provided as an overview of and topical guide to artificial intelligence:

Ron Sun is a cognitive scientist who made significant contributions to computational psychology and other areas of cognitive science and artificial intelligence. He is currently professor of cognitive sciences at Rensselaer Polytechnic Institute, and formerly the James C. Dowell Professor of Engineering and Professor of Computer Science at University of Missouri. He received his Ph.D. in 1992 from Brandeis University.

Artur d'Avila Garcez is a researcher in the field of computational logic and neural computation, in particular hybrid systems with application in software verification and information extraction. His contributions include neural-symbolic learning systems and nonclassical models of computation combining robust learning and reasoning. He is a Professor of Computer Science at City, University London.

In the philosophy of artificial intelligence, GOFAI is classical symbolic AI, as opposed to other approaches, such as neural networks, situated robotics, narrow symbolic AI or neuro-symbolic AI. The term was coined by philosopher John Haugeland in his 1985 book Artificial Intelligence: The Very Idea.

This glossary of artificial intelligence is a list of definitions of terms and concepts relevant to the study of artificial intelligence, its sub-disciplines, and related fields. Related glossaries include Glossary of computer science, Glossary of robotics, and Glossary of machine vision.

In artificial intelligence, a differentiable neural computer (DNC) is a memory augmented neural network architecture (MANN), which is typically recurrent in its implementation. The model was published in 2016 by Alex Graves et al. of DeepMind.

Explainable AI (XAI), often overlapping with Interpretable AI, or Explainable Machine Learning (XML), either refers to an artificial intelligence (AI) system over which it is possible for humans to retain intellectual oversight, or refers to the methods to achieve this. The main focus is usually on the reasoning behind the decisions or predictions made by the AI which are made more understandable and transparent. XAI counters the "black box" tendency of machine learning, where even the AI's designers cannot explain why it arrived at a specific decision.

Pascal Hitzler is a German American computer scientist specializing in Semantic Web and Artificial Intelligence. He is endowed Lloyd T. Smith Creativity in Engineering Chair, one of the Directors of the Institute for Digital Agriculture and Advanced Analytics (ID3A) and Director of the Center for Artificial Intelligence and Data Science (CAIDS) at Kansas State University, and the founding Editor-in-Chief of the Semantic Web journal and the IOS Press book series Studies on the Semantic Web.

References

Bader, Sebastian; Hitzler, Pascal (2005-11-10). "Dimensions of Neural-symbolic Integration – A Structured Survey". arXiv: cs/0511042 .
Garcez, Artur S. d'Avila; Broda, Krysia; Gabbay, Dov M.; Gabbay (2002). Neural-Symbolic Learning Systems: Foundations and Applications. Springer Science & Business Media. ISBN 978-1-85233-512-0.
Garcez, Artur; Besold, Tarek; De Raedt, Luc; Földiák, Peter; Hitzler, Pascal; Icard, Thomas; Kühnberger, Kai-Uwe; Lamb, Luís; Miikkulainen, Risto; Silver, Daniel (2015). Neural-Symbolic Learning and Reasoning: Contributions and Challenges. AAAI Spring Symposium - Knowledge Representation and Reasoning: Integrating Symbolic and Neural Approaches. Stanford, CA. doi:10.13140/2.1.1779.4243.
Garcez, Artur d'Avila; Gori, Marco; Lamb, Luis C.; Serafini, Luciano; Spranger, Michael; Tran, Son N. (2019). "Neural-Symbolic Computing: An Effective Methodology for Principled Integration of Machine Learning and Reasoning". arXiv: 1905.06088 [cs.AI].
Garcez, Artur d'Avila; Lamb, Luis C. (2020). "Neurosymbolic AI: The 3rd Wave". arXiv: 2012.05876 [cs.AI].
Hitzler, Pascal; Sarker, Md Kamruzzaman (2022). Neuro-Symbolic Artificial Intelligence: The State of the Art. IOS Press. ISBN 978-1-64368-244-0.
Hitzler, Pascal; Sarker, Md Kamruzzaman; Eberhart, Aaron (2023). Compendium of Neurosymbolic Artificial Intelligence. IOS Press. ISBN 978-1-64368-406-2.
Hochreiter, Sepp. "Toward a Broad AI." Commun. ACM 65(4): 56–57 (2022). Toward a broad AI
Honavar, Vasant (1995). Symbolic Artificial Intelligence and Numeric Artificial Neural Networks: Towards a Resolution of the Dichotomy. The Springer International Series In Engineering and Computer Science. Springer US. pp. 351–388. doi:10.1007/978-0-585-29599-2_11.
Kautz, Henry (2020-02-11). The Third AI Summer, Henry Kautz, AAAI 2020 Robert S. Engelmore Memorial Award Lecture . Retrieved 2022-07-06.
Kautz, Henry (2022). "The Third AI Summer: AAAI Robert S. Engelmore Memorial Lecture". AI Magazine. 43 (1): 93–104. doi: 10.1609/aimag.v43i1.19122 . ISSN 2371-9621. S2CID 248213051 . Retrieved 2022-07-12.
Mao, Jiayuan; Gan, Chuang; Kohli, Pushmeet; Tenenbaum, Joshua B.; Wu, Jiajun (2019). "The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision". arXiv: 1904.12584 [cs.CV].
Marcus, Gary; Davis, Ernest (2019). Rebooting AI: Building Artificial Intelligence We Can Trust. Vintage.
Marcus, Gary (2020). "The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence". arXiv: 2002.06177 [cs.AI].
Rossi, Francesca (2022-07-06). "AAAI2022: Thinking Fast and Slow in AI (AAAI 2022 Invited Talk)" . Retrieved 2022-07-06.
Selman, Bart (2022-07-06). "AAAI2022: Presidential Address: The State of AI" . Retrieved 2022-07-06.
Serafini, Luciano; Garcez, Artur d'Avila (2016-07-07). "Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge". arXiv: 1606.04422 [cs.AI].
Sun, Ron (1995). "Robust reasoning: Integrating rule-based and similarity-based reasoning". Artificial Intelligence. 75 (2): 241–296. doi:10.1016/0004-3702(94)00028-Y.
Sun, Ron; Bookman, Lawrence (1994). Computational Architectures Integrating Neural and Symbolic Processes. Kluwer.
Sun, Ron; Alexandre, Frederic (1997). Connectionist Symbolic Integration. Lawrence Erlbaum Associates.
Sun, R (2001). "Hybrid systems and connectionist implementationalism". Encyclopedia of Cognitive Science (MacMillan Publishing Company, 2001).

Valiant, Leslie G (2008). "Knowledge Infusion: In Pursuit of Robustness in Artificial Intelligence". IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science. doi:10.4230/LIPIcs.FSTTCS.2008.1770.

External links

Artificial Intelligence: Workshop series on Neural-Symbolic Learning and Reasoning

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[FOOTNOTEValiant2008-1] Valiant 2008.

[FOOTNOTEGarcezBesoldDe_RaedtFöldiák2015-2] Garcez et al. 2015.

[3] D'Avila Garcez, Artur S.; Lamb, Luis C.; Gabbay, Dov M. (2009). Neural-symbolic cognitive reasoning. Cognitive technologies. Springer. ISBN 978-3-540-73245-7.

[FOOTNOTEMarcus202044-4] Marcus 2020, p. 44.

[FOOTNOTEMarcusDavis201917-5] Marcus & Davis 2019, p. 17.

[FOOTNOTEKautz2020-6] Kautz 2020.

[FOOTNOTERossi2022-7] Rossi 2022.

[FOOTNOTESelman2022-8] Selman 2022.

[FOOTNOTESun1995-9] Sun 1995.

[Sarker-10] 1 2 Sarker, Md Kamruzzaman; Zhou, Lu; Eberhart, Aaron; Hitzler, Pascal (2021). "Neuro-symbolic artificial intelligence: Current trends". AI Communications. 34 (3): 197–209. doi:10.3233/AIC-210084. S2CID 239199144.

[FOOTNOTEMaoGanKohliTenenbaum2019-11] Mao et al. 2019.

[12] Rocktäschel, Tim; Riedel, Sebastian (2016). "Learning Knowledge Base Inference with Neural Theorem Provers". Proceedings of the 5th Workshop on Automated Knowledge Base Construction. San Diego, CA: Association for Computational Linguistics. pp. 45–50. doi: 10.18653/v1/W16-1309 . Retrieved 2022-08-06.

[13] Serafini, Luciano; Garcez, Artur d'Avila (2016). "Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge". arXiv: 1606.04422 [cs.AI].

[FOOTNOTEBaderHitzler2005-14] Bader & Hitzler 2005.

[15] L.C. Lamb, A.S. d'Avila Garcez, M.Gori, M.O.R. Prates, P.H.C. Avelar, M.Y. Vardi (2020). "Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective." CoRR abs/2003.00330 (2020)

[16] Hochreiter, Sepp (April 2022). "Toward a broad AI". Communications of the ACM. 65 (4): 56–57. doi:10.1145/3512715. ISSN 0001-0782.

[FOOTNOTEMarcus202050-17] Marcus 2020, p. 50.

[FOOTNOTEMarcus202048-18] Marcus 2020, p. 48.

[FOOTNOTESunBookman1994-19] 1 2 Sun & Bookman 1994.

[FOOTNOTEHonavar1995-20] Honavar 1995.

[FOOTNOTEGarcezLamb20202-21] Garcez & Lamb 2020, p. 2.

[FOOTNOTEGarcezBrodaGabbayGabbay2002-22] Garcez et al. 2002.

[23] "Neuro-Symbolic Artificial Intelligence". people.cs.ksu.edu. Retrieved 2023-09-11.

[FOOTNOTESun2001-24] Sun 2001.

[25] Li, Ziyang; Huang, Jiani; Naik, Mayur (2023). "Scallop: A Language for Neurosymbolic Programming". arXiv: 2304.04812 [cs.PL].

[26] "Model Induction Method for Explainable AI". USPTO. 2021-05-06.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]