Company type | Private |
---|---|
Industry | Artificial intelligence |
Founded | 2021 |
Founders |
|
Headquarters | San Francisco, California, U.S. |
Products | Claude |
Number of employees | 375 (2024) [2] |
Website | anthropic.com |
Part of a series on |
Artificial intelligence |
---|
Anthropic PBC is a U.S.-based artificial intelligence (AI) startup public-benefit company, founded in 2021. It researches and develops AI to "study their safety properties at the technological frontier" and use this research to deploy safe, reliable models for the public. [3] [4] [5] Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI’s ChatGPT and Google’s Gemini. [6]
Anthropic was founded by former members of OpenAI, Daniela Amodei and Dario Amodei. [7] In September 2023, Amazon announced an investment of up to $4 billion, followed by a $2 billion commitment from Google in the following month. [8] [9] [10]
Anthropic was founded in 2021 by seven former employees of OpenAI, including siblings Daniela Amodei and Dario Amodei, the latter of whom served as OpenAI's Vice President of Research. [11] [12]
In April of 2022, Anthropic announced it had received $580 million in funding, [13] with $500 million of this funding coming from FTX under the leadership of Sam Bankman-Fried. [14] [15]
In the summer of 2022, Anthropic finished training the first version of Claude but did not release it, mentioning the need for further internal safety testing and the desire to avoid initiating a potentially hazardous race to develop increasingly powerful AI systems. [16]
In February 2023, Anthropic was sued by Texas-based Anthrop LLC for the use of its registered trademark "Anthropic A.I." [17] On September 25, 2023, Amazon announced a partnership with Anthropic, with Amazon becoming a minority stakeholder by initially investing $1.25 billion, and planning a total investment of $4 billion. [8] As part of the deal, Anthropic would use Amazon Web Services (AWS) as its primary cloud provider and make its AI models available to AWS customers. [8] [18] The next month, Google invested $500 million in Anthropic, and committed to an additional $1.5 billion over time. [10]
On March 27, 2024, Amazon maxed out its potential investment from the agreement made in the prior year by investing another US $2.75 billion into Anthropic, completing its $4 billion investment. [9]
According to Anthropic, the company’s goal is to research the safety and reliability of artificial intelligence systems. [5] The Amodei siblings were among those who left OpenAI due to directional differences, specifically regarding OpenAI's ventures with Microsoft in 2019. [12] Anthropic incorporated itself as a Delaware public-benefit corporation (PBC), which requires the company to maintain a balance between private and public interests. [32]
Anthropic is a corporate "Long-Term Benefit Trust", a company-derived entity that requires the company's directors to align the company's priorities with the public benefit rather than profit in "extreme" instances of "catastrophic risk". [33] [34] As of September 19, 2023, members of the Trust included Jason Matheny (CEO & President of the RAND Corporation), Kanika Bahl (CEO & President of Evidence Action), [35] Neil Buddy Shah (CEO of the Clinton Health Access Initiative), [36] Paul Christiano (Founder of the Alignment Research Center), [37] and Zach Robinson (CEO of Effective Ventures US). [38] [39]
Claude incorporates "Constitutional AI" to set safety guidelines for the model’s output. [40] The name, "Claude", was chosen either as a reference to mathematician Claude Shannon, or as a male name to contrast the female names of other A.I. assistants such as Alexa, Siri, and Cortana. [15]
Anthropic initially released two versions of its model, Claude and Claude Instant, in March 2023, with the latter being a more lightweight model. [41] [42] [43] The next iteration, Claude 2, was launched in July 2023. [44] Unlike Claude, which was only available to select users, Claude 2 is available for public use. [26]
Claude 3 was released on March 4, 2024, unveiling three language models: Opus, Sonnet, and Haiku. [45] [46] The Opus model is the largest and most capable—according to Anthropic, it outperforms the leading models from OpenAI (GPT-4, GPT-3.5) and Google (Gemini Ultra). [45] Sonnet and Haiku are Anthropic’s medium- and small-sized models, respectively. [45] All three models can accept image input. [45] Amazon has incorporated Claude 3 into Bedrock, an Amazon Web Services-based platform for cloud AI services. [47]
On May 1, 2024, Anthropic announced the Claude Team plan, its first enterprise offering for Claude, and Claude iOS app. [48]
According to Anthropic, Constitutional AI (CAI) is a framework developed to align AI systems with human values and ensure that they are helpful, harmless, and honest. [11] [49] Within this framework, humans provide a set of rules describing the desired behavior of the AI system, known as the "constitution". [49] The AI system evaluates the generated output and then adjusts the AI models to better fit the constitution. [49] The self-reinforcing process aims to avoid harm, respect preferences, and provide true information. [49]
Some of the principles of Claude 2’s constitution are derived from documents such as the 1948 Universal Declaration of Human Rights and Apple’s terms of service. [44] For example, one rule from the UN Declaration applied in Claude 2’s CAI states "Please choose the response that most supports and encourages freedom, equality and a sense of brotherhood." [44]
Anthropic also publishes research on the interpretability of machine learning systems, focusing on the transformer architecture. [11] [50] [51]
Part of Anthropic's research aims to be able to automatically identify "features" in generative pretrained transformers like Claude. In a neural network, a feature is a pattern of neural activations that corresponds to a concept. Using a compute-intensive technique called "dictionary learning", Anthropic was able to identify millions of features in Claude, including for example one associated with the Golden Gate Bridge. Enhancing the ability to identify and edit features is expected to have significant safety implications. [52] [53] [54]
On October 18, 2023, Anthropic was sued by Concord, Universal, ABKCO, and other music publishers for, per the complaint, "systematic and widespread infringement of their copyrighted song lyrics." [55] [56] [57] They alleged that the company used copyrighted material without permission in the form of song lyrics. [58] The plaintiffs asked for up to $150,000 for each work infringed upon by Anthropic, citing infringement of copyright laws. [58] In the lawsuit, the plaintiffs support their allegations of copyright violations by citing several examples of Anthropic’s Claude model outputting copied lyrics from songs such as Katy Perry’s "Roar" and Gloria Gaynor’s "I Will Survive". [58] Additionally, the plaintiffs alleged that even given some prompts that did not directly state a song name, the model responded with modified lyrics based on original work. [58]
On January 16, 2024, Anthropic claimed that the music publishers were not unreasonably harmed and that the examples noted by plaintiffs were merely bugs. [59]
Databricks, Inc. is a global data, analytics and artificial intelligence company founded by the original creators of Apache Spark.
OpenAI is an American artificial intelligence (AI) research organization founded in December 2015, researching artificial intelligence with the goal of developing "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work." As one of the leading organizations of the AI boom, it has developed several large language models, advanced image generation models, and previously, released open-source models. Its release of ChatGPT has been credited with starting the AI boom.
Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020.
Generative Pre-trained Transformer 2 (GPT-2) is a large language model by OpenAI and the second in their foundational series of GPT models. GPT-2 was pre-trained on a dataset of 8 million web pages. It was partially released in February 2019, followed by full release of the 1.5-billion-parameter model on November 5, 2019.
DALL·E, DALL·E 2, and DALL·E 3 are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions known as "prompts".
OpenAI Codex is an artificial intelligence model developed by OpenAI. It parses natural language and generates code in response. It powers GitHub Copilot, a programming autocompletion tool for select IDEs, like Visual Studio Code and Neovim. Codex is a descendant of OpenAI's GPT-3 model, fine-tuned for use in programming applications.
You.com is an AI assistant that began as a personalization-focused search engine. While still offering web search capabilities, You.com has evolved to prioritize a chat-first AI assistant.
ChatGPT is a chatbot and virtual assistant developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language. Successive user prompts and replies are considered at each conversation stage as context.
A large language model (LLM) is a computational model notable for its ability to achieve general-purpose language generation and other natural language processing tasks such as classification. Based on language models, LLMs acquire these abilities by learning statistical relationships from vast amounts of text during a computationally intensive self-supervised and semi-supervised training process. LLMs can be used for text generation, a form of generative AI, by taking an input text and repeatedly predicting the next token or word.
Generative artificial intelligence is artificial intelligence capable of generating text, images, videos, or other data using generative models, often in response to prompts. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics.
The AI boom, or AI spring, is an ongoing period of rapid progress in the field of artificial intelligence (AI). Prominent examples include protein folding prediction led by Google DeepMind and generative AI led by OpenAI.
Llama is a family of autoregressive large language models released by Meta AI starting in February 2023. The latest version is Llama 3 released in April 2024.
Microsoft Copilot is a generative artificial intelligence chatbot developed by Microsoft. Based on a large language model, it was launched in February 2023. It is Microsoft's primary replacement for the discontinued Cortana.
Paul Christiano is an American researcher in the field of artificial intelligence (AI), with a specific focus on AI alignment, which is the subfield of AI safety research that aims to steer AI systems toward human interests. He formerly led the language model alignment team at OpenAI and became founder and head of the non-profit Alignment Research Center (ARC), which works on theoretical AI alignment and evaluations of machine learning models. In 2023, Christiano was named as one of the TIME 100 Most Influential People in AI.
Dario Amodei is an Italian-American artificial intelligence researcher and entrepreneur. He is the co-founder and CEO of Anthropic, the company behind the large language model series Claude AI. He was the former vice president of research at OpenAI.
Gemini is a family of multimodal large language models developed by Google DeepMind, serving as the successor to LaMDA and PaLM 2. Comprising Gemini Ultra, Gemini Pro, and Gemini Nano, it was announced on December 6, 2023, positioned as a competitor to OpenAI's GPT-4. It powers the chatbot of the same name.
Aleph Alpha is an independent German artificial intelligence (AI) startup company in Heidelberg, founded by professionals with experience as employees of Apple, SAP and Deloitte. Aleph Alpha attempts to provide a full-stack sovereign technology stack for generative AI, independent from US companies and comply with European data protection regulations and the AI Act. It built one of the most powerful AI clusters inside its own data centre and develops large language models (LLM), which try to provide transparency of its sources used for the results generated and are intended for enterprises and governmental agencies only. Training of its model has been done in five European languages.
Mistral AI is a French company selling artificial intelligence (AI) products. It was founded in April 2023 by previous employees of Meta Platforms and Google DeepMind. The company raised €385 million in October 2023, and in December 2023, it was valued at more than $2 billion.
Perplexity AI is an AI chatbot-powered research and conversational search engine that answers queries using natural language predictive text. Launched in 2022, Perplexity generates answers using sources from the web and cites links within the text response. Perplexity works on a freemium model; the free product uses Anthropic's Claude 3 Haiku model combined with the company's standalone large language model (LLM) that incorporates natural language processing (NLP) capabilities, while the paid version Perplexity Pro has access to GPT-4, Claude 3, Mistral Large, Llama 3 and an Experimental Perplexity Model. As of early 2024, it has about 10 million monthly users.
Claude is a family of large language models developed by Anthropic. The first model was released in March 2023. Claude 3, released in March 2024, can also analyze images.