Blazegraph

Blazegraph
Developer(s)	Systap
Stable release	2.1.5 / 19 March 2019
Preview release	2.1.6rc / 3 February 2020
Repository	github.com/blazegraph/database ;
Written in	Java
Type	Graph database
License	GNU GPL (version 2)
Website	blazegraph.com

Last updated May 17, 2024

Blazegraph is an open source triplestore and graph database,^[3]^[4]^[5]^[6] developed by Systap, which is used in the Wikidata SPARQL endpoint and by other large customers. It is licensed under the GNU GPL (version 2).

Early history

The system was first known as Bigdata. Since release of version 1.5 (12 February 2015), it is named Blazegraph.^[8]^[9]

Prominent users

The Wikimedia Foundation uses Blazegraph for the Wikidata Query Service, which is a SPARQL endpoint.^[10]
The Datatourisme project uses Blazegraph as the database platform; however, GraphQL is used as the query language instead of SPARQL.^[11]

Notable features

RDF* — an alternative approach to RDF reification,^[12] which gives RDF graphs capabilities of LPG graphs;^[13]
as the consequence of the previous, ability of querying graphs both in SPARQL and Gremlin;^[14]^[15]
as an alternative to Gremlin querying, GAS abstraction over RDF graphs support in SPARQL;^[16]
The SERVICE syntax of federated queries for functionality extending;^[17]
Managed behavior of the query plan generator;^[18]
Reusable named subqueries.^[19]

Acqui-hiring by Amazon Web Service (AWS)

It was alleged^[20] that Amazon Neptune is based on Blazegraph, as evidenced by:

acquiring of the Blazegraph trademark by AWS;^[21]
acquiring of the blazegraph.com domain name by AWS;^[22]
transition of many employees (including CEO) to AWS.^[23]

Related Research Articles

The Resource Description Framework (RDF) is a World Wide Web Consortium (W3C) standard originally designed as a data model for metadata. It has come to be used as a general method for description and exchange of graph data. RDF provides a variety of syntax notations and data serialization formats, with Turtle currently being the most widely used notation.

A query language, also known as data query language or database query language (DQL), is a computer language used to make queries in databases and information systems. In database systems, query languages rely on strict theory to retrieve information. A well known example is the Structured Query Language (SQL).

RDF Schema (Resource Description Framework Schema, variously abbreviated as RDFS, RDF(S), RDF-S, or RDF/S) is a set of classes with certain properties using the RDF extensible knowledge representation data model, providing basic elements for the description of ontologies. It uses various forms of RDF vocabularies, intended to structure RDF resources. RDF and RDFS can be saved in a triplestore, then one can extract some knowledge from them using a query language, like SPARQL.

SPARQL is an RDF query language—that is, a semantic query language for databases—able to retrieve and manipulate data stored in Resource Description Framework (RDF) format. It was made a standard by the RDF Data Access Working Group (DAWG) of the World Wide Web Consortium, and is recognized as one of the key technologies of the semantic web. On 15 January 2008, SPARQL 1.0 was acknowledged by W3C as an official recommendation, and SPARQL 1.1 in March, 2013.

RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information. This library contains parsers/serializers for almost all of the known RDF serializations, such as RDF/XML, Turtle, N-Triples, & JSON-LD, many of which are now supported in their updated form. The library also contains both in-memory and persistent Graph back-ends for storing RDF information and numerous convenience functions for declaring graph namespaces, lodging SPARQL queries and so on. It is in continuous development with the most recent stable release, rdflib 6.1.1 having been released on 20 December 2021. It was originally created by Daniel Krech with the first release in November, 2002.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

An RDF query language is a computer language, specifically a query language for databases, able to retrieve and manipulate data stored in Resource Description Framework (RDF) format.

Apache Jena is an open source Semantic Web framework for Java. It provides an API to extract data from and write to RDF graphs. The graphs are represented as an abstract "model". A model can be sourced with data from files, databases, URLs or a combination of these. A model can also be queried through SPARQL 1.1.

DBpedia is a project aiming to extract structured content from the information created in the Wikipedia project. This structured information is made available on the World Wide Web using OpenLink Virtuoso. DBpedia allows users to semantically query relationships and properties of Wikipedia resources, including links to other related datasets.

A triplestore or RDF store is a purpose-built database for the storage and retrieval of triples through semantic queries. A triple is a data entity composed of subject–predicate–object, like "Bob is 35" or "Bob knows Fred".

Freebase was a large collaborative knowledge base consisting of data composed mainly by its community members. It was an online collection of structured data harvested from many sources, including individual, user-submitted wiki contributions. Freebase aimed to create a global resource that allowed people to access common information more effectively. It was developed by the American software company Metaweb and run publicly beginning in March 2007. Metaweb was acquired by Google in a private sale announced on 16 July 2010. Google's Knowledge Graph is powered in part by Freebase.

A graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph. The graph relates the data items in the store to a collection of nodes and edges, the edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly and, in many cases, retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships is fast because they are perpetually stored in the database. Relationships can be intuitively visualized using graph databases, making them useful for heavily inter-connected data.

<span class="mw-page-title-main">Named graph</span> Extension of the RDF data model

Named graphs are a key concept of Semantic Web architecture in which a set of Resource Description Framework statements are identified using a URI, allowing descriptions to be made of that set of statements such as context, provenance information or other such metadata.

Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, is able to use under the CC0 public domain license. Wikidata is a wiki powered by the software MediaWiki, including its extension for semi-structured data, the Wikibase. As of early 2023, Wikidata had 1.54 billion item statements.

GeoSPARQL is a standard for representation and querying of geospatial linked data for the Semantic Web from the Open Geospatial Consortium (OGC). The definition of a small ontology based on well-understood OGC standards is intended to provide a standardized exchange basis for geospatial RDF data which can support both qualitative and quantitative spatial reasoning and querying with the SPARQL database query language.

Semantic queries allow for queries and analytics of associative and contextual nature. Semantic queries enable the retrieval of both explicitly and implicitly derived information based on syntactic, semantic and structural information contained in data. They are designed to deliver precise results or to answer more fuzzy and wide open questions through pattern matching and digital reasoning.

Shapes Constraint Language (SHACL) is a World Wide Web Consortium (W3C) standard language for describing Resource Description Framework (RDF) graphs. SHACL has been designed to enhance the semantic and technical interoperability layers of ontologies expressed as RDF graphs.

Amazon Neptune is a managed graph database product published by Amazon.com. It is used as a web service and is part of Amazon Web Services (AWS). It was announced on November 29, 2017. Amazon Neptune supports popular graph models property graph and W3C's RDF, and their respective query languages Apache TinkerPop's Gremlin, openCypher, and SPARQL, including other Amazon Web Services products.

NitrosBase is a Russian high-performance multi-model database system. The database system supports relational, graph and document database models.

Ontotext GraphDB is a graph database and knowledge discovery tool compliant with RDF and SPARQL and available as a high-availability cluster. Ontotext GraphDB is used in various European research projects.

References

1 2 "Releases - blazegraph/database" . Retrieved 8 March 2020– via GitHub.
↑ "LICENSE.txt file" . Retrieved 8 March 2020– via GitHub.
↑ Kiryakov, Atanas; Mariana, Damova (2011). "Chapter 7: Storing the Semantic Web: Repositories". In Domingue, John; Fensel, Dieter; Hendler, James A. (eds.). Handbook of Semantic Web Technologies . Springer-Verlag Berlin Heidelberg. pp. 271–272. ISBN 978-3-540-92912-3.
↑ Thompson, Bryan; Personick, Mike; Cutcher, Martin (2014). "Chapter 8: The bigdata RDF Graph Database". In Harth, Andreas; Hose, Katja; Schenkel, Ralf (eds.). Linked data management. CRC Press. pp. 193–237. ISBN 978-1-466-58240-8.
↑ Sikos, Leslie (2015). Mastering structured data on the Semantic Web. APress. p. 171. ISBN 978-1-4842-1049-9.
↑ Sakr, Sh.; Wylot, M.; Mutharaju, R.; Le Phuoc, D.; Fundulaki, I. (2018). Linked Data: Storing, Querying, and Reasoning. Springer International Publishing Imprint Springer. pp. 70–71. ISBN 978-3-319-73515-3.
↑ "Blazegraph fork?". GitHub. 12 April 2018.
↑ "Blazegraph - Semantic Web Standards". W3.org. Retrieved 29 August 2018.
↑ "Announcing Blazegraph Release 1.5.0 | Blazegraph(tm)". blog.blazegraph.com. Retrieved 17 September 2018.
↑ "[Wikidata-tech] Wikidata Query Backend Update (take two!)". lists.wikimedia.org. Retrieved 29 August 2018. (The message also contains a link to the graph databases comparison performed by Wikimedia.)
↑ "Principes de l'API". Framagit.io (in French). Retrieved 29 August 2018.
↑ Hartig, Olaf; Thompson, Bryan (2014). "Foundations of an Alternative Approach to Reification in RDF". arXiv: 1406.3399 [cs.DB].
↑ Hartig, Olaf (2014). "Reconciliation of RDF* and Property Graphs". arXiv: 1409.3288 [cs.DB].
↑ "TinkerPop-enabled Providers". tinkerpop.apache.org. Archived from the original on 10 February 2018. Retrieved 29 August 2018. (Currently, the Blazegraph logo is replaced with the Amazon Neptune logo.)
↑ "Blazegraph TinkerPop Implementation". GitHub.com. Retrieved 29 August 2018.
↑ Fu, Zhisong; Personick, Michael (2014). MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs. pp. 1–6. ISBN 978-1-4503-2982-8.{{cite book}}: |journal= ignored (help)
↑ "Wikidata Query Service: User manual. Extensions". MediaWiki.org. Retrieved 29 August 2018.
↑ "Query Hints". Blazegraph Wiki. Retrieved 29 August 2018.
↑ "Named Subquery". Blazegraph Wiki. Retrieved 29 August 2018.
↑ "SPARQL and Amazon Web Service's Neptune database". Bob DuCharme's weblog. Retrieved 29 August 2018.
↑ "BLAZEGRAPH Trademark of Amazon Technologies, Inc.. Serial Number: 86498414". Trademarkia.com. Retrieved 29 August 2018.
↑ "blazegraph.com". ICANN WHOIS. Retrieved 29 August 2018.
↑ "Bradley Bebee". LinkedIn.com. Retrieved 29 August 2018.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[github-releases-1] 1 2 "Releases - blazegraph/database" . Retrieved 8 March 2020– via GitHub.

[github-license-2] "LICENSE.txt file" . Retrieved 8 March 2020– via GitHub.

[3] Kiryakov, Atanas; Mariana, Damova (2011). "Chapter 7: Storing the Semantic Web: Repositories". In Domingue, John; Fensel, Dieter; Hendler, James A. (eds.). Handbook of Semantic Web Technologies . Springer-Verlag Berlin Heidelberg. pp. 271–272. ISBN 978-3-540-92912-3.

[4] Thompson, Bryan; Personick, Mike; Cutcher, Martin (2014). "Chapter 8: The bigdata RDF Graph Database". In Harth, Andreas; Hose, Katja; Schenkel, Ralf (eds.). Linked data management. CRC Press. pp. 193–237. ISBN 978-1-466-58240-8.

[5] Sikos, Leslie (2015). Mastering structured data on the Semantic Web. APress. p. 171. ISBN 978-1-4842-1049-9.

[6] Sakr, Sh.; Wylot, M.; Mutharaju, R.; Le Phuoc, D.; Fundulaki, I. (2018). Linked Data: Storing, Querying, and Reasoning. Springer International Publishing Imprint Springer. pp. 70–71. ISBN 978-3-319-73515-3.

[7] "Blazegraph fork?". GitHub. 12 April 2018.

[sw-8] "Blazegraph - Semantic Web Standards". W3.org. Retrieved 29 August 2018.

[9] "Announcing Blazegraph Release 1.5.0 | Blazegraph(tm)". blog.blazegraph.com. Retrieved 17 September 2018.

[10] "[Wikidata-tech] Wikidata Query Backend Update (take two!)". lists.wikimedia.org. Retrieved 29 August 2018. (The message also contains a link to the graph databases comparison performed by Wikimedia.)

[11] "Principes de l'API". Framagit.io (in French). Retrieved 29 August 2018.

[12] Hartig, Olaf; Thompson, Bryan (2014). "Foundations of an Alternative Approach to Reification in RDF". arXiv: 1406.3399 [cs.DB].

[13] Hartig, Olaf (2014). "Reconciliation of RDF* and Property Graphs". arXiv: 1409.3288 [cs.DB].

[14] "TinkerPop-enabled Providers". tinkerpop.apache.org. Archived from the original on 10 February 2018. Retrieved 29 August 2018. (Currently, the Blazegraph logo is replaced with the Amazon Neptune logo.)

[15] "Blazegraph TinkerPop Implementation". GitHub.com. Retrieved 29 August 2018.

[16] Fu, Zhisong; Personick, Michael (2014). MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs. pp. 1–6. ISBN 978-1-4503-2982-8.{{cite book}}: |journal= ignored (help)

[17] "Wikidata Query Service: User manual. Extensions". MediaWiki.org. Retrieved 29 August 2018.

[18] "Query Hints". Blazegraph Wiki. Retrieved 29 August 2018.

[19] "Named Subquery". Blazegraph Wiki. Retrieved 29 August 2018.

[20] "SPARQL and Amazon Web Service's Neptune database". Bob DuCharme's weblog. Retrieved 29 August 2018.

[21] "BLAZEGRAPH Trademark of Amazon Technologies, Inc.. Serial Number: 86498414". Trademarkia.com. Retrieved 29 August 2018.

[22] "blazegraph.com". ICANN WHOIS. Retrieved 29 August 2018.

[23] "Bradley Bebee". LinkedIn.com. Retrieved 29 August 2018.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]