NEPOMUK (software)

Last updated
NEPOMUK
Written in C++, Java
Type Semantic desktop
License Various (BSD-style preferred) [1]
Website

NEPOMUK (Networked Environment for Personal, Ontology-based Management of Unified Knowledge) is an open-source software specification that is concerned with the development of a social semantic desktop that enriches and interconnects data from different desktop applications using semantic metadata stored as RDF. Between 2006 and 2008 it was funded by a European Union research project of the same name [2] that grouped together industrial and academic actors to develop various Semantic Desktop technologies.

Contents

Implementations

Three active implementations of NEPOMUK exist: A C++/KDE-based variant, a Java-based variant, and a commercial version. More versions were created during the EU project between 2006 and 2008, some active beyond the project. [3]

KDE

NEPOMUK-KDE was originally featured as one of the newer technologies in KDE Software Compilation 4. [4] It used Soprano as the main RDF data storage and parsing library, while handling ontology imports through the Raptor parser plugin and the Redland storage plugin; all RDF data was stored by Virtuoso which also handled full-text indexing. [5] On a technical level, NEPOMUK-KDE allowed associating metadata to various items present on a normal user's desktop such as files, bookmarks, e-mails, and calendar entries. Metadata could be arbitrary RDF. Tagging is the most user-visible metadata application.

As the KDE SC 4 series of releases progressed, it became apparent that NEPOMUK was not delivering the performance and user experience that had initially been anticipated. As a result of this, in KDE SC 4.13 a new indexing and semantic search technology Baloo was introduced, with a short transition period allowing applications to be ported and data to be migrated before the removal of NEPOMUK. [6] [7] Baloo initially used SQLite but currently uses LMDB [8] for storage, and Xapian for searching.

Zeitgeist

The Zeitgeist framework, used by GNOME and Ubuntu's Unity user interface, uses the NEPOMUK ontology, as does the Tracker search engine.

Java

The Java-based implementation of NEPOMUK [9] was finished at the end of 2008 and served as a proof-of-concept environment for several novel semantic desktop techniques. It features its own frontend (PSEW) that integrates search, browsing, recommendation, and peer-to-peer functionality. The Java implementation uses the Sesame RDF store and the Aperture [10] framework for integrating with other desktop applications such as mail clients and browsers.

A number of artifacts have been created in the context of the Java research implementation:

Refinder by Gnowsis

Implementation of the commercial Software as a service product Refinder [12] started in 2009 and a limited beta-version was released in December 2010. [13] Refinder was developed by Gnowsis, a spin-off company of the German Research Centre for Artificial Intelligence (DFKI) (project lead in the NEPOMUK EU project). The start-up was shut down in late 2013, [14] with no plans to make the implementation code available.

Refinder uses the same data formats as the other implementations, but using Software as a service instead of the desktop approach of the other implementations.

Data formats

See also

Related Research Articles

The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.

The Resource Description Framework (RDF) is a World Wide Web Consortium (W3C) standard originally designed as a data model for metadata. It has come to be used as a general method for description and exchange of graph data. RDF provides a variety of syntax notations and data serialization formats, with Turtle currently being the most widely used notation.

<span class="mw-page-title-main">KDevelop</span> Integrated development environment

KDevelop is a free and open-source integrated development environment (IDE) for Unix-like computer operating systems and Windows. It provides editing, navigation and debugging features for several programming languages, and integration with build automation and version-control systems, using a plugin-based architecture.

<span class="mw-page-title-main">Desktop search</span>

Desktop search tools search within a user's own computer files as opposed to searching the Internet. These tools are designed to find information on the user's PC, including web browser history, e-mail archives, text documents, sound files, images, and video. A variety of desktop search programs are now available; see this list for examples. Most desktop search programs are standalone applications. Desktop search products are software alternatives to the search software included in the operating system, helping users sift through desktop files, emails, attachments, and more.

<span class="mw-page-title-main">KDE Software Compilation 4</span> Software

KDE Software Compilation 4 was the only series of the so-called KDE Software Compilation, first released in January 2008 and the last release being 4.14.3 released in November 2014. It was the follow-up to K Desktop Environment 3. Following KDE SC 4, the compilation was broken up into basic framework libraries, desktop environment and applications, which are termed KDE Frameworks 5, KDE Plasma 5 and KDE Applications, respectively.

A semantic wiki is a wiki that has an underlying model of the knowledge described in its pages. Regular, or syntactic, wikis have structured text and untyped hyperlinks. Semantic wikis, on the other hand, provide the ability to capture or identify information about the data within pages, and the relationships between pages, in ways that can be queried or exported like a database through semantic queries.

Simple Knowledge Organization System (SKOS) is a W3C recommendation designed for representation of thesauri, classification schemes, taxonomies, subject-heading systems, or any other type of structured controlled vocabulary. SKOS is part of the Semantic Web family of standards built upon RDF and RDFS, and its main objective is to enable easy publication and use of such vocabularies as linked data.

Protégé is a free, open source ontology editor and a knowledge management system. The Protégé meta-tool was first built by Mark Musen in 1987 and has since been developed by a team at Stanford University. The software is the most popular and widely used ontology editor in the world.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

Semantic publishing on the Web, or semantic web publishing, refers to publishing information on the web as documents accompanied by semantic markup. Semantic publication provides a way for computers to understand the structure and even the meaning of the published information, making information search and data integration more efficient.

Ontotext is a software company with offices in Europe and USA. It is the semantic technology branch of Sirma Group. Its main domain of activity is the development of software based on the Semantic Web languages and standards, in particular RDF, OWL and SPARQL. Ontotext is best known for the Ontotext GraphDB semantic graph database engine. Another major business line is the development of enterprise knowledge management and analytics systems that involve big knowledge graphs. Those systems are developed on top of the Ontotext Platform that builds on top of GraphDB capabilities for text mining using big knowledge graphs.

<span class="mw-page-title-main">Apache Jena</span> Open source semantic web framework for Java

Apache Jena is an open source Semantic Web framework for Java. It provides an API to extract data from and write to RDF graphs. The graphs are represented as an abstract "model". A model can be sourced with data from files, databases, URLs or a combination of these. A model can also be queried through SPARQL 1.1.

<span class="mw-page-title-main">RDF4J</span>

Eclipse RDF4J is an open-source framework for storing, querying, and analysing RDF data. It was created by the Dutch software company Aduna as part of "On-To-Knowledge", a semantic web project that ran from 1999 to 2002. It contains implementations of an in-memory triplestore and an on-disk triplestore, along with two separate Servlet packages that can be used to manage and provide access to these triplestores, on a permanent server. The RDF4J Rio package contains a simple API for Java-based RDF parsers and writers. Parsers and writers for popular RDF serialisations are distributed along with RDF4J, and users can easily extend the list by putting their parsers and writers on the Java classpath when running their application.

In computer science, the semantic desktop is a collective term for ideas related to changing a computer's user interface and data handling capabilities so that data are more easily shared between different applications or tasks and so that data that once could not be automatically processed by a computer could be. It also encompasses some ideas about being able to share information automatically between different people. This concept is very much related to the Semantic Web, but is distinct insofar as its main concern is the personal use of information.

<span class="mw-page-title-main">Tracker (search software)</span>

Tracker is a file indexing and search framework for Linux and other Unix-like systems. It is written in the C programming language.

Strigi was a file indexing and file search framework adopted by KDE SC. Strigi was initiated by Jos van den Oever. Strigi's goals are to be fast, use a small amount of RAM, and use flexible backends and plug-ins. A benchmark as of January 2007 showed that Strigi is faster and uses less memory than other search systems, but it lacks many of their features. Like most desktop search systems, Strigi can extract information from files, such as the length of an audio clip, the contents of a document, or the resolution of a picture; plugins determine what filetypes it is capable of handling. Strigi uses its own Jstream system which allows for deep indexing of files. Strigi is accessible via Konqueror, or by clicking on its icon, after adding it to KDE's Kicker or GNOME Panel. The graphical user interface (GUI) is named Strigiclient.

A semantic reasoner, reasoning engine, rules engine, or simply a reasoner, is a piece of software able to infer logical consequences from a set of asserted facts or axioms. The notion of a semantic reasoner generalizes that of an inference engine, by providing a richer set of mechanisms to work with. The inference rules are commonly specified by means of an ontology language, and often a description logic language. Many reasoners use first-order predicate logic to perform reasoning; inference commonly proceeds by forward chaining and backward chaining. There are also examples of probabilistic reasoners, including non-axiomatic reasoning systems, and probabilistic logic networks.

<span class="mw-page-title-main">Akonadi</span>

Akonadi is a storage service for personal information management (PIM) data and metadata named after the oracle goddess of justice in Ghana. It is one of the “pillars” behind the KDE SC 4 project, although it is designed to be used in any desktop environment. It is extensible and provides concurrent read, write, and query access.

The Open Semantic Framework (OSF) is an integrated software stack using semantic technologies for knowledge management. It has a layered architecture that combines existing open source software with additional open source components developed specifically to provide a complete Web application framework. OSF is made available under the Apache 2 license.

References

  1. "Nepomuk License". Archived from the original on 2009-04-29.
  2. "European Commission : CORDIS : Projects & Results Service : Networked environment for personal ontology-based management of unified knowledge".
  3. How Does Gnowsis Relate to NEPOMUK and Others? Archived 2012-02-17 at the Wayback Machine Blogpost dated 20.11.2010. Written by Leo Sauermann who contributed to the NEPOMUK project. 6 implementations of NEPOMUK are listed
  4. "NEPOMUK-KDE project". Archived from the original on 2013-03-17. Retrieved 2008-05-09.
  5. Trüg, Sebastian (22 September 2011). "About Strigi, Soprano, Virtuoso, CLucene, and Libstreamanalyzer".
  6. "KDE's Next Generation Semantic Search". 24 February 2014. Retrieved 2014-06-28.
  7. "KDE Community Wiki - Baloo" . Retrieved 2014-06-28.
  8. "Baloo 5.15" . Retrieved 2015-10-11.
  9. Groza, Tudor. "Semantic Desktop[.]org". Archived from the original on 2007-12-01.
  10. Fluit, Leo Sauermann, Christiaan. "Aperture Framework".
  11. "Welcome to the framework!".
  12. "Home - Refinder".
  13. Refinder: Das nächste Level des Hyperlinks Article in Futurezone by Jakob Steinschaden, 3.12.2010.
  14. "We are shutting Refinder down - Refinder".
  15. "Personal Information Model (PIMO)". Archived from the original on 2008-12-28.
  16. NEPOMUK Information Element Ontology Archived 2008-12-30 at the Wayback Machine