Logical Foundations of Privacy-Preserving Publishing of Linked Data

Bernardo Cuenca Grau; Egor Kostylev

doi:10.1609/aaai.v30i1.10105

Authors

Bernardo Cuenca Grau University of Oxford
Egor Kostylev University of Oxford

DOI:

https://doi.org/10.1609/aaai.v30i1.10105

Keywords:

Linked Data, RDF Data, privacy, Semantic Web, Logic, Complexity of Reasoning

Abstract

The widespread adoption of Linked Data has been driven by the increasing demand for information exchange between organisations, as well as by data publishing regulations in domains such as health care and governance. In this setting, sensitive information is at risk of disclosure since published data can be linked with arbitrary external data sources. In this paper we lay the foundations of privacy-preserving data publishing (PPDP) in the context of Linked Data. We consider anonymisations of RDF graphs (and, more generally, relational datasets with labelled nulls) and define notions of safe and optimal anonymisations. Safety ensures that the anonymised data can be published with provable protection guarantees against linking attacks, whereas optimality ensures that it preserves as much information from the original data as possible, while satisfying the safety requirement. We establish the complexity of the underpinning decision problems both under open-world semantics inherent to RDF and a closed-world semantics, where we assume that an attacker has complete knowledge over some part of the original data.

Logical Foundations of Privacy-Preserving Publishing of Linked Data

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription