The scope of DPV has been expanded to include non-personal data and AI technologies - though the focus of the group remains on privacy and data protection. The structure of the repo has also been changed to incorporate multiple jurisdictions and regulations, and their names have been changed e.g.
dpv-gdpr
islegal/eu/gdpr
. The article Data Privacy Vocabulary (DPV) – Version 2 by Pandit et al. (2024), accepted for presentation at the 23rd International Semantic Web Conference (ISWC 2024), describes DPV v2 in terms of its contents, methodology, current adoptions and uses, and future potential. It also describes the relevance and role of DPV in acting as a common vocabulary to support various regulatory (e.g. EU’s DGA and AI Act) and community initiatives (e.g. Solid) emerging across the globe. A Search Index of all concepts from DPV and extensions is available.
is available under a new versioned IRI for continued use - though the DPVCG recommends using the latest version of DPV. Versioned IRIs have been created to refer to specific versions, with https://w3id.org/dpv/1.0 for v1 and https://w3id.org/dpv/2.0 for v2. The versionless IRI https://w3id.org/dpv will always point to the latest version. See the v2 changelog for more details.
License: All work produced by DPVCG and provided through this repo or elsewhere is provided by contributors under the W3C Document License. A copy of the license is provided in the LICENSE.md file.
Newcomers to the DPV are recommended to start with the Primer to familiarise themselves with the concepts, semantics, and usefulness of the DPV. A Concise Primer is also available for a quick (2-pager) introduction to DPV. ### Data Privacy Vocabulary (DPV) The Data Privacy Vocabulary (DPV) provides an ontology (classes and properties) and taxonomies of concepts to represent information regarding how personal data is processed in the form of an ontology or a knowledge graph. For example, it provides taxonomies associated with:
The namespace for DPV terms is http://w3id.org/dpv#
with
suggested prefix dpv
, and serialisations are provided in
RDF/XML, Turtle, JSON-LD, and N3 formats. The default serialisations are
defined using RDFS/SKOS semantics, with an alternate serialisation defined
using OWL2 semantics.
These extensions provide additional concepts that extend the concepts and scope of the main DPV specification: - Personal Data (PD) provides a taxonomy of personal data categories - Location (LOC) provides a taxonomy of location concepts based on ISO 3166 (countries, regions) - Technology (TECH) provides a taxonomy of technology concepts - AI provides a taxonomy of AI concepts extending the TECH extension - Justifications provides concepts for representing justifications i.e. why something must be done or could not be done - Risk provides concepts for risk assessment and management
The legal extensions provide concepts associated with specific jurisdictions and the laws, authorities, and treaties within them. The Legal page provides an overview of these. The jurisdictions are represented by using their ISO 3166-2 codes.
The NACE Taxonomy serialised in RDFS provides a serialisation of the NACE v2 taxonomy in RDFS for use with DPV terms. Since then, NACE v2.1 has been published by the EU Commission. The DPVCG has decided to retire/not provide an alternative serialisation of NACE as it provided no significant benefit and the best practice for using NACE is to always utilise the official authoritative version.
The following are final reports i.e. formally published by the W3C:
The DPVCG was established as part of the SPECIAL H2020 Project, which received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 731601 from 2017 to 2019.
Harshvardhan J. Pandit was funded to work as the chair of DPVCG from 2020 to 2022 by the Irish Research Council's Government of Ireland Postdoctoral Fellowship Grant#GOIPD/2020/790, and through the ADAPT SFI Centre for Digital Media Technology is funded by Science Foundation Ireland through the SFI Research Centres Programme and is co-funded under the European Regional Development Fund (ERDF) through Grant#13/RC/2106 (2018 to 2020) and Grant#13/RC/2106_P2 (2021 onwards).
Further funding acknowledgements for individual members are provided within relevant specifications.