DWBP Implementation Report

Abstract

This document reports on evidence and implementations of the Data on the Web Best Practices Candidate Recommendation. In particular, it demonstrates that the DWBP are already in use and are also implementable.

1. Introduction

One of the main goals of the Data on the Web Best Practices (DWBP) is to facilitate interaction between publishers and consumers of data on the Web. A set of 35 Best Practices were created to cover different challenges related to data publishing and consumption, such as Metadata, Data licenses, Data provenance, Data quality, Data versioning, Data identification, Data formats, Data vocabularies, Data access and APIs, Data preservation, Feedback, Data enrichment and Data republication.

To show that the DWBP are implementable as well broadly adopted and referenced by well-known organizations, we collected evidence in the form of datasets, data portals, documents, references and guidelines (Section 2). We used two forms to collect this evidence: (DWBP evidence form and DWBP template form). The results are summarized in this report.

Besides the results collected from the surveys, in order to strengthen the DWBP adoption evidence, we also present our evaluation of how DWBP are currently being adopted by the major data catalog solutions, including CKAN, Socrata, DKAN, JUNAR, ArcGIS Open Data and OPENDATASOFT (Section 3). Finally, we also present some examples to illustrate that each one of the DWBP is implementable (Section 4).

1.1 Methodology

We followed the steps described below to collect evidence for the DWBP:

A standard email was sent to several organizations around the world asking for contributions to DWBP implementations.
Implementations of the DWBP were collected using the standard forms: DWBP evidence form and DWBP template form.
A review of the collected implementations was made in order to check which best practices should have more implementations.
A detailed review of the implementations as well as the comments received through the surveys were made in order to prepare the implementation report.
The Implementation Report was developed.

As noted, to have a broader coverage of the DWBP adoption we considered different types of evidence:

Datasets, Data Portals and Vocabularies: this type of evidence shows that the DWBP were already considered by organizations responsible for publishing data on the Web.
Documents and References: this category includes Web sites, Web pages, blogs, published papers, APIs documentation, projects and wikis. This type of evidence shows the adoption of DWBP in more general scenarios.
Guidelines: these guidelines were proposed by governmental organizations to help data publishers to make data available on the Web. Each guideline discusses and proposes practices that makes an explicit) reference to a DWBP or where the advice offered is fully consistent with the relevant DWBP Best Practice.

1.2 Meeting the exit criteria

As described in the DWBP charter, to move on to Proposed Recommendation, evidence will be adduced in order to demonstrate that each of the best practices has been recommended or adopted in at least two environments, such as data portals and formal policies. Evidence of implementation was gathered from existing datasets and data portals, which already implement the proposed best practices, as well as from national or sector-specific guidelines that reference the DWBP and documents available on the Web.

2. DWBP Evidence

The table below shows the evidence collected for each one of the DWBP.

BP	Evidence	Total
BP1	D02, D03, D04, D06, D08, D11, D12, D13, D17, D18, D19, D20, D21, D22, D31, D38, D45, D49, D50, D51, D52, D53, D55, D55, D56, D57, D58, D59, R10, G03, G04, G07, G08, G09, G11, G13, G14, G15, G16	39
BP2	D02, D06, D08, D11, D12, D13, D17, D19, D20, D21, D22, D23, D38, D45, D49, D50, D51, D52, D53, D54, D56, D57, D58, D59, R10, G01, G07, G08, G11, G13, G14, G15, G16	33
BP3	D04, D06, D08, D17, D20, D21, D31, D45, D49, D50, D52, D59, R10, G01, G02, G03, G07, G08, G16	19
BP4	D02, D03, D04, D05, D08, D11, D12, D13, D17, D20, D22, D25, D38, D45, D49, D51, D52, D53, D55, D56, D58, D59, G01, G02, G03, G08, G10, G13, G15, G16	30
BP5	D01, D06, D11, D12, D13, D16, D17, D20, D21, D24, D38, D52, D53, D58, R11, G01, G03, G08, G13, G16	20
BP6	D06, D08, D13, D16, D49, D50, D52, D54, D58, D59, R11, G03, G10, G15, G16	15
BP7	D01, D02, D05, D08, D12, D13, D17, D20, D41, D45, D46, D47, D50, D51, D59, R11, R12, G07, G16	19
BP8	D12, D13, D17, D20, D45, D46, R04, R10, R12, G08, G11, G16	12
BP9	D03, D04, D05, D11, D12, D13, D14, D17, D20, D21, D22, D41, D49, D51, D52, D53, D55, D56, D57, D58, D59, R10, G01, G02, G03, G04, G06, G07, G11, G13, G14, G15, G16	33
BP10	D02, D03, D04, D05, D06, D08, D11, D13, D14, D16, D17, D19, D20, D21, D37, D49, D51, D52, D53, D55, D58, G04, G08, G16	24
BP11	D06, D11, D13, D14, D17, D20, D32, G01, G02, G07, G15, G16	12
BP12	D01, D02, D03, D04, D05, D08, D11, D12, D13, D16, D17, D19, D20, D21, D22, D28, D30, D37, D45, D49, D51, D52, D53, D55, D56, D57, D58, D59, G02, G04, G07, G08, G10, G11, G13, G14, G15, G16	38
BP13	D03, D04, D06, D08, D11, D17, D18, D20, D21, D51, D53, D55, D56, G13	14
BP14	D01, D02, D03, D04, D05, D06, D12, D13, D14, D17, D19, D22, D23, D37, D45, D49, D51, D52, D53, D55, D56, D57, R10, G02, G07, G08, G16	27
BP15	D02, D03, D04, D06, D08, D11, D12, D13, D17, D19, D20, D21, D22, D42, D49, D50, D51, D52, D53, D55, D58, D59, R10, G01, G02, G07, G08, G10, G11, G13, G14, G15, G16	33
BP16	D03, D04, D06, D11, D16, D17, D19, D20, D21, D37, D45, D49, D50, D51, D52, D53, D55, D58, G07, G08	20
BP17	D01, D05, D11, D13, D17, D18, D20, D44, D48, D49, D52, D53, D56, D57, D58, D59, G04, G08, G16	19
BP18	D03, D04, D05, D11, D13, D16, D17, D29, D33, D44, D45, D49, D52, D53, D55, R05, G08, G16	18
BP19	D03, D04, D08, D13, D17, D19, D20, D21, D49, D51, D53, D55, D56, D57, D58, R06, R07, G07, G15, G16	20
BP20	D05, D17, D22, D45, R01, R14, G16	7
BP21	D01, D05, D06, D11, D12, D13, D17, D18, D19, D21, D22, D38, D45, D49, D51, D53, D57, D58, D59, G08, G12, G15, G16	23
BP22	D05, D06, D43, D45, D54, D56, D59, G16	8
BP23	D02, D03, D04, D11, D12, D13, D14, D15, D16, D17, D19, D20, D21, D22, D28, D44, D45, D48, D49, D52, D53, D55, D56, D57, D59, R15, G04, G05, G08	29
BP24	D03, D04, D12, D13, D14, D15, D16, D17, D19, D20, D21, D22, D44, D45, D49, D53, D55, R16	18
BP25	D02, D07, D12, D13, D14, D15, D16, D17, D19, D21, D22, D27, D37, D45, D48, D49, D52, D53, D56, D57, R15	21
BP26	D03, D04, D11, D14, D15, D17, D21, D45, D53, D55, R17	11
BP27	D04, D11, D17, D59, R10, G01, G11, G14, G16, R22	10
BP28	D11, D58, R08, R10	4
BP29	D01, D02, D04, D05, D06, D12, D13, D14, D15, D17, D19, D20, D21, D36, D45, D51, D52, D53, D56, D57, D58, G01, G04, G08, G11, G12, G16	27
BP30	D05, D13, D14, D15, D17, D20, D34, D51, D59, R13, G04, G16	12
BP31	D04, D05, D16, D17, D19, D20, D49, R03, G16	9
BP32	D01, D02, D05, D16, D17, D19, D20, D26, D45, D52, D53, D55, D57, D58, R02	15
BP33	D06, D19, D35, D45, D49, D52, R09, G16	8
BP34	D04, D05, D09, D11, D16, D17, D20, D40, D45, D49, D52, D55, D56, D58, D59, G04, G08, G09, G15, G16	20
BP35	D01, D04, D06, D11, D16, D17, D19, D20, D21, D38, D45, D49, D52, D57, D58, D59, D59, G16	18

2.1 Datasets, Data portals and Vocabularies

The following table shows organizations and implementers that contributed with DWBP evidence in the form of Datasets, Data Portals and Vocabularies.

ID	Organization Name	Evidence URI	Category	Domain	Data Catalog?^*
D01	Activist	https://data.gg/developers/health	Dataset	Healthcare	proprietary
D02	Auckland War Memorial Museum Open Data	https://datahub.io/en/dataset/am-collections-online	Dataset	Cultural Heritage	CKAN (datahub)
D03	BBC	http://shakespeare.acropolis.org.uk/	Dataset	Literature and Folklore	no
D04	BBC	http://acropolis.org.uk/	Dataset	Education	no
D05	Center for Open Data Enterprise	http://opendataimpactmap.org/map.html	Dataset	Impact Analysis	no
D06	Cetic.br	http:/cetic.br/tics/usuarios/2014/total-brasil/A/	Dataset	Digital Inclusion	no
D07	CNR-IMATI	http://linkeddata.ge.imati.cnr.it/services.jsp	Dataset	Environment	no
D08	CNR-IMATI	http://linkeddata.ge.imati.cnr.it/resource/data/dcat-void/EARTh20140604?output=text/turtle	Dataset	Environment	no
D09	Crunchbase	http://www0.cs.ucl.ac.uk/staff/w.zhang/cb.html	Dataset	Finance	proprietary
D10	Crunchbase	http://www0.cs.ucl.ac.uk/staff/w.zhang/cb.html	Dataset	Finance	proprietary
D11	Data Archiving and Networked Services (DANS)	https://easy.dans.knaw.nl/ui/home	Dataset	Archive Documents	proprietary
D12	Data.gov	http://catalog.data.gov/dataset/consumer-complaint-database	Dataset	Government Data	CKAN
D13	Data.gov.uk	https://data.gov.uk/dataset/land-registry-monthly-price-paid-data	Dataset	Government Data	CKAN
D14	Data.gov.uk	https://data.gov.uk/dataset/land-registry-monthly-price-paid-data	Dataset	Government Data	CKAN
D15	Data.gov.uk	https://data.gov.uk/dataset/uk-civil-service-high-earners/	Dataset	Government data	CKAN
D16	Datawheel, Deloitte	https://datausa.io/about/datasets/	Dataset	Government Data	no
D17	DBpedia	https://dbpedia.org	Dataset	Cross-domain	proprietary (uses DCAT)
D18	EMPREL	http://dados.recife.pe.gov.br/dataset/monitoramento-das-areas-de-riscos	Dataset	Government Data	CKAN
D19	Europeana	http://labs.europeana.eu/	Data Portal	Cultural Heritage	proprietary
D20	Faculty of Computer Science and Engineering - Skopje, Macedonia	https://datahub.io/dataset/linked-drugs	Dataset	Pharmaceutical Consumption	CKAN (datahub)
D21	FAO	http://ring.ciard.net/chinese-crop-germplasm-information-system-cgris	Dataset	Agriculture and Rural Development	proprietary
D22	Gijon City Council	http://transparencia.gijon.es/risp_datasets/show/busgijontr	Dataset	Government Data	proprietary
D23	Governo de Alagoas	http://transparencia.al.gov.br/portal/api/exportacao	Data Portal	Government data	no
D24	Governo de Alagoas	http://transparencia.al.gov.br/portal/duvidas-frequentes	Data Portal	Government Data	proprietary
D25	Governo de Alagoas	http://transparencia.al.gov.br/portal/api/licenca-de-uso	Data Portal	Government Data	proprietary
D26	Governo de Alagoas	http://transparencia.al.gov.br/pessoal/servidores-ativos/	Dataset	Government Data	proprietary
D27	Governo de Alagoas	http://transparencia.al.gov.br/portal/api/pessoal/servidores-ativos/lista-de-servidores	Dataset	Government Data	proprietary
D28	Governo de Alagoas	http://transparencia.al.gov.br/despesa/json-despesa-acao/	Dataset	Government Data	proprietary
D29	Governo de Alagoas	http://transparencia.al.gov.br/portal/download-de-dados/pessoal/servidor-ativo	Dataset	Government Data	proprietary
D30	Governo de Alagoas	http://transparencia.al.gov.br/despesa/json-despesa-acao/	Dataset	Government Data	proprietary
D31	Governo de Alagoas	http://transparencia.al.gov.br/portal/download-de-dados/despesas/comparativo-de-despesa	Dataset	Government Data	proprietary
D32	IGN - Institut National de Línformation Géographique et Forestiére	http://data.ign.fr/set/ignf/20140409.trig	Dataset	Geographic Data	no
D33	IGN - Institut National de Línformation Géographique et Forestiére	http://data.ign.fr/endpoint.html	SPARQL endpoint	Geographic Data	no
D34	Kaggle	https://www.kaggle.com/txtrouble/carbon-emissions	Dataset	Environment	proprietary
D35	Kaggle	https://www.kaggle.com/njitram/d/hugomathien/soccer/exploring-the-incident-data/comments	Dataset	Sports	proprietary
D36	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/contato/	Dataset	Government Data	CKAN
D37	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/dataset/compras-publicas-do-governo-federal	Dataset	Government Data	CKAN
D38	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/dataset/dominios-gov-br	Dataset	Government Data	CKAN
D39	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/dataset/dominios-gov-br/resource/e2ec4c92-bad8-4739-a9e3-42dad967c2cb	Dataset	Government Data	CKAN
D40	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/dataset/imoveis-dominiais-da-uniao	Dataset	Government Data	CKAN
D41	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/dataset/ocorrencias-aeronauticas-da-aviacao-civil-brasileira	Dataset	Government Data	CKAN
D42	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/dataset/orcamento-federal	Dataset	Government Data	CKAN
D43	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/dataset/promocao-e-apoio-a-eventos-nacionais-de-turismo	Dataset	Government Data	CKAN
D44	Ministério do Planejamento, Desenvolvimento e Gestão	http://dados.gov.br/dataset/siconv	Dataset	Government Data	CKAN
D45	National Transport Authority	https://data.dublinked.ie/dataset/real-time-passenger-information-rtpi-for-dublin-bus-bus-eireann-luas-and-irish-rail	Dataset	Transport	CKAN
D46	NCBI Consensus CDS database	https://www.ncbi.nlm.nih.gov/projects/CCDS/CcdsBrowse.cgi?REQUEST=PAST_ANNOUNCEMENTS	Data Portal	Biological Data	proprietary
D47	NOAA data catalog	http://data.noaa.gov	Data Portal	Environment	CKAN
D48	OpenStreetMap	http://wiki.openstreetmap.org/wiki/Planet.osm	Dataset	Geographic Data	no
D49	Ordnance Survey Ireland & ADAPT, Trinity College Dublin	https://data.gov.ie/dataset/osi-national-statutory-boundary-linked-data	Dataset	Geographic Data	CKAN
D50	Pacific Northwest National Laboratory	https://rdesc.org/metadata.php?uri=http://rdesc.org/arm/datastream/sgpswatsE25.b1	Dataset	Scientific Research	no
D51	Schema.org	http://schema.org/Organization	Vocabulary	Cross-domain	no
D52	Scottish Government	http://statistics.gov.scot/data/carbon-footprint	Dataset	Government Data	PublishMyData
D53	Scottish Governmet	http://statistics.gov.scot/data/age-at-first-birth	Dataset	Government Data	PublishMyData
D54	US Department of Energy Atmospheric Radiation Measurement (ARM) Data Archive, Microwave Radiometer Dataset	http://www.archive.arm.gov/arm/Thumbnail2.jsp?datastream=sgpmwrlosB5.b1&startDate=04/21/2009&varName=vap	Dataset	Environment	proprietary
D55	Wellcome Trust	http://wellcomelibrary.org/resource/collections/	Dataset	Digital Libraries	proprietary
D56	Wikidata	https://www.wikidata.org/wiki/Wikidata:Data_access	Dataset	Cross-domain	no
D57	World Bank	http://data.worldbank.org/data-catalog/ed-stats	Dataset	Education	proprietary
D58	Southampton Open Data Service	http://id.southampton.ac.uk/dataset/university-timetable	Data Portal	Educational Administration	proprietary
D59	CEDA - Center for Environmenatl Data Analysis	http://catalogue.ceda.ac.uk/uuid/8fb58cd1a37b4ade8cb5a3f62a240574	Dataset	Environment	yes

* This column indicates if a data catalog solution is used to provide the data. The data catalog can be based on an existing solution like CKAN or can be a proprietary one.

2.2 Documents and References

The following table shows organizations and implementers that contributed with DWBP evidence in the form Documents and References.

ID	Organization	Evidence URI	Category
R01	NextBus	https://www.nextbus.com/#!/sf-muni/E/E____I_F00/4532/4503	Page
R02	CKAN	http://docs.ckan.org/en/latest/maintaining/data-viewer.html	Page
R03	null	https://www.researchgate.net/publication/262254329_Fashion_10000_An_enriched_social_image_dataset_for_fashion_and_clothing	Paper
R04	IGN - Institut National de Línformation Géographique et Forestiére	https://www.w3.org/2016/11/sdsvoc/SDSVoc16_paper_6	Paper
R05	20th Century Reanalysis Project	http://portal.nersc.gov/project/20C_Reanalysis/	Project
R06	RESTLET	http://restlet.com/blog/2015/12/10/understanding-http-content-negotiation/	Blog
R07	W3C	https://www.w3.org/blog/2006/02/content-negotiation/	Blog
R08	David Rosenthal Blog	http://blog.dshr.org/2008/01/does-preserving-context-matter.html	Blog
R09	Universal Protein Resource	http://www.uniprot.org/contact	Site
R10	Earth System Grid/ Accelerated Climate Modeling for Energy	https://pcmdi.llnl.gov/search/acme-llnl/	Site
R11	collaboration between Lawrence Berkeley Natl Lab; MIT; and other insitutions	http://materialsproject.org	Site
R12	British Cardiovascular Intervention Society	http://www.bcis.org.uk	Site
R13	WikiPATHWAYS	http://wikipathways.org/index.php	Wiki
R14	Twitter	https://dev.twitter.com/streaming/public	Page
R15	Twitter	https://dev.twitter.com/overview/api	Page
R16	Twitter	https://dev.twitter.com/rest/public	Page
R17	Twitter	https://dev.twitter.com/overview/api/upcoming-changes-to-tweets	Page
R18	The National Archives	https://www.nationalarchives.gov.uk/documents/information-management/redirection-technical-guidance-for-departments-v4.2-web-version.pdf	Document
R19	Sunlight Foundation	http://sunlightfoundation.com/opendataguidelines/#permanent-access	Page
R20	Johns Hopkins University Data Management Services	https://dmp.data.jhu.edu/preserve-share-research-data/preserve-archive/	Page
R21	Dataverse project	http://guides.dataverse.org/en/latest/user/dataset-management.html	Page
R22	CEOS Data Stewardship Interest Group	http://ceos.org/document_management/Working_Groups/WGISS/Interest_Groups/Data_Stewardship/Best_Practices/CEOS%20Persistent%20Identifier%20Best%20Practices_v1.1.pdf	Page

2.3 Guidelines

The following table shows organizations and implementers that contributed with DWBP evidence in the form Guidelines.

ID	Guide	Creator	Country	Year
G01	DCAT-AP guidelines	European Commission	Europe	2016
G02	Open Data Support training material	not available	not available	not available
G03	Linee Guida Nazionali per la Valorizzazione del Patrimonio Informativo Pubblico	Agenzia per l'Italia Digitale	Italy	2016
G04	Romanian Open Data Guide	Chancellery of PM	Romania	2016
G05	Vidareutnyttjande.se	not available	Sweden	not available
G06	Ramverk för öppna data (municipalities and regions)	not available	Sweden	not available
G07	ELI implementation methodology: Good practices and guidelines	ELI Task Force/Publications Office of the European Union	Europe	2015
G08	Standardy publikace a katalogizace otevřených dat veřejné správy ČR	Ministry of the Interior of the Czech Republic	Czech Republic	2016
G09	Open Data Resource Pack	Scottish Government	Scotland	2015
G10	Government Data Openness and Re-Use	Government of Catalonia	Spain	2014
G11	Open Data Decalogue	Open Data Spain Community Group	Spain	2012
G12	Guía metodológica para planes open data sectoriales (Methodological Guide for Sectorial Open Data Plans)	Spanish Government	Spain	2014
G13	Guía para el desarrollo de la Universidad Abierta (Open University Development Guide)	CRUE-TIC Spain	Spain	2014
G14	Castilla y León Open Data Guidelines	Government of Castille and León	Spain	2012
G15	Open Data Support	European Commission	Europe	2014
G16	Guía de aplicación de la Norma Técnica de Interoperabilidad de Reutilización de Recursos de Información	Spanish Government - Ministerio de Hacienda y Administraciones Públicas	Spain	2016

3. General analysis

One of our main concerns when we started to collect evidence for each one of the DWBP was to have implementations from well-known organizations as well as high profile datasets and data portals worldwide, like DBpedia, Data.gov.uk, Data.gov and World Bank. Analyzing the tables presented in the previous section, we can say that we accomplished this goal. The DWBP evidence were collected from well-known organizations and projects including the ones mentioned before as well as BBC, Twitter, Europeana, Pacific Northwest National Laboratory and OpenStreetMaps. Considering the geographical coverage, we collected implementations from several countries, including Brazil, France, Ireland, New Zealand, Spain, UK, USA and Italy. It is also important to notice that evidence in the form of guidelines concerns several governmental organizations from Europe. Other important characteristic from the DWBP implementations is their broad domain coverage, e.g. they refer to different domains, like Government, Environment and Healthcare, as described in the graphic below.

Evidence count per domain

As we can observe in the graphic below, there is a broad adoption of DWBP related to Metadata (BP1 and BP2), Data Licenses (BP4), Data Identification (BP9 and BP10), Data Formats (BP12 and BP14), Vocabularies (BP15 and BP16), Data Access (BP23, BP24, BP25 and BP26) and Feedback (BP29). On the other hand, for others, such as Preserve identifiers (BP27), Assess dataset coverage (BP28), Provide real-time access (BP20) and Provide an explanation for data that is not available (BP22), collection of evidence was more difficult, especially related to datasets and data portals. This can be justified by comments received during the evidence gathering process and also available in the DWBP evidence form. Bill Roberts from the SWIRRL, for example, made the following comment about one of the Data Preservation best practices: "Too difficult to test in a meaningful way. In this system, no datasets have yet been taken offline, so the archiving process has not been developed." In the same way, he made a comment about the Best Practice Provide real-time access: "The system does not currently hold dataset collected in 'real time'. Generally the data is statistical in nature and goes through a slower collection and processing cycle."

Evidence count per Best Practices

4. DWBP and Data Catalogs

In this section we present some more evidence that shows the adoption of the DWBP. Rather than specific datasets or data portals, we use the following data catalog solutions as evidence: CKAN, Socrata, DKAN, JUNAR, ArcGIS Open Data and OPENDATASOFT. For each one of the DWBP, we show the list of data catalog solutions that implement it.

BP	Data Catalogs	Total
BP1	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP2	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP3	CKAN (partial), SOCRATA, JUNAR (partial), ARCGIS OPEN DATA	4
BP4	CKAN, SOCRATA, DKAN, ARCGIS OPEN DATA (partial), OPENDATASOFT (partial)	5
BP5	CKAN, SOCRATA, DKAN, ARCGIS OPEN DATA (partial), OPENDATASOFT (partial)	5
BP6	SOCRATA	1
BP7	CKAN, DKAN	2
BP8	SOCRATA (partial), DKAN	2
BP9	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP10		0
BP11	SOCRATA	1
BP12	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP13	OPENDATASOFT (partial)	1
BP14	SOCRATA, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	4
BP15	CKAN, SOCRATA, DKAN, JUNAR, OPENDATASOFT	5
BP16		0
BP17	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP18	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA	5
BP19	SOCRATA, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	4
BP20	ARCGIS OPEN DATA	1
BP21	SOCRATA	1
BP22		0
BP23	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP24	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP25	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP26	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP27		0
BP28		0
BP29	CKAN, SOCRATA, DKAN	3
BP30	CKAN, SOCRATA, OPENDATASOFT	3
BP31		0
BP32	CKAN, SOCRATA, DKAN, JUNAR, ARCGIS OPEN DATA, OPENDATASOFT	6
BP33	OPENDATASOFT	1
BP34		0
BP35		0

As we may notice, there is no evidence for some of the DWBP. This happens because these Best Practices do not concern the solution used for making the data available on the Web, e.g. the data catalog solution, as explained below.

BP10, BP16, BP22, BP28: these BP apply to the data itself rather than the data catalog solution used to publish the data.
BP33, B34, BP35: these BP apply to situations of data republication, i.e. it depends from the consumer rather than the data catalog solution used to publish the data.
BP31: this BP concerns processes that can be used to enhance, refine or otherwise improve raw or previously processed data, which are not part of the basic data catalog functions.

Concerning BP27 none of the data catalog solutions implement it. In general, when a dataset is not available then just a 404 error message is returned.

Some Best Practices related to metadata are partially implemented by the data catalog solutions. Note that almost all data catalog solutions are compatible with DCAT, which means that metadata covered by DCAT may be completely or partially available both in human-readable and machine-readable formats. In general, it means that just a human-readable or a machine-readable version of the metadata is available, as detailed in the following.

BP3 is partially implemented by CKAN and JUNAR because they do not offer an explicit way to present human-readable structural metadata.
BP4 and BP5 are partially implemented by ARCGIS OPEN DATA and OPENDATASOFT because it does not offer a way to represent machine-readable license metadata and machine-readable provenance metadata.
BP8 is partially implemented by SOCRATA because it does not offer a way to represent machine-readable version history metadata.
BP13 is partially implemented by OPENDATASOFT because it does not offer a way to represent machine-readable language metadata.

As a general analysis with regards to the Data on the Web Challenges, we can say that Metadata, Data Licenses and Data Formats challenges are a main concern of the data catalog solutions. The Data Access challenge has also been recognized as an important one except when it concerns real-time data. The use of Data Access APIs is a consensus. The major data catalog solutions also deal with the Data Identification challenge, however just part of the problem has been solved. The Data Vocabularies challenge has also been considered as an important one since data catalog solutions reuse existing vocabularies, e.g. DCAT, when publishing metadata about the data catalogs. Other challenges like Data Provenance, Data Versioning and Feedback have been superficially dealt with in the data catalog solutions. In general, Data Quality, Data Preservation, Data Enrichment and Data Republications are challenges still not explored by the major data catalog solutions.

5. Set of Best Practices

The following list shows the set of best practices linked to the DWBP document:

Best Practice 1: Provide metadata
Best Practice 2: Provide descriptive metadata
Best Practice 3: Provide structural metadata
Best Practice 4: Provide data license information
Best Practice 5: Provide data provenance information
Best Practice 6: Provide data quality information
Best Practice 7: Provide a version indicator
Best Practice 8: Provide version history
Best Practice 9: Use persistent URIs as identifiers of datasets
Best Practice 10: Use persistent URIs as identifiers within datasets
Best Practice 11: Assign URIs to dataset versions and series
Best Practice 12: Use machine-readable standardized data formats
Best Practice 13: Use locale-neutral data representations
Best Practice 14: Provide data in multiple formats
Best Practice 15: Reuse vocabularies, preferably standardized ones
Best Practice 16: Choose the right formalization level
Best Practice 17: Provide bulk download
Best Practice 18: Provide Subsets for Large Datasets
Best Practice 19: Use content negotiation for serving data available in multiple formats
Best Practice 20: Provide real-time access
Best Practice 21: Provide data up to date
Best Practice 22: Provide an explanation for data that is not available
Best Practice 23: Make data available through an API
Best Practice 24: Use Web Standards as the foundation of APIs
Best Practice 25: Provide complete documentation for your API
Best Practice 26: Avoid Breaking Changes to Your API
Best Practice 27: Preserve identifiers
Best Practice 28: Assess dataset coverage
Best Practice 29: Gather feedback from data consumers
Best Practice 30: Make feedback available
Best Practice 31: Enrich data by generating new data
Best Practice 32: Provide Complementary Presentations
Best Practice 33: Provide Feedback to the Original Publisher
Best Practice 34: Follow Licensing Terms
Best Practice 35: Cite the Original Publication