Data Integrity BBS Cryptosuites v1.0

This specification describes a Data Integrity Cryptosuite for use when generating digital signatures using the BBS signature scheme. The Signature Suite utilizes BBS signatures to provide selective disclosure and unlinkable derived proofs.

Algorithms

The following algorithms describe how to use verifiable credentials with the BBS Signature Scheme [[CFRG-BBS-SIGNATURE]]. When using the BBS signature scheme the SHA-256 variant SHOULD be used.

Implementations SHOULD fetch and cache verification method information as early as possible when adding or verifying proofs. Parameters passed to functions in this section use information from the verification method — such as the public key size — to determine function parameters — such as the cryptographic hashing algorithm.

When the RDF Dataset Canonicalization Algorithm [[RDF-CANON]] is used, implementations of that algorithm will detect dataset poisoning by default, and abort processing upon detection.

Instantiate Cryptosuite

This algorithm is used to configure a cryptographic suite to be used by the Add Proof and Verify Proof functions in [[[VC-DATA-INTEGRITY]]]. The algorithm takes an options object ([=map=] |options|) as input and returns a [=data integrity cryptographic suite instance|cryptosuite instance=] ([=struct=] |cryptosuite|).

Initialize |cryptosuite| to an empty [=struct=].
If |options|.|type| does not equal `DataIntegrityProof`, return |cryptosuite|.
If |options|.|cryptosuite| is `bbs-2023` then:
1. Set |cryptosuite|.|createProof| to the algorithm in Section [[[#create-base-proof-bbs-2023]]].
2. Set |cryptosuite|.|verifyProof| to the algorithm in Section [[[#verify-derived-proof-bbs-2023]]].
Return |cryptosuite|.

Selective Disclosure Functions

createShuffledIdLabelMapFunction

The following algorithm creates a label map factory function that uses an HMAC to shuffle canonical blank node identifiers. The required input is an HMAC (previously initialized with a secret key), |HMAC|. A function, labelMapFactoryFunction, is produced as output.

Create a function, |labelMapFactoryFunction|, with one required input (a canonical node identifier map, |canonicalIdMap|), that will return a blank node identifier map, bnodeIdMap, as output. Set the function's implementation to:
1. Generate a new empty bnode identifier map, bnodeIdMap.
2. For each map entry, entry, in |canonicalIdMap|:
  1. Perform an HMAC operation on the canonical identifier from the value in entry to get an HMAC digest, digest.
  2. Generate a new string value, b64urlDigest, and initialize it to "u" followed by appending a base64url-no-pad encoded version of the digest value.
  3. Add a new entry, |newEntry|, to bnodeIdMap using the key from entry and b64urlDigest as the value.
3. Derive the shuffled mapping from the `bnodeIdMap` as follows:
  1. Set `hmacIds` to be the sorted array of values from the `bnodeIdMap`, and set `bnodeKeys` to be the ordered array of keys from the `bnodeIdMap`.
  2. For each key in `bnodeKeys`, replace the `bnodeIdMap` value for that key with the index position of the value in the `hmacIds` array prefixed by "b", i.e., `bnodeIdMap.set(bkey, 'b' + hmacIds.indexOf(bnodeIdMap.get(bkey)))`.
4. Return bnodeIdMap.
Return |labelMapFactoryFunction|.

It should be noted that step 1.2 in the above algorithm is identical to step 1.2 in Section 3.3.4 `createHmacIdLabelMapFunction` of [[DI-ECDSA]], so developers might be able to reuse the code or call the function if implementing both.

bbs-2023 Functions

serializeBaseProofValue

Depending upon the value of the |featureOption|, set up the |proofValue| as follows.
If |featureOption| equals `"baseline"`:
1. Initialize a byte array, |proofValue|, that starts with the BBS base proof header bytes `0xd9`, `0x5d`, and `0x02`.
2. Initialize |components| to an array with five elements containing the values of: |bbsSignature|, |bbsHeader|, |publicKey|, |hmacKey|, and |mandatoryPointers|.
If |featureOption| equals `"anonymous_holder_binding"`:
1. Initialize a byte array, |proofValue|, that starts with the BBS base proof header bytes `0xd9`, `0x5d`, and `0x04`.
2. Initialize |components| to an array with six elements containing the values of: |bbsSignature|, |bbsHeader|, |publicKey|, |hmacKey|, and |mandatoryPointers|.
If |featureOption| equals `"pseudonym"`:
1. Initialize a byte array, |proofValue|, that starts with the BBS base proof header bytes `0xd9`, `0x5d`, and `0x06`.
2. Initialize |components| to an array with six elements containing the values of: |bbsSignature|, |bbsHeader|, |publicKey|, |hmacKey|, |mandatoryPointers|, and |signer_nym_entropy|.
If |featureOption| equals `"holder_binding_pseudonym"`:
1. Initialize a byte array, |proofValue|, that starts with the BBS base proof header bytes `0xd9`, `0x5d`, and `0x08`.
2. Initialize |components| to an array with six elements containing the values of: |bbsSignature|, |bbsHeader|, |publicKey|, |hmacKey|, |mandatoryPointers|, and |signer_nym_entropy|.
CBOR-encode |components| per [[RFC8949]] where CBOR tagging MUST NOT be used on any of the |components|. Append the produced encoded value to |proofValue|.
Initialize |baseProof| to a string with the multibase-base64url-no-pad-encoding of `proofValue`. That is, return a string starting with "`u`" and ending with the base64url-no-pad-encoded value of |proofValue|.
Return |baseProof| as base proof.

parseBaseProofValue

The following algorithm parses the components of a `bbs-2023` selective disclosure base proof value. The required input is a proof value (|proofValue|). A single object, parsed base proof, containing six or seven elements, using the names "bbsSignature", "bbsHeader", "publicKey", "hmacKey", "mandatoryPointers", "featureOption", and possibly optional feature parameter "signer_nym_entropy", is produced as output.

If the `proofValue` string does not start with u (U+0075 LATIN SMALL LETTER U), indicating that it is a `multibase-base64url-no-pad-encoded` value, an error MUST be raised and SHOULD convey an error type of PROOF_VERIFICATION_ERROR.
Initialize |decodedProofValue| to the result of base64url-no-pad-decoding the substring that follows the leading `u` in `proofValue`.
Check that the BBS base proof starts with an allowed header value and set the |featureOption| variable as follows:
1. If the |decodedProofValue| starts with the bytes `0xd9`, `0x5d`, and `0x02`, set |featureOption| to `"baseline"`.
2. If the |decodedProofValue| starts with the bytes `0xd9`, `0x5d`, and `0x04`, set |featureOption| to `"anonymous_holder_binding"`.
3. If the |decodedProofValue| starts with the bytes `0xd9`, `0x5d`, and `0x06`, set |featureOption| to `"pseudonym"`.
4. If the |decodedProofValue| starts with the bytes `0xd9`, `0x5d`, and `0x08`, set |featureOption| to `"holder_binding_pseudonym"`.
5. If the |decodedProofValue| starts with any other three byte sequence, an error MUST be raised and SHOULD convey an error type of PROOF_VERIFICATION_ERROR.
Initialize `components` to an array that is the result of CBOR-decoding the bytes that follow the three-byte BBS base proof header.
Based on the value of |featureOption|, return an object based on |components|, as follows:
1. If |featureOption| equals `"baseline"`, set the property names for the object based on |components| to "bbsSignature", "bbsHeader", "publicKey", "hmacKey", and "mandatoryPointers", in that order, and add |featureOption| as a property.
2. If |featureOption| equals `"anonymous_holder_binding"`, set the property names for the object based on |components| to "bbsSignature", "bbsHeader", "publicKey", "hmacKey", and "mandatoryPointers", in that order, and add |featureOption| as a property.
3. If |featureOption| equals `"pseudonym"`, set the property names for the object based on |components| to "bbsSignature", "bbsHeader", "publicKey", "hmacKey", "mandatoryPointers", and "signer_nym_entropy", in that order, and add |featureOption| as a property.
4. If |featureOption| equals `"holder_binding_pseudonym"`, set the property names for the object based on |components| to "bbsSignature", "bbsHeader", "publicKey", "hmacKey", "mandatoryPointers", and "signer_nym_entropy", in that order, and add |featureOption| as a property.

createDisclosureData

Initialize |bbsSignature|, |bbsHeader|, |publicKey|, |hmacKey|, and |mandatoryPointers| to the values of the associated properties in the object returned when calling the algorithm in Section , passing the `proofValue` from `proof`.
Initialize |hmac| to an HMAC API using |hmacKey|. The HMAC uses the same hash algorithm used in the signature algorithm, i.e., SHA-256.
Initialize |labelMapFactoryFunction| to the result of calling the algorithm of section , passing |hmac| as |HMAC|.
Initialize |combinedPointers| to the concatenation of |mandatoryPointers| and |selectivePointers|.
Initialize |groupDefinitions| to a map with the following entries: key of the string `"mandatory"` and value of |mandatoryPointers|; key of the string `"selective"` and value of |selectivePointers|; and key of the string `"combined"` and value of |combinedPointers|.
Initialize |groups| and |labelMap| to the result of calling the algorithm in Section 3.3.16 canonicalizeAndGroup of the [[DI-ECDSA]] specification, passing |document| |labelMapFactoryFunction|, |groupDefinitions|, and any custom JSON-LD API options. Note: This step transforms the document into an array of canonical N-Quads whose order has been shuffled based on 'hmac'-applied blank node identifiers, and groups the N-Quad strings according to selections based on JSON pointers.
Compute the mandatory indexes relative to their positions in the combined statement list, i.e., find the position at which a mandatory statement occurs in the list of combined statements. One method for doing this is given below.
1. Initialize |mandatoryIndexes| to an empty array. Set |mandatoryMatch| to |groups.mandatory.matching| map; set |combinedMatch| to |groups.combined.matching|; and set |combinedIndexes| to the ordered array of just the keys of the |combinedMatch| map.
2. For each key in the |mandatoryMatch| map, find its index in the |combinedIndexes| array (e.g., `combinedIndexes.indexOf(key)`), and add this value to the |mandatoryIndexes| array.
Compute the selective indexes relative to their positions in the non-mandatory statement list, i.e., find the position at which a selected statement occurs in the list of non-mandatory statements. One method for doing this is given below.
1. Initialize |selectiveIndexes| to an empty array. Set |selectiveMatch| to the |groups.selective.matching| map; set |mandatoryNonMatch| to the map |groups.mandatory.nonMatching|; and |nonMandatoryIndexes| to to the ordered array of just the keys of the |mandatoryNonMatch| map.
2. For each key in the |selectiveMatch| map, find its index in the |nonMandatoryIndexes| array (e.g., `nonMandatoryIndexes.indexOf(key)`), and add this value to the |selectiveIndexes| array.
Initialize |bbsMessages| to an array of byte arrays containing the values in the |nonMandatory| array of strings encoded using the UTF-8 character encoding.
Set |bbsProof| to the value computed by the appropriate procedure given below based on the value of the |featureOption| parameter.
1. If |featureOption| equals `"baseline"`, set `bbsProof` to the value computed by the `ProofGen` procedure from [[CFRG-BBS-SIGNATURE]], i.e., `ProofGen(PK, signature, header, ph, messages, disclosed_indexes)`, where `PK` is the original issuers public key, `signature` is the `bbsSignature`, `header` is the `bbsHeader`, `ph` is the `presentationHeader` `messages` is `bbsMessages`, and `disclosed_indexes` is `selectiveIndexes`.
2. If |featureOption| equals `"anonymous_holder_binding"`, set `bbsProof` to the value computed by the `BlindProofGen` procedure from [[CFRG-Blind-BBS-Signature]], where |PK| is the original issuers public key, |signature| is the |bbsSignature|, |header| is the |bbsHeader|, |ph| is the |presentationHeader|, |messages| is |bbsMessages|, |disclosed_indexes| is |selectiveIndexes|, and `commitment_with_proof`. The holder will also furnish its |holder_secret|, and the |proverBlind| that was used to compute the |commitment_with_proof|. This is the Anonymous Holder Binding feature option. To be updated when IETF API is finalized.
3. If |featureOption| equals `"pseudonym"`, use the "Verification and Finalization" operation from [[CFRG-Pseudonym-BBS-Signature]] with an empty |committed_messages| array to both verify the |bbsSignature| and compute the |nym_secret| value. This operation uses the |prover_nym|, |signer_nym_entropy|, and |secret_prover_blind|.
  
  Determine the |nym_domain|. This might be specified by the verifier or set by the holder, depending on the usage scenario. Use the "Proof Generation with Pseudonym" operation from [[CFRG-Pseudonym-BBS-Signature]] to produce the derived proof. This operation takes as inputs the original issuer's public key as |PK|, the |bbsSignature| as |signature|, the |bbsHeader| as |header|, the |presentationHeader| as |ph|, the |bbsMessages| as |messages|, the |selectiveIndexes| as |disclosed_indexes|, a |nym_secret|, a |nym_domain|, an empty array for |committed_messages|, and a |secret_prover_blind|. In addition to providing the raw cryptographic proof value which is assigned to |bbsProof|, it also returns the |pseudonym|.
  
  This is for the Credential-Bound Pseudonyms feature option. To be updated when IETF API is finalized.
4. If |featureOption| equals `"holder_binding_pseudonym"`, use the "Verification and Finalization" operation from [[CFRG-Pseudonym-BBS-Signature]] with the |committed_messages| array containing the |holder_secret| as its only value, to both verify the |bbsSignature| and compute the |nym_secret| value. This operation uses the |prover_nym|, |signer_nym_entropy|, and |secret_prover_blind|.
  
  Determine the |nym_domain|. This might be specified by the verifier or set by the holder depending on the usage scenario. Use the "Proof Generation with Pseudonym" operation from [[CFRG-Pseudonym-BBS-Signature]] to produce the derived proof. This operation takes as inputs |PK|, the original issuers public key, |signature|, the |bbsSignature|, |header| is the |bbsHeader|, |ph| is the |presentationHeader|, |messages| is |bbsMessages|, |disclosed_indexes| is |selectiveIndexes|, This operation takes as inputs the original issuers public key as |PK|, the |bbsSignature| as |signature|, the |bbsHeader| as |header|, the |presentationHeader| as |ph|, the |bbsMessages| as |messages|, the |selectiveIndexes| as |disclosed_indexes|, a |nym_secret|, a |nym_domain|, the only value of the |committed_messages| array as |holder_secret|, and a |secret_prover_blind|. In addition to providing the raw cryptographic proof value which is assigned to |bbsProof|, it also returns the |pseudonym|. This is for the Holder Binding and Pseudonyms feature option. To be updated when IETF API is finalized.
If |featureOption| equals `"anonymous_holder_binding"`, `"pseudonym"`, or `"holder_binding_pseudonym"` set the |lengthBBSMessages| parameter to the length of the |bbsMessages| array.
Initialize |revealDocument| to the result of the "selectJsonLd" algorithm from [[DI-ECDSA]], passing `document`, and `combinedPointers` as `pointers`.
Run the RDF Dataset Canonicalization Algorithm [[RDF-CANON]] on the joined |combinedGroup|.|deskolemizedNQuads|, passing any custom options, and get the canonical bnode identifier map, |canonicalIdMap|. Note: This map includes the canonical blank node identifiers that a verifier will produce when they canonicalize the reveal document.
Initialize |verifierLabelMap| to an empty map. This map will map the canonical blank node identifiers produced by the verifier when they canonicalize the revealed document, to the blank node identifiers that were originally signed in the base proof.
For each key (`inputLabel`) and value (`verifierLabel`) in `canonicalIdMap:
1. Add an entry to `verifierLabelMap`, using `verifierLabel` as the key, and the value associated with `inputLabel` as a key in `labelMap` as the value.
Return an object with properties matching |bbsProof|, "verifierLabelMap" for |labelMap|, |mandatoryIndexes|, |selectiveIndexes|, |revealDocument|, |pseudonym|, and, if computed, |lengthBBSMessages|.

compressLabelMap

The following algorithm compresses a label map. The required input is label map (|labelMap|). The output is a compressed label map.

Initialize `map` to an empty map.
For each entry (`k`, `v`) in `labelMap`:
1. Add an entry to `map`, with a key that is a base-10 integer parsed from the characters following the "c14n" prefix in `k`, and a value that is a base-10 integer parsed from the characters following the "b" prefix in `v`.
Return `map` as compressed label map.

decompressLabelMap

The following algorithm decompresses a label map. The required input is a compressed label map (|compressedLabelMap|). The output is a decompressed label map.

Initialize `map` to an empty map.
For each entry (`k`, `v`) in `compressedLabelMap`:
1. Add an entry to `map`, with a key that adds the prefix "c14n" to `k`, and a value that adds a prefix of "b" to `v`.
Return `map` as decompressed label map.

serializeDerivedProofValue

Initialize `compressedLabelMap` to the result of calling the algorithm in Section , passing `labelMap` as the parameter.
Depending on the value of |featureOption| do the following:
1. If |featureOption| equals `"baseline"`:
  1. Initialize |proofValue| to start with the disclosure proof header bytes `0xd9`, `0x5d`, and `0x03`.
  2. Initialize |components| to an array with elements containing the values of |bbsProof|, |compressedLabelMap|, |mandatoryIndexes|, |selectiveIndexes|, and |presentationHeader|.
2. If |featureOption| equals `"anonymous_holder_binding"`:
  1. Initialize |proofValue| to start with the disclosure proof header bytes `0xd9`, `0x5d`, and `0x05`.
  2. Initialize |components| to an array with elements containing the values of |bbsProof|, |compressedLabelMap|, |mandatoryIndexes|, |selectiveIndexes|, |presentationHeader|, and |lengthBBSMessages|.
3. If |featureOption| equals `"pseudonym"`:
  1. Initialize |proofValue| to start with the disclosure proof header bytes: `0xd9`, `0x5d`, and `0x07`.
  2. Initialize |components| to an array with elements containing the values of |bbsProof|, |compressedLabelMap|, |mandatoryIndexes|, |selectiveIndexes|, |presentationHeader|, |nym_domain|, |pseudonym|, and |lengthBBSMessages|.
4. If |featureOption| equals `"holder_binding_pseudonym"`:
  1. Initialize |proofValue| to start with the disclosure proof header bytes: `0xd9`, `0x5d`, and `0x09`.
  2. Initialize |components| to an array with elements containing the values of |bbsProof|, |compressedLabelMap|, |mandatoryIndexes|, |selectiveIndexes|, |presentationHeader|, |nym_domain|, |pseudonym|, and |lengthBBSMessages|.
CBOR-encode |components| per [[RFC8949]] where CBOR tagging MUST NOT be used on any of the |components|. Append the produced encoded value to |proofValue|.
Return the derived proof as a string with the multibase-base64url-no-pad-encoding of |proofValue|. That is, return a string starting with "`u`" and ending with the base64url-no-pad-encoded value of `proofValue`.

parseDerivedProofValue

The following algorithm parses the components of the derived proof value. The required input is a derived proof value (|proofValue|). A single derived proof value object is produced as output, which contains a set of six to nine elements, having the names "bbsProof", "labelMap", "mandatoryIndexes", "selectiveIndexes", "presentationHeader", "featureOption", and, depending on the value of the |featureOption| parameter, "nym_domain", "pseudonym", and/or "lengthBBSMessages".

If the `proofValue` string does not start with u (U+0075, LATIN SMALL LETTER U), indicating that it is a `multibase-base64url-no-pad-encoded` value, an error MUST be raised and SHOULD convey an error type of PROOF_VERIFICATION_ERROR.
Initialize |decodedProofValue| to the result of base64url-no-pad-decoding the substring that follows the leading `u` in `proofValue`.
Check that the BBS disclosure proof starts with an allowed header value and set the |featureOption| variable as follows:
1. If the |decodedProofValue| starts with the header bytes `0xd9`, `0x5d`, and `0x03`, set |featureOption| to `"baseline"`.
2. If the |decodedProofValue| starts with the header bytes `0xd9`, `0x5d`, and `0x05`, set |featureOption| to `"anonymous_holder_binding"`.
3. If the |decodedProofValue| starts with the header bytes `0xd9`, `0x5d`, and `0x07`, set |featureOption| to `"pseudonym"`.
4. If the |decodedProofValue| starts with the header bytes `0xd9`, `0x5d`, and `0x09`, set |featureOption| to `"holder_binding_pseudonym"`.
Initialize `components` to an array that is the result of CBOR-decoding the bytes that follow the three-byte BBS disclosure proof header. If the result is not an array of five, six, seven, or eight elements, an error MUST be raised and SHOULD convey an error type of PROOF_VERIFICATION_ERROR.
Replace the second element in `components` using the result of calling the algorithm in Section , passing the existing second element of `components` as `compressedLabelMap`.
Return derived proof value as an object with properties set to the five, six, seven, or eight elements, using the names "bbsProof", "labelMap", "mandatoryIndexes", "selectiveIndexes", "presentationHeader", and optional "nym_domain", "pseudonym", and/or "lengthBBSMessages", respectively. In addition, add |featureOption| and its value to the object.

createVerifyData

Initialize |proofHash| to the result of performing RDF Dataset Canonicalization [[RDF-CANON]] on the proof options, i.e., the proof portion of the document with the |proofValue| removed. The hash used is the same as that used in the signature algorithm, i.e., SHA-256. Note: This step can be performed in parallel; it only needs to be completed before this algorithm needs to use the |proofHash| value.
Initialize |bbsProof|, |labelMap|, |mandatoryIndexes|, |selectiveIndexes|, |presentationHeader|, |featureOption|, and, possibly, |pseudonym| and/or |lengthBBSMessages| to the values associated with their property names in the object returned when calling the algorithm in Section , passing |proofValue| from |proof|.
Initialize |labelMapFactoryFunction| to the result of calling the "`createLabelMapFunction`" algorithm of [[DI-ECDSA]], passing |labelMap|.
Initialize |nquads| to the result of calling the "`labelReplacementCanonicalize`" algorithm of [[DI-ECDSA]], passing |document|, |labelMapFactoryFunction|, and any custom JSON-LD API options. Note: This step transforms the document into an array of canonical N-Quads with pseudorandom blank node identifiers based on |labelMap|.
Initialize |mandatory| to an empty array.
Initialize |nonMandatory| to an empty array.
For each entry (|index|, |nq|) in |nquads|, separate the N-Quads into mandatory and non-mandatory categories:
1. If |mandatoryIndexes| includes |index|, add |nq| to |mandatory|.
2. Otherwise, add |nq| to |nonMandatory|.
Initialize |mandatoryHash| to the result of calling the "`hashMandatory`" primitive, passing |mandatory|.
Return an object with properties matching |baseSignature|, |proofHash|, |nonMandatory|, |mandatoryHash|, |selectiveIndexes|, |featureOption|, and, possibly, |pseudonym| and/or |lengthBBSMessages|.

bbs-2023

The `bbs-2023` cryptographic suite takes an input document, canonicalizes the document using the RDF Dataset Canonicalization Algorithm [[RDF-CANON]], and then applies a number of transformations and cryptographic operations resulting in the production of a data integrity proof. The algorithms in this section also include the verification of such a data integrity proof.

Create Base Proof (bbs-2023)

The |featureOption| parameter is used to indicate which optional feature, if any, is being used. It can take one of the following values: `"baseline"`, `"anonymous_holder_binding"`, `"pseudonym"`, or `"holder_binding_pseudonym"`. Note that `"baseline"` is used to denote the case of no optional features. If |featureOption| is set to `"anonymous_holder_binding"`, `"pseudonym"`, or `"holder_binding_pseudonym"`, the |commitment_with_proof| input MUST be supplied. If |featureOption| is set to `"pseudonym"` or `"holder_binding_pseudonym"`, the |signer_nym_entropy| input MUST be supplied.

Let |proof| be a clone of the proof options, |options|.
Let |proofConfig| be the result of running the algorithm in Section [[[#base-proof-configuration-bbs-2023]]] with |options| passed as a parameter.
Let |transformedData| be the result of running the algorithm in Section [[[#base-proof-transformation-bbs-2023]]] with |unsecuredDocument|, |proofConfig|, and |options| passed as parameters.
Let |hashData| be the result of running the algorithm in Section [[[#base-proof-hashing-bbs-2023]]] with |transformedData| and |proofConfig| passed as a parameters.
Let |proofBytes| be the result of running the algorithm in Section [[[#base-proof-serialization-bbs-2023]]] with |hashData|, |options|, |featureOption|, and, if required, |commitment_with_proof| passed as parameters.
Let |proof|.|proofValue| be a base64url-encoded Multibase value of the |proofBytes|.
Return |proof| as the [=data integrity proof=].

Base Proof Transformation (bbs-2023)

The following algorithm specifies how to transform an unsecured input document into a transformed document that is ready to be provided as input to the hashing algorithm in Section .

Required inputs to this algorithm are an unsecured data document (|unsecuredDocument|) and transformation options (|options|). The transformation options MUST contain a type identifier for the cryptographic suite (|type|), a cryptosuite identifier (|cryptosuite|), and a verification method (|verificationMethod|). The transformation options MUST contain an array of mandatory JSON pointers (|mandatoryPointers|) and MAY contain additional options, such as a JSON-LD document loader. A transformed data document is produced as output. Whenever this algorithm encodes strings, it MUST use UTF-8 encoding.

Initialize |hmac| to an HMAC API using a locally generated and exportable HMAC key. The HMAC uses the same hash algorithm used in the signature algorithm, i.e., SHA-256. Per the recommendations of [[RFC2104]], the HMAC key MUST be the same length as the digest size; for SHA-256, this is 256 bits or 32 bytes.
Initialize `labelMapFactoryFunction` to the result of calling the `createShuffledIdLabelMapFunction` algorithm passing `hmac` as `HMAC`.
Initialize `groupDefinitions` to a map with an entry with a key of the string "`mandatory`" and a value of |mandatoryPointers|.
Initialize `groups` to the result of calling the algorithm in Section 3.3.16 canonicalizeAndGroup of the [[DI-ECDSA]] specification, passing `labelMapFactoryFunction`, `groupDefinitions`, `unsecuredDocument` as `document`, and any custom JSON-LD API options. Note: This step transforms the document into an array of canonical N-Quads whose order has been shuffled based on 'hmac' applied blank node identifiers, and groups the N-Quad strings according to selections based on JSON pointers.
Initialize `mandatory` to the values in the `groups.mandatory.matching` map.
Initialize `nonMandatory` to the values in the `groups.mandatory.nonMatching` map.
Initialize `hmacKey` to the result of exporting the HMAC key from `hmac`.
Return an object with "`mandatoryPointers`" set to `mandatoryPointers`, "`mandatory`" set to `mandatory`, "`nonMandatory`" set to `nonMandatory`, and "`hmacKey`" set to `hmacKey`.

Base Proof Hashing (bbs-2023)

The following algorithm specifies how to cryptographically hash a transformed data document and proof configuration into cryptographic hash data that is ready to be provided as input to the algorithms in Section .

The required inputs to this algorithm are a transformed data document (|transformedDocument|) and canonical proof configuration (|canonicalProofConfig|). A hash data value represented as an object is produced as output.

Initialize `proofHash` to the result of calling the RDF Dataset Canonicalization algorithm [[RDF-CANON]] on `canonicalProofConfig` and then cryptographically hashing the result using the same hash that is used by the signature algorithm, i.e., SHA-256. Note: This step can be performed in parallel; it only needs to be completed before this algorithm terminates, as the result is part of the return value.
Initialize `mandatoryHash` to the result of calling the the algorithm in Section 3.3.17 hashMandatoryNQuads of the [[DI-ECDSA]] specification, passing |transformedDocument|.`mandatory` and using the SHA-256 algorithm.
Initialize `hashData` as a deep copy of |transformedDocument|, and add `proofHash` as "`proofHash`" and `mandatoryHash` as "`mandatoryHash`" to that object.
Return `hashData` as hash data.

Base Proof Configuration (bbs-2023)

The following algorithm specifies how to generate a proof configuration from a set of proof options that is used as input to the base proof hashing algorithm.

Let |proofConfig| be a clone of the |options| object.
If |proofConfig|.|type| is not set to `DataIntegrityProof` and/or |proofConfig|.|cryptosuite| is not set to `bbs-2023`, an error MUST be raised and SHOULD convey an error type of PROOF_GENERATION_ERROR.
If |proofConfig|.|created| is set and if the value is not a valid [[XMLSCHEMA11-2]] datetime, an error MUST be raised and SHOULD convey an error type of PROOF_GENERATION_ERROR.
Set |proofConfig|.@context to |unsecuredDocument|.@context.
Let |canonicalProofConfig| be the result of applying the RDF Dataset Canonicalization Algorithm [[RDF-CANON]] to the |proofConfig|.
Return |canonicalProofConfig|.

Base Proof Serialization (bbs-2023)

The following algorithm, to be called by an issuer of a BBS-protected Verifiable Credential, specifies how to create a base proof. The base proof is to be given only to the holder, who is responsible for generating a derived proof from it, exposing only selectively disclosed details in the proof to a verifier. This algorithm is designed to be used in conjunction with the algorithms defined in the Data Integrity [[VC-DATA-INTEGRITY]] specification, Section 4: Algorithms. Required inputs are cryptographic hash data (|hashData|), |featureOption|, and, if required, |commitment_with_proof|. If |featureOption| is set to `"anonymous_holder_binding"`, `"pseudonym"`, or `"holder_binding_pseudonym"`, the |commitment_with_proof| input MUST be supplied; if not supplied, an error MUST be raised and SHOULD convey an error type of PROOF_GENERATION_ERROR. If |featureOption| is set to `"pseudonym"` or `"holder_binding_pseudonym"`, the |signer_nym_entropy| input MUST be supplied; if not supplied, an error MUST be raised and SHOULD convey an error type of PROOF_GENERATION_ERROR. A single digital proof value represented as series of bytes is produced as output.

Initialize `proofHash`, `mandatoryPointers`, `mandatoryHash`, `nonMandatory`, and `hmacKey` to the values associated with their property names in |hashData|.
Initialize `bbsHeader` to the concatenation of `proofHash` and `mandatoryHash` in that order.
Initialize `bbsMessages` to an array of byte arrays containing the values in the `nonMandatory` array of strings encoded using the UTF-8 character encoding.
Compute the `bbsSignature` using the procedures below, dependent on the values of |featureOption|.
1. If |featureOption| equals `"baseline"`, compute the `bbsSignature` using the `Sign` procedure of [[CFRG-BBS-Signature]], with appropriate key material, |bbsHeader| for the `header`, and |bbsMessages| for the `messages`.
2. If |featureOption| equals `"anonymous_holder_binding"` , compute the `bbsSignature` using the `BlindSign` procedure of [[CFRG-Blind-BBS-Signature]], with appropriate key material, |commitment_with_proof| for the `commitment_with_proof`, |bbsHeader| for the `header`, and |bbsMessages| for the `messages`. This provides for the Anonymous Holder Binding feature.
3. If |featureOption| equals `"pseudonym"` or `"holder_binding_pseudonym"`, the issuer generates a cryptographically random value for the |signer_nym_entropy| and computes the `bbsSignature` using the "Blind Issuance" operation from [[CFRG-Pseudonym-BBS-Signature]] with appropriate key material, |bbsHeader| for the `header`, |bbsMessages| for the `messages`, |commitment_with_proof| for the `commitment_with_proof`, and |signer_nym_entropy| values. If the issuer might ever need to reissue a credential to this holder that is bound to the same |nym_secret|, they should retain the |signer_nym_entropy| value; otherwise, this value can be discarded.
  
  This provides for the Credential-Bound Pseudonyms or Holder Binding and Pseudonyms features.
Initialize `proofValue to the result of calling the algorithm in Section , passing |bbsSignature|, |bbsHeader|, |publicKey|, |hmacKey|, |mandatoryPointers|, |featureOption|, and, depending on the |featureOption| value, |signer_nym_entropy|, as parameters.

Note: `publicKey` is a byte array of the public key, encoded according to [[CFRG-BBS-SIGNATURE]].
Return `proofValue` as digital proof.

Add Derived Proof (bbs-2023)

The following algorithm, to be called by a holder of a `bbs-2023`-protected verifiable credential, creates a selective disclosure derived proof. The derived proof is to be given to the verifier. The inputs include a JSON-LD document (|document|), a BBS base proof (|proof|), an array of JSON pointers to use to selectively disclose statements (|selectivePointers|), an OPTIONAL BBS |presentationHeader| (a byte array), a |featureOption| parameter, additional parameters supporting the |featureOption| selected (see below), and any custom JSON-LD API options, such as a document loader. A single selectively revealed document value, represented as an object, is produced as output.

Initialize |bbsProof|, |labelMap|, |mandatoryIndexes|, |selectiveIndexes|, and |revealDocument| to the values associated with their property names in the object returned when calling the algorithm in Section , passing the |document|, |proof|, |selectivePointers|, |presentationHeader|, |featureOption|, required additional inputs based on the |featureOption|, and any custom JSON-LD API options, such as a document loader.
Initialize |newProof| to a shallow copy of |proof|.
Replace |proofValue| in |newProof| with the result of calling the algorithm in Section , passing |bbsProof|, |labelMap|, |mandatoryIndexes|, |selectiveIndexes|, |featureOption|, and any required inputs indicated by the |featureOption|.
Set the value of the "`proof`" property in |revealDocument| to |newProof|.
Return |revealDocument| as the selectively revealed document.

Verify Derived Proof (bbs-2023)

The following algorithm specifies how to verify a [=data integrity proof=] given an secured data document. Required inputs are a secured data document ([=map=] |securedDocument|). This algorithm returns a verification result, which is a [=struct=] whose [=struct/items=] are:

verified: `true` or `false`
verifiedDocument: Null, if [=verification result/verified=] is `false`; otherwise, an [=unsecured data document=]

To verify a derived proof, perform the following steps:

Let |publicKeyBytes| be the result of retrieving the public key bytes associated with the |proof|.|verificationMethod| value as described in the Data Integrity [[VC-DATA-INTEGRITY]] specification, Section 4: Retrieve Verification Method.
Let |unsecuredDocument| be a copy of |securedDocument| with the `proof` value removed.
Let |proof| be the value of |securedDocument|.|proof|.
Initialize |bbsProof|, |proofHash|, |mandatoryHash|, |selectiveIndexes|, |presentationHeader|, |nonMandatory|, |featureOption|, and, possibly, |lengthBBSMessages| and/or |pseudonym|, to the values associated with their property names in the object returned when calling the algorithm in Section , passing the |unsecuredDocument|, |proof|, and any custom JSON-LD API options (such as a document loader).
Initialize |bbsHeader| to the concatenation of |proofHash| and |mandatoryHash| in that order. Initialize |disclosedMessages| to an array of byte arrays obtained from the UTF-8 encoding of the elements of the |nonMandatory| array.
Initialize |verified| to the result of applying the verification algorithm below, depending the |featureOption| value.
1. If the |featureOption| equals `"baseline"`, initialize |verified| to the result of applying the verification algorithm `ProofVerify(PK, proof, header, ph, disclosed_messages, disclosed_indexes)` of [[CFRG-BBS-SIGNATURE]] with `PK` set as the public key of the original issuer, `proof` set as `bbsProof`, `header` set as `bbsHeader`, `disclosed_messages` set as `disclosedMessages`, `ph` set as `presentationHeader`, and `disclosed_indexes` set as `selectiveIndexes`.
2. If the |featureOption| equals `"anonymous_holder_binding"`, initialize |verified| to the result of applying the `ProofVerify` verification algorithm of [[CFRG-Blind-BBS-Signature]] using |lengthBBSMessages| for the `"L"` parameter. To be updated when IETF API is finalized.
3. If the |featureOption| equals `"pseudonym"` or `"holder_binding_pseudonym"`, initialize |verified| to the result of applying the "Proof Verification with Pseudonym" operation from [[CFRG-Pseudonym-BBS-Signature]] using |lengthBBSMessages| for the `"L"` parameter and an empty |committed_messages| array. To be updated when IETF API is finalized.
Return a [=verification result=] with [=struct/items=]:

[=verified=]

|verified|

[=verifiedDocument=]

|unsecuredDocument| if |verified| is `true`, otherwise Null

Optional Features

The optional BBS features described in this section, and included in the algorithms in this specification, are at risk and will be removed before the finalization of this specification if their respective specifications at the IETF do not reach RFC status on the same timeline or if there are not at least two independent implementations for each optional feature.

Anonymous Holder Binding

This feature binds, at the time of issuance, a document with base proof, to a secret, known only to a holder, in such a way, that only that holder can generate a revealed document with derived proof that will verify. For example, if an adversary obtained the document with base proof, they could not create a revealed document with derived proof that can verify.

To provide for this functionality, a holder generates a |holder_secret| value which should generally be at least 32 bytes long and cryptographically randomly generated. This value is never shared by the holder. Instead, the holder generates a commitment, along with a zero knowledge proof of knowledge of this value, using the "Commitment Computation" operation of [[CFRG-Blind-BBS-Signature]]. This computation involves cryptographically random values and computes the |commitment_with_proof| and |secret_prover_blind| values. The |commitment_with_proof| is conveyed to the issuer, while the |secret_prover_blind| is kept secret and is retained by the holder for use in generation of derived proofs. Note that a holder can run the "Commitment Computation" operation multiple times, to produce unlinkable |commitment_with_proof| values for use with different issuers.

The issuer, on receipt of the |commitment_with_proof|, follows the procedures of this specification, and uses the "Blind Signature Generation" operation of [[CFRG-Blind-BBS-Signature]] to produce a base proof (signature) over the document, with the |commitment_with_proof| furnished by the holder.

When the holder wants to create a selectively disclosed document with derived proof, they use the procedures of this specification and the "Proof Generation" operation of [[CFRG-Blind-BBS-Signature]]. They use their |holder_secret| as the only "message" in the |commited messages| array, and supply their |secret_prover_blind|.

Verification of the revealed document with derived proof uses the procedures of this specification and the "Proof Verification" operation of [[CFRG-Blind-BBS-Signature]].

Credential-Bound Pseudonyms

BBS-signed Verifiable Credentials as specified in this document allow for selective disclosure and unlinkable cryptographic proof artifacts. By "unlinkable", we mean that the cryptographic information in the derived proofs cannot be linked to other proofs nor the original signature. This implies that a verifier cannot determine whether a holder has presented the same credential before (with a different proof instantiation), nor can they assert some type of identity. Credential-bound pseudonyms provide a privacy preserving mechanism to allow for the limited linkability of a cryptographic pseudonym. Such a pseudonym may be determined strictly by the holder, or jointly by the holder and the verifier.

This type of cryptographic pseudonym (cryptographic identifier/name) is computed from two parts. The first part, the |nym_secret|, is specified by, and will only be known by, the holder; the second part, the |nym_domain|, may be specified by either the holder or the verifier, and will be shared between the holder and verifier. An issuer binds a credential to a |nym_secret| during the issuance process. A holder can then compute pseudonyms from the |nym_secret| and prove to verifiers that these pseudonyms are bound to the credential they are presenting. Cryptographic pseudonyms computed from the same |nym_secret| but different |nym_domain| values are unlinkable.

The holder might choose a |nym_domain| to give themselves a pseudonym for some type of public forum, e.g., choose |nym_domain| = "Mark Twain". The cryptographic pseudonym calculated by the holder from this |nym_domain| with their |nym_secret| is essentially unique, and no entity that does not both know the |nym_secret| and possess the base verifiable credential bound to the pseudonym could assert this pseudonymous identity. Note that the doublet of (|nym_domain|, pseudonym) has to be sent with the derived credential to assert this pseudonymous identity.

In a different situation, a holder may be using a service from a verifier where the verifier wants to track visits over time or monitor use of some resource by the holder. In this case, the verifier chooses the |nym_domain| that the holder needs to use when presenting their derived credential. For example, a verifier might specify a public |nym_domain| tied to their DNS domain (e.g., `"www.nym.example"`) for the holder to use. A verifier could also demonstrate that they support data minimization, of a sort, by periodically changing the |nym_domain| (e.g., tying it to a date, `"www.nym.example/2025-01-02"`). This specification does not dictate values for the |nym_domain|.

Finally, to prevent a malicious holder who obtains another holder's |nym_secret| from getting a credential bound to that value, the operations from the [[CFRG-Pseudonym-BBS-Signature]] have the issuer add a randomization factor, |signer_nym_entropy|, that is securely "mixed" with the holder's portion, the |prover_nym|, during signature generation. This results in a |nym_secret| for which the issuer can provide cryptographic assurance that it is unique, to be used by the holder.

An outline of the creation and use of credential-bound pseudonyms is shown in the steps below.

The holder creates a secure value, known as the |prover_nym|, that should be at least 32 bytes long and generated in cryptographically random manner. They then use the "`Commitment`" operation from [[CFRG-Pseudonym-BBS-Signature]] with an empty |committed_messages| array to compute the |commitment_with_proof| and the |secret_prover_blind|. The holder sends the |commitment_with_proof| to the issuer with a request for a credential, but never discloses |prover_nym| and |secret_prover_blind|, keeping them for later use.
The issuer receives the holder's request for a credential with a bound pseudonym (|featureOption| equals `"pseudonym"`), along with the |commitment_with_proof|. The issuer generates a cryptographically random value for the |signer_nym_entropy| and uses the "Blind Issuance" operation from [[CFRG-Pseudonym-BBS-Signature]] to produce the base proof. Among other information, the base proof will contain the |signer_nym_entropy| value. If the issuer might ever need to reissue a credential to this holder that is bound to the same |nym_secret|, they should retain the |signer_nym_entropy| value; otherwise, this value can be discarded.
On receipt of the credential with bound pseudonym, the holder uses the procedures in this specification, along with the "Verification and Finalization" operation from [[CFRG-Pseudonym-BBS-Signature]] with an empty |committed_messages| array, to both verify the signature and compute the |nym_secret| value. This operation uses the |prover_nym|, |signer_nym_entropy|, and |secret_prover_blind| values, amongst others. If third-party monitoring is being used, then the |nym_secret| value will need to be securely shared with the monitoring organization.
When a holder wants to generate a derived proof bound to a pseudonym, they need to determine the |nym_domain|. As previously noted, this might be specified by the verifier or set by the holder, depending on the usage scenario. They then use the procedures in this specification with the "Proof Generation with Pseudonym" operation from [[CFRG-Pseudonym-BBS-Signature]] to produce the derived proof. This operation takes as inputs the |nym_secret|, |nym_domain|, empty |committed_messages| array, and |secret_prover_blind|, amongst others. In addition to providing the raw cryptographic proof value, it also returns the |pseudonym| value that gets included in the derived proof.
When a verifier receives a derived proof with bound pseudonym, they use the procedures in this specification along with the "Proof Verification with Pseudonym" operation from [[CFRG-Pseudonym-BBS-Signature]] to validate the derived proof. Note that these procedures will also validate that the specified |nym_domain| was used in the computation of the |pseudonym|.

Holder Binding and Pseudonyms

Anonymous holder binding and credential-bound pseudonyms are, in a sense, orthogonal features, and a holder and credential ecosystem may wish to use both at the same time. For instance, a holder may wish to bind a verifiable credential to a |holder_secret|, so that only a holder knowing this value can generate a derived proof from the base proof, and bind a |nym_secret| to the base proof so that pseudonyms can be bound to the derived proof. This corresponds to the |featureOption| being equal to `"holder_binding_pseudonym"`.

An outline of the creation and use of both anonymous holder binding and credential bound pseudonyms is given in the following steps.

The holder creates two secure values known as the |holder_secret| and |prover_nym| that should be at least 32 bytes long and generated in cryptographically random manner. They then use the "Commitment" operation from [[CFRG-Pseudonym-BBS-Signature]] with the |committed_messages| array input containing the |holder_secret| as its only element, to compute the |commitment_with_proof| and |secret_prover_blind|. The |commitment_with_proof| is sent with their request to the issuer for a credential while the |holder_secret|, |prover_nym| and |secret_prover_blind| are kept for later use by the holder and are never disclosed.
The issuer receives the holders request for a credential with a bound pseudonym, |featureOption| equals `"pseudonym"`, along with the |commitment_with_proof|. The issuer uses the procedures in this specification and generates a cryptographically random value for the |signer_nym_entropy| and uses the "Blind Issuance" operation from [[CFRG-Pseudonym-BBS-Signature]] to produce the base proof. Among other information the base proof will contain the |signer_nym_entropy| value. If the issuer ever needs reissue a credential to this holder that is bound to the same |nym_secret| they should retain the |signer_nym_entropy| value, otherwise this value can be discarded.
The holder on receipt of the credential with bound pseudonym uses the procedures in this specification along with the "Verification and Finalization" operation from [[CFRG-Pseudonym-BBS-Signature]] with the |committed_messages| array containing the |holder_secret| as its only element, to both verify the signature and compute the |nym_secret| value. This operation uses the |holder_secret|, |prover_nym|, |signer_nym_entropy|, and |secret_prover_blind| values amongst others. If third party monitoring is being utilized then the |nym_secret| value would be securely shared with the monitoring organization.
When the holder wants to generate a derived proof bound to a pseudonym they need to determine the |nym_domain|. As previously stated this might be specified by the verifier or set by the holder depending on the usage scenario. They then use the procedures in this specification and the "Proof Generation with Pseudonym" operation from [[CFRG-Pseudonym-BBS-Signature]] to produce the derived proof. This operation takes as inputs the |nym_secret|, |nym_domain|, the |committed_messages| array containing the |holder_secret| as its only element, and |secret_prover_blind| amongst others. In addition to providing the raw cryptographic proof value it also returns the |pseudonym| value that gets included in the derived proof.
When a verifier receives a derived proof with bound pseudonym they use the procedures in this specification along with the "Proof Verification with Pseudonym" operation from [[CFRG-Pseudonym-BBS-Signature]] to validate the derived proof. Note that these procedures will also validate that the specified |nym_domain| was used in the computation of the |pseudonym|.

Optional Feature Summary

This section provides summaries of the inputs, outputs, proof serialiation, tasks, and procedures for "baseline" BBS proofs as well as those for the optional features. By baseline BBS, we mean BBS base and derived proofs without additional features. All the optional features are "additive" in the sense that some additional input, task, or output is generated in addition to those of the "baseline" BBS signatures/proofs.

Table 1 *Issuer Create Base: Inputs and such.*
Name	Tasks	Inputs	Signing Algorithm
Baseline BBS	baseline: BBS signature generation from VC	baseline: document, proof options, key material, mandatory pointers	BBS
Anonymous Holder Binding	baseline	baseline + commitment with proof from holder	Blind BBS
Credential-Bound Pseudonyms	baseline + generate \|signer_nym_entropy\|	baseline + \|signer_nym_entropy\|, commitment with proof from holder	Pseudonym BBS
Holder Binding and Pseudonyms	baseline + generate \|signer_nym_entropy\|	baseline + \|signer_nym_entropy\|, commitment with proof from holder	Pseudonym BBS

Table 2 *Issuer Create Base: Headers and Serialization.*
Name	Proof Header Bytes	Serialized Output
Baseline BBS	`0xd9`, `0x5d`, and `0x02`	baseline: bbsSignature, bbsHeader, publicKey, hmacKey, and mandatoryPointers
Anonymous Holder Binding	`0xd9`, `0x5d`, and `0x04`	baseline
Credential-Bound Pseudonyms	`0xd9`, `0x5d`, and `0x06`	baseline + \|signer_nym_entropy\|
Holder Binding and Pseudonyms	`0xd9`, `0x5d`, and `0x08`	baseline + \|signer_nym_entropy\|

Table 3 *Holder Add Derived: Inputs and such.*
Name	Tasks	Inputs	Proof Generation Algorithm
Baseline BBS	BBS derived proof generation from VC with base proof	baseline: (from base proof serialization) bbsSignature, bbsHeader, publicKey, hmacKey, and mandatoryPointers; selectivePointers (holders choice)	BBS
Anonymous Holder Binding	baseline	baseline + \|holder_secret\|, \|prover_blind\| (both known to holder)	Blind BBS
Credential Bound Pseudonyms	baseline + compute \|nym_secret\|, compute \|pseudonym\|	baseline + \|prover_nym\|, \|prover_blind\| (all known to holder), \|signer_nym_entropy\| (included in base from issuer), \|nym_domain\|	Pseudonym BBS
Holder Binding and Pseudonyms	baseline + compute \|nym_secret\|, compute \|pseudonym\|	baseline + \|holder_secret\|, \|prover_nym\|, \|prover_blind\| (all known to holder), \|signer_nym_entropy\| (included in base from issuer), \|nym_domain\|	Pseudonym BBS

Table 4 *Holder Add Derived: Headers and Serialization.*
Name	Proof Header Bytes	Serialized Output
Baseline BBS	`0xd9`, `0x5d`, and `0x03`	baseline: bbsProof, compressedLabelMap, mandatoryIndexes, selectiveIndexes, presentationHeader
Anonymous Holder Binding	`0xd9`, `0x5d`, and `0x05`	baseline
Credential Bound Pseudonyms	`0xd9`, `0x5d`, and `0x07`	baseline + \|pseudonym\|, \|nym_domain\|
Holder Binding and Pseudonyms	`0xd9`, `0x5d`, and `0x09`	baseline + \|pseudonym\|, \|nym_domain\|

Table 5 *Verify Derived: Inputs and Algorithms.*
Name	Inputs	Proof Verification Algorithm
BBS baseline	baseline: (from derived proof serialization) bbsProof, compressedLabelMap, mandatoryIndexes, selectiveIndexes, presentationHeader	BBS
Anonymous Holder Binding	baseline	Blind BBS
Credential-Bound Pseudonyms	baseline + \|pseudonym\| (included in derived proof), \|nym_domain\|	Pseudonym BBS
Holder Binding and Pseudonyms	baseline + \|pseudonym\| (included in derived proof), \|nym_domain\|	Pseudonym BBS

Privacy Considerations

Selective Disclosure and Data Leakage

Selective disclosure permits a holder to minimize the information revealed to a verifier to achieve a particular purpose. In prescribing an overall system that enables selective disclosure, care has to be taken that additional information that was not meant to be disclosed to the verifier is minimized. Such leakage can occur through artifacts of the system. Such artifacts can come from higher layers of the system, such as in the structure of data or from the lower level cryptographic primitives.

For example the BBS signature scheme is an extremely space efficient scheme for producing a signature on multiple messages, i.e., the cryptographic signature sent to the holder is a constant size regardless of the number of messages. The holder then can selectively disclose any of these messages to a verifier, however as part of the encryption scheme, the total number of messages signed by the issuer has to be revealed to the verifier. If such information leakage needs to be avoided then it is recommended to pad the number of messages out to a common length as suggested in the privacy considerations section of [[CFRG-BBS-SIGNATURE]].

At the higher levels, how data gets mapped into individual statements suitable for selective disclosure, i.e., BBS messages, is a potential source of data leakage. This cryptographic suite is able to eliminate many structural artifacts used to express JSON data that might leak information (nesting, map, or array position, etc.) by using JSON-LD processing to transform inputs into RDF. RDF can then be expressed as a canonical, flat format of simple subject, property, value statements (referred to as claims in the Verifiable Credentials Data Model [[VC-DATA-MODEL-2.0]]). In the following, we examine RDF canonicalization, a general scheme for mapping a verifiable credential in JSON-LD format into a set of statements (BBS messages), for selective disclosure. We show that after this process is performed, there remains a possible source of information leakage, and we show how this leakage is mitigated via the use of a keyed pseudo random function (PRF).

RDF canonicalization can be used to flatten a JSON-LD VC into a set of statements. The algorithm is dependent on the content of the VC and also employs a cryptographic hash function to help in ordering the statements. In essence, how this happens is that each JSON object that represents the subject of claims within a JSON-LD document will be assigned an id, if it doesn't have an `@id` field defined. Such ids are known as blank node ids. These ids are needed to express claims as simple subject, property, value statements such that the subject in each claim can be differentiated. The id values are deterministically set per [[RDF-CANON]] and are based on the data in the document and the output of a cryptographic hash function such as SHA-256.

Below we show two slightly different VCs for a set of windsurf sails and their canonicalization into a set of statements that can be used for selective disclosure. By changing the year of the 6.1 size sail we see a major change in statement ordering between these two VCs. If the holder discloses information about just his larger sails (the 7.0 and 7.8) the verifier could tell something changed about the set of sails, i.e., information leakage.

{
  "@context": [
    "https://www.w3.org/ns/credentials/v2",
    {
      "@vocab": "https://windsurf.grotto-networking.com/selective#"
    }
  ],
  "type": [
    "VerifiableCredential"
  ],
  "credentialSubject": {
    "sails": [
      {
        "size": 5.5,
        "sailName": "Kihei",
        "year": 2023
      },
      {
        "size": 6.1,
        "sailName": "Lahaina",
        "year": 2023 // Will change this to see the effect on canonicalization
      },
      {
        "size": 7.0,
        "sailName": "Lahaina",
        "year": 2020
      },
      {
        "size": 7.8,
        "sailName": "Lahaina",
        "year": 2023
      }
    ]
  }
}

Canonical form of the above VC. Assignment of blank node ids, i.e., the _:c14nX labels are dependent upon the content of the VC and this also affects the ordering of the statements.

_:c14n0 <https://windsurf.grotto-networking.com/selective#sailName> "Lahaina" .
_:c14n0 <https://windsurf.grotto-networking.com/selective#size> "7.8E0"^^<http://www.w3.org/2001/XMLSchema#double> .
_:c14n0 <https://windsurf.grotto-networking.com/selective#year> "2023"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n1 <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <https://www.w3.org/2018/credentials#VerifiableCredential> .
_:c14n1 <https://www.w3.org/2018/credentials#credentialSubject> _:c14n4 .
_:c14n2 <https://windsurf.grotto-networking.com/selective#sailName> "Lahaina" .
_:c14n2 <https://windsurf.grotto-networking.com/selective#size> "7"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n2 <https://windsurf.grotto-networking.com/selective#year> "2020"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n3 <https://windsurf.grotto-networking.com/selective#sailName> "Kihei" .
_:c14n3 <https://windsurf.grotto-networking.com/selective#size> "5.5E0"^^<http://www.w3.org/2001/XMLSchema#double> .
_:c14n3 <https://windsurf.grotto-networking.com/selective#year> "2023"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n4 <https://windsurf.grotto-networking.com/selective#sails> _:c14n0 .
_:c14n4 <https://windsurf.grotto-networking.com/selective#sails> _:c14n2 .
_:c14n4 <https://windsurf.grotto-networking.com/selective#sails> _:c14n3 .
_:c14n4 <https://windsurf.grotto-networking.com/selective#sails> _:c14n5 .
_:c14n5 <https://windsurf.grotto-networking.com/selective#sailName> "Lahaina" .
_:c14n5 <https://windsurf.grotto-networking.com/selective#size> "6.1E0"^^<http://www.w3.org/2001/XMLSchema#double> .
_:c14n5 <https://windsurf.grotto-networking.com/selective#year> "2023"^^<http://www.w3.org/2001/XMLSchema#integer> .

Updated windsurf sail collection, i.e., the 6.1 size sail has been updated to the 2024 model. This changes the ordering of statements via the assignment of blank node ids.

{
  "@context": [
    "https://www.w3.org/ns/credentials/v2",
    {
      "@vocab": "https://windsurf.grotto-networking.com/selective#"
    }
  ],
  "type": [
    "VerifiableCredential"
  ],
  "credentialSubject": {
    "sails": [
      {
        "size": 5.5,
        "sailName": "Kihei",
        "year": 2023
      },
      {
        "size": 6.1,
        "sailName": "Lahaina",
        "year": 2024 // New sail to update older model, changes canonicalization
      },
      {
        "size": 7.0,
        "sailName": "Lahaina",
        "year": 2020
      },
      {
        "size": 7.8,
        "sailName": "Lahaina",
        "year": 2023
      }
    ]
  }
}

Canonical form of the previous VC. Note the difference in blank node id assignment and ordering of statements.

_:c14n0 <https://windsurf.grotto-networking.com/selective#sailName> "Lahaina" .
_:c14n0 <https://windsurf.grotto-networking.com/selective#size> "6.1E0"^^<http://www.w3.org/2001/XMLSchema#double> .
_:c14n0 <https://windsurf.grotto-networking.com/selective#year> "2024"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n1 <https://windsurf.grotto-networking.com/selective#sailName> "Lahaina" .
_:c14n1 <https://windsurf.grotto-networking.com/selective#size> "7.8E0"^^<http://www.w3.org/2001/XMLSchema#double> .
_:c14n1 <https://windsurf.grotto-networking.com/selective#year> "2023"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n2 <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <https://www.w3.org/2018/credentials#VerifiableCredential> .
_:c14n2 <https://www.w3.org/2018/credentials#credentialSubject> _:c14n5 .
_:c14n3 <https://windsurf.grotto-networking.com/selective#sailName> "Lahaina" .
_:c14n3 <https://windsurf.grotto-networking.com/selective#size> "7"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n3 <https://windsurf.grotto-networking.com/selective#year> "2020"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n4 <https://windsurf.grotto-networking.com/selective#sailName> "Kihei" .
_:c14n4 <https://windsurf.grotto-networking.com/selective#size> "5.5E0"^^<http://www.w3.org/2001/XMLSchema#double> .
_:c14n4 <https://windsurf.grotto-networking.com/selective#year> "2023"^^<http://www.w3.org/2001/XMLSchema#integer> .
_:c14n5 <https://windsurf.grotto-networking.com/selective#sails> _:c14n0 .
_:c14n5 <https://windsurf.grotto-networking.com/selective#sails> _:c14n1 .
_:c14n5 <https://windsurf.grotto-networking.com/selective#sails> _:c14n3 .
_:c14n5 <https://windsurf.grotto-networking.com/selective#sails> _:c14n4 .

To prevent such information leakage from the assignment of these blank node ids and the ordering they impose on the statements, an HMAC based PRF is run on the blank node ids. The HMAC secret key is only shared between the issuer and holder and each Base Proof generated by the issuer uses a new HMAC key. An example of this can be seen in the canonical HMAC test vector of [[DI-ECDSA]]. As discussed in the next section, for BBS to preserve unlinkability we do not use HMAC based blank node ids but produce a shuffled version of the ordering based on the HMAC as shown in test vector . Note that this furnishes less information hiding concerning blank node ids than in the ECDSA-SD approach, since information the number of blank node ids can leak, but prevents linkage attacks via the essentially unique identifiers produced by applying an HMAC to blank node ids.

Selective Disclosure and Unlinkability

In some uses of VCs it can be important to the privacy of a holder to prevent the tracking or linking of multiple different verifier interactions. In particular we consider two important cases (i) verifier to issuer collusion, and (ii) verifier to verifier collusion. In the first case, shown in , a verifier reports back to the original issuer of the credential on an interaction with a holder. In this situation, the issuer could track all the holder interactions with various verifiers using the issued VC. In the second situation, shown in , multiple verifiers collude to share information about holders with whom they have interacted.

Diagram showing multiple verifiers sending data back to the issuer.
The diagram is laid out top to bottom with a circle labeled issuer at the top,
connected to a circle label holder below. From the circle labeled holder there
are multple arrows to additional circles labeled verifiers. From the circles
labeled verifiers there are dashed arrows back to the circle labeled issuer
showing collusive data flow. — Verifier-to-issuer collusion.

Diagram showing multiple verifiers sharing with each other.
The diagram is laid out top to bottom with a circle labeled issuer at the top,
connected to a circle label holder below. From the circle labeled holder there
are multple arrows to additional circles labeled verifiers. From the circles
labeled verifiers there are dashed arrows back to other circles labeled issuer
to show verifier to verifier collusive data flows. — Verifier-to-verifier collusion.

We use the term unlinkability to describe the property of a VC system to prevent such "linkage attacks" on holder privacy. Although the term unlinkability is relatively new section 3.3 of [[NISTIR8053]] discusses and gives a case study of Re-identification through Linkage Attacks. A systemization of knowledge on linkage attack on data privacy can be found in [[Powar2023]]. The most widespread use of linkage attack on user privacy occurs via the practice of web browser fingerprinting, a survey of which can be found in [[Pugliese2020]].

To quantify the notion of linkage, [[Powar2023]] introduces the idea of an anonymity set. In the VC case we are concerned with here, the anonymity set would contain the holder of a particular VC and other holders associated with a particular issuer. The smaller the anonymity set the more likely the holder can be tracked across verifiers. Since a signed VC contains a reference to a public key of the issuer, the starting size for the anonymity set for a holder possessing a VC from a particular issuer is the number of VC issued by that issuer with that particular public/private key pair. Non-malicious issuers are expected to minimize the number of public/private key pairs used to issue VCs. Note that the anonymity set idea is similar to the group privacy concept in [[vc-bitstring-status-list]]. When we use the term linkage here we generally mean any mechanism that results in a reduction in size of the anonymity set.

Sources of linkage in a VC system supporting selective disclosure:

Artifacts from cryptographic primitives.
Artifacts from mapping a VC into a set of statements suitable for selective disclosure.
Artifacts from Proof Options and Mandatory reveal Information in the VC.
Selectively revealed information in the VC.
External VC System Based Linkage

We discuss each of these below.

Linkage via Cryptographic Artifacts

Cryptographic Hashes, HMACs, and digital signatures by their nature generate highly unique identifiers. The output of a hash function such as SHA-256, by its collision resistance properties, are guaranteed to be essentially unique given different inputs and result in a strong linkage, i.e., reduces the anonymity set size to one. Similarly deterministic signature algorithms such as Ed25519 and deterministic ECDSA will produce essentially unique outputs for different inputs and lead to strong linkages.

This implies that holders can be easily tracked across verifiers via digital signature, HMAC, or hash artifacts inside VCs and hence are vulnerable to verifier-verifier collusion and verifier-issuer collusion. Randomized signature algorithms such as some forms of ECDSA can permit the issuer to generate many distinct signatures on the same inputs and send these to the holder for use with different verifiers. Such an approach could be used to prevent verifier-verifier collusion based tracking but cannot help with verifier-issuer collusion.

To achieve unlinkability requires specially designed cryptographic signature schemes that allow the holder to generate what is called a zero knowledge proof of knowledge of a signature (ZKPKS). What this means is that the holder can take a signature from the issuer in such a scheme, compute a ZKPKS to send to a verifier. This ZKPKS cannot be linked back to the original signature, but has all the desirable properties of a signature, i.e., the verifier can use it to verify that the messages were signed by the issuers public key and that the messages have not been altered. In addition, the holder can generate as many ZKPKSs as desired for different verifiers and these are essentially independent and unlinkable. BBS is one such signature scheme that supports this capability.

Although the ZKPKS, known as a BBS proof in this document, has guaranteed unlinkability properties. BBS when used with selective disclosure has two artifacts that can contribute to linkability. These are the total number of messages originally signed, and the index values for the revealed statements. See the privacy considerations in [[CFRG-BBS-SIGNATURE]] for a discussion and mitigation techniques.

As mentioned in the section on Issuer's Public Keys of [[CFRG-BBS-SIGNATURE]] there is the potential threat that an issuer might use multiple public keys with some of those used to track a specific subset of users via verifier-issuer collusion. Since the issuers public key has to be visible to the verifier, i.e., it is referenced in the BBS proof (derived proof) this can be used as a linkage point if the issuer has many different public keys and particularly if it uses a subset of those keys with a small subset of users (holders).

Linkage via VC Processing

We saw in the section on information leakage that RDF canonicalization uses a hash function to order statements and that a further shuffle of the order of the statements is performed based on an HMAC. This can leave a fingerprint that might allow for some linkage. How strong of a linkage is dependent on the number of blank nodes, essentially JSON objects within the VC, and the number of indexes revealed. Given n blank nodes and k disclosed indexes in the worst case this would be a reduction in the anonymity set size by a factor of C(n, k), i.e., the number combinations of size k chosen from a set of n elements. One can keep this number quite low by reducing the number of blank nodes in the VC, e.g., keep the VC short and simple.

Linkage via JSON-LD Node Identifiers

JSON-LD is a JSON-based format for serialization of Linked Data. As such, it supports assigning a globally unambiguous `@id` attribute (node identifier) to each object ("node", in JSON-LD terminology) within a document. This allows for the linking of linked data, enabling information about the same entity to be correlated. This correlation can be desirable or undesirable, depending on the use case.

When using BBS for its unlinkability feature, globally unambiguous node identifiers cannot be used for individuals nor for their personally identifiable information, since the strong linkage they provide is undesirable. Note that the use of such identifiers is acceptable when expressing statements about non-personal information (e.g., using a globally unambiguous identifier to identify a large country or a concert event). Also note that JSON-LD's use of `@context`, which maps terms to IRIs, does not generally affect unlinkability.

Linkage via Proof Options and Mandatory Reveal

In the [[vc-data-integrity]] specification, a number of properties of the `proof` attribute of a VC are given. Care has to be taken that optional fields ought not provide strong linkage across verifiers. The optional fields include: id, created, expires, domain, challenge, and nonce. For example the optional created field is a `dateTimeStamp` object which can specify the creation date for the proof down to an arbitrary sub-second granularity. Such information, if present, could greatly reduce the size of the anonymity set. If the issuer wants to include such information they ought to make it as coarse grained as possible, relative to the number of VCs being issued over time.

The issuer can also compel a holder to reveal certain statements to a verifier via the `mandatoryPointers` input used in the creation of the Base Proof. See section , , and . By compel we mean that a generated Derived Proof will not verify unless these statements are revealed to the verifier. Care should be taken such that if such information is required to be disclosed, that the anonymity set remains sufficiently large.

Linkage via Holder Selective Reveal

As discussed in [[Powar2023]] there are many documented cases of re-identification of individuals from linkage attacks. Hence the holder is urged to reveal as little information as possible to help keep the anonymity set large. In addition, it has been shown a number of times that innocuous seeming information can be highly unique and thus leading to re-identification or tracking. See [[NISTIR8053]] for a walk through of a particularly famous case of a former governor of Massachusetts and [[Powar2023]] for further analysis and categorization of 94 such public cases.

External VC System Based Linkage

It ought to be pointed out that maintaining unlinkability, i.e., anonymity, requires care in the systems holding and communicating the VCs. Networking artifacts such as IP address (layer 3) or Ethernet/MAC address (layer 2) are well known sources of linkage. For example, mobile phone MAC addresses can be used to track users if they revisited a particular access point, this led to mobile phone manufacturers providing a MAC address randomization feature. Public IP addresses generally provide enough information to geolocate an individual to a city or region within a country potentially greatly reducing the anonymity set.

Test Vectors

Baseline Basic Example

The document test vectors are based on a purely fictitious permanent resident card, and broken into two groups — those that would be generated by the issuer ("base proof"), and those that would be generated by the holder ("derived proof").

Base Proof

To add a selective disclosure base proof to a document, the issuer needs the following cryptographic key material:

The issuer's private/public key pair, i.e., the key pair corresponding to the verification method that will be part of the proof.
An HMAC key. This is used to randomize the order of the blank node IDs to avoid potential information leakage via the blank node ID ordering. This is used only once, and is shared between issuer and holder. The HMAC in this case is functioning as a pseudorandom function (PRF).

The key material used for generating the test vectors to test add base proof is shown below. Hexadecimal representation is used for the BBS key pairs and the HMAC key.

In our scenario, a permanent resident credential is being issued. The unsigned permanent resident document is shown below.

This mandatory information is specified via an array of JSON pointers, as shown below.

The result of applying the above JSON pointers to the document is shown below.

Transformation of the unsigned document begins with canonicalizing the document, as shown below.

To prevent possible information leakage via the ordering of the blank node IDs, these IDs are processed through a PRF (i.e., the HMAC) to give the canonicalized HMAC document shown below. This represents an ordered list of statements that will be subject to "mandatory" and "selective" (or "non-mandatory") disclosure, i.e., when these statements are grouped based on their disclosure requirements, they will still be ordered as in this list.

The list from the canonical document above gets grouped into mandatory and non-mandatory statements. The final output of the selective disclosure transformation process is shown below. Note that the statements are now grouped as mandatory or non-mandatory disclosure, and the index of each statement in the previous list is remembered.

The next step is to create the base proof configuration and canonicalize it. This is shown in the following two examples.

In the hashing step, we compute the SHA-256 hash of the canonicalized proof options to produce the `proofHash`, and we compute the SHA-256 hash of the `JOIN` of all the mandatory N-Quads to produce the `mandatoryHash`. These are shown below in hexadecimal format.

Shown below are the computed `bbsSignature` in hexadecimal, and the `mandatoryPointers`. These are are fed to the final serialization step with the `hmacKey`.

Finally, the values above are run through the algorithm of Section , to produce the `proofValue` which is used in the signed base document, as shown below.

Derived Proof

Random numbers are used, and an optional `presentationHeader` can be an additional input, for the creation of BBS proofs. To furnish a deterministic set of test vectors, we used the Mocked Random Scalars procedure from [[CFRG-BBS-SIGNATURE]]. The `seed` and `presentationHeader` values we used for generation of the derived proof test vectors are given in hex, below.

To create a derived proof, a holder starts with a signed document containing a base proof. The base document we will use for these test vectors is the final example from Section , above. The first step is to run the algorithm of Section to recover `bbsSignature`, `hmacKey`, and `mandatoryPointers`, as shown below.

Next, the holder needs to indicate what non-mandatory statements, if any, they wish to reveal to the verifiers, by specifying JSON pointers for selective disclosure. These are shown below.

To produce the `revealDocument` (i.e., the unsigned document that will eventually be signed and sent to the verifier), we append the selective pointers to the mandatory pointers, and input these combined pointers along with the document without proof to the `selectJsonLd` algorithm of [[DI-ECDSA]]. This gets the result shown below.

Now that we know what the reveal document looks like, we need to furnish appropriately updated information to the verifier about which statements are mandatory, and the indexes of the selected non-mandatory statements. Running step 6 of the yields abundant information about various statement groups relative to the original document. Below, we show a portion of the indexes for those groups.

The verifier needs to be able to aggregate and hash the mandatory statements. To enable this, we furnish them with a list of the indexes of the mandatory statements, adjusted to their positions in the reveal document (i.e., relative to the `combinedIndexes`), while the `selectiveIndexes` are adjusted relative to their positions within the `nonMandatoryIndexes`. These "adjusted" indexes are shown below.

The last important piece of disclosure data is the `labelMap`, a mapping of canonical blank node IDs to HMAC-based shuffled IDs, computed according to Section . This is shown below, along with the rest of the disclosure data, minus the reveal document.

Finally, using the disclosure data above with the algorithm of Section , we obtain the signed derived (reveal) document shown below.

Baseline Enhanced Example

Demonstration of selective disclosure features including mandatory disclosure, selective disclosure, and overlap between those, requires an input credential document with more content than previous test vectors. To avoid excessively long test vectors, the starting document test vector is based on a purely fictitious windsurfing (sailing) competition scenario. In addition, we break the test vectors into two groups, based on those that would be generated by the issuer (base proof) and those that would be generated by the holder (derived proof).

Base Proof

To add a selective disclosure base proof to a document, the issuer needs the following cryptographic key material:

The issuer's private/public key pair, i.e., the key pair corresponding to the verification method that will be part of the proof.
An HMAC key. This is used to randomize the order of the blank node IDs to avoid potential information leakage via the blank node ID ordering. This is used only once, and is shared between issuer and holder. The HMAC in this case is functioning as a pseudorandom function (PRF).

The key material used for generating the test vectors to test add base proof is shown below. Hexadecimal representation is used for the BBS key pairs and the HMAC key.

In our scenario, a sailor is registering with a race organizer for a series of windsurfing races to be held over a number of days on Maui. The organizer will inspect the sailor's equipment to certify that what has been declared is accurate. The sailor's unsigned equipment inventory is shown below.

In addition to letting other sailors know what kinds of equipment their competitors may be sailing on, it is mandatory that each sailor disclose the year of their most recent windsurfing board and full details on two of their sails. Note that all sailors are identified by a sail number that is printed on all their equipment. This mandatory information is specified via an array of JSON pointers as shown below.

The result of applying the above JSON pointers to the sailor's equipment document is shown below.

Transformation of the unsigned document begins with canonicalizing the document, as shown below.

To prevent possible information leakage from the ordering of the blank node IDs these are processed through a PRF (i.e., the HMAC) to give the canonicalized HMAC document shown below. This represents an ordered list of statements that will be subject to mandatory and selective disclosure, i.e., it is from this list that statements are grouped.

The above canonical document gets grouped into mandatory and non-mandatory statements. The final output of the selective disclosure transformation process is shown below. Each statement is now grouped as mandatory or non-mandatory, and its index in the previous list of statements is remembered.

The next step is to create the base proof configuration and canonicalize it. This is shown in the following two examples.

In the hashing step, we compute the SHA-256 hash of the canonicalized proof options to produce the `proofHash`, and we compute the SHA-256 hash of the join of all the mandatory N-Quads to produce the `mandatoryHash`. These are shown below in hexadecimal format.

Shown below are the computed `bbsSignature` in hexadecimal, and the `mandatoryPointers`. These are are fed to the final serialization step with the `hmacKey`.

Finally, the values above are run through the algorithm of Section , to produce the `proofValue` which is used in the signed base document shown below.

Derived Proof

Random numbers are used, and an optional `presentationHeader` can be an input, for the creation of BBS proofs. To furnish a deterministic set of test vectors, we used the Mocked Random Scalars procedure from [[CFRG-BBS-SIGNATURE]]. The `seed` and `presentationHeader` values we used for generation of the derived proof test vectors are given in hex, below.

Next, the holder needs to indicate what else, if anything, they wish to reveal to the verifiers, by specifying JSON pointers for selective disclosure. In our windsurfing competition scenario, a sailor (the holder) has just completed their first day of racing, and wishes to reveal to the general public (the verifiers) all the details of the windsurfing boards they used in the competition. These are shown below. Note that this slightly overlaps with the mandatory disclosed information which included only the year of their most recent board.

Now that we know what the revealed document looks like, we need to furnish appropriately updated information to the verifier about which statements are mandatory, and the indexes for the selected non-mandatory statements. Running step 6 of the yields an abundance of information about various statement groups relative to the original document. Below we show a portion of the indexes for those groups.

The verifier needs to be able to aggregate and hash the mandatory statements. To enable this, we furnish them with a list of indexes of the mandatory statements adjusted to their positions in the reveal document (i.e., relative to the `combinedIndexes`), while the `selectiveIndexes` need to be adjusted relative to their positions within the `nonMandatoryIndexes`. These "adjusted" indexes are shown below.

Finally, using the disclosure data above with the algorithm of Section , we obtain the signed derived (reveal) document shown below.

Anonymous Holder Binding Feature

Holder Binding Commitment Generation

The first steps in using the anonymous holder binding feature are for the holder to generate their |holderSecret| value, and then compute a commitment with proof for this value, according to the commitment computation procedure of [[CFRG-Blind-BBS-Signature]]. Example values and outputs of this procedure are shown below.

Holder Binding Base Proof

The addition of a base proof under the anonymous holder binding option begins with the issuer receiving a |commitmentWithProof| value from the holder and then verifying that value with the commitment verification procedure of [[CFRG-Blind-BBS-Signature]]. The cryptographic key material explained and used in section Base Proof will also be used here, and is repeated below.

In this scenario, we consider an electronic version of a drivers license.

To preserve the holder's privacy, the only mandatory fields are the "issuer" and "expirationDate", as realized in the mandatory pointers given below.

Transformation of the unsigned document begins with canonicalizing the document, as shown below.

To prevent possible information leakage from the ordering of the blank node IDs, these are processed through a PRF (i.e., the HMAC) to give the canonicalized HMAC document shown below. This represents an ordered list of statements that will be subject to mandatory and selective disclosure, i.e., it is from this list that statements are grouped.

The next steps are to create the base proof configuration and canonicalize it. This is shown in the following two examples.

In the hashing step, we compute the SHA-256 hash of the canonicalized proof options to produce the |proofHash|, and we compute the SHA-256 hash of the join of all the mandatory N-Quads to produce the `mandatoryHash`. These are shown below in hexadecimal format.

Finally, the values above are run through the algorithm of Section , to produce the |proofValue| which is used in the signed base document shown below.

Holder Binding Derived Proof

Next, the holder needs to indicate what else, if anything, they wish to reveal to the verifiers, by specifying JSON pointers for selective disclosure. In this case, the holder only wishes to reveal their driving privileges.

Now that we know what the revealed document looks like, we need to furnish appropriately updated information to the verifier about which statements are mandatory, and the indexes for the selected non-mandatory statements. Running step 6 of the yields an abundance of information about various statement groups relative to the original document. Below, we show a portion of the indexes for those groups.

The last important piece of disclosure data is a mapping of canonical blank node IDs to HMAC-based shuffled IDs, the `labelMap`, computed according to Section . This is shown below, along with the rest of the disclosure data minus the reveal document. Note that here we are showing the results appropriate to the |featureOption| equal to `"anonymous_holder_binding"` which uses the blind proof generation procedure of [[CFRG-Blind-BBS-Signature]]. Note that |blindAdjDisclosedIdxs| is the final set of BBS selective indexes used in the proof serialization process and comes from the blind BBS proof generation function which takes the |adjSelectiveIndexes| as inputs.

Finally, using the disclosure data above with the algorithm of Section , we obtain the signed derived (reveal) document shown below.

Credential Bound Pseudonym Feature

Prover Nym Commitment Generation

The first steps in using the credential-bound pseudonym feature are for the holder to generate their secret |prover_nym| value, and then compute a commitment with proof for this value according to the "Commitment" operation from [[CFRG-Pseudonym-BBS-Signature]]. Example values and outputs of this procedure are shown below.

Pseudonym Base Proof

The addition of a base proof under the pseudonym feature option begins with the issuer receiving a |commitmentWithProof| value from the holder and generating a cryptographically random value for |signer_nym_entropy|.

This example will make use of the same key material as shown in , the same unsigned document as shown in , and the same mandatory pointers as shown in . This results in the same canonical document as shown in , the same canonical HMAC document as shown in , and the same "add base transformation" as shown in .

This example makes use of the same proof configuration as in . This results in the same canonical base proof as in . Combining this with the above assumptions leads to the same base hashes as in .

Since |featureOption| is equal to `"pseudonym"`, the procedure of section will produce the output shown below. This makes use of the signature generation algorithm of [[CFRG-Pseudonym-BBS-Signature]]. Note the inclusion of the |signer_nym_entropy| and |featureOption| values, as these need to be communicated to the holder.

Finally, the values above are run through the algorithm of section , producing the `proofValue` which is used in the signed base document shown below.

Pseudonym Derived Proof

Next, the holder needs to indicate what non-mandatory statements, if any, they wish to reveal to the verifiers, by specifying JSON pointers for selective disclosure. In this case, the holder reveals the same information as given in . This results in the same |revealDocument| as shown in .

Running yields the same information for derived group indexes as shown in and the same adjusted mandatory and selective indexes as shown in .

Finally, using the disclosure data above with the algorithm of section , we obtain the signed derived (reveal) document shown below.

Holder Binding and Pseudonym Feature

Holder Secret and Prover Nym Commitment Generation

The first steps in using the holder binding and pseudonym feature are for the holder to generate its |holder_secret| and |prover_nym| values, and then compute a commitment with proof for these values, according to the "Commitment" operation from [[CFRG-Pseudonym-BBS-Signature]]. Example values and outputs of this procedure are shown below.

Holder Binding and Pseudonym Base Proof

This example will make use of the same key material as shown in , and the same unsigned document as shown in , and the same mandatory pointers as shown in . This results in the same canonical document as shown in , the same canonical HMAC document as shown in , and the same "add base transformation" as shown in .

This example makes use of the same proof configuration as in . This results in the same canonical base proof as in . Combining this with the above assumptions leads to the same base hashes as in .

Since |featureOption| is equal to `"holder_binding_pseudonym"`, the procedure of section will produce the output shown below. This makes use of the signature generation algorithm of [[CFRG-Pseudonym-BBS-Signature]]. Note the inclusion of the |signer_nym_entropy| and |featureOption| values, as these need to be communicated to the holder.

Finally, the values above are run through the algorithm of section , producing the `proofValue` which is used in the signed base document shown below.

Holder Binding and Pseudonym Derived Proof

Running yields the same information for derived group indexes as shown in and the same adjusted mandatory and selective indexes as shown in .

Finally, using the disclosure data above with the algorithm of section , we obtain the signed derived (reveal) document shown below.

Introduction

Terminology

Data Model

Verification Methods

Multikey

Proof Representations

DataIntegrityProof

Algorithms

Instantiate Cryptosuite

Selective Disclosure Functions

createShuffledIdLabelMapFunction

bbs-2023 Functions

serializeBaseProofValue

parseBaseProofValue

createDisclosureData

compressLabelMap

decompressLabelMap

serializeDerivedProofValue

parseDerivedProofValue

createVerifyData

bbs-2023

Create Base Proof (bbs-2023)

Base Proof Transformation (bbs-2023)

Base Proof Hashing (bbs-2023)

Base Proof Configuration (bbs-2023)

Base Proof Serialization (bbs-2023)

Add Derived Proof (bbs-2023)

Verify Derived Proof (bbs-2023)

Optional Features

Anonymous Holder Binding

Credential-Bound Pseudonyms

Holder Binding and Pseudonyms

Optional Feature Summary

Security Considerations

Base Proof Security Properties

Derived Proof Security Properties

Privacy Considerations

Selective Disclosure and Data Leakage

Selective Disclosure and Unlinkability

Linkage via Cryptographic Artifacts

Linkage via VC Processing

Linkage via JSON-LD Node Identifiers

Linkage via Proof Options and Mandatory Reveal

Linkage via Holder Selective Reveal

External VC System Based Linkage

Test Vectors

Baseline Basic Example

Base Proof

Derived Proof

Baseline Enhanced Example

Base Proof

Derived Proof

Anonymous Holder Binding Feature

Holder Binding Commitment Generation

Holder Binding Base Proof

Holder Binding Derived Proof

Credential Bound Pseudonym Feature

Prover Nym Commitment Generation

Pseudonym Base Proof

Pseudonym Derived Proof

Holder Binding and Pseudonym Feature

Holder Secret and Prover Nym Commitment Generation

Holder Binding and Pseudonym Base Proof

Holder Binding and Pseudonym Derived Proof

Acknowledgements