WebDriver BiDi

Editor’s Draft,

More details about this document
This version:
https://w3c.github.io/webdriver-bidi/
Latest published version:
https://www.w3.org/TR/webdriver-bidi/
Implementation Report:
https://wpt.fyi/results/webdriver/tests/bidi
Test Suite:
https://github.com/web-platform-tests/wpt/tree/master/webdriver/tests/bidi
Feedback:
GitHub
Inline In Spec
Editors:
James Graham (Mozilla)
Alex Rudenko (Google)
Maksim Sadym (Google)
Channel:
#webdriver on irc.w3.org
Wiki:
W3C WebDriver Wiki

Abstract

This document defines the BiDirectional WebDriver Protocol, a mechanism for remote control of user agents.

Status of this document

This section describes the status of this document at the time of its publication. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at https://www.w3.org/TR/.

GitHub Issues are preferred for discussion of this specification. Alternatively, you can send comments to the Browser Testing and Tools Working Group’s mailing list, public-browser-tools-testing@w3.org (archives).

This document was published by the Browser Testing and Tools Working Group as an Editor’s Draft.

Publication as an Editor’s Draft does not imply endorsement by W3C and its Members.

This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.

This document was produced by a group operating under the W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.

This document is governed by the 03 November 2023 W3C Process Document.

1. Introduction

This section is non-normative.

WebDriver defines a protocol for introspection and remote control of user agents. This specification extends WebDriver by introducing bidirectional communication. In place of the strict command/response format of WebDriver, this permits events to stream from the user agent to the controlling software, better matching the evented nature of the browser DOM.

2. Infrastructure

This specification depends on the Infra Standard. [INFRA]

Network protocol messages are defined using CDDL. [RFC8610]

This specification defines a wait queue which is a map.

Surely there’s a better mechanism for doing this "wait for an event" thing.

When an algorithm algorithm running in parallel awaits a set of events events, and resume id:

  1. Pause the execution of algorithm.

  2. Assert: wait queue does not contain resume id.

  3. Set wait queue[resume id] to (events, algorithm).

To resume given name, id and parameters:
  1. If wait queue does not contain id, return.

  2. Let (events, algorithm) be wait queue[id]

  3. For each event in events:

    1. If event equals name:

      1. Remove id from wait queue.

      2. Resume running the steps in algorithm from the point at which they were paused, passing name and parameters as the result of the await.

        Should we have something like microtasks to ensure this runs before any other tasks on the event loop?

3. Protocol

This section defines the basic concepts of the WebDriver BiDi protocol. These terms are distinct from their representation at the transport layer.

The protocol is defined using a CDDL definition. For the convenience of implementers two separate CDDL definitions are defined; the remote end definition which defines the format of messages produced on the local end and consumed on the remote end, and the local end definition which defines the format of messages produced on the remote end and consumed on the local end

3.1. Definition

Should this be an appendix?

This section gives the initial contents of the remote end definition and local end definition. These are augmented by the definition fragments defined in the remainder of the specification.

Remote end definition

Command = {
  id: js-uint,
  CommandData,
  Extensible,
}

CommandData = (
  BrowserCommand //
  BrowsingContextCommand //
  InputCommand //
  NetworkCommand //
  ScriptCommand //
  SessionCommand //
  StorageCommand //
  WebExtensionCommand
)

EmptyParams = {
   Extensible
}

Local end definition

Message = (
  CommandResponse /
  ErrorResponse /
  Event
)

CommandResponse = {
  type: "success",
  id: js-uint,
  result: ResultData,
  Extensible
}

ErrorResponse = {
  type: "error",
  id: js-uint / null,
  error: ErrorCode,
  message: text,
  ? stacktrace: text,
  Extensible
}

ResultData = (
  BrowsingContextResult /
  EmptyResult /
  NetworkResult /
  ScriptResult /
  SessionResult /
  StorageResult /
  WebExtensionResult
)

EmptyResult = {
  Extensible
}

Event = {
  type: "event",
  EventData,
  Extensible
}

EventData = (
  BrowsingContextEvent //
  LogEvent //
  NetworkEvent //
  ScriptEvent
)

Remote end definition and Local end definition

Extensible = (*text => any)

js-int = -9007199254740991..9007199254740991
js-uint = 0..9007199254740991

3.2. Session

WebDriver BiDi extends the session concept from WebDriver.

A session has a BiDi flag, which is false unless otherwise stated.

A BiDi session is a session which has the BiDi flag set to true.

The list of active BiDi sessions is given by:
  1. Let BiDi sessions be a new list.

  2. For each session in active sessions:

    1. If session is a BiDi session append session to BiDi sessions.

  3. Return BiDi sessions.

3.3. Modules

The WebDriver BiDi protocol is organized into modules.

Each module represents a collection of related commands and events pertaining to a certain aspect of the user agent. For example, a module might contain functionality for inspecting and manipulating the DOM, or for script execution.

Each module has a module name which is a string. The command name and event name for commands and events defined in the module start with the module name followed by a period ".".

Modules which contain commands define remote end definition fragments. These provide choices in the CommandData group for the module’s commands, and can also define additional definition properties. They can also define local end definition fragments that provide additional choices in the ResultData group for the results of commands in the module.

Modules which contain events define local end definition fragments that are choices in the Event group for the module’s events.

An implementation may define extension modules. These must have a module name that contains a single colon ":" character. The part before the colon is the prefix; this is typically the same for all extension modules specific to a given implementation and should be unique for a given implementation.

Other specifications may define their own WebDriver-BiDi modules that extend the protocol. Such modules must not have a name which contains a colon (:) character, nor must they define command names, event names, or property names that contain that character.

Authors of external specifications are encouraged to to add new modules rather than extending existing ones. Where it is desired to extend an existing module, it is preferred to integrate the extension directly into the specification containing the original module definition.

3.4. Commands

A command is an asynchronous operation, requested by the local end and run on the remote end, resulting in either a result or an error being returned to the local end. Multiple commands can run at the same time, and commands can potentially be long-running. As a consequence, commands can finish out-of-order.

Each command is defined by:

A command that can run without an active session is a static command. Commands are not static commands unless stated in their definition.

When commands are sent from the local end they have a command id. This is an identifier used by the local end to identify the response from a particular command. From the point of view of the remote end this identifier is opaque and cannot be used internally to identify the command.

Note: This is because the command id is entirely controlled by the local end and isn’t necessarily unique over the course of a session. For example a local end which ignores all responses could use the same command id for each command.

The set of all command names is a set containing all the defined command names, including any belonging to extension modules.

3.5. Errors

WebDriver BiDi extends the set of error codes from WebDriver with the following additional codes:

invalid web extension
Tried to install an invalid web extension.
no such client window
Tried to interact with an unknown client window.
no such handle
Tried to deserialize an unknown RemoteObjectReference.
no such history entry
Tried to havigate to an unknown session history entry.
no such intercept
Tried to remove an unknown network intercept.
no such node
Tried to deserialize an unknown SharedReference.
no such request
Tried to continue an unknown request.
no such script
Tried to remove an unknown preload script.
no such storage partition
Tried to access data in a non-existent storage partition.
no such user context
Tried to reference an unknown user context.
no such web extension
Tried to reference an unknown web extension.
unable to close browser
Tried to close the browser, but failed to do so.
unable to set cookie
Tried to create a cookie, but the user agent rejected it.
underspecified storage partition
Tried to interact with data in a storage partition which was not adequately specified.
unable to set file input
Tried to set a file input, but failed to do so.
ErrorCode = "invalid argument" /
            "invalid selector" /
            "invalid session id" /
            "invalid web extension" /
            "move target out of bounds" /
            "no such alert" /
            "no such element" /
            "no such frame" /
            "no such handle" /
            "no such history entry" /
            "no such intercept" /
            "no such node" /
            "no such request" /
            "no such script" /
            "no such storage partition" /
            "no such user context" /
            "no such web extension" /
            "session not created" /
            "unable to capture screen" /
            "unable to close browser" /
            "unable to set cookie" /
            "unable to set file input" /
            "underspecified storage partition" /
            "unknown command" /
            "unknown error" /
            "unsupported operation"

3.6. Events

An event is a notification, sent by the remote end to the local end, signaling that something of interest has occurred on the remote end.

A BiDi session has subscriptions which is a list of subscriptions.

A BiDi session has a known subscription ids which is a set of all subscription ids that have been issued to the local end but which have not yet been unsubscribed.

A subscription is a struct consisting of a subscription id (a string), event names (a set of event names) and top-level traversable ids (a set of IDs of top-level traversables).

A subscription subscription is global if subscription’s top-level traversable ids is an empty set.

The set of sessions for which an event is enabled given event name and navigables is:

  1. Let sessions be a new set.

  2. For each session in active BiDI sessions:

    1. If event is enabled with session, event name and navigables, append session to sessions.

  3. Return sessions.

To determine if an event is enabled given session, event name and navigables:

Note: navigables is a set because a shared worker can be associated with multiple contexts.

  1. Let top-level traversables be get top-level traversables with navigables.

  2. For each subscription in session’s subscriptions:

    1. If subscription’s event names do not contains event name, continue.

    2. If subscription is global return true.

    3. Let subscription top-level traversables be get navigables by ids with subscription’s top-level traversable ids.

    4. If the intersection of top-level traversables and subscription top-level traversables is not empty return true.

  3. Return false.

The set of top-level traversables for which an event is enabled given event name and session is:

  1. Let result be a new set.

  2. For each subscription in session’s subscriptions:

    1. If subscription’s event names does not contain event name, continue.

    2. If subscription’s is global:

      1. For each traversable in remote end’s top-level traversables:

        1. Append traversable to result.

      2. Break.

    3. Otherwise:

      1. Let top-level traversables be get navigables by ids with subscription’s top-level traversable ids.

      2. Append each item of top-level traversables to result.

  3. Return result.

4. Transport

Message transport is provided using the WebSocket protocol. [RFC6455]

Note: In the terms of the WebSocket protocol, the local end is the client and the remote end is the server / remote host.

Note: The encoding of commands and events as messages is similar to JSON-RPC, but this specification does not normatively reference it. [JSON-RPC] The normative requirements on remote ends are instead given as a precise processing model, while no normative requirements are given for local ends.

A WebSocket listener is a network endpoint that is able to accept incoming WebSocket connections.

A WebSocket listener has a host, a port, a secure flag, and a list of WebSocket resources.

When a WebSocket listener listener is created, a remote end must start to listen for WebSocket connections on the host and port given by listener’s host and port. If listener’s secure flag is set, then connections established from listener must be TLS encrypted.

A remote end has a set of WebSocket listeners active listeners, which is initially empty.

A remote end has a set of WebSocket connections not associated with a session, which is initially empty.

A WebSocket connection is a network connection that follows the requirements of the WebSocket protocol

A BiDi session has a set of session WebSocket connections whose elements are WebSocket connections. This is initially empty.

A BiDi session session is associated with connection connection if session’s session WebSocket connections contains connection.

Note: Each WebSocket connection is associated with at most one BiDi session.

When a client establishes a WebSocket connection connection by connecting to one of the set of active listeners listener, the implementation must proceed according to the WebSocket server-side requirements, with the following steps run when deciding whether to accept the incoming connection:

  1. Let resource name be the resource name from reading the client’s opening handshake. If resource name is not in listener’s list of WebSocket resources, then stop running these steps and act as if the requested service is not available.

  2. If resource name is the byte string "/session", and the implementation supports BiDi-only sessions:

    1. Run any other implementation-defined steps to decide if the connection should be accepted, and if it is not stop running these steps and act as if the requested service is not available.

    2. Add the connection to WebSocket connections not associated with a session.

    3. Return.

  3. Get a session ID for a WebSocket resource with resource name and let session id be that value. If session id is null then stop running these steps and act as if the requested service is not available.

  4. If there is a session in the list of active sessions with session id as its session ID then let session be that session. Otherwise stop running these steps and act as if the requested service is not available.

  5. Run any other implementation-defined steps to decide if the connection should be accepted, and if it is not stop running these steps and act as if the requested service is not available.

  6. Otherwise append connection to session’s session WebSocket connections, and proceed with the WebSocket server-side requirements when a server chooses to accept an incoming connection.

Do we support > 1 connection for a single session?

When a WebSocket message has been received for a WebSocket connection connection with type type and data data, a remote end must handle an incoming message given connection, type and data.

When the WebSocket closing handshake is started or when the WebSocket connection is closed for a WebSocket connection connection, a remote end must handle a connection closing given connection.

Note: Both conditions are needed because it is possible for a WebSocket connection to be closed without a closing handshake.

To construct a WebSocket resource name given a session session:

  1. If session is null, return "/session"

  2. Return the result of concatenating the string "/session/" with session’s session ID.

To construct a WebSocket URL given a WebSocket listener listener and session session:

  1. Let resource name be the result of construct a WebSocket resource name with session.

  2. Return a WebSocket URI constructed with host set to listener’s host, port set to listener’s port, path set to resource name, following the wss-URI construct if listener’s secure flag is set and the ws-URL construct otherwise.

To get a session ID for a WebSocket resource given resource name:

  1. If resource name doesn’t begin with the byte string "/session/", return null.

  2. Let session id be the bytes in resource name following the "/session/" prefix.

  3. If session id is not the string representation of a UUID, return null.

  4. Return session id.

To start listening for a WebSocket connection given a session session:
  1. If there is an existing WebSocket listener in active listeners which the remote end would like to reuse, let listener be that listener. Otherwise let listener be a new WebSocket listener with implementation-defined host, port, secure flag, and an empty list of WebSocket resources.

  2. Let resource name be the result of construct a WebSocket resource name with session.

  3. Append resource name to the list of WebSocket resources for listener.

  4. Append listener to the remote end's active listeners.

  5. Return listener.

Note: An intermediary node handling multiple sessions can use one or many WebSocket listeners. WebDriver defines that an endpoint node supports at most one session at a time, so it’s expected to only have a single listener.

Note: For an endpoint node the host in the above steps will typically be "localhost".

To handle an incoming message given a WebSocket connection connection, type type and data data:
  1. If type is not text, send an error response given connection, null, and invalid argument, and finally return.

  2. Assert: data is a scalar value string, because the WebSocket handling errors in UTF-8-encoded data would already have failed the WebSocket connection otherwise.

    Nothing seems to define what status code is used for UTF-8 errors.

  3. If there is a BiDi Session associated with connection connection, let session be that session. Otherwise if connection is in WebSocket connections not associated with a session, let session be null. Otherwise, return.

  4. Let parsed be the result of parsing JSON into Infra values given data. If this throws an exception, then send an error response given connection, null, and invalid argument, and finally return.

  5. If session is not null and not in active sessions then return.

  6. Match parsed against the remote end definition. If this results in a match:

    1. Let matched be the map representing the matched data.

    2. Assert: matched contains "id", "method", and "params".

    3. Let command id be matched["id"].

    4. Let method be matched["method"]

    5. Let command be the command with command name method.

    6. If session is null and command is not a static command, then send an error response given connection, command id, and invalid session id, and return.

    7. Run the following steps in parallel:

      1. Let result be the result of running the remote end steps for command given session and command parameters matched["params"]

      2. If result is an error, then send an error response given connection, command id, and result’s error code, and finally return.

      3. Let value be result’s data.

      4. Assert: value matches the definition for the result type corresponding to the command with command name method.

      5. If method is "session.new", let session be the entry in the list of active sessions whose session ID is equal to the "sessionId" property of value, append connection to session’s session WebSocket connections, and remove connection from the WebSocket connections not associated with a session.

      6. Let response be a new map matching the CommandResponse production in the local end definition with the id field set to command id and the value field set to value.

      7. Let serialized be the result of serialize an infra value to JSON bytes given response.

      8. Send a WebSocket message comprised of serialized over connection.

  7. Otherwise:

    1. Let command id be null.

    2. If parsed is a map and parsed["id"] exists and is an integer greater than or equal to zero, set command id to that integer.

    3. Let error code be invalid argument.

    4. If parsed is a map and parsed["method"] exists and is a string, but parsed["method"] is not in the set of all command names, set error code to unknown command.

    5. Send an error response given connection, command id, and error code.

To get related navigables given an settings object settings:

  1. Let related navigables be an empty set.

  2. If settingsrelevant global object is a Window:

    1. Let navigable be relevant global object's associated Document’s node navigable.

    2. If navigable is not null, append navigable to related navigables.

  3. Otherwise if the global object specified by settings is a WorkerGlobalScope, for each owner in the global object's owner set:

    1. Let navigable be null.

    2. If owner is a Document, set navigable to owner’s node navigable.

    3. If navigable is not null, append navigable to related navigables.

  4. Return related navigables.

To get navigables by ids given a list of context ids navigable ids:

  1. Let result be an empty set.

  2. For each navigable id in navigable ids:

    1. Let navigable be the navigable with id navigable id if such navigable exists, and null otherwise.

    2. Append navigable to result if navigable is not null.

  3. Return result.

To get top-level traversables given a list of navigables navigables:

  1. Let result be an empty set.

  2. For each navigable in navigables:

    1. Append navigable’s top-level traversable to result.

  3. Return result.

To emit an event given session, and body:
  1. Assert: body has size 2 and contains "method" and "params".

  2. Let serialized be the result of serialize an infra value to JSON bytes given body.

  3. For each connection in session’s session WebSocket connections:

    1. Send a WebSocket message comprised of serialized over connection.

To send an error response given a WebSocket connection connection, command id, and error code:
  1. Let error data be a new map matching the ErrorResponse production in the local end definition, with the id field set to command id, the error field set to error code, the message field set to an implementation-defined string containing a human-readable definition of the error that occurred and the stacktrace field optionally set to an implementation-defined string containing a stack trace report of the active stack frames at the time when the error occurred.

  2. Let response be the result of serialize an infra value to JSON bytes given error data.

    Note: command id can be null, in which case the id field will also be set to null, not omitted from response.

  3. Send a WebSocket message comprised of response over connection.

To handle a connection closing given a WebSocket connection connection:

  1. If there is a BiDi session associated with connection connection:

    1. Let session be the BiDi session associated with connection connection.

    2. Remove connection from session’s session WebSocket connections.

  2. Otherwise, if WebSocket connections not associated with a session contains connection, remove connection from that set.

Note: This does not end any session.

Need to hook in to the session ending to allow the UA to close the listener if it wants.

To close the WebSocket connections given session:

  1. For each connection in session’s session WebSocket connections:

    1. Start the WebSocket closing handshake with connection.

      Note: this will result in the steps in handle a connection closing being run for connection, which will clean up resources associated with connection.

4.1. Establishing a Connection

WebDriver clients opt in to a bidirectional connection by requesting the WebSocket URL capability with value true.

The WebDriver new session algorithm defined by this specification, with parameters session, capabilities, and flags is:
  1. If flags contains "bidi", return.

  2. Let webSocketUrl be the result of getting a property named "webSocketUrl" from capabilities.

  3. If webSocketUrl is undefined, return.

  4. Assert: webSocketUrl is true.

  5. Let listener be the result of start listening for a WebSocket connection given session.

  6. Set webSocketUrl to the result of construct a WebSocket URL with listener and session.

  7. Set a property on capabilities named "webSocketUrl" to webSocketUrl.

  8. Set session’s BiDi flag to true.

  9. Append "bidi" to flags.

Implementations should also allow clients to establish a BiDi Session which is not a HTTP Session. In this case the URL to the WebSocket server is communicated out-of-band. An implementation that allows this supports BiDi-only sessions. At the time such an implementation is ready to accept requests to start a WebDriver session, it must:

  1. Start listening for a WebSocket connection given null.

5. Sandboxed Script Execution

A common requirement for automation tools is to execute scripts which have access to the DOM of a document, but don’t have information about any changes to the DOM APIs made by scripts running in the navigable containing the document.

A BiDi session has a sandbox map which is a weak map in which the keys are Window objects, and the values are maps between strings and SandboxWindowProxy objects.

Note: The definition of sandboxes here is an attempt to codify the behaviour of existing implementations. It exposes parts of the implementations that have previously been considered internal by specifications, in particular the distinction between the internal state of platform objects (which is typically implemented as native objects in the main implementation language of the browser engine) and the ECMAScript-visible state. Because existing sandbox implementations happen at a low level in the engine, implementations converging toward the specification in all details might be a slow process. In the meantime, implementers are encouraged to provide detailed documentation on any differences with the specification, and users of this feature are encouraged to explicitly test that scripts running in sandboxes work in all implementations.

5.1. Sandbox Realms

Each sandbox is a unique ECMAScript Realm. However the sandbox realm provides access to platform objects in an existing Window realm via SandboxProxy objects.

To get or create a sandbox realm given name and navigable:
  1. If name is an empty string, then return error with error code invalid argument.

  2. Let window be navigable’s active window.

  3. If sandbox map does not contain window, set sandbox map[window] to a new map.

  4. Let sandboxes be sandbox map[window].

  5. If sandboxes does not contain name, set sandboxes[name] to create a sandbox realm with navigable.

  6. Return success with data sandboxes[name].

To create a sandbox realm with window:

Define creation of sandbox realm. This is going to return a SandboxWindowProxy wrapping window.

To get a sandbox name given target realm:

  1. Let realms maps be get the values of sandbox map.

  2. For each realms map in realms maps:

    1. For each namerealm in realms map:

      1. If realm is target realm, return name.

  3. Return null.

5.2. Sandbox Proxy Objects

A SandboxProxy object is an exotic object that mediates sandboxed access to objects from another realm. Sandbox proxy objects are designed to enforce the following restrictions:

There is no SandboxProxy interface object.

Define in detail how SandboxProxy works

To get unwrapped object:
  1. While object is SandboxProxy or SandboxWindowProxy, set object to it’s wrapped object.

  2. Return object.

5.3. SandboxWindowProxy

A SandboxWindowProxy is an exotic object that represents a Window object wrapped by a SandboxProxy object. This provides sandboxed access to that data in a Window global.

Define how this works.

6. User Contexts

A user context represents a collection of zero or more top-level traversables within a remote end. Each user context has an associated storage partition, so that remote end data is not shared between different user contexts.

Unclear that this is the best way to formally define the concept of a user context or the interaction with storage.

Note: The infra spec uses the term "user agent" to refer to the same concept as user contexts. However, this is not compatible with usage of the term "user agent" to mean the entire web client with multiple user contexts. Although this difference is not visible to web content, it is observed via WebDriver, so we avoid using this terminology.

A user context has a user context id, which is a unique string set upon the user context creation.

A navigable has an associated user context, which is a user context.

When a new top-level traversable is created its associated user context is set to a user context in the set of user contexts.

Note: In some cases the user context is set by specification when the top-level traversable is created, however in cases where no such requirements are present, the associated user context for a top-level traversable is implemenation-defined.

Should we specify that top-level traversables with a non-null opener have the same associated user context as their opener? Need to check if this is something existing implementations enforce.

A child navigable's associated user context is it’s parent's associated user context.

A user context which isn’t the associated user context for any top-level traversable is an empty user context.

The default user context is a user context with user context id "default".

An implementation has a set of user contexts, which is a set of user contexts. Initially this contains the default user context.

Implementations may append new user contexts to the set of user contexts at any time, for example in response to user actions.

Note: "At any time" here includes during implementation startup, so a given implementation might always have multiple entries in the set of user contexts.

Implementations may remove any empty user context, with exception of the default user context, from the set of user contexts at any time. However they are not required to remove such user contexts. User contexts that are not empty user contexts must not be removed from the set of user contexts.

To get user context given user context id:
  1. For each user context in the set of user contexts:

  2. If user context’s user context id equals user context id:

    1. Return user context.

  3. Return null.

7. Modules

7.1. The session Module

The session module contains commands and events for monitoring the status of the remote end.

7.1.1. Definition

remote end definition

SessionCommand = (
  session.End //
  session.New //
  session.Status //
  session.Subscribe //
  session.Unsubscribe
)

local end definition

SessionResult = (
   session.NewResult /
   session.StatusResult
)
To end the session given session:
  1. Remove session from active sessions.

  2. If active sessions is empty, set the webdriver-active flag to false.

To cleanup the session given session:

  1. Close the WebSocket connections with session.

  2. If active sessions is empty, cleanup remote end state.

  3. Perform any implementation-specific cleanup steps.

To cleanup remote end state.
  1. Clear the before request sent map.

  2. Set the default cache behavior to "default".

  3. Clear the navigable cache behavior map.

  4. Perform implementation-defined steps to enable any implementation-specific resource caches that are usually enabled in the current remote end configuration.

7.1.2. Types

7.1.2.1. The session.CapabilitiesRequest Type
session.CapabilitiesRequest = {
  ? alwaysMatch: session.CapabilityRequest,
  ? firstMatch: [*session.CapabilityRequest]
}

The session.CapabilitiesRequest type represents the capabilities requested for a session.

7.1.2.2. The session.CapabilityRequest Type

remote end definition and local end definition

session.CapabilityRequest = {
  ? acceptInsecureCerts: bool,
  ? browserName: text,
  ? browserVersion: text,
  ? platformName: text,
  ? proxy: session.ProxyConfiguration,
  ? unhandledPromptBehavior: session.UserPromptHandler,
  Extensible
};

The session.CapabilityRequest type represents a specific set of requested capabilities.

WebDriver BiDi defines additional WebDriver capabilities. The following tables enumerates the capabilities each implementation must support for WebDriver BiDi.

Capability WebSocket URL
Key "webSocketUrl"
Value type boolean
Description Defines the current session’s support for bidirectional connection.
The additional capability deserialization algorithm for the "webSocketUrl" capability, with parameter value is:
  1. If value is not a boolean, return error with code invalid argument.

  2. Return success with data value.

The matched capability serialization algorithm for the "webSocketUrl" capability, with parameter value is:
  1. If value is false, return success with data null.

  2. Return success with data true.

7.1.2.3. The session.ProxyConfiguration Type

remote end definition and local end definition

session.ProxyConfiguration = {
   session.AutodetectProxyConfiguration //
   session.DirectProxyConfiguration //
   session.ManualProxyConfiguration //
   session.PacProxyConfiguration //
   session.SystemProxyConfiguration //
}

session.AutodetectProxyConfiguration = (
   proxyType: "autodetect",
   Extensible
)

session.DirectProxyConfiguration = (
   proxyType: "direct",
   Extensible
)

session.ManualProxyConfiguration = (
   proxyType: "manual",
   ? ftpProxy: text,
   ? httpProxy: text,
   ? sslProxy: text,
   ? session.SocksProxyConfiguration,
   ? noProxy: [*text],
   Extensible
)

session.SocksProxyConfiguration = (
   socksProxy: text,
   socksVersion: 0..255,
)

session.PacProxyConfiguration = (
   proxyType: "pac",
   proxyAutoconfigUrl: text,
   Extensible
)

session.SystemProxyConfiguration = (
   proxyType: "system"
   Extensible
)

7.1.2.4. The session.UserPromptHandler Type

Remote end definition and local end definition

session.UserPromptHandler = {
  ? alert: session.UserPromptHandlerType,
  ? beforeUnload: session.UserPromptHandlerType,
  ? confirm: session.UserPromptHandlerType,
  ? default: session.UserPromptHandlerType,
  ? prompt: session.UserPromptHandlerType,
};

The session.UserPromptHandler type represents the configuration of the user prompt handler.

7.1.2.5. The session.UserPromptHandlerType Type

Remote end definition and local end definition

session.UserPromptHandlerType = "accept" / "dismiss" / "ignore";

The session.UserPromptHandlerType type represents the behavior of the user prompt handler.

7.1.2.6. The session.Subscription Type
session.Subscription = text

The session.Subscription type represents a unique subscription identifier.

7.1.2.7. The session.SubscriptionRequest Type
session.SubscriptionRequest = {
  events: [+text],
  ? contexts: [+browsingContext.BrowsingContext],
}

The session.SubscriptionRequest type represents a request to subscribe to or unsubscribe from a specific set of events.

7.1.2.8. The session.UnsubscribeByIDRequest Type
session.UnsubscribeByIDRequest = {
  subscriptions: [+session.Subscription],
}

The session.UnsubscribeByIDRequest type represents a request to remove event subscriptions identified by subscription IDs.

7.1.2.9. The session.UnsubscribeByAttributesRequest Type

Note: contexts param is deprecated and will be removed in the future versions.

session.UnsubscribeByAttributesRequest = {
  events: [+text],
  ? contexts: [+browsingContext.BrowsingContext],
}

The session.UnsubscribeByAttributesRequest type represents a request to unsubscribe using subscription attributes.

7.1.3. Commands

7.1.3.1. The session.status Command

The session.status command returns information about whether a remote end is in a state in which it can create new sessions, but may additionally include arbitrary meta information that is specific to the implementation.

This is a static command.

Command Type
session.Status = (
  method: "session.status",
  params: EmptyParams,
)
Result Type
session.StatusResult = {
  ready: bool,
  message: text,
}

The remote end steps given session, and command parameters are:

  1. Let body be a new map with the following properties:

    "ready"
    The remote end’s readiness state.
    "message"
    An implementation-defined string explaining the remote end’s readiness state.
  2. Return success with data body

7.1.3.2. The session.new Command

The session.new command allows creating a new BiDi session.

Note: A session created this way will not be accessible via HTTP.

This is a static command.

Command Type
session.New = (
  method: "session.new",
  params: session.NewParameters
)

session.NewParameters = {
  capabilities: session.CapabilitiesRequest
}
Result Type
session.NewResult = {
  sessionId: text,
  capabilities: {
    acceptInsecureCerts: bool,
    browserName: text,
    browserVersion: text,
    platformName: text,
    setWindowRect: bool,
    userAgent: text,
    ? proxy: session.ProxyConfiguration,
    ? unhandledPromptBehavior: session.UserPromptHandler,
    ? webSocketUrl: text,
    Extensible
  }
}

The remote end steps given session and command parameters are:

  1. If session is not null, return an error with error code session not created.

  2. If the implementation is unable to start a new session for any reason, return an error with error code session not created.

  3. Let flags be a set containing "bidi".

  4. Let capabilities json be the result of trying to process capabilities with command parameters and flags.

  5. Let capabilities be convert a JSON-derived JavaScript value to an Infra value with capabilities json.

  6. Let session be the result of trying to create a session with capabilities and flags.

  7. Set session’s BiDi flag to true.

    Note: the connection for this session will be set to the current connection by the caller.

  8. Let body be a new map matching the session.NewResult production, with the sessionId field set to session’s session ID, and the capabilities field set to capabilities.

  9. Return success with data body.

7.1.3.3. The session.end Command

The session.end command ends the current session.

Command Type
session.End = (
  method: "session.end",
  params: EmptyParams
)

Result Type
EmptyResult

The remote end steps given session and command parameters are:

  1. End the session with session.

  2. Return success with data null, and in parallel run the following steps:

    1. Wait until the Send a WebSocket message steps have been called with the response to this command.

      this is rather imprecise language, but hopefully it’s clear that the intent is that we send the response to the command before starting shutdown of the connections.

    2. Cleanup the session with session.

7.1.3.4. The session.subscribe Command

The session.subscribe command enables certain events either globally or for a set of navigables.

This needs to be generalized to work with realms too.

Command Type
session.Subscribe = (
  method: "session.subscribe",
  params: session.SubscriptionRequest
)
Result Type
session.SubscriptionRequestResult = {
  subscription: session.Subscription,
}
The remote end steps with session and command parameters are:
  1. Let the event names be create a set with command parameters["events"].

  2. Let subscription navigables be a set.

  3. Let top-level traversable context ids be a set.

  4. Let input context ids be create a set with command parameters[contexts].

  5. If input context ids is not empty:

    1. Let navigables be get navigables by ids with input context ids.

    2. If size of navigables does not equal size of input context ids:

      1. Return error with error code invalid argument.

    3. Set subscription navigables be get top-level traversables with navigables.

    4. For each navigable in subscription navigables:

      1. Append navigable’s navigable id to top-level traversable context ids.

  6. Otherwise, set subscription navigables to a set of all top-level traversables in the remote end.

  7. Let subscription be a subscription with subscription id set to the string representation of a UUID, event names set to event names, and top-level traversable ids set to top-level traversable context ids.

  8. Let subscribe step events be a new map.

  9. For each event name in the event names:

    1. If the event with event name event name does not define remote end subscribe steps, continue;

    2. Let existing navigables be a set of top-level traversables for which an event is enabled with session and event name.

    3. Set subscribe step events[event name] to difference of subscription navigables and existing navigables.

  10. Append subscription to session’s subscriptions.

  11. Append subscription’s subscription id to session’s known subscription ids.

  12. Sort in ascending order subscribe step events using the following less than algorithm given two entries with keys event name one and event name two:

    1. Let event one be the event with name event name one

    2. Let event two be the event with name event name two

    3. Return true if event one’s subscribe priority is less than event two’s subscribe priority, or false otherwise.

  13. If subscription is global, let include global be true, otherwise let include global be false.

  14. For each event namenavigables in subscribe step events:

    1. Run the remote end subscribe steps for the event with event name event name given session, navigables and include global.

  15. Return success with data null.

7.1.3.5. The session.unsubscribe Command

The session.unsubscribe command disables events either globally or for a set of navigables.

This needs to be generalised to work with realms too.

Note: contexts parameter in UnsubscribeByAttributesRequest is deprecated and will be removed in the future versions.

Command Type
session.Unsubscribe = (
  method: "session.unsubscribe",
  params: session.UnsubscribeByAttributesRequest / session.UnsubscribeByIDRequest,
)
Result Type
EmptyResult
The remote end steps with session and command parameters are:
  1. If command parameters matches the session.UnsubscribeByAttributesRequest production:

    1. Let event names be create a set with command parameters["events"].

    2. Let top-level traversable context ids be a set.

    3. Let input context ids be create a set with command parameters[contexts].

    4. If input context ids is not empty:

      1. Let navigables be get navigables by ids with input context ids.

      2. If size of navigables does not equal size of input context ids:

        1. Return error with error code invalid argument.

      3. Set top-level traversables be get top-level traversables with navigables.

      4. For each navigable in top-level traversables:

        1. Append navigable’s navigable id to top-level traversable context ids.

    5. Let new subscriptions to be a list.

    6. For each subscription of session’s subscriptions:

      1. If intersection of subscription’s event names and event names is an empty set:

        1. append subscription to new subscriptions.

        2. Continue.

      2. If top-level traversable context ids is an empty set:

        1. Remove all items contained in event names from subscription’s event names.

        2. If subscription’s event names is not empty:

          1. append subscription to new subscriptions.

      3. Otherwise:

        1. If subscription is global:

          1. append subscription to new subscriptions.

        2. Otherwise:

          Note: unsubscribe by contexts is deprecated and will be removed in the future versions.

          1. Let event map be an empty map.

          2. For each event name in subscription’s event names:

            1. Set event map[event name] to clone of subscription’s top-level traversable ids.

          3. For each event name in event names:

            1. If event map[event name] does not exist, continue.

            2. Set event map[event name] to the difference between event map[event name] and top-level traversable context ids.

            3. If event map[event name] is an empty set,

              1. Remove event map[event name].

          4. For each event nameremaining top-level traversable ids in event map:

            1. Let partial subscription be a subscription with subscription id set to subscription’s subscription id, event names set to a new set containing event name, top-level traversable ids set to remaining top-level traversable ids.

            2. append partial subscription to new subscriptions.

    7. Set session’s subscriptions to new subscriptions.

  2. Otherwise:

    1. Let subscriptions be create a set with command parameters[subscriptions].

    2. Let unknown subscription ids to difference between subscriptions and session’s known subscription ids.

    3. If unknown subscription ids is not empty:

      1. Return error with error code invalid argument.

    4. Let subscriptions to remove be an empty set.

    5. For each subscription in session’s subscriptions:

      1. If subscriptions contains subscription’s subscription id:

        1. Append subscription to subscriptions to remove.

    6. Set session’s known subscription ids to difference between session’s known subscription ids and subscriptions.

    7. Remove each item in subscriptions to remove from session’s subscriptions.

  3. Return success with data null.

7.2. The browser Module

The browser module contains commands for managing the remote end browser process.

7.2.1. Definition

remote end definition

BrowserCommand = (
  browser.Close //
  browser.CreateUserContext //
  browser.GetClientWindows //
  browser.GetUserContexts //
  browser.RemoveUserContext //
  browser.SetClientWindowState //
)

local end definition

BrowserResult = (
   browser.CreateUserContextResult /
   browser.GetUserContextsResult
)

7.2.2. Windows

Each top-level traversable is associated with a single client window which represents a rectangular area containing the viewport that will be used to render that top-level traversable's active document when its visibility state is "visible", as well as any browser-specific user interface elements associated with displaying the traversable (e.g. any URL bar, toolbars, or OS window decorations).

A client window has a client window id which is a string uniquely identifying that window.

A client window has an x-coordinate, which is the number of CSS pixels between the left edge of the web-exposed screen area and the left edge of the window, or zero if that doesn’t make sense for a particular window.

A client window has a y-coordinate, which is the number of CSS pixels between the top edge of the web-exposed screen area and the top edge of the window, or zero if that doesn’t make sense for a particular window.

A client window has a width, which is the width of the window’s rectangle in CSS pixels.

A client window has a height, which is the height of the window’s rectangle in CSS pixels.

To maximize the client window window an implementation should either perform steps corresponding to the platform notion of maximizing window, or position window such that its x-coordinate is as close as possible to 0, its y-coordinate is as close as possible to 0, its width is as close as possible to the width of the web-exposed screen area and its height is as close as possible to the height of the web-exposed screen area. If either of these options are supported then maximize client window is supported.

To minimize the client window window an implementation should either perform steps corresponding to the platform notion of minimizing window, or otherwise hide window such that all the active documents in top-level traversables associated with window have visibility state "hidden" and window’s width and height are both as close as possible to 0. If either of these options are supported then minimize client window is supported.

To restore the client window window an implementation should ensure that it’s neither in a platform-defined maximized state, nor in a platform-defined minimized state, and that if there is one or more top-level traversable associated with window, at least one of those has an active document in the "visible" state. If this is supported then restore client window is supported.

To get the client window state given window:

  1. Let documents be an empty list.

  2. Let visible documents be an empty list.

  3. For each top-level traversable traversable:

    1. If traversable’s client window is not window then continue.

    2. Let document be traversable’s active document.

    3. Append document to documents.

    4. If document’s visibility state is "visible", Append document to visible documents.

  4. For each document in visible documents:

    1. If document’s fullscreen element is not null, return "fullscreen".

  5. If visible documents is empty but documents is not empty, or if window is otherwise in an OS-specific minimized state, return "minimized".

    Note: This will usually, but not necessarily, mean that window’s width and height are equal to 0.

  6. If window is in an OS-specific maximized state return "maximized".

    Note: This will usually, but not necessarily, mean that window’s width is equal to the width of the web-exposed screen area and window’s height is equal to the height of the web-exposed screen area.

  7. Return "normal".

To set the client window state given window and state:

  1. Let current state be get the client window state with window.

  2. If current state is equal to state, return success with data null.

  3. In the following list of conditions and associated steps, run the first set of steps for which the associated condition is true:

    "fullscreen"
    If not fullscreen is supported return error with error code unsupported operation.
    "normal"
    If not restore client window is supported for window return error with error code unsupported operation.
    "maximize"
    If not maximize client window is supported for window return error with error code unsupported operation.
    "minimize"
    If not minimize client window is supported for window return error with error code unsupported operation.
  4. Let documents be an empty list.

  5. For each top-level traversable traversable:

    1. If traversable’s associated client window is not window then continue.

    2. Let document be traversable’s active document.

    3. Append document to documents.

  6. If documents is empty return error with error code no such client window.

  7. If current state is "fullscreen":

    1. For each document in documents:

      1. Fully exit fullscreen with document.

        Note: This is a no-op for documents in window that are not fullscreen.

  8. Switch on the value of state:

    "fullscreen"
    1. For each document in documents:

      1. If document’s visibility state is "visible", fullscreen an element with document’s document element.

      2. Break.

    "normal"
    1. Restore the client window window.
    "maximize
    1. Maximize the client window window.
    "minimize
    1. Minimize the client window window.
  9. Return success with data null.

7.2.3. Types

7.2.3.1. The browser.ClientWindow Type
browser.ClientWindow = text;

The browser.ClientWindow uniquely identifies a client window.

7.2.3.2. The browser.ClientWindowInfo Type
browser.ClientWindowInfo = {
  active: bool,
  clientWindow: browser.ClientWindow,
  height: js-uint,
  state: "fullscreen" / "maximized" / "minimized" / "normal",
  width: js-uint,
  x: js-int,
  y: js-int,
}

The browser.ClientWindowInfo type represents properties of a client window.

To get the client window info given client window:
  1. Let client window id be the client window id for client window.

  2. Let state be get the client window state with client window.

  3. If client window can receive keyboard input channeled from the operating system, let active be true, otherwise let active be false.

    Note: This could mean that a top-level traversable whose client window is client window has system focus, or it could mean that the user interface of the browser itself currently has focus.

  4. Let client window info be a map matching the browser.ClientWindowsInfo production with the clientWindow field set to client window id, state field set to state, the x field set to client window’s x-coordinate, the y field set to client window’s y-coordinate, the width field set to client window’s width, the height field set to client window’s height, and the active field set to active.

  5. Return client window info

7.2.3.3. The browser.UserContext Type
browser.UserContext = text;

The browser.UserContext unique identifies a user context.

7.2.3.4. The browser.UserContextInfo Type
browser.UserContextInfo = {
  userContext: browser.UserContext
}

The browser.UserContextInfo type represents properties of a user context.

7.2.4. Commands

7.2.4.1. The browser.close Command

The browser.close command terminates all WebDriver sessions and cleans up automation state in the remote browser instance.

Command Type
browser.Close = (
  method: "browser.close",
  params: EmptyParams,
)
Return Type
EmptyResult
The remote end steps with session and command parameters are:
  1. End the session with session.

  2. If active sessions is not empty an implementation may return error with error code unable to close browser, and then run the following steps in parallel:

    1. Wait until the Send a WebSocket message steps have been called with the response to this command.

    2. Cleanup the session with session.

    Note: The behaviour in cases where the browser has multiple automation sessions is currently unspecified. It might be that any session can close the browser, or that only the final open session can actually close the browser, or only the first session started can. This behaviour might be fully specified in a future version of this specification.

  3. For each active session in active sessions:

    1. End the session active session.

    2. Cleanup the session with active session

  4. Return success with data null, and run the following steps in parallel.

    1. Wait until the Send a WebSocket message steps have been called with the response to this command.

    2. Cleanup the session with session.

    3. Close any top-level traversables without prompting to unload.

    4. Perform implementation defined steps to clean up resources associated with the remote end under automation.

      Note: For example this might include cleanly shutting down any OS-level processes associated with the browser under automation, removing temporary state, such as user profile data, created by the remote end while under automation, or shutting down the WebSocket Listener. Because of differences between browsers and operating systems it is not possible to specify in detail precise invariants local ends can depend on here.

7.2.4.2. The browser.createUserContext Command

The browser.createUserContext command creates a user context.

Command Type
browser.CreateUserContext = (
  method: "browser.createUserContext",
  params: EmptyParams,
)
Return Type
browser.CreateUserContextResult = browser.UserContextInfo

The remote end steps are:

  1. Let user context be a new user context.

  2. Append user context to the set of user contexts.

  3. Let user context info be a map matching the browser.UserContextInfo production with the userContext field set to user context’s user context id.

  4. Return success with data user context info.

7.2.4.3. The browser.getClientWindows Command

The browser.getClientWindows command returns a list of client windows.

Command Type
browser.GetClientWindows = (
  method: "browser.getClientWindows",
  params: EmptyParams,
)
Return Type
browser.GetClientWindowsResult = {
  clientWindows: [ * browser.ClientWindowInfo]
}

The remote end steps are:

  1. Let client window ids be an empty set.

  2. Let client windows be an empty list.

  3. For each top-level traversable traversable:

    1. Let client window be traversable’s associated client window

    2. Let client window id be the client window id for client window.

    3. If client window ids contains client window id, continue.

    4. Append client window id to client window ids.

    5. Let client window info be get the client window info with client window.

    6. Append client window info to client windows.

  4. Let result be a map matching the browser.GetClientWindowsResult production with the clientWindows field set to client windows.

  5. Return success with data result.

7.2.4.4. The browser.getUserContexts Command

The browser.getUserContexts command returns a list of user contexts.

Command Type
browser.GetUserContexts = (
  method: "browser.getUserContexts",
  params: EmptyParams,
)
Return Type
browser.GetUserContextsResult = {
  userContexts: [ + browser.UserContextInfo]
}

The remote end steps are:

  1. Let user contexts be an empty list.

  2. For each user context in the set of user contexts:

    1. Let user context info be a map matching the browser.UserContextInfo production with the userContext field set to user context’s user context id.

    2. Append user context info to user contexts.

  3. Let result be a map matching the browser.GetUserContextsResult production with the userContexts field set to user contexts.

  4. Return success with data result.

7.2.4.5. The browser.removeUserContext Command

The browser.removeUserContext command closes a user context and all navigables in it without running beforeunload handlers.

Command Type
browser.RemoveUserContext = (
  method: "browser.removeUserContext",
  params: browser.RemoveUserContextParameters
)

browser.RemoveUserContextParameters = {
  userContext: browser.UserContext
}
Return Type
EmptyResult

The remote end steps with command parameters are:

  1. Let user context id be command parameters["userContext"].

  2. If user context id is "default", return error with error code invalid argument.

  3. Set user context to get user context with user context id.

  4. If user context is null, return error with error code no such user context.

  5. For each top-level traversable navigable:

    1. If navigable’s associated user context is user context:

      1. Close navigable without prompting to unload.

  6. Remove user context for the set of user contexts.

  7. Return success with data null.

7.2.4.6. The browser.setClientWindowState Command

The browser.setClientWindowState command sets the dimensions of a client window.

Command Type
browser.SetClientWindowState = (
  method: "browser.setClientWindowState",
  params: browser.SetClientWindowStateParameters
)

browser.SetClientWindowStateParameters = {
  clientWindow: browser.ClientWindow,
  browser.ClientWindowNamedState // browser.ClientWindowRectState
}

browser.ClientWindowNamedState = (
  state: "fullscreen" / "maximized" / "minimized"
)

browser.ClientWindowRectState = (
  state: "normal",
  ? width: js-uint,
  ? height: js-uint,
  ? x: js-int,
  ? y: js-int,
)
Return Type
browser.ClientWindowInfo

The remote end steps with session and command parameters are:

  1. If the implementation does not support setting the client window state at all, then return error with error code unsupported operation.

  2. If there is a client window with client window id command parameters["clientWindow"], let client window be that client window. Otherwise return error with error code no such client window.

  3. Try to set the client window state with client window and command parameters["state"].

  4. If command parameters["state"] is "normal":

    1. If command parameters contains "x" and the implementation supports positioning client windows, set the x-coordinate of client window to a value that is as close as possible command parameters["x"].

    2. If command parameters contains "y" and the implementation supports positioning client windows, set the y-coordinate of client window to a value that is as close as possible command parameters["y"].

    3. If command parameters contains "width" and the implementation supports resizing client windows, set the width of client window to a value that is as close as possible command parameters["width"].

    4. If command parameters contains "width" and the implementation supports resizing client windows, set the width of client window to a value that is as close as possible command parameters["width"].

  5. Let client window info be get the client window info with client window.

  6. Return success with data client window info.

Note: For simplicity this models all client window operations as synchronous. Therefore the returned client window dimensions are expected to be those after the window has reached its new state.

7.3. The browsingContext Module

The browsingContext module contains commands and events relating to navigables.

Note: For historic reasons this module is called browsingContext rather than navigable, and the protocol uses the term context to refer to navigables, particularly as a field in command and response parameters.

The progress of navigation is communicated using an immutable WebDriver navigation status struct, which has the following items:

id
The navigation id for the navigation, or null when the navigation is canceled before making progress.
status
A status code that is either "canceled", "pending", or "complete".
url
The URL which is being loaded in the navigation

7.3.1. Definition

remote end definition

BrowsingContextCommand = (
  browsingContext.Activate //
  browsingContext.CaptureScreenshot //
  browsingContext.Close //
  browsingContext.Create //
  browsingContext.GetTree //
  browsingContext.HandleUserPrompt //
  browsingContext.LocateNodes //
  browsingContext.Navigate //
  browsingContext.Print //
  browsingContext.Reload //
  browsingContext.SetViewport //
  browsingContext.TraverseHistory
)

local end definition

BrowsingContextResult = (
  browsingContext.CaptureScreenshotResult /
  browsingContext.CreateResult /
  browsingContext.GetTreeResult /
  browsingContext.LocateNodesResult /
  browsingContext.NavigateResult /
  browsingContext.PrintResult /
  browsingContext.TraverseHistoryResult
)

BrowsingContextEvent = (
  browsingContext.ContextCreated //
  browsingContext.ContextDestroyed //
  browsingContext.DomContentLoaded //
  browsingContext.DownloadWillBegin //
  browsingContext.FragmentNavigated //
  browsingContext.HistoryUpdated //
  browsingContext.Load //
  browsingContext.NavigationAborted //
  browsingContext.NavigationFailed //
  browsingContext.NavigationStarted //
  browsingContext.UserPromptClosed //
  browsingContext.UserPromptOpened
)

A remote end has a device pixel ratio overrides which is a weak map between navigables and device pixel ratio overrides. It is initially empty.

Note: this map is not cleared when the final session ends i.e. device pixel ratio overrides outlive any WebDriver session.

7.3.2. Types

7.3.2.1. The browsingContext.BrowsingContext Type

remote end definition and local end definition

browsingContext.BrowsingContext = text;

Each navigable has an associated navigable id, which is a string uniquely identifying that navigable. This is implicitly set when the navigable is created. For navigables with an associated WebDriver window handle the navigable id must be the same as the window handle.

Each navigable also has an associated storage partition, which is the storage partition it uses to persist data.

Each navigable also has an associated original opener, which is a navigable that caused the navigable to open or null, initially set to null.

To get a navigable given navigable id:
  1. If navigable id is null, return success with data null.

  2. If there is no navigable with navigable id navigable id return error with error code no such frame

  3. Let navigable be the navigable with id navigable id.

  4. Return success with data navigable.

7.3.2.2. The browsingContext.Info Type

local end definition

browsingContext.InfoList = [*browsingContext.Info]

browsingContext.Info = {
  children: browsingContext.InfoList / null,
  clientWindow: browser.ClientWindow,
  context: browsingContext.BrowsingContext,
  originalOpener: browsingContext.BrowsingContext / null,
  url: text,
  userContext: browser.UserContext,
  ? parent: browsingContext.BrowsingContext / null,
}

The browsingContext.Info type represents the properties of a navigable.

To get the child navigables given navigable:

TODO: make this return a list in document order

  1. Let child navigables be a set containing all navigables that are a child navigable of navigable.

  2. Return child navigables.

To get the navigable info given navigable, max depth and include parent id:
  1. Let navigable id be the navigable id for navigable.

  2. Let navigable be navigable’s parent.

  3. If navigable is not null let parent id be the navigable id of navigable. Otherwise let parent id be null.

  4. Let document be navigable’s active document.

  5. Let url be the result of running the URL serializer, given document’s URL.

    Note: This includes the fragment component of the URL.

  6. Let child infos be null.

  7. If max depth is null, or max depth is greater than 0:

    1. Let child navigables be get the child navigables given navigable.

    2. Let child depth be max depth - 1 if max depth is not null, or null otherwise.

    3. Set child infos to an empty list.

    4. For each child navigable of child navigables:

      1. Let info be the result of get the navigable info given child navigable, child depth, and false.

      2. Append info to child infos

  8. Let user context be navigable’s associated user context.

  9. Let opener id be the navigable id for navigable’s original opener, if navigable’s original opener is not null, and null otherwise.

  10. Let top-level traversable be navigable’s top-level traversable.

  11. Let client window id be the client window id for top-level traversable’s associated client window.

  12. Let navigable info be a map matching the browsingContext.Info production with the context field set to navigable id, the parent field set to parent id if include parent id is true, or unset otherwise, the url field set to url, the userContext field set to user context’s user context id, originalOpener field set to opener id, the children field set to child infos, and the clientWindow field set to client window id.

  13. Return navigable info.

To await a navigation given navigable, request, wait condition, and optionally history handling (default: "default") and ignore cache (default: false):
  1. Let navigation id be the string representation of a UUID based on truly random, or pseudo-random numbers.

  2. Navigate navigable with resource request, and using navigable’s active document as the source Document, with navigation id navigation id, and history handling behavior history handling. If ignore cache is true, the navigation must not load resources from the HTTP cache.

    property specify how the ignore cache flag works. This needs to consider whether only the first load of a resource bypasses the cache (i.e. whether this is like initially clearing the cache and proceeding like normal), or whether resources not directly loaded by the HTML parser (e.g. loads initiated by scripts or stylesheets) also bypass the cache.

  3. Let (event received, navigate status) be await given «"navigation started", "navigation failed", "fragment navigated"», and navigation id.

  4. Assert: navigate status’s id is navigation id.

  5. If navigate status’s status is "complete":

    1. Let body be a map matching the browsingContext.NavigateResult production, with the navigation field set to navigation id, and the url field set to the result of the URL serializer given navigate status’s url.

    2. Return success with data body.

    Note: this is the case if the navigation only caused the fragment to change.

  6. If navigate status’s status is "canceled" return error with error code unknown error.

    TODO: is this the right way to handle errors here?

  7. Assert: navigate status’s status is "pending" and navigation id is not null.

  8. If wait condition is "none":

    1. Let body be a map matching the browsingContext.NavigateResult production, with the navigation field set to navigation id, and the url field set to the result of the URL serializer given navigate status’s url.

    2. Return success with data body.

  9. If wait condition is "interactive", let event name be "domContentLoaded", otherwise let event name be "load".

  10. Let (event received, status) be await given «event name, "download started", "navigation aborted", "navigation failed"» and navigation id.

  11. If event received is "navigation failed" return error with error code unknown error.

    Are we surfacing enough information about what failed and why with an error here? What error code do we want? Is there going to be a problem where local ends parse the implementation-defined strings to figure out what actually went wrong?

  12. Let body be a map matching the browsingContext.NavigateResult production, with the navigation field set to status’s id, and the url field set to the result of the URL serializer given status’s url.

  13. Return success with data body.

7.3.2.3. The browsingContext.Locator Type

remote end definition and local end definition

browsingContext.Locator = (
   browsingContext.AccessibilityLocator /
   browsingContext.CssLocator /
   browsingContext.InnerTextLocator /
   browsingContext.XPathLocator
)

browsingContext.AccessibilityLocator = {
   type: "accessibility",
   value: {
    ? name: text,
    ? role: text,
   }
}

browsingContext.CssLocator = {
   type: "css",
   value: text
}

browsingContext.InnerTextLocator = {
   type: "innerText",
   value: text,
   ? ignoreCase: bool
   ? matchType: "full" / "partial",
   ? maxDepth: js-uint,
}

browsingContext.XPathLocator = {
   type: "xpath",
   value: text
}

The browsingContext.Locator type provides details on the strategy for locating a node in a document.

7.3.2.4. The browsingContext.Navigation Type

remote end definition and local end definition

browsingContext.Navigation = text;

The browsingContext.Navigation type is a unique string identifying an ongoing navigation.

TODO: Link to the definition in the HTML spec.

7.3.2.5. The browsingContext.NavigationInfo Type

local end definition:

browsingContext.NavigationInfo = {
  context: browsingContext.BrowsingContext,
  navigation: browsingContext.Navigation / null,
  timestamp: js-uint,
  url: text,
}

The browsingContext.NavigationInfo type provides details of an ongoing navigation.

To get the navigation info, given navigable and navigation status:
  1. Let navigable id be the navigable id for navigable.

  2. Let navigation id be navigation status’s id.

  3. Let timestamp be a time value representing the current date and time in UTC.

  4. Let url be navigation status’s url.

  5. Return a map matching the browsingContext.NavigationInfo production, with the context field set to navigable id, the navigation field set to navigation id, the timestamp field set to timestamp, and the url field set to the result of the URL serializer given url.

7.3.2.6. The browsingContext.ReadinessState Type
browsingContext.ReadinessState = "none" / "interactive" / "complete"

The browsingContext.ReadinessState type represents the stage of document loading at which a navigation command will return.

7.3.2.7. The browsingContext.UserPromptType Type

Remote end definition and local end definition

browsingContext.UserPromptType = "alert" / "beforeunload" / "confirm" / "prompt";

The browsingContext.UserPromptType type represents the possible user prompt types.

7.3.3. Commands

7.3.3.1. The browsingContext.activate Command

The browsingContext.activate command activates and focuses the given top-level traversable.

Command Type
browsingContext.Activate = (
  method: "browsingContext.activate",
  params: browsingContext.ActivateParameters
)

browsingContext.ActivateParameters = {
  context: browsingContext.BrowsingContext
}
Result Type
EmptyResult

The remote end steps with command parameters are:

  1. Let navigable id be the value of the command parameters["context"] field.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. If navigable is not a top-level traversable, return error with error code invalid argument.

  4. Return activate a navigable with navigable.

To activate a navigable given navigable:
  1. Run implementation-specific steps so that navigable’s system visibility state becomes visible. If this is not possible return error with error code unsupported operation.

    Note: This can have the side effect of making currently visible navigables hidden.

    Note: This can change the underlying OS state by causing the window to become unminimized or by other side effects related to changing the system visibility state.

  2. Run implementation-specific steps to set the system focus on the navigable if it is not focused.

    Note: This does not change the focused area of the document except as mandated by other specifications.

  3. Return success with data null.

7.3.3.2. The browsingContext.captureScreenshot Command

The browsingContext.captureScreenshot command captures an image of the given navigable, and returns it as a Base64-encoded string.

Command Type
browsingContext.CaptureScreenshot = (
  method: "browsingContext.captureScreenshot",
  params: browsingContext.CaptureScreenshotParameters
)

browsingContext.CaptureScreenshotParameters = {
  context: browsingContext.BrowsingContext,
  ? origin: ("viewport" / "document") .default "viewport",
  ? format: browsingContext.ImageFormat,
  ? clip: browsingContext.ClipRectangle,
}

browsingContext.ImageFormat = {
   type: text,
   ? quality: 0.0..1.0,
}

browsingContext.ClipRectangle = (
  browsingContext.BoxClipRectangle /
  browsingContext.ElementClipRectangle
)

browsingContext.ElementClipRectangle = {
  type: "element",
  element: script.SharedReference
}

browsingContext.BoxClipRectangle = {
   type: "box",
   x: float,
   y: float,
   width: float,
   height: float
}
Result Type
browsingContext.CaptureScreenshotResult = {
  data: text
}
To normalize rect given rect:

Note: This ensures that the resulting rect has positive width dimension and height dimension.

  1. Let x be rect’s x coordinate.

  2. Let y be rect’s y coordinate.

  3. Let width be rect’s width dimension.

  4. Let height be rect’s height dimension.

  5. If width is less than 0, set x to x + width and then set width to -width.

  6. If height is less than 0, set y to y + height and then set height to -height.

  7. Return a new DOMRectReadOnly with x coordinate x, y coordinate y, width dimension width and height dimension height.

To rectangle intersection given rect1 and rect2
  1. Let rect1 be normalize rect with rect1.

  2. Let rect2 be normalize rect with rect2.

  3. Let x1_0 be rect1’s x coordinate.

  4. Let x2_0 be rect2’s x coordinate.

  5. Let x1_1 be rect1’s x coordinate plus rect1’s width dimension.

  6. Let x2_1 be rect2’s x coordinate plus rect2’s width dimension.

  7. Let x_0 be the maximum element of «x1_0, x2_0».

  8. Let x_1 be the minimum element of «x1_1, x2_1».

  9. Let y1_0 be rect1’s y coordinate.

  10. Let y2_0 be rect2’s y coordinate.

  11. Let y1_1 be rect1’s y coordinate plus rect1’s height dimension.

  12. Let y2_1 be rect2’s y coordinate plus rect2’s height dimension.

  13. Let y_0 be the maximum element of «y1_0, y2_0».

  14. Let y_1 be the minimum element of «y1_1, y2_1».

  15. If x_1 is less than x_0, let width be 0. Otherwise let width be x_1 - x_0.

  16. If y_1 is less than y_0, let height be 0. Otherwise let height be y_1 - y_0.

  17. Return a new DOMRectReadOnly with x coordinate x_0, y coordinate y_0, width dimension width and height dimension height.

To render document to a canvas given document and rect:
  1. Let ratio be determine the device pixel ratio given document’s default view.

  2. Let paint width be rect’s width dimension multiplied by ratio, rounded to the nearest integer, so it matches the width of rect in device pixels.

  3. Let paint height be rect’s height dimension multiplied by ratio, rounded to the nearest integer, so it matches the height of rect in device pixels.

  4. Let canvas be a new HTMLCanvasElement with width paint width and height paint height.

  5. Let canvas context be the result of running the 2D context creation algorithm with canvas and null.

  6. Set canvas’s context mode to 2D.

  7. Complete implementation specific steps equivalent to drawing the region of the framebuffer representing the region of document covered by rect to canvas context, such that each pixel in the framebuffer corresponds to a pixel in canvas context with (rect’s x coordinate, rect’s y coordinate) in viewport coordinates corresponding to (0,0) in canvas context and (rect’s x coordinate + rect’s width dimension, rect’s y coordinate + rect’s height dimension) corresponding to (paint width, paint height).

  8. Return canvas.

To encode a canvas as Base64 given canvas and format:
  1. If format is not null, let type be the type field of format, and let quality be the quality field of format.

  2. Otherwise, let type be "image/png" and let quality be undefined.

  3. Let file be a serialization of the bitmap as a file for canvas with type and quality.

  4. Let encoded string be the forgiving-base64 encode of file.

  5. Return success with data encoded string.

To get the origin rectangle given document and origin:
  1. If origin is "viewport":

    1. Let viewport be document’s visual viewport.

    2. Let viewport rect be a DOMRectReadOnly with x coordinate viewport page left, y coordinate viewport page top, width dimension viewport width, and height dimension viewport height.

    3. Return success with data viewport rect.

  2. Assert: origin is "document".

  3. Let document element be the document element for document.

  4. Let document rect be a DOMRectReadOnly with x coordinate 0, y coordinate 0, width dimension document element scroll height, and height dimension document element scroll width.

  5. Return success with data document rect.

The remote end steps with session and command parameters are:

  1. Let navigable id be the value of the context field of command parameters if present, or null otherwise.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. If the implementation is unable to capture a screenshot of navigable for any reason then return error with error code unsupported operation.

  4. Let document be navigable’s active document.

  5. Immediately after the next invocation of the run the animation frame callbacks algorithm for document:

    This ought to be integrated into the update rendering algorithm in some more explicit way.

  6. Let origin be the value of the context field of command parameters if present, or "viewport" otherwise.

  7. Let origin rect be the result of trying to get the origin rectangle given origin and document.

  8. Let clip rect be origin rect.

  9. If command parameters contains "clip":

    1. Let clip be command parameters["clip"].

    2. Run the steps under the first matching condition:

      clip matches the browsingContext.ElementClipRectangle production:
      1. Let environment settings be the environment settings object whose relevant global object's associated Document is document.

      2. Let realm be environment settingsrealm execution context's Realm component.

      3. Let element be the result of trying to deserialize remote reference with clip["element"], realm, and session.

      4. If element doesn’t implement Element return error with error code no such element.

      5. If element’s node document is not document, return error with error code no such element.

      6. Let viewport rect be get the origin rectangle given "viewport" and document.

      7. Let element rect be get the bounding box for element.

      8. Let clip rect be a DOMRectReadOnly with x coordinate element rect["x"] + viewport rect["x"]|, y coordinate element rect["y"] + viewport rect["y"], width element rect["width"], and height element rect["height"].

      clip matches the browsingContext.BoxClipRectangle production:
      1. Let clip x be clip["x"] plus origin rect’s x coordinate.

      2. Let clip y be clip["y"] plus origin rect’s y coordinate.

      3. Let clip rect be a DOMRectReadOnly with x coordinate clip x, y coordinate clip y, width clip["width"], and height clip["height"].

  10. Note: All coordinates are now measured from the origin of the document.

  11. Let rect be the rectangle intersection of origin rect and clip rect.

  12. If rect’s width dimension is 0 or rect’s height dimension is 0, return error with error code unable to capture screen.

  13. Let canvas be render document to a canvas with document and rect.

  14. Let format be the format field of command parameters.

  15. Let encoding result be the result of trying to encode a canvas as Base64 with canvas and format.

  16. Let body be a map matching the browsingContext.CaptureScreenshotResult production, with the data field set to encoding result.

  17. Return success with data body.

7.3.3.3. The browsingContext.close Command

The browsingContext.close command closes a top-level traversable.

Command Type
browsingContext.Close = (
  method: "browsingContext.close",
  params: browsingContext.CloseParameters
)

browsingContext.CloseParameters = {
  context: browsingContext.BrowsingContext,
  ? promptUnload: bool .default false
}
Result Type
EmptyResult
The remote end steps with command parameters are:
  1. Let navigable id be the value of the context field of command parameters.

  2. Let prompt unload be the value of the promptUnload field of command parameters.

  3. Let navigable be the result of trying to get a navigable with navigable id.

  4. Assert: navigable is not null.

  5. If navigable is not a top-level traversable, return error with error code invalid argument.

  6. If prompt unload is true:

    1. Close navigable.

  7. Otherwise:

    1. Close navigable without prompting to unload.

  8. Return success with data null.

There is an open discussion about the behavior when closing the last top-level traversable. We could expect to close the browser, close the session or leave this up to the implementation. [Issue #w3c/webdriver-bidi#170]

7.3.3.4. The browsingContext.create Command

The browsingContext.create command creates a new navigable, either in a new tab or in a new window, and returns its navigable id.

Command Type
browsingContext.Create = (
  method: "browsingContext.create",
  params: browsingContext.CreateParameters
)

browsingContext.CreateType = "tab" / "window"

browsingContext.CreateParameters = {
  type: browsingContext.CreateType,
  ? referenceContext: browsingContext.BrowsingContext,
  ? background: bool .default false,
  ? userContext: browser.UserContext
}
Result Type
browsingContext.CreateResult = {
  context: browsingContext.BrowsingContext
}
The remote end steps with command parameters are:
  1. Let type be the value of the type field of command parameters.

  2. Let reference navigable id be the value of the referenceContext field of command parameters, if present, or null otherwise.

  3. If reference navigable id is not null, let reference navigable be the result of trying to get a navigable with reference navigable id. Otherwise let reference navigable be null.

  4. If reference navigable is not null and is not a top-level traversable, return error with error code invalid argument.

  5. If the implementation is unable to create a new top-level traversable for any reason then return error with error code unsupported operation.

  6. Let user context be the default user context if reference navigable is null, and reference navigableassociated user context otherwise.

  7. Let user context id be the value of the userContext field of command parameters if present, or null otherwise.

  8. If user context id is not null, set user context to the result of trying to get user context with user context id.

  9. If user context is null, return error with error code no such user context.

  10. If the implementation is unable to create a new top-level traversable with associated user context user context for any reason, return error with error code unsupported operation.

  11. Let traversable be the result of trying to create a new top-level traversable steps with null and empty string, and setting the associated user context for the newly created top-level traversable to user context. Which OS window the new top-level traversable is created in depends on type and reference navigable:

  12. If the value of the command parametersbackground field is false:

    1. Let activate result be the result of activate a navigable with the newly created navigable.

    2. If activate result is an error, return activate result.

    Note: Do not invoke the focusing steps for the created navigable if background is true.

  13. Let body be a map matching the browsingContext.CreateResult production, with the context field set to traversable’s navigable id.

  14. Return success with data body.

7.3.3.5. The browsingContext.getTree Command

The browsingContext.getTree command returns a tree of all descendent navigables including the given parent itself, or all top-level contexts when no parent is provided.

Command Type
browsingContext.GetTree = (
  method: "browsingContext.getTree",
  params: browsingContext.GetTreeParameters
)

browsingContext.GetTreeParameters = {
  ? maxDepth: js-uint,
  ? root: browsingContext.BrowsingContext,
}
Result Type
browsingContext.GetTreeResult = {
  contexts: browsingContext.InfoList
}
The remote end steps with session and command parameters are:
  1. Let root id be the value of the root field of command parameters if present, or null otherwise.

  2. Let max depth be the value of the maxDepth field of command parameters if present, or null otherwise.

  3. Let navigables be an empty list.

  4. If root id is not null, append the result of trying to get a navigable given root id to navigables. Otherwise append all top-level traversables to navigables.

  5. Let navigables infos be an empty list.

  6. For each navigable of navigables:

    1. Let info be the result of get the navigable info given navigable, max depth, and true.

    2. Append info to navigables infos

  7. Let body be a map matching the browsingContext.GetTreeResult production, with the contexts field set to navigables infos.

  8. Return success with data body.

7.3.3.6. The browsingContext.handleUserPrompt Command

The browsingContext.handleUserPrompt command allows closing an open prompt

Command Type
browsingContext.HandleUserPrompt = (
  method: "browsingContext.handleUserPrompt",
  params: browsingContext.HandleUserPromptParameters
)

browsingContext.HandleUserPromptParameters = {
  context: browsingContext.BrowsingContext,
  ? accept: bool,
  ? userText: text,
}
Result Type
EmptyResult

The remote end steps with session and command parameters are:

  1. Let navigable id be the value of the context field of command parameters.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. Let accept be the value of the accept field of command parameters if present, or true otherwise.

  4. Let userText be the value of the userText field of command parameters if present, or the empty string otherwise.

  5. If navigable is currently showing a simple dialog from a call to alert then acknowledge the prompt.

    Otherwise if navigable is currently showing a simple dialog from a call to confirm, then respond positively if accept is true, or respond negatively if accept is false.

    Otherwise if navigable is currently showing a simple dialog from a call to prompt, then respond with the string value userText if accept is true, or abort if accept is false.

    Otherwise, if navigable is currently showing a prompt as part of the prompt to unload steps, then confirm the navigation if accept is true, otherwise refuse the navigation.

    Otherwise return error with error code no such alert.

  6. Return success with data null.

7.3.3.7. The browsingContext.locateNodes Command

The browsingContext.locateNodes command returns a list of all nodes matching the specified locator.

Command Type
browsingContext.LocateNodes = (
  method: "browsingContext.locateNodes",
  params: browsingContext.LocateNodesParameters
)

browsingContext.LocateNodesParameters = {
   context: browsingContext.BrowsingContext,
   locator: browsingContext.Locator,
   ? maxNodeCount: (js-uint .ge 1),
   ? serializationOptions: script.SerializationOptions,
   ? startNodes: [ + script.SharedReference ]
}
Result Type
browsingContext.LocateNodesResult = {
    nodes: [ * script.NodeRemoteValue ]
}
To locate nodes using CSS with given navigable, context nodes, selector, maximum returned node count, and session:
  1. Let returned nodes be an empty list.

  2. Let parse result be the result of parse a selector given selector.

  3. If parse result is failure, return error with error code invalid selector.

  4. For each context node of context nodes:

    1. Let elements be the result of match a selector against a tree with parse result and navigable’s active document root using scoping root context node.

    2. For each element in elements:

      1. Append element to returned nodes.

      2. If maximum returned node count is not null and size of returned nodes is equal to maximum returned node count, return success with data returned nodes.

  5. Return success with data returned nodes.

To locate nodes using XPath with given navigable, context nodes, selector, and maximum returned node count:

Note: Owing to the unmaintained state of the XPath specification, this algorithm is phrased as if making calls to the XPath DOM APIs. However this is to be understood as equivalent to spec-internal calls directly accessing the underlying algorithms, without going via the ECMAScript runtime.

  1. Let returned nodes be an empty list.

  2. For each context node of context nodes:

    1. Let evaluate result be the result of calling evaluate on navigable’s active document, with arguments selector, context node, null, ORDERED_NODE_SNAPSHOT_TYPE, and null. If this throws a "SyntaxError" DOMException, return error with error code invalid selector; otherwise, if this throws any other exception return error with error code unknown error.

    2. Let index be 0.

    3. Let length be the result of getting the snapshotLength property from evaluate result.

    4. Repeat, while index is less than length:

      1. Let node be the result of calling snapshotItem with evaluate result as this and index as the argument.

      2. Append node to returned nodes.

      3. If maximum returned node count not null and size of returned nodes is equal to maximum returned node count, return success with data returned nodes.

      4. Set index to index + 1.

  3. Return success with data returned nodes.

To locate nodes using inner text with given context nodes, selector, max depth, match type, ignore case, and maximum returned node count:
  1. If selector is the empty string, return error with error code invalid selector.

  2. Let returned nodes be an empty list.

  3. If ignore case is false, let search text be selector. Otherwise, let search text be the result of toUppercase with selector according to the Unicode Default Case Conversion algorithm.

  4. For each context node in context nodes:

    1. If context node implements Document or DocumentFragment:

      Note: when traversing the document or document fragment, max depth is not decreased intentionally to make the search result with document and document.documentElement equivalent.

      1. Let child nodes be an empty list.

      2. For each node child in the children of context node.

        1. Append child to child nodes.

      3. Extend returned nodes with the result of trying to locate nodes using inner text with child nodes, selector, max depth, match type, ignore case, and maximum returned node count.

    2. If context node does not implement HTMLElement then continue.

    3. Let node inner text be the result of calling the innerText getter steps with context node as the this value.

    4. If ignore case is false, let node text be node inner text. Otherwise, let node text be the result of toUppercase with node inner text according to the Unicode Default Case Conversion algorithm.

    5. If search text is a code point substring of node text, perform the following steps:

      1. Let child nodes be an empty list and, for each node child in the children of context node:

        1. Append child to child nodes.

      2. If size of child nodes is equal to 0 or max depth is equal to 0, perform the following steps:

        1. If match type is "full" and node text is search text, append context node to returned nodes.

        2. Otherwise, if match type is "partial", append context node to returned nodes.

      3. Otherwise, perform the following steps:

        1. Let child max depth be null if max depth is null, or max depth - 1 otherwise.

        2. Let child node matches be the result of locate nodes using inner text with child nodes, selector, child max depth , match type, ignore case, and maximum returned node count.

        3. If size of child node matches is equal to 0 and match type is "partial", append context node to returned nodes. Otherwise, extend returned nodes with child node matches.

  5. If maximum returned node count is not null, remove all entries in returned nodes with an index greater than or equal to maximum returned node count.

  6. Return success with data returned nodes.

To collect nodes using accessibility attributes with given context nodes, selector, maximum returned node count, and returned nodes:
  1. If returned nodes is null:

    1. Set returned nodes to an empty list.

  2. For each context node in context nodes:

    1. Let match be true.

    2. If context node implements Element:

      1. If selector contains "role":

        1. Let role be the computed role of context node.

        2. If selector["role"] is not role:

          1. Set match to false.

      2. If selector contains "name":

        1. Let name be the accessible name of context node.

        2. If selector["name"] is not name:

          1. Set match to false.

    3. Otherwise, set match to false.

    4. If match is true:

      1. If maximum returned node count is not null and size of returned nodes is equal to maximum returned node count, break.

      2. Append context node to returned nodes.

    5. Let child nodes be an empty list and, for each node child in the children of context node:

      1. If child implements Element, append child to child nodes.

    6. Try to collect nodes using accessibility attributes with child nodes, selector, maximum returned node count, and returned nodes.

  3. Return returned nodes.

To locate nodes using accessibility attributes with given context nodes, selector, and maximum returned node count:
  1. If selector does not contain "role" and selector does not contain "name", return error with error code invalid selector.

  2. Return the result of collect nodes using accessibility attributes with context nodes, selector, maximum returned node count, and null.

The remote end steps with session and command parameters are:
  1. Let navigable id be command parameters["context"].

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. Assert: navigable is not null.

  4. Let realm be the result of trying to get a realm from a navigable with navigable id of navigable and null.

  5. Let locator be command parameters["locator"].

  6. If command parameters contains "startNodes", let start nodes parameter be command parameters["startNodes"]. Otherwise let start nodes parameter be null.

  7. If command parameters contains "maxNodeCount", let maximum returned node count be command parameters["maxNodeCount"]. Otherwise, let maximum returned node count be null.

  8. Let context nodes be an empty list.

  9. If start nodes parameter is null, append the document element of navigable’s active document to context nodes. Otherwise, for each serialized start node in start nodes parameter:

    1. Let start node be the result of trying to deserialize shared reference given serialized start node, realm and session.

    2. Append start node to context nodes.

  10. Assert size of context nodes is greater than 0.

  11. Let type be locator["type"].

  12. In the following list of conditions and associated steps, run the first set of steps for which the associated condition is true:

    type is the string "css"
    1. Let selector be locator["value"].

    2. Let result nodes be a result of trying to locate nodes using css given navigable, context nodes, selector and maximum returned nodes.

    type is the string "xpath"
    1. Let selector be locator["value"].

    2. Let result nodes be a result of trying to locate nodes using xpath given navigable, context nodes, selector and maximum returned nodes.

    type is the string "innerText"
    1. Let selector be locator["value"].

    2. If locator contains maxDepth, let max depth be locator["maxDepth"]. Otherwise, let max depth be null.

    3. If locator contains ignoreCase, let ignore case be locator["ignoreCase"]. Otherwise, let ignore case be false.

    4. If locator contains matchType, let match type be locator["matchType"]. Otherwise, let match type be "full".

    5. Let result nodes be a result of trying to locate nodes using inner text given context nodes, selector, max depth, match type, ignore case and maximum returned node count.

    type is the string "accessibility"
    1. Let selector be locator["value"].

    2. Let result nodes be locate nodes using accessibility attributes given context nodes, selector, and maximum returned node count.

  13. Assert: maximum returned node count is null or size of result nodes is less than or equal to maximum returned node count.

  14. If command parameters contains "serializationOptions", let serialization options be command parameters["serializationOptions"]. Otherwise, let serialization options be a map matching the script.SerializationOptions production with the fields set to their default values.

  15. Let result ownership be "none".

  16. Let serialized nodes be an empty list.

  17. For each result node in result nodes:

    1. Let serialized node be the result of serialize as a remote value with result node, serialization options, result ownership, a new map as serialization internal map, realm and session.

    2. Append serialized node to serialized nodes.

  18. Let result be a map matching the browsingContext.LocateNodesResult production, with the nodes field set serialized nodes.

  19. Return success with data result.

7.3.3.8. The browsingContext.navigate Command

The browsingContext.navigate command navigates a navigable to the given URL.

Command Type
browsingContext.Navigate = (
  method: "browsingContext.navigate",
  params: browsingContext.NavigateParameters
)

browsingContext.NavigateParameters = {
  context: browsingContext.BrowsingContext,
  url: text,
  ? wait: browsingContext.ReadinessState,
}
Result Type
browsingContext.NavigateResult = {
  navigation: browsingContext.Navigation / null,
  url: text,
}
The remote end steps with session and command parameters are:
  1. Let navigable id be the value of the context field of command parameters.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. Assert: navigable is not null.

  4. Let wait condition be the value of the wait field of command parameters if present, or "none" otherwise.

  5. Let url be the value of the url field of command parameters.

  6. Let document be navigable’s active document.

  7. Let base be document’s base URL.

  8. Let url record be the result of applying the URL parser to url, with base URL base.

  9. If url record is failure, return error with error code invalid argument.

  10. Let request be a new request whose URL is url record.

  11. Return the result of await a navigation with navigable, request and wait condition.

7.3.3.9. The browsingContext.print Command

The browsingContext.print command creates a paginated representation of a document, and returns it as a PDF document represented as a Base64-encoded string.

Command Type
browsingContext.Print = (
  method: "browsingContext.print",
  params: browsingContext.PrintParameters
)

browsingContext.PrintParameters = {
  context: browsingContext.BrowsingContext,
  ? background: bool .default false,
  ? margin: browsingContext.PrintMarginParameters,
  ? orientation: ("portrait" / "landscape") .default "portrait",
  ? page: browsingContext.PrintPageParameters,
  ? pageRanges: [*(js-uint / text)],
  ? scale: (0.1..2.0) .default 1.0,
  ? shrinkToFit: bool .default true,
}

browsingContext.PrintMarginParameters = {
  ? bottom: (float .ge 0.0) .default 1.0,
  ? left: (float .ge 0.0) .default 1.0,
  ? right: (float .ge 0.0) .default 1.0,
  ? top: (float .ge 0.0) .default 1.0,
}

; Minimum size is 1pt x 1pt. Conversion follows from
; https://www.w3.org/TR/css3-values/#absolute-lengths
browsingContext.PrintPageParameters = {
  ? height: (float .ge 0.0352) .default 27.94,
  ? width: (float .ge 0.0352) .default 21.59,
}
Result Type
browsingContext.PrintResult = {
  data: text
}

The remote end steps with session and command parameters are:

  1. Let navigable id be the value of the context field of command parameters.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. If the implementation is unable to provide a paginated representation of navigable for any reason then return error with error code unsupported operation.

  4. Let margin be the value of the margin field of command parameters if present, or otherwise a map matching the browsingContext.PrintMarginParameters with the fields set to their default values.

  5. Let page size be the value of the page field of command parameters if present, or otherwise a map matching the browsingContext.PrintPageParameters with the fields set to their default values.

Note: The minimum page size is 1 point, which is (2.54 / 72) cm as per absolute lengths.

  1. Let page ranges be the value of the pageRanges field of command parameters if present or an empty list otherwise.

  2. Let document be navigable’s active document.

  3. Immediately after the next invocation of the run the animation frame callbacks algorithm for document:

    This ought to be integrated into the update rendering algorithm in some more explicit way.

    1. Let pdf data be the result taking UA-specific steps to generate a paginated representation of document, with the CSS media type set to print, encoded as a PDF, with the following paper settings:

      Property Value
      Width in cm page size["width"] if command parameters["orientation"] is "portrait" otherwise page size["height"]
      Height in cm page size["height"] if command parameters["orientation"] is "portrait" otherwise page size["width"]
      Top margin, in cm margin["top"]
      Bottom margin, in cm margin["bottom"]
      Left margin, in cm margin["left"]
      Right margin, in cm margin["right"]

      In addition, the following formatting hints should be applied by the UA:

      If command parameters["scale"] is not equal to 1:
      Zoom the size of the content by a factor command parameters["scale"]
      If command parameters["background"] is false:
      Suppress output of background images
      If command parameters["shrinkToFit"] is true:
      Resize the content to match the page width, overriding any page width specified in the content
    2. If page ranges is not empty, let pages be the result of trying to parse a page range with page ranges and the number of pages contained in pdf data, then remove any pages from pdf data whose one-based index is not contained in pages.

    3. Let encoding result be the result of calling Base64 Encode on pdf data.

    4. Let encoded data be encoding result’s data.

    5. Let body be a map matching the browsingContext.PrintResult production, with the data field set to encoded data.

    6. Return success with data body.

7.3.3.10. The browsingContext.reload Command

The browsingContext.reload command reloads a navigable.

Command Type
browsingContext.Reload = (
  method: "browsingContext.reload",
  params: browsingContext.ReloadParameters
)

browsingContext.ReloadParameters = {
  context: browsingContext.BrowsingContext,
  ? ignoreCache: bool,
  ? wait: browsingContext.ReadinessState,
}
Return Type
browsingContext.NavigateResult
The remote end steps with command parameters are:
  1. Let navigable id be the value of the context field of command parameters.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. Assert: navigable is not null.

  4. Let ignore cache be the the value of the ignoreCache field of command parameters if present, or false otherwise.

  5. Let wait condition be the value of the wait field of command parameters if present, or "none" otherwise.

  6. Let document be navigable’s active document.

  7. Let url be document’s URL.

  8. Let request be a new request whose URL is url.

  9. Return the result of await a navigation with navigable, request, wait condition, history handling "reload", and ignore cache ignore cache.

7.3.3.11. The browsingContext.setViewport Command

The browsingContext.setViewport command modifies specific viewport characteristics (e.g. viewport width and viewport height) on the given top-level traversable.

Command Type
browsingContext.SetViewport = (
  method: "browsingContext.setViewport",
  params: browsingContext.SetViewportParameters
)

browsingContext.SetViewportParameters = {
  context: browsingContext.BrowsingContext,
  ? viewport: browsingContext.Viewport / null,
  ? devicePixelRatio: (float .gt 0.0) / null,
}

browsingContext.Viewport = {
  width: js-uint,
  height: js-uint,
}
Result Type
EmptyResult

The remote end steps with command parameters are:

  1. Let navigable id be the value of the context field of command parameters.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. If navigable is not a top-level traversable, return error with error code invalid argument.

  4. If the implementation is unable to adjust the navigable’s layout viewport parameters with the given command parameters for any reason, return error with error code unsupported operation.

  5. If command parameters contains the viewport field:

    1. Let viewport be the command parameters["viewport"].

    2. If viewport is not null, set the width of navigable’s layout viewport to be the width of viewport in CSS pixels and set the height of the navigable’s layout viewport to be the height of viewport in CSS pixels.

    3. Otherwise, set the navigable’s layout viewport to the implementation-defined default.

  6. Run the CSSOM View § 13.1 Resizing viewports steps.

  7. If command parameters contains the devicePixelRatio field:

    1. Let device pixel ratio be the command parameters["devicePixelRatio"].

    2. For the navigable and all descendant navigables:

      1. Let navigable be the navigable whose active document is navigable’s active document.

      2. If device pixel ratio is not null:

        1. When the select an image source from a source set are run, act as if the implementation’s pixel density was set to device pixel ratio when selecting an image.

        2. For the purposes of the resolution media feature, act as if the implementation’s resolution is device pixel ratio dppx scaled by the page zoom.

        3. Set device pixel ratio overrides[navigable] to device pixel ratio.

          Note: This will take an effect because of the patch of § 8.3.1 Determine the device pixel ratio.

      3. Otherwise:

        1. When the select an image source from a source set steps are run, use the implementation’s default behavior, without any changes made by previous invocations of these steps.

        2. For the purposes of the resolution media feature, use the implementation’s default behavior, without any changes made by previous invocations of these steps.

        3. Remove navigable from device pixel ratio overrides.

      4. Run evaluate media queries and report changes for document currently loaded in a specified navigable.

  8. Return success with data null.

7.3.3.12. The browsingContext.traverseHistory Command

The browsingContext.traverseHistory command traverses the history of a given navigable by a delta.

Command Type
browsingContext.TraverseHistory = (
  method: "browsingContext.traverseHistory",
  params: browsingContext.TraverseHistoryParameters
)

browsingContext.TraverseHistoryParameters = {
  context: browsingContext.BrowsingContext,
  delta: js-int,
}
Return Type
browsingContext.TraverseHistoryResult = {
}
The remote end steps with command parameters are:
  1. Let navigable be the result of trying to get a navigable with command parameters["context"].

  2. If navigable is not a top-level traversable, return error with error code invalid argument.

  3. Assert: navigable is not null.

  4. Let delta be command parameters["delta"].

  5. Let resume id be a unique string.

  6. Queue a task on navigable’s session history traversal queue to run the following steps:

    1. Let all steps be the result of getting all used history steps for navigable.

    2. Let current index be the index of navigable’s current session history step within all steps.

    3. Let target index be current index plus delta.

    4. Let valid entry be false if all steps[target index] does not exist, or true otherwise.

    5. Resume with "check history", resume id, and valid entry.

  7. Let is valid entry be await with «"check history"», and resume id.

  8. If is valid entry is false, return error with error code no such history entry.

  9. Traverse the history by a delta given delta and navigable.

    There is a race condition in the algorithm as written because by the time we try to navigate the target session history entry might not exist. Once we support waiting for history to navigate we can handle this more robustly.

  10. TODO: Support waiting for the history traversal to complete.

  11. Let body be a map matching the browsingContext.TraverseHistoryResult production.

  12. Return success with data body.

The WebDriver BiDi page show steps given context and navigation status are:

Do we want to expose a `browsingContext.pageShow event? In that case we’d need to call this whenever `pageshow` is going to be emitted, not just on bfcache restore, and also add the persisted status to the data.

  1. Let navigation id be navigation status’s id.

  2. Resume with "page show", navigation id, and navigation status.

The WebDriver BiDi pop state steps given context and navigation status are:

  1. Let navigation id be navigation status’s id.

  2. Resume with "pop state", navigation id, and navigation status.

7.3.4. Events

7.3.4.1. The browsingContext.contextCreated Event
Event Type
browsingContext.ContextCreated = (
 method: "browsingContext.contextCreated",
 params: browsingContext.Info
)

To Recursively emit context created events given session and navigable:

  1. Emit a context created event with session and navigable.

  2. For each child navigable, child, of navigable:

    1. Recursively emit context created events given session and child.

To Emit a context created event given session and navigable:

  1. Let params be the result of get the navigable info given navigable, 0, and true.

  2. Let body be a map matching the browsingContext.ContextCreated production, with the params field set to params.

  3. Emit an event with session and body.

The remote end event trigger is the WebDriver BiDi navigable created steps given navigable and opener navigable:

  1. Set navigable’s original opener to opener navigable, if opener navigable is provided.

  2. If the navigable cache behavior with navigable is "bypass", then perform implementation-defined steps to disable any implementation-specific resource caches for network requests originating from navigable.

  3. Let related navigables be a set containing navigable.

  4. For each session in the set of sessions for which an event is enabled given "browsingContext.contextCreated" and related navigables:

    1. Emit a context created event given session and navigable.

The remote end subscribe steps, with subscribe priority 1, given session, navigables and include global are:

  1. For each navigable in navigables:

    1. Recursively emit context created events given session and navigable.

7.3.4.2. The browsingContext.contextDestroyed Event
Event Type
browsingContext.ContextDestroyed = (
 method: "browsingContext.contextDestroyed",
 params: browsingContext.Info
)
The remote end event trigger is:

The remote end event trigger is the WebDriver BiDi navigable destroyed steps given navigable:

  1. Let params be the result of get the navigable info, given navigable, null, and true.

  2. Let body be a map matching the browsingContext.ContextDestroyed production, with the params field set to params.

  3. Let related navigables be a set containing navigable’s parent, if that is not null, or an empty set otherwise.

  4. For each session in the set of sessions for which an event is enabled given "browsingContext.contextDestroyed" and related navigables:

    1. Emit an event with session and body.

    2. Let subscriptions to remove be a set.

    3. For each subscription in session’s subscriptions:

      1. If subscription’s top-level traversable ids contains navigable’s navigable id;

        1. Remove navigable’s navigable id from subscription’s top-level traversable ids.

        2. If subscription’s top-level traversable ids is empty:

          1. Append subscription to subscriptions to remove.

    4. Remove subscriptions to remove from session’s subscriptions.

It’s unclear if we ought to only fire this event for browsing contexts that have active documents; navigation can also cause contexts to become inaccessible but not yet get discarded because bfcache.

7.3.4.3. The browsingContext.navigationStarted Event
Event Type
browsingContext.NavigationStarted = (
 method: "browsingContext.navigationStarted",
 params: browsingContext.NavigationInfo
)
The remote end event trigger is the WebDriver BiDi navigation started steps given navigable and navigation status:
  1. Let params be the result of get the navigation info given navigable and navigation status.

  2. Let body be a map matching the browsingContext.NavigationStarted production, with the params field set to params.

  3. Let navigation id be navigation status’s id.

  4. Let related navigables be a set containing navigable.

  5. Resume with "navigation started", navigation id, and navigation status.

  6. For each session in the set of sessions for which an event is enabled given "browsingContext.navigationStarted" and related navigables:

    1. Emit an event with session and body.

7.3.4.4. The browsingContext.fragmentNavigated Event
Event Type
browsingContext.FragmentNavigated = (
 method: "browsingContext.fragmentNavigated",
 params: browsingContext.NavigationInfo
)
The remote end event trigger is the WebDriver BiDi fragment navigated steps given navigable and navigation status:
  1. Let params be the result of get the navigation info given navigable and navigation status.

  2. Let body be a map matching the browsingContext.FragmentNavigated production, with the params field set to params.

  3. Let navigation id be navigation status’s id.

  4. Let related navigable be a set containing navigable.

  5. Resume with "fragment navigated", navigation id, and navigation status.

  6. For each session in the set of sessions for which an event is enabled given "browsingContext.fragmentNavigated" and related navigable:

    1. Emit an event with session and body.

7.3.4.5. The browsingContext.historyUpdated Event
Event Type
browsingContext.HistoryUpdated = (
  method: "browsingContext.historyUpdated",
  params: browsingContext.HistoryUpdatedParameters
)

browsingContext.HistoryUpdatedParameters = {
  context: browsingContext.BrowsingContext,
  url: text
}
The remote end event trigger is the WebDriver BiDi history updated steps given navigable:
  1. Let url be the result of running the URL serializer, given navigable’s active browsing context's active document's URL.

  2. Let params be a map matching the browsingContext.HistoryUpdatedParameters production, with the url field set to url and the context field set to navigable’s navigable id.

  3. Let body be a map matching the browsingContext.HistoryUpdated production, with the params field set to params.

  4. Let related browsing contexts be a set containing navigable’s active browsing context.

  5. For each session in the set of sessions for which an event is enabled given "browsingContext.historyUpdated" and related browsing contexts:

    1. Emit an event with session and body.

7.3.4.6. The browsingContext.domContentLoaded Event
Event Type
browsingContext.DomContentLoaded = (
 method: "browsingContext.domContentLoaded",
 params: browsingContext.NavigationInfo
)
The remote end event trigger is the WebDriver BiDi DOM content loaded steps given navigable and navigation status:
  1. Let params be the result of get the navigation info given navigable and navigation status.

  2. Let body be a map matching the browsingContext.DomContentLoaded production, with the params field set to params.

  3. Let related navigables be a set containing navigable.

  4. Let navigation id be navigation status’s id.

  5. Resume with "domContentLoaded", navigation id, and navigation status.

  6. For each session in the set of sessions for which an event is enabled given "browsingContext.domContentLoaded" and related navigables:

    1. Emit an event with session and body.

7.3.4.7. The browsingContext.load Event
Event Type
browsingContext.Load = (
 method: "browsingContext.load",
 params: browsingContext.NavigationInfo
)
The remote end event trigger is the WebDriver BiDi load complete steps given navigable and navigation status:
  1. Let params be the result of get the navigation info given navigable and navigation status.

  2. Let body be a map matching the browsingContext.Load production, with the params field set to params.

  3. Let related navigables be a set containing navigable.

  4. Let navigation id be navigation status’s id.

  5. Resume with "load", navigation id and navigation status.

  6. For each session in the set of sessions for which an event is enabled given "browsingContext.load" and related navigables:

    1. Emit an event with session and body.

7.3.4.8. The browsingContext.downloadWillBegin Event
Event Type
browsingContext.DownloadWillBegin = (
 method: "browsingContext.downloadWillBegin",
 params: browsingContext.NavigationInfo
)
The remote end event trigger is the WebDriver BiDi download started steps given navigable and navigation status:
  1. Let params be the result of get the navigation info given navigable and navigation status.

  2. Let body be a map matching the browsingContext.DownloadWillBegin production, with the params field set to params.

  3. Let navigation id be navigation status’s id.

  4. Let related navigables be a set containing navigable.

  5. Resume with "download started", navigation id, and navigation status.

  6. For each session in the set of sessions for which an event is enabled given "browsingContext.downloadWillBegin" and related navigables:

    1. Emit an event with session and body.

7.3.4.9. The browsingContext.navigationAborted Event
Event Type
browsingContext.NavigationAborted = (
 method: "browsingContext.navigationAborted",
 params: browsingContext.NavigationInfo
)
The remote end event trigger is the WebDriver BiDi navigation aborted steps given navigable and navigation status:
  1. Let params be the result of get the navigation info given navigable and navigation status.

  2. Let body be a map matching the browsingContext.NavigationAborted production, with the params field set to params.

  3. Let navigation id be navigation status’s id.

  4. Let related navigables be a set containing navigable.

  5. Resume with "navigation aborted", navigation id, and navigation status.

  6. For each session in the set of sessions for which an event is enabled given "browsingContext.navigationAborted" and related navigables:

    1. Emit an event with session and body.

7.3.4.10. The browsingContext.navigationFailed Event
Event Type
browsingContext.NavigationFailed = (
 method: "browsingContext.navigationFailed",
 params: browsingContext.NavigationInfo
)
The remote end event trigger is the WebDriver BiDi navigation failed steps given navigable and navigation status:
  1. Let params be the result of get the navigation info given navigable and navigation status.

  2. Let body be a map matching the browsingContext.NavigationFailed production, with the params field set to params.

  3. Let navigation id be navigation status’s id.

  4. Let related navigables be a set containing navigable.

  5. Resume with "navigation failed", navigation id, and navigation status.

  6. For each session in the set of sessions for which an event is enabled given "browsingContext.navigationFailed" and related navigables:

    1. Emit an event with session and body.

7.3.4.11. The browsingContext.userPromptClosed Event
Event Type
browsingContext.UserPromptClosed = (
  method: "browsingContext.userPromptClosed",
  params: browsingContext.UserPromptClosedParameters
)

browsingContext.UserPromptClosedParameters = {
  context: browsingContext.BrowsingContext,
  accepted: bool,
  type: browsingContext.UserPromptType,
  ? userText: text
}
The remote end event trigger is the WebDriver BiDi user prompt closed steps given window, type, accepted and optional user text (default: null).
  1. Let navigable be window’s navigable.

  2. Let navigable id be the navigable id for navigable.

  3. Let params be a map matching the browsingContext.UserPromptClosedParameters production with the context field set to navigable id, the accepted field set to accepted, the type field set to type, and the userText field set to user text if user text is not null or omitted otherwise.

  4. Let body be a map matching the BrowsingContextUserPromptClosedEvent production, with the params field set to params.

  5. Let related navigables be a set containing navigable.

  6. For each session in the set of sessions for which an event is enabled given "browsingContext.userPromptClosed" and related navigables:

    1. Emit an event with session and body.

7.3.4.12. The browsingContext.userPromptOpened Event
Event Type
browsingContext.UserPromptOpened = (
  method: "browsingContext.userPromptOpened",
  params: browsingContext.UserPromptOpenedParameters
)

browsingContext.UserPromptOpenedParameters = {
  context: browsingContext.BrowsingContext,
  handler: session.UserPromptHandlerType,
  message: text,
  type: browsingContext.UserPromptType,
  ? defaultValue: text
}
The remote end event trigger is the WebDriver BiDi user prompt opened steps given window, type, message, and optional default value (default: null).
  1. Let navigable be window’s navigable.

  2. Let navigable id be the navigable id for navigable.

  3. Let handler configuration be get the prompt handler with type.

  4. Let handler be handler configuration’s handler.

  5. Let params be a map matching the browsingContext.UserPromptOpenedParameters production with the context field set to navigable id, the type field set to type, the message field set to message, the defaultValue field set to default value if default value is not null or omitted otherwise, and the handler field set to handler.

  6. Let body be a map matching the browsingContext.UserPromptOpened production, with the params field set to params.

  7. Let related navigables be a set containing navigable.

  8. For each session in the set of sessions for which an event is enabled given "browsingContext.userPromptOpened" and related navigables:

    1. Emit an event with session and body.

  9. If handler is "ignore", set handler to "none".

  10. Return handler.

7.4. The network Module

The network module contains commands and events relating to network requests.

7.4.1. Definition

remote end definition

NetworkCommand = (
  network.AddIntercept //
  network.ContinueRequest //
  network.ContinueResponse //
  network.ContinueWithAuth //
  network.FailRequest //
  network.ProvideResponse //
  network.RemoveIntercept //
  network.SetCacheBehavior
)

local end definition

NetworkResult = (
   network.AddInterceptResult
)

NetworkEvent = (
    network.AuthRequired //
    network.BeforeRequestSent //
    network.FetchError //
    network.ResponseCompleted //
    network.ResponseStarted
)

A remote end has a before request sent map which is initially an empty map. It’s used to track the network events for which a network.beforeRequestSent event has already been sent.

A remote end has a default cache behavior which is a string. It is initially "default".

A remote end has a navigable cache behavior map which is a weak map between top-level traversables and strings representing cache behavior. It is initially empty.

7.4.2. Network Intercepts

A network intercept is a mechanism to allow remote ends to intercept and modify network requests and responses.

A BiDi session has an intercept map which is a map between intercept id and a struct with fields url patterns, phases, and browsingContexts that define the properties of active network intercepts. It is initially empty.

A BiDi session has a blocked request map, used to track the requests which are actively being blocked. It is an map between request id and a struct with fields request, phase, and response. It is initially empty.

To get the network intercepts given session, event, request, and navigable id:

  1. Let session intercepts be session’s intercept map.

  2. Let intercepts be an empty list.

  3. Run the steps under the first matching condition:

    event is "network.beforeRequestSent"
    Set phase to "beforeRequestSent".
    event is "network.responseStarted"
    Set phase to "responseStarted".
    event is "network.authRequired"
    Set phase to "authRequired".
    event is "network.responseCompleted"
    Return intercepts.
  4. Let url be the result of running the URL serializer with request’s URL.

  5. For each intercept idintercept of session intercepts:

    1. If intercept’s contexts is not null:

      1. If intercept’s contexts does not contain navigable id:

        1. Continue.

    2. If intercept’s phases contains phase:

      1. Let url patterns be intercept’s url patterns.

      2. If url patterns is empty:

        1. Append intercept id to intercepts.

        2. Continue.

      3. For each url pattern in url patterns:

        1. If match URL pattern with url pattern and url:

          1. Append intercept id to intercepts.

          2. Break.

  6. Return intercepts.

To update the response given session, command and command parameters:
  1. Let blocked requests be session’s blocked request map.

  2. Let request id be command parameters["request"].

  3. If blocked requests does not contain request id then return error with error code no such request.

  4. Let (request, phase, response) be blocked requests[request id].

  5. If phase is "beforeRequestSent" and command is "continueResponse", return error with error code "invalid argument".

    TODO: Consider a different error

  6. If response is null:

    1. Assert: phase is "beforeRequestSent".

    2. Set response to a new response.

  7. If command parameters contains "statusCode":

    1. Set responses’s status be command parameters["statusCode"].

  8. If command parameters contains "reasonPhrase":

    1. Set responses’s status message be UTF-8 encode with command parameters["reasonPhrase"].

  9. If command parameters contains "headers":

    1. Let headers be an empty header list.

    2. For header in command parameters["headers"]:

      1. Let deserialized header be deserialize header with header.

      2. If deserialized header’s name does not match the field-name token production, return error with error code "invalid argument".

      3. If deserialized header’s value does not match the header value production, return error with error code "invalid argument".

      4. Append deserialized header to headers.

    3. Set response’s header list to headers.

  10. If command parameters contains "cookies":

    1. If command parameters contains "headers", let headers be response’s header list.

      Otherwise:

      1. Let headers be an empty header list.

      2. For each header in response’s headers list:

        1. Let name be header’s name.

        2. If byte-lowercase name is not `set-cookie`:

          1. Append header to headers

    2. For cookie in command parameters["cookies"]:

      1. Let header value be serialize set-cookie header with cookie.

      2. Append (`Set-Cookie`, header value) to headers.

      3. Set response’s header list to headers.

  11. If command parameters contains "credentials":

    This doesn’t have a way to cancel the auth.

    1. Let credentials be command parameters["credentials"].

    2. Assert: credentials["type"] is "password".

    3. Set response’s authentication credentials to (credentials["username"], credentials["password"])

  12. Return response

7.4.3. Types

7.4.3.1. The network.AuthChallenge Type
network.AuthChallenge = {
  scheme: text,
  realm: text,
}

To extract challenges given response:

Should we include parameters other than realm?

  1. If response’s status is 401, let header name be `WWW-Authenticate`. Otherwise if response’s status is 407, let header name be `Proxy-Authenticate`. Otherwise return null.

  2. Let challenges be a new list.

  3. For each (name, value) in response’s header list:

    as in Fetch it’s unclear if this is the right way to handle multiple headers, parsing issues, etc.

    1. If name is a byte-case-insensitive match for header name:

      1. Let header challenges be the result of parsing value into a list of challenges, each consisting of a scheme and a list of parameters, each of which is a tuple (name, value), according to the rules of [RFC9110].

      2. For each header challenge in header challenges:

        1. Let scheme be header challenge’s scheme.

        2. Let realm be the empty string.

        3. For each (param name, param value) in header challenge’s parameters:

          1. If param name equals `realm` let realm be UTF-8 decode param value.

        4. Let challenge be a new map matching the network.AuthChallenge production, with the scheme field set to scheme and the realm field set to realm.

      3. Append challenge to challenges.

  4. Return challenges.

7.4.3.2. The network.AuthCredentials Type
network.AuthCredentials = {
  type: "password",
  username: text,
  password: text,
}

The network.AuthCredentials type represents the response to a request for authorization credentials.

7.4.3.3. The network.BaseParameters Type
network.BaseParameters = (
    context: browsingContext.BrowsingContext / null,
    isBlocked: bool,
    navigation: browsingContext.Navigation / null,
    redirectCount: js-uint,
    request: network.RequestData,
    timestamp: js-uint,
    ? intercepts: [+network.Intercept]
)

The network.BaseParameters type is an abstract type representing the data that’s common to all network events.

Consider including the `sharedId` of the document node that initiated the request in addition to the context.

To process a network event given session, event, and request:
  1. Let request data be the result of get the request data with request.

  2. Let navigation be request’s navigation id.

  3. Let navigable id be null.

  4. Let top-level navigable id be null.

  5. If request’s window is an environment settings object:

    1. Let environment settings be request’s window

    2. If there is a navigable whose active window is environment settingsglobal object, set navigable id to the navigable id for that navigable, and set top-level navigable id to be top-level traversable's navigable id for that context.

  6. Let intercepts be the result of get the network intercepts with session, event, request, and top-level navigable id.

  7. Let redirect count be request’s redirect count.

  8. Let timestamp be a time value representing the current date and time in UTC.

  9. If intercepts is not empty, let is blocked be true, otherwise let is blocked be false.

  10. Let params be map matching the network.BaseParameters production, with the request field set to request data, the navigation field set to navigation, the context field set to navigable id, the timestamp field set to timestamp, the redirectCount field set to redirect count, the isBlocked field set to is blocked, and intercepts field set to intercepts if is blocked is true, or omitted otherwise.

  11. Return params

7.4.3.4. The network.BytesValue Type
network.BytesValue = network.StringValue / network.Base64Value;

network.StringValue = {
  type: "string",
  value: text,
}

network.Base64Value = {
  type: "base64",
  value: text,
}

The network.BytesValue type represents binary data sent over the network. Valid UTF-8 is represented with the network.StringValue type, any other data is represented in Base64-encoded form as network.Base64Value.

To deserialize protocol bytes given protocol bytes:

Note: this takes bytes encoded as a network.BytesValue and returns a byte sequence.

  1. If protocol bytes matches the network.StringValue production, let bytes be UTF-8 encode protocol bytes["value"].

  2. Otherwise if protocol bytes matches the network.Base64Value production. Let bytes be forgiving-base64 decode protocol bytes["value"].

  3. Return bytes.

To serialize protocol bytes given bytes:

Note: this takes a byte sequence and returns a network.BytesValue.

  1. Let text be UTF-8 decode without BOM or fail bytes.

  2. If text is failure, return a map matching the network.Base64Value production, with value set to forgiving-base64 encode bytes.

  3. Return a map matching the network.StringValue production, with value set to text.

Remote end definition and local end definition

network.SameSite = "strict" / "lax" / "none"

network.Cookie = {
    name: text,
    value: network.BytesValue,
    domain: text,
    path: text,
    size: js-uint,
    httpOnly: bool,
    secure: bool,
    sameSite: network.SameSite,
    ? expiry: js-uint,
    Extensible,
};

The network.Cookie type represents a cookie.

To serialize cookie given stored cookie:

Note: The definitions of stored cookie’s fields are from [COOKIES], except samesite-flag, which is from [SAME-SITE-COOKIES].

  1. Let name be the result of UTF-8 decode with stored cookie’s name field.

  2. Let value be serialize protocol bytes with stored cookie’s value.

  3. Let domain be stored cookie’s domain field.

  4. Let path be stored cookie’s path field.

  5. Let expiry be stored cookie’s expiry-time field represented as a unix timestamp, if set, or null otherwise.

  6. Let size be the byte length of the result of serializing stored cookie as it would be represented in a Cookie header.

  7. Let http only be true if stored cookie’s http-only-flag is true, or false otherwise.

  8. Let secure be true if stored cookie’s secure-only-flag is true, or false otherwise.

  9. Let same site be "none" if stored cookie’s samesite-flag is "None", "lax" if it is "Lax", or "strict" if it is "Strict".

  10. Return a map matching the network.Cookie production, with the name field set to name, the value field set to value, the domain field set to domain, the path field set to path, the expiry field set to expiry if it’s not null, or omitted otherwise, the size field set to size, the httpOnly field set to http only, the secure field set to secure, and the sameSite field set to same site.

7.4.3.6. The network.CookieHeader Type

Remote end definition

network.CookieHeader = {
    name: text,
    value: network.BytesValue,
};

The network.CookieHeader type represents the subset of cookie data that’s in a Cookie request header.

To serialize cookie header given protocol cookie:
  1. Let name be UTF-8 encode protocol cookie["name"].

  2. Let value be deserialize protocol bytes with protocol cookie["value"].

  3. Let header value be the byte sequence formed by concatenating name, `=`, and value

  4. Return header value.

7.4.3.7. The network.FetchTimingInfo Type

Remote end definition and local end definition

network.FetchTimingInfo = {
    timeOrigin: float,
    requestTime: float,
    redirectStart: float,
    redirectEnd: float,
    fetchStart: float,
    dnsStart: float,
    dnsEnd: float,
    connectStart: float,
    connectEnd: float,
    tlsStart: float,
    requestStart: float,
    responseStart: float,
    responseEnd: float,
};

The network.FetchTimingInfo type represents the time of each part of the request, relative to the time origin of the request's client.

To get the fetch timings given request:
  1. Let global be request’s client.

  2. If global is null, return a map matching the network.FetchTimingInfo production, with all fields set to 0.

  3. Let time origin be get time origin timestamp with global.

  4. Let timings be request’s fetch timing info.

  5. Let connection timing be timingsfinal connection timing info if it’s not null, or a new connection timing info otherwise.

  6. Let request time be convert fetch timestamp given timingsstart time and global.

  7. Let redirect start be convert fetch timestamp given timingsredirect start time and global.

  8. Let redirect end be convert fetch timestamp given timingsredirect end time and global.

  9. Let fetch start be convert fetch timestamp given timingspost-redirect start time and global.

  10. Let DNS start be convert fetch timestamp given connection timing’s domain lookup start time and global.

  11. Let DNS end be convert fetch timestamp given connection timing’s domain lookup end time and global.

  12. Let TLS start be convert fetch timestamp given connection timing’s secure connection start time and global.

  13. Let connect start be convert fetch timestamp given connection timing’s connection start time and global.

  14. Let connect end be convert fetch timestamp given connection timing’s connection end time and global.

  15. Let request start be convert fetch timestamp given timingsfinal network-request start time and global.

  16. Let response start be convert fetch timestamp given timingsfinal network-response start time and global.

  17. Let response end be convert fetch timestamp given timingsend time and global.

  18. Return a map matching the network.FetchTimingInfo production with the timeOrigin field set to time origin, the requestTime field set to request time, the redirectStart field set to redirect start, the redirectEnd field set to redirect end, the fetchStart field set to fetch start, the dnsStart field set to DNS start, the dnsEnd field set to DNS end, the connectStart field set to connect start, the connectEnd field set to connect end, the tlsStart field set to TLS start, the requestStart field set to request start, the responseStart field set to response start, and the responseEnd field set to response end.

TODO: Add service worker fields

7.4.3.8. The network.Header Type

Remote end definition and local end definition

network.Header = {
  name: text,
  value: network.BytesValue,
}

The network.Header type represents a single request header.

To serialize header given name bytes and value bytes:
  1. Let name be the result of UTF-8 decode with name bytes.

    Assert: Since header names are constrained to be ASCII-only this cannot fail.

  2. Let value be serialize protocol bytes with value bytes.

  3. Return a map matching the network.Header production, with the name field set to name, and the value field set to value.

To deserialize header given protocol header:
  1. Let name be UTF-8 encode protocol header["name"].

  2. Let value be deserialize protocol bytes with protocol header["value"].

  3. Return a header (name, value).

7.4.3.9. The network.Initiator Type

Remote end definition and local end definition

network.Initiator = {
    ? columnNumber: js-uint,
    ? lineNumber: js-uint,
    ? request: network.Request,
    ? stackTrace: script.StackTrace,
    ? type: "parser" / "script" / "preflight" / "other"
};

The network.Initiator type represents the source of a network request.

Note: The type field is included in the definition for backwards compatibility, but is no longer set by the get the initiator steps, and will be removed in a future revision of this specification. Its use is expected to be replaced by initiatorType and destination on network.RequestData.

Note: The request field is included in the definition for backwards compatibility, but is no longer set by the get the initiator steps, and will be removed in a future revision of this specification. The network.Initiator is included in the network.BeforeRequestSentParameters which also contain the same request id, making this information redundant. See § 7.4.3.3 The network.BaseParameters Type.

To get the initiator given request:
  1. If request’s initiator type is "fetch" or "xmlhttprequest":

    1. Let stack trace be the current stack trace.

    2. If stack trace has size of 1 or greater, let line number be value of the lineNumber field in stack trace[0], and let column number be the value of the columnNumber field in stack trace[0]. Otherwise let line number and column number be 0.

    Otherwise, let stack trace, column number, and line number all be null.

    TODO: Chrome includes the current parser position as column number / line number for parser-inserted resources.

  2. Return a map matching the network.Initiator production, the columnNumber field set to column number if it’s not null, or omitted otherwise, the lineNumber field set to line number if it’s not null, or omitted otherwise and the stackTrace field set to stack trace if it’s not null, or omitted otherwise.

7.4.3.10. The network.Intercept Type

Remote end definition and local end definition

network.Intercept = text

The network.Intercept type represents the id of a network intercept.

7.4.3.11. The network.Request Type

Remote end definition and local end definition

network.Request = text;

Each network request has an associated request id, which is a string uniquely identifying that request. The identifier for a request resulting from a redirect matches that of the request that initiated it.

7.4.3.12. The network.RequestData Type

Remote end definition and local end definition

network.RequestData = {
    request: network.Request,
    url: text,
    method: text,
    headers: [*network.Header],
    cookies: [*network.Cookie],
    headersSize: js-uint,
    bodySize: js-uint / null,
    destination: text,
    initiatorType: text / null,
    timings: network.FetchTimingInfo,
};

The network.RequestData type represents an ongoing network request.

To get the request data given request:

  1. Let request id be request’s request id.

  2. Let url be the result of running the URL serializer with request’s URL.

  3. Let method be request’s method.

  4. Let body size be null.

  5. Let body be request’s body.

  6. If body is a byte sequence, set body size to the length of that sequence. Otherwise, if body is a body then set body size to that body’s length.

  7. Let headers size be the size in bytes of request’s headers list when serialized as mandated by [HTTP11].

    Note: For protocols which allow header compression, this is the compressed size of the headers, as sent over the network.

  8. Let headers be an empty list.

  9. Let cookies be an empty list.

  10. For each (name, value) in request’s headers list:

    1. Append the result of serialize header with name and value to headers.

    2. If name is a byte-case-insensitive match for "Cookie" then:

      1. For each cookie in the user agent’s cookie store that are included in request:

        Note: [COOKIES] defines some baseline requirements for which cookies in the store can be included in a request, but user agents are free to impose additional constraints.

        1. Append the result of serialize cookie given cookie to cookies.

  11. Let destination be request’s destination.

  12. Let initiator type be request’s initiator type.

  13. Let timings be get the fetch timings with request.

  14. Return a map matching the network.RequestData production, with the request field set to request id, url field set to url, the method field set to method, the headers field set to headers, the cookies field set to cookies, the headersSize field set to headers size, the bodySize field set to body size, the destination field set to destination, the initiatorType field set to initiator type, and the timings field set to timings.

7.4.3.13. The network.ResponseContent Type

Remote end definition and local end definition

network.ResponseContent = {
    size: js-uint
};

The network.ResponseContent type represents the decoded response to a network request.

To get the response content info given response.
  1. Return a new map matching the network.ResponseContent production, with the size field set to response’s response body info's decoded size

7.4.3.14. The network.ResponseData Type

Remote end definition and local end definition

network.ResponseData = {
    url: text,
    protocol: text,
    status: js-uint,
    statusText: text,
    fromCache: bool,
    headers: [*network.Header],
    mimeType: text,
    bytesReceived: js-uint,
    headersSize: js-uint / null,
    bodySize: js-uint / null,
    content: network.ResponseContent,
    ?authChallenges: [*network.AuthChallenge],
};

The network.ResponseData type represents the response to a network request.

To get the protocol given response:

  1. Let protocol be the empty string.

  2. If response’s final connection timing info is not null, set protocol to response’s final connection timing info's ALPN negotiated protocol.

  3. If protocol is the empty string, or is equal to "unknown":

    1. Set protocol to response’s url's scheme

    2. If protocol is equal to either "http" or "https" and response has an associated HTTP Response.

      Note: [FETCH] isn’t clear about the relation between a HTTP network response and a response object.

      1. Let http version be the HTTP Response’s Status line’s HTTP-version [HTTP11].

      2. If http version starts with "HTTP/":

        1. Let version be the code unit substring of http version from 5 to http version’s length.

        2. If version is "0.9", set protocol to "http/0.9", otherwise if version is "1.0", set protocol to "http/1.0", otherwise if version is "1.1", set protocol to "http/1.1".

  4. Return protocol.

To get the response data given response:
  1. Let url be the result of running the URL serializer with response’s URL.

  2. Set protocol to get the protocol given response.

  3. Let status be response’s status.

  4. Let status text be response’s status message.

  5. If response’s cache state is "local", let from cache be true, otherwise let it be false.

  6. Let headers be an empty list.

  7. Let mime type be the essence of the computed mime type for response.

    Note: this is whatever MIME type the browser is actually using, even if it isn’t following the exact algorithm in the [MIMESNIFF] specification.

  8. For each (name, value) in response’s headers list:

    1. Append the result of serialize header with name and value to headers.

  9. Let bytes received be the total number of bytes transmitted as part of the HTTP response associated with response.

  10. Let headers size be the number of bytes transmitted as part of the header fields section of the HTTP response.

  11. Let body size be response’s response body info's encoded size.

  12. Let content be the result of get the response content info with response.

  13. Let auth challenges be the result of extract challenges with response.

  14. Return a map matching the network.ResponseData production, with the url field set to url, the protocol field set to protocol, the status field set to status, the statusText field set to status text, the fromCache field set to from cache, the headers field set to headers, the mimeType field set to mime type, the bytesReceived field set to bytes received, the headersSize field set to headers size, the bodySize field set to body size, content field set to content, and the authChallenges field set to auth challenges if it’s not null, or omitted otherwise.

7.4.3.15. The network.SetCookieHeader Type

Remote end definition

network.SetCookieHeader = {
    name: text,
    value: network.BytesValue,
    ? domain: text,
    ? httpOnly: bool,
    ? expiry: text,
    ? maxAge: js-int,
    ? path: text,
    ? sameSite: network.SameSite,
    ? secure: bool,
}

The network.SetCookieHeader represents the data in a Set-Cookie response header.

To serialize an integer given input that is an integer:

Note: This produces the shortest representation of input as a string of decimal digits.

  1. Let serialized be an empty string.

  2. Let value be input.

  3. While value is greater than 0:

    1. Let x be value divided by 10.

    2. Let most significant digits be the integer part of x.

    3. Let y be most significant digits multiplied by 10.

    4. Let least significant digit be value - y.

    5. Assert: least significant digit is an integer in the range 0 to 9, inclusive.

    6. Let codepoint be the code point whose value is U+0030 DIGIT ZERO’s value + least significant digit.

    7. Prepend codepoint to serialized.

    8. Set value to most significant digits.

  4. Return serialized.

To serialize set-cookie header given protocol cookie:
  1. Let name be UTF-8 encode protocol cookie["name"].

  2. Let value be deserialize protocol bytes with protocol cookie["value"].

  3. Let header value be the byte sequence formed by concatenating name, `=`, and value.

  4. If protocol cookie contains "expiry":

    1. Let attribute be `;Expires=`

    2. Append UTF-8 encode protocol cookie["expiry"] to attribute.

    3. Append attribute to header value.

  5. If protocol cookie contains "maxAge":

    1. Let attribute be `;Max-Age=`

    2. Let max age string be serialize an integer protocol cookie["maxAge"].

    3. Append UTF-8 encode max age string to attribute.

    4. Append attribute to header value.

  6. If protocol cookie contains "domain":

    1. Let attribute be `;Domain=`

    2. Append UTF-8 encode protocol cookie["domain"] to attribute.

    3. Append attribute to header value.

  7. If protocol cookie contains "path":

    1. Let attribute be `;Path=`

    2. Append UTF-8 encode protocol cookie["path"] to attribute.

    3. Append attribute to header value.

  8. If protocol cookie contains "secure" and protocol cookie["secure"] is true:

    1. Append `;Secure` to header value.

  9. If protocol cookie contains "httpOnly" and protocol cookie["httpOnly"] is true:

    1. Append `;HttpOnly` to header value.

  10. If protocol cookie contains "sameSite":

    1. Let attribute be `;SameSite=`

    2. Append UTF-8 encode protocol cookie["sameSite"] to attribute.

    3. Append attribute to header value.

  11. Return header value.

7.4.3.16. The network.UrlPattern Type

Remote end definition

network.UrlPattern = (
  network.UrlPatternPattern /
  network.UrlPatternString
)

network.UrlPatternPattern = {
    type: "pattern",
    ?protocol: text,
    ?hostname: text,
    ?port: text,
    ?pathname: text,
    ?search: text,
}


network.UrlPatternString = {
    type: "string",
    pattern: text,
}

A network.UrlPattern represents a pattern used for matching request URLs for network intercepts.

When URLs are matched against a network.UrlPattern the URL is parsed, and each component is compared for equality with the corresponding field in the pattern, if present. Missing fields from the pattern always match.

Note: This syntax is designed with future extensibility in mind. In particular the syntax forbids characters that are treated specially in the [URLPattern] specification. These can be escaped by prefixing them with a U+005C (\) character.

To unescape URL pattern given pattern
  1. Let forbidden characters be the set of codepoints «U+0028 ((), U+0029 ()), U+002A (*), U+007B ({), U+007D (})»

  2. Let result be the empty string.

  3. Let is escaped character be false.

  4. For each codepoint in pattern:

    1. If is escaped character is false:

      1. If forbidden characters contains codepoint, return error with error code invalid argument.

      2. If codepoint is U+005C (\):

        1. Set is escaped character to true.

        2. Continue.

    2. Append codepoint to result.

    3. Set is escaped character to false.

  5. Return success with data result.

To parse URL pattern, given pattern:
  1. Let has protocol be true.

  2. Let has hostname be true.

  3. Let has port be true.

  4. Let has pathname be true.

  5. Let has search be true.

  6. If pattern matches the network.UrlPatternPattern production:

    1. Let pattern url be the empty string.

    2. If pattern contains "protocol":

      1. If pattern["protocol"] is the empty string, return error with error code invalid argument.

      2. Let protocol be the result of trying to unescape URL Pattern with pattern["protocol"].

      3. For each codepoint in protocol:

        1. If codepoint is not ASCII alphanumeric and «U+002B (+), U+002D (-), U+002E (.)» does not contain codepoint:

          1. Return error with error code invalid argument.

      4. Append protocol to pattern url.

    3. Otherwise:

      1. Set has protocol to false.

      2. Append "http" to pattern url.

    4. Let scheme be ASCII lowercase with pattern url.

    5. Append ":" to pattern url.

    6. If scheme is special, append "//" to pattern url.

    7. If pattern contains "hostname":

      1. If pattern["hostname"] is the empty string, return error with error code invalid argument.

      2. If scheme is "file" return error with error code invalid argument.

      3. Let hostname be the result of trying to unescape URL Pattern with pattern["hostname"].

      4. Let inside brackets be false.

      5. For each codepoint in hostname:

        1. If «U+002F (/), U+003F (?), U+0023 (#)» contains codepoint:

          1. Return error with error code invalid argument.

        2. If inside brackets is false and codepoint is U+003A (:):

          1. Return error with error code invalid argument.

        3. If codepoint is U+005B ([), set inside brackets to true.

        4. If codepoint is U+005D (]), set inside brackets to false.

      6. Append hostname to pattern url.

    8. Otherwise:

      1. If scheme is not "file", append "placeholder" to pattern url.

      2. Set has hostname to false.

    9. If pattern contains "port":

      1. If pattern["port"] is the empty string, return error with error code invalid argument.

      2. Let port be the result of trying to unescape URL Pattern with pattern["port"].

      3. Append ":" to pattern url.

      4. For each codepoint in port:

        1. If codepoint is not an ASCII digit:

          1. Return error with error code invalid argument.

      5. Append port to pattern url.

    10. Otherwise:

      1. Set has port to false.

    11. If pattern contains "pathname":

      1. Let pathname be the result of trying to unescape URL Pattern with pattern["pathname"].

      2. If pathname does not start with U+002F (/), then append "/" to pattern url.

      3. For each codepoint in pathname:

        1. If «U+003F (?), U+0023 (#)» contains codepoint:

          1. Return error with error code invalid argument.

      4. Append pathname to pattern url.

    12. Otherwise:

      1. Set has pathname to false.

    13. If pattern contains "search":

      1. Let search be the result of trying to unescape URL pattern with pattern["search"].

      2. If search does not start with U+003F (?), then append "?" to pattern url.

      3. For each codepoint in search:

        1. If codepoint is U+0023 (#):

          1. Return error with error code invalid argument.

      4. Append search to pattern url.

    14. Otherwise:

      1. Set has search to false.

  7. Otherwise, if pattern matches the network.UrlPatternString production:

    1. Let pattern url be the result of trying to unescape URL pattern with pattern["pattern"].

  8. Let url be the result of parsing pattern url.

  9. If url is failure, return error with error code invalid argument.

  10. Let parsed be a struct with the following fields:

    protocol
    url’s scheme if has protocol is true, or null otherwise.
    hostname
    url’s host if has hostname is true, or null otherwise.
    port
    1. If has port is false:

      1. null.

    2. Otherwise:

      1. If url’s scheme is special and url’s scheme's default port is not null, and url’s port is null or is equal to scheme's default port:

        1. The empty string.

      2. Otherwise, if url’s port is not null:

        1. Serialize an integer with url’s port.

      3. Otherwise:

        1. null.

    pathname
    1. If has pathname is false:

      1. null.

    2. Otherwise:

      1. The result of running the URL path serializer with url, if url’s path is not the empty string and is not empty, or null otherwise.

    search
    1. If has search is false:

      1. null.

    2. Otherwise:

      1. The empty string if url’s query is null, or url’s query otherwise.

  11. Return success with data parsed.

To match URL pattern given url pattern and url string:
  1. Let url be the result of parsing url string.

  2. If url pattern’s protocol is not null and is not equal to url’s scheme, return false.

  3. If url pattern’s hostname is not null and is not equal to url’s host, return false.

  4. If url pattern’s port is not null:

    1. Let port be null.

    2. If url’s scheme is special and url’s scheme's default port is not null, and url’s port is null or is equal to scheme's default port:

      1. Set port to the empty string.

    3. Otherwise, if url’s port, is not null:

      1. Set port to serialize an integer with url’s port.

    4. If url pattern’s port is not equal to port, return false.

  5. If url pattern’s pathname is not null and is not equal to the result of running the URL path serializer with url, return false.

  6. If url pattern’s search is not null:

    1. Let url query be url’s query.

    2. If url query is null, set url query to the empty string.

    3. If url pattern’s search is not equal to url query, return false.

  7. Return true.

7.4.4. Commands

7.4.4.1. The network.addIntercept Command

The network.addIntercept command adds a network intercept.

Command Type
network.AddIntercept = (
  method: "network.addIntercept",
  params: network.AddInterceptParameters
)

network.AddInterceptParameters = {
  phases: [+network.InterceptPhase],
  ? contexts: [+browsingContext.BrowsingContext],
  ? urlPatterns: [*network.UrlPattern],
}

network.InterceptPhase = "beforeRequestSent" / "responseStarted" /
                         "authRequired"
Return Type
network.AddInterceptResult = {
  intercept: network.Intercept
}
The remote end steps given session and command parameters are:
  1. Let intercept be the string representation of a UUID.

  2. Let url patterns be the urlPatterns field of command parameters if present, or an empty list otherwise.

  3. Let navigables be null.

  4. If the contexts field of command parameters is present:

    1. Set navigables to an empty set.

    2. For each navigable id of command parameters["contexts"]

      1. Let navigable be the result of trying to get a navigable with navigable id.

      2. If navigable is not a top-level traversable, return error with error code invalid argument.

      3. Append navigable to navigables.

    3. If navigables is an empty set, return error with error code invalid argument.

  5. Let intercept map be session’s intercept map.

  6. Let parsed patterns be an empty list.

  7. For each url pattern in url patterns:

    1. Let parsed be the result of trying to parse url pattern with url pattern.

    2. Append parsed to parsed patterns.

  8. Set intercept map[intercept] to a struct with url patterns parsed patterns, phases command parameters["phases"] and browsingContexts navigables.

  9. Return a new map matching the network.AddInterceptResult production with the intercept field set to intercept.

7.4.4.2. The network.continueRequest Command

The network.continueRequest command continues a request that’s blocked by a network intercept.

Command Type
network.ContinueRequest = (
  method: "network.continueRequest",
  params: network.ContinueRequestParameters
)

network.ContinueRequestParameters = {
  request: network.Request,
  ?body: network.BytesValue,
  ?cookies: [*network.CookieHeader],
  ?headers: [*network.Header],
  ?method: text,
  ?url: text,
}
Return Type
EmptyResult
The remote end steps given session and command parameters are:
  1. Let blocked requests be session’s blocked request map.

  2. Let request id be command parameters["request"].

  3. If blocked requests does not contain request id then return error with error code no such request.

  4. Let (request, phase, response) be blocked requests[request id].

  5. If phase is not "beforeRequestSent", then return error with error code invalid argument.

    consider a "request already sent" error.

  6. If command parameters contains "url":

    1. Let url record be the result of applying the URL parser to command parameters["url"], with base URL null.

    2. If url record is failure, return error with error code invalid argument.

      TODO: Should we also resume here?

    3. Let request’s url be url record.

  7. If command parameters contains "method":

    1. Let method be command parameters["method"].

    2. If method does not match the method token production, return error with error code "invalid argument".

    3. Let request’s method be method.

  8. If command parameters contains "headers":

    1. Let headers be an empty header list.

    2. For header in command parameters["headers"]:

      1. Let deserialized header be deserialize header with header.

      2. If deserialized header’s name does not match the field-name token production, return error with error code "invalid argument".

      3. If deserialized header’s value does not match the header value production, return error with error code "invalid argument".

      4. Append deserialized header to headers.

    3. Set request’s headers list to headers.

  9. If command parameters contains "cookies":

    1. Let cookie header be an empty byte sequence.

    2. For each cookie in command parameters["cookies"]:

      1. If cookie header is not empty, append `;` to cookie header.

      2. Append serialize cookie header with cookie to cookie header.

    3. Let found cookie header be false.

    4. For each header in request’s headers list:

      1. Let name be header’s name.

      2. If byte-lowercase name is `cookie`:

        1. Set header’s value to cookie header.

        2. Set found cookie header to true.

        3. Break.

    5. If found cookie header is false:

      1. Append the header (`Cookie`, cookie header) to request’s headers list.

  10. If command parameters contains "body":

    1. Let body be deserialize protocol bytes with command parameters["body"].

    2. Set request’s body to body.

  11. Resume with "continue request", request id, and (null, "incomplete").

  12. Return success with data null.

7.4.4.3. The network.continueResponse Command

The network.continueResponse command continues a response that’s blocked by a network intercept. It can be called in the responseStarted phase, to modify the status and headers of the response, but still provide the network response body.

Command Type
network.ContinueResponse = (
  method: "network.continueResponse",
  params: network.ContinueResponseParameters
)

network.ContinueResponseParameters = {
  request: network.Request,
  ?cookies: [*network.SetCookieHeader]
  ?credentials: network.AuthCredentials,
  ?headers: [*network.Header],
  ?reasonPhrase: text,
  ?statusCode: js-uint,
}
Return Type
EmptyResult
The remote end steps given session and command parameters are:
  1. Let request id be command parameters["request"].

  2. Let response be the result of trying to update the response with session, "continueResponse" and command parameters.

  3. Resume with "continue request", request id, and (response, "incomplete").

  4. Return success with data null.

7.4.4.4. The network.continueWithAuth Command

The network.continueWithAuth command continues a response that’s blocked by a network intercept at the authRequired phase.

Command Type
network.ContinueWithAuth = (
  method: "network.continueWithAuth",
  params: network.ContinueWithAuthParameters
)

network.ContinueWithAuthParameters = {
  request: network.Request,
  (network.ContinueWithAuthCredentials // network.ContinueWithAuthNoCredentials)
}

network.ContinueWithAuthCredentials = (
  action: "provideCredentials", 
  credentials: network.AuthCredentials
)

network.ContinueWithAuthNoCredentials = (
  action: "default" / "cancel"
)
Return Type
EmptyResult
The remote end steps given session and command parameters are:
  1. Let blocked requests be session’s blocked request map.

  2. Let request id be command parameters["request"].

  3. If blocked requests does not contain request id then return error with error code no such request.

  4. Let (request, phase, response) be blocked requests[request id].

  5. If phase is not "authRequired", then return error with error code invalid argument.

  6. If command parameters "action" is "cancel", set response’s authentication credentials to "cancelled".

  7. If command parameters "action" is "provideCredentials":

    1. Let credentials be command parameters["credentials"].

    2. Assert: credentials["type"] is "password".

    3. Set response’s authentication credentials to (credentials["username"], credentials["password"])

  8. Resume with "continue request", request id, and (response, "incomplete").

  9. Return success with data null.

7.4.4.5. The network.failRequest Command

The network.failRequest command fails a fetch that’s blocked by a network intercept.

Command Type
network.FailRequest = (
  method: "network.failRequest",
  params: network.FailRequestParameters
)

network.FailRequestParameters = {
  request: network.Request,
}
Return Type
EmptyResult
The remote end steps given session and command parameters are:
  1. Let blocked requests be session’s blocked request map.

  2. Let request id be command parameters["request"].

  3. If blocked requests does not contain request id then return error with error code no such request.

  4. Let (request, phase, response) be blocked requests[request id].

  5. If phase is "authRequired", then return error with error code invalid argument.

  6. Let response be a new network error.

    Allow setting the precise kind of error [Issue #508]

  7. Resume with "continue request", request id, and (response, "complete").

  8. Return success with data null.

7.4.4.6. The network.provideResponse Command

The network.provideResponse command continues a request that’s blocked by a network intercept, by providing a complete response.

Note: This will not prevent the request going through the normal request lifecycle, and therefore emitting other events as it progresses.

Command Type
network.ProvideResponse = (
  method: "network.provideResponse",
  params: network.ProvideResponseParameters
)

network.ProvideResponseParameters = {
  request: network.Request,
  ?body: network.BytesValue,
  ?cookies: [*network.SetCookieHeader],
  ?headers: [*network.Header],
  ?reasonPhrase: text,
  ?statusCode: js-uint,
}
Return Type
EmptyResult
The remote end steps given session and command parameters are:
  1. Let request id be command parameters["request"].

  2. Let response be the result of trying to update the response with session, "provideResponse", and command parameters.

  3. If command parameters contains "body":

    1. Let body be deserialize protocol bytes with command parameters["body"].

    2. Set response’s body to body as a body.

  4. Resume with "continue request", request id, and (response,"complete").

  5. Return success with data null.

7.4.4.7. The network.removeIntercept Command

The network.removeIntercept command removes a network intercept.

Command Type
network.RemoveIntercept = (
  method: "network.removeIntercept",
  params: network.RemoveInterceptParameters
)

network.RemoveInterceptParameters = {
  intercept: network.Intercept
}
Return Type
EmptyResult
The remote end steps given session and command parameters are:
  1. Let intercept be the value of the "intercept" field in command parameters.

  2. Let intercept map be session’s intercept map.

  3. If intercept map does not contain intercept, return error with error code no such intercept.

  4. Remove intercept from intercept map.

Note: removal of an intercept does not affect requests that have been already blocked by this intercept. Only future requests or future phases of existing requests will be affected.

  1. Return success with data null.

7.4.4.8. The network.setCacheBehavior Command

The network.setCacheBehavior command configures the network cache behavior for certain requests.

Command Type
network.SetCacheBehavior = (
  method: "network.setCacheBehavior",
  params: network.SetCacheBehaviorParameters
)

network.SetCacheBehaviorParameters = {
  cacheBehavior: "default" / "bypass",
  ? contexts: [+browsingContext.BrowsingContext]
}
Return Type
EmptyResult
The WebDriver BiDi cache behavior steps given request are:
  1. Let navigable be null.

  2. If request’s window is an environment settings object:

    1. Let environment settings be request’s window

    2. If there is a navigable whose active window is environment settingsglobal object, set navigable to the top-level browsing context for that browsing context.

  3. If navigable is not null and navigable cache behavior map contains navigable, return navigable cache behavior map[navigable].

  4. Return default cache behavior.

The navigable cache behavior steps given navigable are:
  1. Let top-level navigable be navigable’s top-level traversable.

  2. If navigable cache behavior map contains top-level navigable, return navigable cache behavior map[top-level navigable].

  3. Return default cache behavior.

The remote end steps given session and command parameters are:
  1. Let behavior be command parameters["cacheBehavior"].

  2. If command parameters does not contain "contexts":

    1. Set the default cache behavior to behavior.

    2. Clear navigable cache behavior map.

    3. Switch on the value of behavior:

      "bypass"
      Perform implementation-defined steps to disable any implementation-specific resource caches.
      "default"
      Perform implementation-defined steps to enable any implementation-specific resource caches that are usually enabled in the current remote end configuration.
    4. Return success with data null.

  3. Let navigables be an empty set.

  4. For each navigable id of command parameters["contexts"]:

    1. Let context be the result of trying to get a navigable with navigable id.

    2. If context is not a top-level browsing context, return error with error code invalid argument.

    3. Append context to navigables.

  5. For each navigable in navigables:

    1. If navigable cache behavior map contains navigable, and navigable cache behavior map[navigable] is equal to behavior then continue.

    2. Switch on the value of behavior:

      "bypass"
      Perform implementation-defined steps to disable any implementation-specific resource caches for network requests originating from any browsing context for which navigable is the top-level browsing context.
      "default"
      Perform implementation-defined steps to enable any implementation-specific resource caches that are usually enabled in the current remote end configuration for network requests originating from any browsing context for which navigable is the top-level browsing context.
    3. If behavior is equal to default cache behavior:

      1. If navigable cache behavior map contains navigable, remove navigable cache behavior map[navigable].

    4. Otherwise:

      1. Set navigable cache behavior map[navigable] to behavior.

  6. Return success with data null.

7.4.5. Events

7.4.5.1. The network.authRequired Event
Event Type
network.AuthRequired = (
  method: "network.authRequired",
  params: network.AuthRequiredParameters
)

network.AuthRequiredParameters = {
  network.BaseParameters,
  response: network.ResponseData
}
Return Type
EmptyResult

This event is emitted when the user agent is going to prompt for authorization credentials.

The remote end event trigger is the WebDriver BiDi auth required steps given request and response:
  1. Let redirect count be request’s redirect count.

  2. Assert: before request sent map[request] is equal to redirect count.

    Note: This implies that every caller needs to ensure that the WebDriver BiDi before request sent steps are invoked with request before these steps.

  3. If request’s client is not null, let related navigables be the result of get related navigables with request’s client. Otherwise let related navigables be an empty set.

  4. For each session in the set of sessions for which an event is enabled given "network.authRequired" and related navigables:

    1. Let params be the result of process a network event with session "network.authRequired", and request.

    2. Let response data be the result of get the response data with response.

    3. Assert: response data contains "authChallenge".

    4. Set the response field of params to response data.

    5. Assert: params matches the network.AuthRequiredParameters production.

    6. Let body be a map matching the network.AuthRequired production, with the params field set to params.

    7. Emit an event with session and body.

    8. If params["isBlocked"] is true:

      1. Let blocked requests be session’s blocked request map.

      2. Let request id be request’s request id.

      3. Set blocked requests[request id] to (request, "authRequired", response).

      4. Await with «"continue request"», and request id.

      5. Remove blocked requests[request id].

7.4.5.2. The network.beforeRequestSent Event
Event Type
 network.BeforeRequestSent = (
  method: "network.beforeRequestSent",
  params: network.BeforeRequestSentParameters
 )

network.BeforeRequestSentParameters = {
  network.BaseParameters,
  ? initiator: network.Initiator,
}

This event is emitted before a request is sent (either over the network or before it’s handled by a serviceworker or a local cache).

The remote end event trigger is the WebDriver BiDi before request sent steps given request:
  1. If before request sent map does not contain request, set before request sent map[request] to a new set.

  2. Let redirect count be request’s redirect count.

  3. Add redirect count to before request sent map[request].

  4. If request’s client is not null, let related navigables be the result of get related navigables with request’s client. Otherwise let related navigables be an empty set.

  5. Let response be null.

  6. Let response status be "incomplete".

  7. For each session in the set of sessions for which an event is enabled given "network.beforeRequestSent" and related navigables:

    1. Let params be the result of process a network event with session, "network.beforeRequestSent", and request.

    2. If params is null then continue.

    3. Let initiator be the result of get the initiator with request.

    4. If initiator is not empty, set the initiator field of params to initiator.

    5. Assert: params matches the network.BeforeRequestSentParameters production.

    6. Let body be a map matching the network.BeforeRequestSent production, with the params field set to params.

    7. Emit an event with session and body.

    8. If params["isBlocked"] is true, then:

      1. Let blocked requests be session’s blocked request map.

      2. Let request id be request’s request id.

      3. Set blocked requests[request id] to (request, "beforeRequestSent", null).

      4. Let (response, status) be await with «"continue request"», and request’s request id.

      5. If status is "complete" set response status to status.

      6. Remove blocked requests[request id].

      Note: While waiting, no further processing of the request occurs.

  8. Return (response, response status).

7.4.5.3. The network.fetchError Event
Event Type
 network.FetchError = (
  method: "network.fetchError",
  params: network.FetchErrorParameters
 )

network.FetchErrorParameters = {
  network.BaseParameters,
  errorText: text,
}

This event is emitted when a network request ends in an error.

The remote end event trigger is the WebDriver BiDi fetch error steps given request:
  1. If before request sent map[request] does not contain request’s redirect count, then run the WebDriver BiDi before request sent steps with request.

    Note: This ensures that a network.beforeRequestSent can always be emitted before a network.fetchError, without the caller needing to explicitly invoke the WebDriver BiDi before request sent steps on every error path.

  2. If request’s client is not null, let related navigables be the result of get related navigables with request’s client. Otherwise let related navigables be an empty set.

  3. For each session in the set of sessions for which an event is enabled given "network.fetchError" and related navigables:

    1. Let params be the result of process a network event with session "network.fetchError", and request.

    2. TODO: Set the errorText field of params.

    3. Assert: params matches the network.FetchErrorParameters production.

    4. Let body be a map matching the network.FetchError production, with the params field set to params.

    5. Emit an event with session and body.

7.4.5.4. The network.responseCompleted Event
Event Type
 network.ResponseCompleted = (
  method: "network.responseCompleted",
  params: network.ResponseCompletedParameters
 )

network.ResponseCompletedParameters = {
  network.BaseParameters,
  response: network.ResponseData,
}

This event is emitted after the full response body is received.

The remote end event trigger is the WebDriver BiDi response completed steps given request and response:
  1. Let redirect count be request’s redirect count.

  2. Assert: before request sent map[request] contains redirect count.

    Note: This implies that every caller needs to ensure that the WebDriver BiDi before request sent steps are invoked with request before these steps.

  3. If request’s client is not null, let related navigables be the result of get related navigables with request’s client. Otherwise let related navigables be an empty set.

  4. For each session in the set of sessions for which an event is enabled given "network.responseCompleted" and related navigables:

    1. Let params be the result of process a network event with session "network.responseCompleted", and request.

    2. Assert: params["isBlocked"] is false.

    3. Let response data be the result of get the response data with response.

    4. Set the response field of params to response data.

    5. Assert: params matches the network.ResponseCompletedParameters production.

    6. Let body be a map matching the network.ResponseCompleted production, with the params field set to params.

    7. Emit an event with session and body.

7.4.5.5. The network.responseStarted Event
Event Type
 network.ResponseStarted = (
  method: "network.responseStarted",
  params: network.ResponseStartedParameters
 )

network.ResponseStartedParameters = {
  network.BaseParameters,
  response: network.ResponseData,
}

This event is emitted after the response headers are received but before the body is complete.

The remote end event trigger is the WebDriver BiDi response started steps given request and response:
  1. Let redirect count be request’s redirect count.

  2. Assert: before request sent map[request] is equal to redirect count.

    Note: This implies that every caller needs to ensure that the WebDriver BiDi before request sent steps are invoked with request before these steps.

  3. If request’s client is not null, let related navigables be the result of get related navigables with request’s client. Otherwise let related navigables be an empty set.

  4. Let response status be "incomplete".

  5. For each session in the set of sessions for which an event is enabled given "network.responseStarted" and related navigables:

    1. Let params be the result of process a network event with session "network.responseStarted", and request.

    2. Let response data be the result of get the response data with response.

    3. Set the response field of params to response data.

    4. Assert: params matches the network.ResponseStartedParameters production.

    5. Let body be a map matching the network.ResponseStarted production, with the params field set to params.

    6. Emit an event with session and body.

    7. If params["isBlocked"] is true:

      1. Let blocked requests be session’s blocked request map.

      2. Let request id be request’s request id.

      3. Set blocked requests[request id] to (request, "beforeRequestSent", response).

      4. Let (response, status) be await with «"continue request"», and request id.

      5. If status is "complete", set response status to status.

      6. Remove blocked requests[request id].

  6. Return (response, response status).

7.5. The script Module

The script module contains commands and events relating to script realms and execution.

7.5.1. Definition

Remote end definition

ScriptCommand = (
  script.AddPreloadScript //
  script.CallFunction //
  script.Disown //
  script.Evaluate //
  script.GetRealms //
  script.RemovePreloadScript
)

local end definition

ScriptResult = (
  script.AddPreloadScriptResult /
  script.EvaluateResult /
  script.GetRealmsResult
)

ScriptEvent = (
  script.Message //
  script.RealmCreated //
  script.RealmDestroyed
)

7.5.2. Preload Scripts

A Preload script is one which runs on creation of a new Window, before any author-defined script have run.

TODO: Extend this to scripts in other kinds of realms.

A BiDi session has a preload script map which is a map in which the keys are UUIDs, and the values are structs with an item named function declaration, which is a string, arguments, contexts, which is a list or null, and an item named sandbox which is a string or null.

Note: If executing a preload script fails, either due to a syntax error, or a runtime exception, an [ECMAScript] exception is reported in the realm in which it was being executed, and other preload scripts run as normal.

To run WebDriver BiDi preload scripts given environment settings:
  1. Let document be environment settingsrelevant global object's associated Document.

  2. Let navigable be document’s navigable.

  3. For each session in active BiDi sessions:

    1. For each preload script in session’s preload script map's values:

      1. If preload script’s contexts is not null:

        1. Let navigable id be navigable’s top-level traversable's id.

        2. If preload script’s contexts does not contain navigable id, continue.

      2. If preload script’s sandbox is not null, let realm be get or create a sandbox realm with preload script’s sandbox and navigable. Otherwise let realm be environment settingsrealm execution context's Realm component.

      3. Let exception reporting global be be environment settingsrealm execution context's Realm component’s global object.

      4. Let arguments be preload script’s arguments.

      5. Let deserialized arguments be an empty list.

      6. For each argument in arguments:

        1. Let channel be create a channel with session, realm and argument.

        2. Append channel to deserialized arguments.

      7. Let base URL be the API base URL of environment settings.

      8. Let options be the default script fetch options.

      9. Let function declaration be preload script’s function declaration.

      10. Let (script, function body evaluation status) be the result of evaluate function body with function declaration, environment settings, base URL, and options.

      11. If function body evaluation status is an abrupt completion, then report an exception given by function body evaluation status.[[Value]] for exception reporting global.

      12. Let function object be function body evaluation status.[[Value]].

      13. If IsCallable(function object) is false:

        1. Let error be a new TypeError object in realm.

        2. Report an exception error for exception reporting global.

      14. Prepare to run script with environment settings.

      15. Set evaluation status to Call(function object, null, deserialized arguments).

      16. Clean up after running script with environment settings.

      17. If evaluation status is an abrupt completion, then report an exception given by evaluation status.[[Value]] for exception reporting global.

7.5.3. Types

7.5.3.1. The script.Channel Type

Remote end definition and local end definition

script.Channel = text;

The script.Channel type represents the id of a specific channel used to send custom messages from the remote end to the local end.

7.5.3.2. The script.ChannelValue Type

Remote end definition

script.ChannelValue = {
  type: "channel",
  value: script.ChannelProperties,
};

script.ChannelProperties = {
  channel: script.Channel,
  ? serializationOptions: script.SerializationOptions,
  ? ownership: script.ResultOwnership,
}

The script.ChannelValue type represents an ArgumentValue that can be deserialized into a function that sends messages from the remote end to the local end.

To create a channel given session, realm and protocol value:

  1. Let channel properties be protocol value["value"].

  2. Let steps be the following steps given the argument message:

    1. Let current realm be the current Realm Record.

    2. Emit a script message with session, current realm, channel properties and message.

  3. Return CreateBuiltinFunction(steps, 1, "", « », realm).

7.5.3.3. The script.EvaluateResult Type

Remote end definition and local end definition

script.EvaluateResult = (
  script.EvaluateResultSuccess /
  script.EvaluateResultException
)

script.EvaluateResultSuccess = {
  type: "success",
  result: script.RemoteValue,
  realm: script.Realm
}

script.EvaluateResultException = {
  type: "exception",
  exceptionDetails: script.ExceptionDetails
  realm: script.Realm
}

The script.EvaluateResult type indicates the return value of a command that executes script. The script.EvaluateResultSuccess variant is used in cases where the script completes normally and the script.EvaluateResultException variant is used in cases where the script completes with a thrown exception.

7.5.3.4. The script.ExceptionDetails Type

Remote end definition and local end definition

script.ExceptionDetails = {
  columnNumber: js-uint,
  exception: script.RemoteValue,
  lineNumber: js-uint,
  stackTrace: script.StackTrace,
  text: text,
};

The script.ExceptionDetails type represents a JavaScript exception.

To get exception details given a realm, a completion record record, an ownership type and a session:

  1. Assert: record.[[Type]] is throw.

  2. Let text be an implementation-defined textual description of the error represented by record.

    TODO: Tighten up the requirements here; people will probably try to parse this data with regex or something equally bad.

  3. Let serialization options be a map matching the script.SerializationOptions production with the fields set to their default values.

  4. Let exception be the result of serialize as a remote value with record.[[Value]], serialization options, ownership type, a new map as serialization internal map, realm and session.

  5. Let stack trace be the stack trace for an exception given record.

  6. If stack trace has size of 1 or greater, let line number be value of the lineNumber field in stack trace[0], and let column number be the value of the columnNumber field stack trace[0]. Otherwise let line number and column number be 0.

  7. Let exception details be a map matching the script.ExceptionDetails production, with the text field set to text, the exception field set to exception, the lineNumber field set to line number, the columnNumber field set to column number, and the stackTrace field set to stack trace.

  8. Return exception details.

7.5.3.5. The script.Handle Type

Remote end definition and local end definition

script.Handle = text;

The script.Handle type represents a handle to an object owned by the ECMAScript runtime. The handle is only valid in a specific Realm.

Each ECMAScript Realm has a corresponding handle object map. This is a strong map from handle ids to their corresponding objects.

7.5.3.6. The script.InternalId Type

Remote end definition and local end definition

script.InternalId = text;

The script.InternalId type represents the id of a previously serialized script.RemoteValue during serialization.

7.5.3.7. The script.LocalValue Type

Remote end definition

script.LocalValue = (
  script.RemoteReference /
  script.PrimitiveProtocolValue /
  script.ChannelValue /
  script.ArrayLocalValue /
  script.DateLocalValue /
  script.MapLocalValue /
  script.ObjectLocalValue /
  script.RegExpLocalValue /
  script.SetLocalValue
)

script.ListLocalValue = [*script.LocalValue];

script.ArrayLocalValue = {
  type: "array",
  value: script.ListLocalValue,
}

script.DateLocalValue = {
  type: "date",
  value: text
}

script.MappingLocalValue = [*[(script.LocalValue / text), script.LocalValue]];

script.MapLocalValue = {
  type: "map",
  value: script.MappingLocalValue,
}

script.ObjectLocalValue = {
  type: "object",
  value: script.MappingLocalValue,
}

script.RegExpValue = {
  pattern: text,
  ? flags: text,
}

script.RegExpLocalValue = {
  type: "regexp",
  value: script.RegExpValue,
}

script.SetLocalValue = {
  type: "set",
  value: script.ListLocalValue,
}

The script.LocalValue type represents values which can be deserialized into ECMAScript. This includes both primitive and non-primitive values as well as remote references and channels.

To deserialize key-value list given serialized key-value list, realm and session:

  1. Let deserialized key-value list be a new list.

  2. For each serialized key-value in the serialized key-value list:

    1. If size of serialized key-value is not 2, return error with error code invalid argument.

    2. Let serialized key be serialized key-value[0].

    3. If serialized key is a string, let deserialized key be serialized key.

    4. Otherwise let deserialized key be result of trying to given deserialize local value with serialized key, realm and session.

    5. Let serialized value be serialized key-value[1].

    6. Let deserialized value be result of trying to deserialize local value given serialized value, realm and session.

    7. Append CreateArrayFromListdeserialized key, deserialized value») to deserialized key-value list.

  3. Return success with data deserialized key-value list.

To deserialize value list given serialized value list, realm and session:

  1. Let deserialized values be a new list.

  2. For each serialized value in the serialized value list:

    1. Let deserialized value be result of trying to deserialize local value given serialized value, realm and session.

    2. Append deserialized value to deserialized values;

  3. Return success with data deserialized values.

To deserialize local value given local protocol value, realm and session:

  1. If local protocol value matches the script.RemoteReference production, return deserialize remote reference of given local protocol value, realm and session.

  2. If local protocol value matches the script.PrimitiveProtocolValue production, return deserialize primitive protocol value with local protocol value.

  3. If local protocol value matches the script.ChannelValue production, return create a channel with session, realm and local protocol value.

  4. Let type be the value of the type field of local protocol value or undefined if no such a field.

  5. Let value be the value of the value field of local protocol value or undefined if no such a field.

  6. In the following list of conditions and associated steps, run the first set of steps for which the associated condition is true:

    type is the string "array"
    1. Let deserialized value list be a result of trying to deserialize value list given value, realm and session.

    2. Return success with data CreateArrayFromList(deserialized value list).

    type is the string "date"
    1. If value does not match Date Time String Format, return error with error code invalid argument.

    2. Let date result be Construct(Date, value).

    3. Assert: date result is not an abrupt completion.

    4. Return success with data date result.

    type is the string "map"
    1. Let deserialized key-value list be a result of trying to deserialize key-value list with value, realm and session.

    2. Let iterable be CreateArrayFromList(deserialized key-value list)

    3. Return success with data Map(iterable).

    type is the string "object"
    1. Let deserialized key-value list be a result of trying to deserialize key-value list with value, realm and session.

    2. Let iterable be CreateArrayFromList(deserialized key-value list)

    3. Return success with data Object.fromEntries(iterable).

    type is the string "regexp"
    1. Let pattern be the value of the pattern field of local protocol value.

    2. Let flags be the value of the flags field of local protocol value or undefined if no such a field.

    3. Let regex_result be Regexp(pattern, flags). If this throws exception, return error with error code invalid argument.

    4. Return success with data regex_result.

    type is the string "set"
    1. Let deserialized value list be a result of trying to deserialize value list given value, realm and session.

    2. Let iterable be CreateArrayFromList(deserialized key-value list)

    3. Return success with data Set object(iterable).

    otherwise
    Return error with error code invalid argument.
7.5.3.8. The script.PreloadScript Type

Remote end definition and local end definition

script.PreloadScript = text;

The script.PreloadScript type represents a handle to a script that will run on realm creation.

7.5.3.9. The script.Realm Type

Remote end definition and local end definition

script.Realm = text;

Each realm has an associated realm id, which is a string uniquely identifying that realm. This is implicitly set when the realm is created.

The realm id for a realm is opaque and must not be derivable from the handle id of the corresponding global object in the handle object map or, where relevant, from the navigable id of any navigable.

Note: this is to ensure that users do not rely on implementation-specific relationships between different ids.

To get a realm given realm id:
  1. If realm id is null, return success with data null.

  2. If there is no realm with id realm id return error with error code no such frame

  3. Let realm be the realm with id realm id.

  4. Return success with data realm

This has the wrong error code

7.5.3.10. The script.PrimitiveProtocolValue Type

Remote end definition and local end definition

script.PrimitiveProtocolValue = (
  script.UndefinedValue /
  script.NullValue /
  script.StringValue /
  script.NumberValue /
  script.BooleanValue /
  script.BigIntValue
)

script.UndefinedValue = {
  type: "undefined",
}

script.NullValue = {
  type: "null",
}

script.StringValue = {
  type: "string",
  value: text,
}

script.SpecialNumber = "NaN" / "-0" / "Infinity" / "-Infinity";

script.NumberValue = {
  type: "number",
  value: number / script.SpecialNumber,
}

script.BooleanValue = {
  type: "boolean",
  value: bool,
}

script.BigIntValue = {
  type: "bigint",
  value: text,
}

The script.PrimitiveProtocolValue represents values which can only be represented by value, never by reference.

To serialize primitive protocol value given a value:

  1. Let remote value be undefined.

  2. In the following list of conditions and associated steps, run the first set of steps for which the associated condition is true, if any:

    Type(value) is undefined
    Let remote value be a map matching the script.UndefinedValue production in the local end definition.
    Type(value) is Null
    Let remote value be a map matching the script.NullValue production in the local end definition.
    Type(value) is String
    Let remote value be a map matching the script.StringValue production in the local end definition, with the value property set to value.

    This doesn’t handle lone surrogates

    Type(value) is Number
    1. Switch on the value of value:

      NaN
      Let serialized be "NaN"
      -0
      Let serialized be "-0"
      Infinity
      Let serialized be "Infinity"
      -Infinity
      Let serialized be "-Infinity"
      Otherwise:
      Let serialized be value
    2. Let remote value be a map matching the script.NumberValue production in the local end definition, with the value property set to serialized.

    Type(value) is Boolean
    Let remote value be a map matching the script.BooleanValue production in the local end definition, with the value property set to value.
    Type(value) is BigInt
    Let remote value be a map matching the script.BigIntValue production in the local end definition, with the value property set to the result of running the ToString operation on value.
  3. Return remote value

To deserialize primitive protocol value given a primitive protocol value:

  1. Let type be the value of the type field of primitive protocol value.

  2. Let value be undefined.

  3. If primitive protocol value has field value:

    1. Let value be the value of the value field of primitive protocol value.

  4. In the following list of conditions and associated steps, run the first set of steps for which the associated condition is true:

    type is the string "undefined"
    Return success with data undefined.
    type is the string "null"
    Return success with data null.
    type is the string "string"
    Return success with data value.
    type is the string "number"
    1. If Type(value) is Number, return success with data value.

    2. Assert: Type(value) is String.

    3. If value is the string "NaN", return success with data NaN.

    4. Let number_result be StringToNumber(value).

    5. If number_result is NaN, return error with error code invalid argument

    6. Return success with data number_result.

    type is the string "boolean"
    Return success with data value.
    type is the string "bigint"
    1. Let bigint_result be StringToBigInt(value).

    2. If bigint_result is undefined, return error with error code invalid argument

    3. Return success with data bigint_result.

  5. Return error with error code invalid argument

7.5.3.11. The script.RealmInfo Type

Local end definition

script.RealmInfo = (
  script.WindowRealmInfo /
  script.DedicatedWorkerRealmInfo /
  script.SharedWorkerRealmInfo /
  script.ServiceWorkerRealmInfo /
  script.WorkerRealmInfo /
  script.PaintWorkletRealmInfo /
  script.AudioWorkletRealmInfo /
  script.WorkletRealmInfo
)

script.BaseRealmInfo = (
  realm: script.Realm,
  origin: text
)

script.WindowRealmInfo = {
  script.BaseRealmInfo,
  type: "window",
  context: browsingContext.BrowsingContext,
  ? sandbox: text
}

script.DedicatedWorkerRealmInfo = {
  script.BaseRealmInfo,
  type: "dedicated-worker",
  owners: [script.Realm]
}

script.SharedWorkerRealmInfo = {
  script.BaseRealmInfo,
  type: "shared-worker"
}

script.ServiceWorkerRealmInfo = {
  script.BaseRealmInfo,
  type: "service-worker"
}

script.WorkerRealmInfo = {
  script.BaseRealmInfo,
  type: "worker"
}

script.PaintWorkletRealmInfo = {
  script.BaseRealmInfo,
  type: "paint-worklet"
}

script.AudioWorkletRealmInfo = {
  script.BaseRealmInfo,
  type: "audio-worklet"
}

script.WorkletRealmInfo = {
  script.BaseRealmInfo,
  type: "worklet"
}

Note: there’s a 1:1 relationship between the script.RealmInfo variants and values of script.RealmType.

The script.RealmInfo type represents the properties of a realm.

To get the navigable with given realm:
  1. Let global object be the global object of the realm.

  2. Let global object be the unwrapped global object.

  3. If global object is not a Window object, return null.

  4. Let document be global object’s wrapped Window's associated Document.

  5. Return document’s node navigable.

To get the worker’s owners with given global object:
  1. Assert: global object is a WorkerGlobalScope object.

  2. Let owners be an empty list.

  3. For each owner in the global object’s associated owner set:

    1. Let owner environment settings be owner’s relevant settings object.

    2. Let owner realm info be the result of get the realm info given owner environment settings.

    3. If owner realm info is null, continue.

    4. Append owner realm info["id"] to owners.

  4. Return owners.

To get the realm info given environment settings:
  1. Let realm be environment settingsrealm execution context's Realm component.

  2. Let realm id be the realm id for realm.

  3. Let origin be the serialization of an origin given environment settings’s origin.

  4. Let global object be the global object specified by environment settings

  5. Run the steps under the first matching condition:

    global object is a Window object
    1. Let document be environment settingsrelevant global object's associated Document.

    2. Let navigable be document’s node navigable.

    3. If navigable is null, return null.

    4. Let navigable id be the navigable id for navigable.

    5. Let realm info be a map matching the script.WindowRealmInfo production, with the realm field set to realm id, the origin field set to origin, and the context field set to navigable id.

    global object is SandboxWindowProxy object
    TODO: Unclear if this is the right formulation for handling sandboxes.
    1. Let document be global object’s wrapped Window's associated Document.

    2. Let navigable be document’s node navigable.

    3. If navigable is null, return null.

    4. Let navigable id be the navigable id for navigable.

    5. Let sandbox name be the result of get a sandbox name given realm.

    6. Assert: sandbox name is not null.

    7. Let realm info be a map matching the script.WindowRealmInfo production, with the realm field set to realm id, the origin field set to origin, the context field set to navigable id, and the sandbox field set to sandbox name.

    global object is a DedicatedWorkerGlobalScope object
    1. Let owners be the result of get the worker’s owners given global object.

    2. Assert: owners has precisely one item.

    3. Let realm info be a map matching the script.DedicatedWorkerRealmInfo production, with the realm field set to realm id, the origin field set to origin, and the owners field set to owners.

    global object is a SharedWorkerGlobalScope object
    1. Let realm info be a map matching the script.SharedWorkerRealmInfo production, with the realm field set to realm id, and the origin field set to origin.

    global object is a ServiceWorkerGlobalScope object
    1. Let realm info be a map matching the script.ServiceWorkerRealmInfo production, with the realm field set to realm id, and the origin field set to origin.

    global object is a WorkerGlobalScope object
    1. Let realm info be a map matching the script.WorkerRealmInfo production, with the realm field set to realm id, and the origin field set to origin.

    global object is a PaintWorkletGlobalScope object
    1. Let realm info be a map matching the script.PaintWorkletRealmInfo production, with the realm field set to realm id, and the origin field set to origin.

    global object is a AudioWorkletGlobalScope object
    1. Let realm info be a map matching the script.AudioWorkletRealmInfo production, with the realm field set to realm id, and the origin field set to origin.

    global object is a WorkletGlobalScope object
    1. Let realm info be a map matching the script.WorkletRealmInfo production, with the realm field set to realm id, and the origin field set to origin.

    Otherwise:
    1. Let realm info be null.

  6. Return realm info

Note: Future variations of this specification will retain the invariant that the last component of the type name after splitting on "-" will always be "worker" for globals implementing WorkerGlobalScope, and "worklet" for globals implementing WorkletGlobalScope.

7.5.3.12. The script.RealmType Type

Remote end definition and local end definition

script.RealmType = "window" / "dedicated-worker" / "shared-worker" / "service-worker" /
                   "worker" / "paint-worklet" / "audio-worklet" / "worklet"

The script.RealmType type represents the different types of Realm.

7.5.3.13. The script.RemoteReference Type

Remote end definition

script.RemoteReference = (
  script.SharedReference /
  script.RemoteObjectReference
)

script.SharedReference = {
   sharedId: script.SharedId
   ? handle: script.Handle,
   Extensible
}

script.RemoteObjectReference = {
   handle: script.Handle,
   ? sharedId: script.SharedId
   Extensible
}

The script.RemoteReference type is either a script.RemoteObjectReference representing a remote reference to an existing ECMAScript object in handle object map in the given Realm, or is a script.SharedReference representing a reference to a node.

handle "stale object reference" case.

Note: if the provided reference has both handle and sharedId, the algorithm will ignore handle and respect only sharedId.

To deserialize remote reference given remote reference, realm and session:
  1. Assert remote reference matches the script.RemoteReference production.

  2. If remote reference matches the script.SharedReference production, return deserialize shared reference with remote reference, realm and session.

  3. Return deserialize remote object reference with remote reference and realm.

To deserialize remote object reference given remote object reference and realm:
  1. Let handle id be the value of the handle field of remote object reference.

  2. Let handle map be realm’s handle object map

  3. If handle map does not contain handle id, then return error with error code no such handle.

  4. Return success with data handle map[handle id].

To deserialize shared reference given shared reference, realm and session:
  1. Assert shared reference matches the script.SharedReference production.

  2. Let navigable be get the navigable with realm.

  3. If navigable is null, return error with error code no such node.

    Note: This happens when the realm isn’t a Window global.

  4. Let shared id be the value of the sharedId field of shared reference.

  5. Let node be result of trying to get a node with session, navigable and shared id.

  6. If node is null, return error with error code no such node.

  7. Let environment settings be the environment settings object whose realm execution context's Realm component is realm.

  8. If node’s node document's origin is not same origin domain with environment settings’s origin then return error with error code no such node.

    Note: This ensures that WebDriver-BiDi can not be used to pass objects between realms that do not otherwise permit script access.

  9. Let realm global object be the global object of the realm.

  10. If the realm global object is SandboxWindowProxy object, set node to the SandboxProxy wrapping node in the realm.

  11. Return success with data node.

7.5.3.14. The script.RemoteValue Type

Remote end definition and local end definition

script.RemoteValue = (
  script.PrimitiveProtocolValue /
  script.SymbolRemoteValue /
  script.ArrayRemoteValue /
  script.ObjectRemoteValue /
  script.FunctionRemoteValue /
  script.RegExpRemoteValue /
  script.DateRemoteValue /
  script.MapRemoteValue /
  script.SetRemoteValue /
  script.WeakMapRemoteValue /
  script.WeakSetRemoteValue /
  script.GeneratorRemoteValue /
  script.ErrorRemoteValue /
  script.ProxyRemoteValue /
  script.PromiseRemoteValue /
  script.TypedArrayRemoteValue /
  script.ArrayBufferRemoteValue /
  script.NodeListRemoteValue /
  script.HTMLCollectionRemoteValue /
  script.NodeRemoteValue /
  script.WindowProxyRemoteValue
)

script.ListRemoteValue = [*script.RemoteValue];

script.MappingRemoteValue = [*[(script.RemoteValue / text), script.RemoteValue]];

script.SymbolRemoteValue = {
  type: "symbol",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.ArrayRemoteValue = {
  type: "array",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
  ? value: script.ListRemoteValue,
}

script.ObjectRemoteValue = {
  type: "object",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
  ? value: script.MappingRemoteValue,
}

script.FunctionRemoteValue = {
  type: "function",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.RegExpRemoteValue = {
  ? handle: script.Handle,
  ? internalId: script.InternalId,
} .and script.RegExpLocalValue

script.DateRemoteValue = {
  ? handle: script.Handle,
  ? internalId: script.InternalId,
} .and script.DateLocalValue

script.MapRemoteValue = {
  type: "map",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
  ? value: script.MappingRemoteValue,
}

script.SetRemoteValue = {
  type: "set",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
  ? value: script.ListRemoteValue
}

script.WeakMapRemoteValue = {
  type: "weakmap",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.WeakSetRemoteValue = {
  type: "weakset",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.GeneratorRemoteValue = {
  type: "generator",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.ErrorRemoteValue = {
  type: "error",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.ProxyRemoteValue = {
  type: "proxy",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.PromiseRemoteValue = {
  type: "promise",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.TypedArrayRemoteValue = {
  type: "typedarray",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.ArrayBufferRemoteValue = {
  type: "arraybuffer",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
}

script.NodeListRemoteValue = {
  type: "nodelist",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
  ? value: script.ListRemoteValue,
}

script.HTMLCollectionRemoteValue = {
  type: "htmlcollection",
  ? handle: script.Handle,
  ? internalId: script.InternalId,
  ? value: script.ListRemoteValue,
}

script.NodeRemoteValue = {
  type: "node",
  ? sharedId: script.SharedId,
  ? handle: script.Handle,
  ? internalId: script.InternalId,
  ? value: script.NodeProperties,
}

script.NodeProperties = {
  nodeType: js-uint,
  childNodeCount: js-uint,
  ? attributes: {*text => text},
  ? children: [*script.NodeRemoteValue],
  ? localName: text,
  ? mode: "open" / "closed",
  ? namespaceURI: text,
  ? nodeValue: text,
  ? shadowRoot: script.NodeRemoteValue / null,
}

script.WindowProxyRemoteValue = {
  type: "window",
  value: script.WindowProxyProperties,
  ? handle: script.Handle,
  ? internalId: script.InternalId
}

script.WindowProxyProperties = {
  context: browsingContext.BrowsingContext
}

Add WASM types?

Should WindowProxy get attributes in a similar style to Node?

handle String / Number / etc. wrapper objects specially?

Values accessible from the ECMAScript runtime are represented by a mirror object, specified as script.RemoteValue. The value’s type is specified in the type property. In the case of JSON-representable primitive values, this contains the value in the value property; in the case of non-JSON-representable primitives, the value property contains a string representation of the value.

For non-primitive objects, the handle property, when present, contains a unique string handle to the object. The handle is unique for each serialization. The remote end will keep objects with a corresponding handle alive until such a time that script.disown is called with that handle, or the realm itself is to be discarded (e.g. due to navigation).

For some non-primitive types, the value property contains a representation of the data in the ECMAScript object; for container types this can contain further script.RemoteValue instances. The value property can be null or omitted if there is a duplicate object i.e. the object has already been serialized in the current script.RemoteValue, perhaps as part of a cycle, or otherwise when the maximum serialization depth is reached.

In case of duplicated objects in the same script.RemoteValue, the value is provided only for one of the remote values, while the unique-per-ECMAScript-object internalId is provided for all the duplicated objects for a given serialization.

Nodes are also represented by script.RemoteValue instances. These have a partial serialization of the node in the value property.

reconsider mirror objects' lifecycle.

Note: mirror objects do not keep the original object alive in the runtime. If an object is discarded in the runtime, subsequent attempts to access it via the protocol will result in an error.

To get the handle for an object given realm, ownership type and object:
  1. If ownership type is equal "none", return null.

  2. Let handle id be a new, unique, string handle for object.

  3. Let handle map be realm’s handle object map

  4. Set handle map[handle id] to object.

  5. Return handle id as a result.

To get shared id for a node given node, and session:
  1. Let node be unwrapped node.

  2. If node does not implement Node, return null.

  3. Let navigable be node’s node navigable.

  4. If navigable is null, return null.

  5. Return get or create a node reference with session, navigable and node.

To set internal ids if needed given serialization internal map, remote value and object:
  1. If the serialization internal map does not contain object, set serialization internal map[object] to remote value.

  2. Otherwise, run the following steps:

    1. Let previously serialized remote value be serialization internal map[object].

    2. If previously serialized remote value does not have a field internalId, run the following steps:

      1. Let internal id be the string representation of a UUID based on truly random, or pseudo-random numbers.

      2. Set the internalId field of previously serialized remote value to internal id.

    3. Set the internalId field of remote value to a field internalId in previously serialized remote value.

To serialize as a remote value given value, serialization options, an ownership type, a serialization internal map, a realm and a session:

  1. Let remote value be a result of serialize primitive protocol value given a value.

  2. If remote value is not undefined, return remote value.

  3. Let handle id be the handle for an object with realm, ownership type and value.

  4. Set ownership type to "none".

  5. Let known object be true, if value is in the serialization internal map, otherwise false.

  6. In the following list of conditions and associated steps, run the first set of steps for which the associated condition is true:

    Type(value) is Symbol
    Let remote value be a map matching the script.SymbolRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    IsArray(value)
    Let remote value be serialize an Array-like with session, script.ArrayRemoteValue, handle id, known object, value, serialization options, ownership type, serialization internal map, realm, and session.
    IsRegExp(value)
    1. Let pattern be ToString(Get(value, "source")).

    2. Let flags be ToString(Get(value, "flags")).

    3. Let serialized be a map matching the script.RegExpValue production in the local end definition, with the pattern property set to the pattern and the the flags property set to the flags.

    4. Let remote value be a map matching the script.RegExpRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise, and the value property set to serialized.

    value has a [[DateValue]] internal slot.
    1. Set serialized to Call(Date.prototype.toISOString, value).

    2. Assert: serialized is not a throw completion.

    3. Let remote value be a map matching the script.DateRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise, and the value set to serialized.

    value has a [[MapData]] internal slot
    1. Let remote value be a map matching the script.MapRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.

    2. Set internal ids if needed with serialization internal map, remote value and value.

    3. Let serialized be null.

    4. If known object is false, and serialization options["maxObjectDepth"] is not 0, run the following steps:

      1. Let serialized be the result of serialize as a mapping with CreateMapIterator(value, key+value), serialization options, ownership type, serialization internal map, realm, and session.

    5. If serialized is not null, set field value of remote value to serialized.

    value has a [[SetData]] internal slot
    1. Let remote value be a map matching the script.SetRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.

    2. Set internal ids if needed with serialization internal map, remote value and value.

    3. Let serialized be null.

    4. If known object is false, and serialization options["maxObjectDepth"] is not 0, run the following steps:

      1. Let serialized be the result of serialize as a list with CreateSetIterator(value, value), serialization options, ownership type, serialization internal map, realm, and session.

    5. If serialized is not null, set field value of remote value to serialized.

    value has a [[WeakMapData]] internal slot
    Let remote value be a map matching the script.WeakMapRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    value has a [[WeakSetData]] internal slot
    Let remote value be a map matching the script.WeakSetRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    value has a [[GeneratorState]] internal slot or [[AsyncGeneratorState]] internal slot
    Let remote value be a map matching the script.GeneratorRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    value has an [[ErrorData]] internal slot
    Let remote value be a map matching the script.ErrorRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    value has a [[ProxyHandler]] internal slot and a [[ProxyTarget]] internal slot
    Let remote value be a map matching the script.ProxyRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    IsPromise(value)
    Let remote value be a map matching the script.PromiseRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    value has a [[TypedArrayName]] internal slot
    Let remote value be a map matching the script.TypedArrayRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    value has an [[ArrayBufferData]] internal slot
    Let remote value be a map matching the script.ArrayBufferRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    value is a platform object that implements NodeList
    Let remote value be serialize an Array-like with script.NodeListRemoteValue,handle id, known object, value, serialization options, ownership type, serialization internal map, realm, and session.
    value is a platform object that implements HTMLCollection
    Let remote value be serialize an Array-like with script.HTMLCollectionRemoteValue, handle id, known object, value, serialization options, ownership type, known object, serialization internal map, realm, and session.
    value is a platform object that implements Node
    1. Let shared id be get shared id for a node with value and session.

    2. Let remote value be a map matching the script.NodeRemoteValue production in the local end definition, with the sharedId property set to shared id if it’s not null, or omitted otherwise, and the handle property set to handle id if it’s not null, or omitted otherwise.

    3. Set internal ids if needed with serialization internal map, remote value and value.

    4. Let serialized be null.

    5. If known object is false, run the following steps:

      1. Let serialized be a map.

      2. Set serialized["nodeType"] to Get(value, "nodeType").

      3. Set node value to Get(value, "nodeValue").

      4. If node value is not null set serialized["nodeValue"] to node value.

      5. If value implements Element or Attr:

        1. Set serialized["localName"] to Get(value, "localName").

        2. Set serialized["namespaceURI"] to Get(value, "namespaceURI")

      6. Let child node count be the size of value’s children.

      7. Set serialized["childNodeCount"] to child node count.

      8. If serialization options["maxDomDepth"] is equal to 0, or if value implements ShadowRoot and serialization options["includeShadowTree"] is "none", or if serialization options["includeShadowTree"] is "open" and value’s mode is "closed", let children be null.

        Otherwise, let children be an empty list and, for each node child in the children of value:

        1. Let child serialization options be a clone of serialization options.

        2. If child serialization options["maxDomDepth"] is not null, set child serialization options["maxDomDepth"] to child serialization options["maxDomDepth"] - 1.

        3. Let serialized be the result of serialize as a remote value with child, child serialization options, ownership type, serialization internal map, realm, and session.

        4. Append serialized to children.

      9. If children is not null, set serialized["children"] to children.

      10. If value implements Element:

        1. Let attributes be a new map.

        2. For each attribute in value’s attribute list:

          1. Let name be attribute’s qualified name

          2. Let value be attribute’s value.

          3. Set attributes[name] to value

        3. Set serialized["attributes"] to attributes.

        4. Let shadow root be value’s shadow root.

        5. If shadow root is null, let serialized shadow be null. Otherwise run the following substeps:

          1. Let serialized shadow be the result of serialize as a remote value with shadow root, serialization options, ownership type, serialization internal map, realm, and session.

        6. Set serialized["shadowRoot"] to serialized shadow.

      11. If value implements ShadowRoot, set serialized["mode"] to value’s mode.

    6. If serialized is not null, set field value of remote value to serialized.

    value is a platform object that implements WindowProxy
    1. Let window be the value of value’s [[WindowProxy]] internal slot.

    2. Let navigable be window’s navigable.

    3. Let navigable id be the navigable id for navigable.

    4. Let serialized be a map matching the script.WindowProxyProperties production in the local end definition with the context property set to navigable id.

    5. Let remote value be a map matching the script.WindowProxyRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise, and the value property set to serialized.

    value is a platform object
    1. Let remote value be a map matching the script.ObjectRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    IsCallable(value)
    Let remote value be a map matching the script.FunctionRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.
    Otherwise:
    1. Assert: type(value) is Object

    2. Let remote value be a map matching the script.ObjectRemoteValue production in the local end definition, with the handle property set to handle id if it’s not null, or omitted otherwise.

    3. Set internal ids if needed with serialization internal map, remote value and value.

    4. Let serialized be null.

    5. If known object is false, and serialization options["maxObjectDepth"] is not 0, run the following steps:

      1. Let serialized be the result of serialize as a mapping with EnumerableOwnPropertyNames(value, key+value), serialization options, ownership type, serialization internal map, realm, and session.

    6. If serialized is not null, set field value of remote value to serialized.

  7. Return remote value

children and child nodes are different things. Either childNodeCount should reference to childNodes, or it should be renamed to childrenCount.

To serialize an Array-like given production, handle id, known object, value, serialization options, ownership type, serialization internal map, realm, and session:
  1. Let remote value be a map matching production, with the handle property set to handle id if it’s not null, or omitted otherwise.

  2. Set internal ids if needed with serialization internal map, remote value and value.

  3. If known object is false, and serialization options["maxObjectDepth"]| is not 0:

    1. Let serialized be the result of serialize as a list with CreateArrayIterator(value, value), serialization options, ownership type, serialization internal map, realm, and session.

    2. If serialized is not null, set field value of remote value to serialized.

  4. Return remote value

To serialize as a list given iterable, serialization options, ownership type, serialization internal map, realm, and session:
  1. If serialization options["maxObjectDepth"] is not null, assert: serialization options["maxObjectDepth"] is greater than 0.

  2. Let serialized be a new list.

  3. For each child value in IteratorToList(GetIterator(iterable, sync)):

    1. Let child serialization options be a clone of serialization options.

    2. If child serialization options["maxObjectDepth"] is not null, set child serialization options["maxObjectDepth"] to child serialization options["maxObjectDepth"] - 1.

    3. Let serialized child be the result of serialize as a remote value with child value, child serialization options, ownership type, serialization internal map, realm, and session.

    4. Append serialized child to serialized.

  4. Return serialized

To serialize as a mapping given iterable, serialization options, ownership type, serialization internal map, realm, and session:

  1. If serialization options["maxObjectDepth"] is not null, assert: serialization options["maxObjectDepth"] is greater than 0.

  2. Let serialized be a new list.

  3. For item in IteratorToList(GetIterator(iterable, sync)):

    1. Assert: IsArray(item)

    2. Let property be CreateListFromArrayLike(item)

    3. Assert: property is a list of size 2

    4. Let key be property[0] and let value be property[1]

    5. Let child serialization options be a clone of serialization options.

    6. If child serialization options["maxObjectDepth"] is not null, set child serialization options["maxObjectDepth"] to child serialization options["maxObjectDepth"] - 1.

    7. If Type(key) is String, let serialized key be child key, otherwise let serialized key be the result of serialize as a remote value with child key, child serialization options, ownership type, serialization internal map, realm, and session.

    8. Let serialized value be the result of serialize as a remote value with value, child serialization options, ownership type, serialization internal map, realm, and session.

    9. Let serialized child be («serialized key, serialized value»).

    10. Append serialized child to serialized.

  4. Return serialized

7.5.3.15. The script.ResultOwnership Type
script.ResultOwnership = "root" / "none"

The script.ResultOwnership specifies how the serialized value ownership will be treated.

7.5.3.16. The script.SerializationOptions Type

Remote end definition

script.SerializationOptions = {
  ? maxDomDepth: (js-uint / null) .default 0,
  ? maxObjectDepth: (js-uint / null) .default null,
  ? includeShadowTree: ("none" / "open" / "all") .default "none",
}

The script.SerializationOptions allows specifying how ECMAScript objects will be serialized.

7.5.3.17. The script.SharedId Type

Remote end definition and local end definition

script.SharedId = text;

The script.SharedId type represents a reference to a DOM Node that is usable in any realm (including Sandbox Realms).

7.5.3.18. The script.StackFrame Type

Remote end definition and local end definition

script.StackFrame = {
  columnNumber: js-uint,
  functionName: text,
  lineNumber: js-uint,
  url: text,
}

A frame in a stack trace is represented by a StackFrame object. This has a url property, which represents the URL of the script, a functionName property which represents the name of the executing function, and lineNumber and columnNumber properties, which represent the line and column number of the executed code.

7.5.3.19. The script.StackTrace Type

Remote end definition and local end definition

script.StackTrace = {
  callFrames: [*script.StackFrame],
}

The script.StackTrace type represents the javascript stack at a point in script execution.

Note: The details of how to get a list of stack frames, and the properties of that list are underspecified, and therefore the details here are implementation defined.

It is assumed that an implementation is able to generate a list of stack frames, which is a list with one entry for each item in the javascript call stack, starting from the most recent. Each entry is a single stack frame corresponding to execution of a statement or expression in a script script, which contains the following fields:

script url
The url of the resource containing script
function
The name of the function being executed
line number
The zero-based line number of the executed code, relative to the top of the resource containing script.
column number
The zero-based column number of the executed code, relative to the start of the line in the resource containing script.

To construct a stack trace, with a list of stack frames stack:

  1. Let call frames be a new list.

  2. For each stack frame frame in stack, starting from the most recently executed frame, run the following steps:

    1. Let url be the result of running the URL serializer, given the URL of frame’s script url.

    2. Let frame info be a new map matching the script.StackFrame production, with the url field set to url, the functionName field set to frame’s function, the lineNumber field set to frame’s line number and the columnNumber field set to frame’s column number.

  3. Append frame info to call frames.

  4. Let stack trace be a new map matching the script.StackTrace production, with the callFrames property set to call frames.

  5. Return stack trace.

The current stack trace is the result of construct a stack trace given a list of stack frames representing the callstack of the running execution context.

The stack trace for an exception with an exception, or a Completion Record of type throw, exception, is given by:

  1. If exception is a value that has been thrown as an exception, let record be the Completion Record created to throw exception. Otherwise let record be exception.

  2. Let stack be the list of stack frames corresponding to execution at the point record was created.

  3. Return construct a stack trace given stack.

7.5.3.20. The script.Source Type

Local end definition

script.Source = {
  realm: script.Realm,
  ? context: browsingContext.BrowsingContext
}

The script.Source type represents a script.Realm with an optional browsingContext.BrowsingContext in which a script related event occurred.

To get the source given source realm:
  1. Let realm be the realm id for source realm.

  2. Let environment settings be the environment settings object whose realm execution context's Realm component is source realm.

  3. If environment settings has a associated Document:

    1. Let document be environment settings’ associated Document.

    2. Let navigable be document’s node navigable.

    3. Let navigable id be the navigable id for navigable if navigable is not null.

    Otherwise let navigable be null.

  4. Let source be a map matching the script.Source production with the realm field set to realm, and the context field set to navigable id if navigable is not null, or unset otherwise.

  5. Return source.

7.5.3.21. The script.Target Type

Remote end definition

script.RealmTarget = {
  realm: script.Realm
};

script.ContextTarget = {
  context: browsingContext.BrowsingContext,
  ? sandbox: text
}

script.Target = (
  script.ContextTarget /
  script.RealmTarget
);

The script.Target type represents a value that is either a script.Realm or a browsingContext.BrowsingContext. This is useful in cases where a navigable identifier can stand in for the realm associated with the navigable’s active document.

To get a realm from a navigable given navigable id and sandbox:
  1. Let navigable be the result of trying to get a navigable with navigable id.

  2. If sandbox is null or is an empty string:

    1. Let document be navigable’s active document.

    2. Let environment settings be the environment settings object whose relevant global object's associated Document is document.

    3. Let realm be environment settingsrealm execution context's Realm component.

  3. Otherwise: let realm be result of trying to get or create a sandbox realm given sandbox and navigable.

  4. Return success with data realm

This has the wrong error code

To get a realm from a target given target:
  1. If target matches the script.ContextTarget production:

    1. Let sandbox be null.

    2. If target contains "sandbox", set sandbox to target["sandbox"].

    3. Let realm be get a realm from a navigable with target["context"] and sandbox.

  2. Otherwise:

    1. Assert: target matches the script.RealmTarget production.

    2. Let realm id be the value of the realm field of target.

    3. Let realm be get a realm given realm id.

  3. Return success with data realm

This has the wrong error code

7.5.4. Commands

7.5.4.1. The script.addPreloadScript Command

The script.addPreloadScript command adds a preload script.

Command Type
script.AddPreloadScript = (
  method: "script.addPreloadScript",
  params: script.AddPreloadScriptParameters
)

script.AddPreloadScriptParameters = {
  functionDeclaration: text,
  ? arguments: [*script.ChannelValue],
  ? contexts: [+browsingContext.BrowsingContext],
  ? sandbox: text
}
Return Type
script.AddPreloadScriptResult = {
  script: script.PreloadScript
}
The remote end steps given session and command parameters are:
  1. Let function declaration be the functionDeclaration field of command parameters.

  2. Let arguments be the arguments field of command parameters if present, or an empty list otherwise.

  3. Let navigables be null.

  4. If the contexts field of command parameters is present:

    1. Set navigables to an empty set.

    2. For each navigable id of command parameters["contexts"]

      1. Let navigable be the result of trying to get a navigable with navigable id.

      2. If navigable is not a top-level traversable, return error with error code invalid argument.

      3. Append navigable to navigables.

  5. Let sandbox be the value of the "sandbox" field in command parameters, if present, or null otherwise.

  6. Let script be the string representation of a UUID.

  7. Let preload script map be session’s preload script map.

  8. Set preload script map[script] to a struct with function declaration function declaration, arguments arguments, contexts navigables, and sandbox sandbox.

  9. Return a new map matching the script.AddPreloadScriptResult with the script field set to script.

7.5.4.2. The script.disown Command

The script.disown command disowns the given handles. This does not guarantee the handled object will be garbage collected, as there can be other handles or strong ECMAScript references.

Command Type
script.Disown = (
  method: "script.disown",
  params: script.DisownParameters
)

script.DisownParameters = {
  handles: [*script.Handle]
  target: script.Target;
}
Return Type
EmptyResult
The remote end steps with command parameters are:
  1. Let realm be the result of trying to get a realm from a target given the value of the target field of command parameters.

  2. Let handles the value of the handles field of command parameters.

  3. For each handle id of handles:

    1. Let handle map be realm’s handle object map

    2. If handle map contains handle id, remove handle id from the handle map.

  4. Return success with data null.

7.5.4.3. The script.callFunction Command

The script.callFunction command calls a provided function with given arguments in a given realm.

RealmInfo can be either a realm or a navigable.

Note: In case of an arrow function in functionDeclaration, the this argument doesn’t affect function’s this binding.

Command Type
script.CallFunction = (
  method: "script.callFunction",
  params: script.CallFunctionParameters
)

script.CallFunctionParameters = {
  functionDeclaration: text,
  awaitPromise: bool,
  target: script.Target,
  ? arguments: [*script.LocalValue],
  ? resultOwnership: script.ResultOwnership,
  ? serializationOptions: script.SerializationOptions,
  ? this: script.LocalValue,
  ? userActivation: bool .default false,
}
Result Type
script.EvaluateResult

TODO: Add timeout argument as described in the script.evaluate.

To deserialize arguments with given realm, serialized arguments list and session:

  1. Let deserialized arguments list be an empty list.

  2. For each serialized argument of serialized arguments list:

    1. Let deserialized argument be the result of trying to deserialize local value given serialized argument, realm and session.

    2. Append deserialized argument to the deserialized arguments list.

  3. Return success with data deserialized arguments list.

To evaluate function body given function declaration, environment settings, base URL, and options:

Note: the function declaration is parenthesized and evaluated.

  1. Let parenthesized function declaration be concatenate «"(", function declaration, ")

  2. Let function script be the result of create a classic script with parenthesized function declaration, environment settings, base URL, and options.

  3. Prepare to run script with environment settings.

  4. Let function body evaluation status be ScriptEvaluation(function script’s record).

  5. Clean up after running script with environment settings.

  6. Return (function script, function body evaluation status).

The remote end steps with session and command parameters are:

  1. Let realm be the result of trying to get a realm from a target given the value of the target field of command parameters.

  2. Let realm id be realm’s realm id.

  3. Let environment settings be the environment settings object whose realm execution context's Realm component is realm.

  4. Let command arguments be the value of the arguments field of command parameters.

  5. Let deserialized arguments be an empty list.

  6. If command arguments is not null, set deserialized arguments to the result of trying to deserialize arguments given realm, command arguments and session.

  7. Let this parameter be the value of the this field of command parameters.

  8. Let this object be null.

  9. If this parameter is not null, set this object to the result of trying to deserialize local value given this parameter, realm and session.

  10. Let function declaration be the value of the functionDeclaration field of command parameters.

  11. Let await promise be the value of the awaitPromise field of command parameters.

  12. Let serialization options be the value of the serializationOptions field of command parameters, if present, or otherwise a map matching the script.SerializationOptions production with the fields set to their default values.

  13. Let result ownership be the value of the resultOwnership field of command parameters, if present, or none otherwise.

  14. Let base URL be the API base URL of environment settings.

  15. Let options be the default script fetch options.

  16. Let (script, function body evaluation status) be the result of evaluate function body with function declaration, environment settings, base URL, and options.

  17. If function body evaluation status.[[Type]] is throw:

    1. Let exception details be the result of get exception details given realm, function body evaluation status, result ownership and session.

    2. Return a new map matching the script.EvaluateResultException production, with the exceptionDetails field set to exception details.

  18. Let function object be function body evaluation status.[[Value]].

  19. If IsCallable(function object) is false:

    1. Return an error with error code invalid argument

  20. If command parameters["userActivation"] is true, run activation notification steps.

  21. Prepare to run script with environment settings.

  22. Set evaluation status to Call(function object, this object, deserialized arguments).

  23. If evaluation status.[[Type]] is normal, and await promise is true, and IsPromise(evaluation status.[[Value]]):

    1. Set evaluation status to Await(evaluation status.[[Value]]).

  24. Clean up after running script with environment settings.

  25. If evaluation status.[[Type]] is throw:

    1. Let exception details be the result of get exception details given realm, evaluation status, result ownership and session.

    2. Return a new map matching the script.EvaluateResultException production, with the exceptionDetails field set to exception details.

  26. Assert: evaluation status.[[Type]] is normal.

  27. Let result be the result of serialize as a remote value with evaluation status.[[Value]], serialization options, result ownership, a new map as serialization internal map, realm and session.

  28. Return a new map matching the script.EvaluateResultSuccess production, with the realm field set to realm id, and the result field set to result.

7.5.4.4. The script.evaluate Command

The script.evaluate command evaluates a provided script in a given realm. For convenience a navigable can be provided in place of a realm, in which case the realm used is the realm of the browsing context’s active document.

The method returns the value of executing the provided script, unless it returns a promise and awaitPromise is true, in which case the resolved value of the promise is returned.

Command Type
script.Evaluate = (
  method: "script.evaluate",
  params: script.EvaluateParameters
)

script.EvaluateParameters = {
  expression: text,
  target: script.Target,
  awaitPromise: bool,
  ? resultOwnership: script.ResultOwnership,
  ? serializationOptions: script.SerializationOptions,
  ? userActivation: bool .default false,
}
Result Type
script.EvaluateResult

TODO: Add timeout argument. It’s not totally clear how this ought to work; in Chrome it seems like the timeout doesn’t apply to the promise resolve step, but that likely isn’t what clients want.

The remote end steps given session and command parameters are:

  1. Let realm be the result of trying to get a realm from a target given the value of the target field of command parameters.

  2. Let realm id be realm’s realm id.

  3. Let environment settings be the environment settings object whose realm execution context's Realm component is realm.

  4. Let source be the value of the expression field of command parameters.

  5. Let await promise be the value of the awaitPromise field of command parameters.

  6. Let serialization options be the value of the serializationOptions field of command parameters, if present, or otherwise a map matching the script.SerializationOptions production with the fields set to their default values.

  7. Let result ownership be the value of the resultOwnership field of command parameters, if present, or none otherwise.

  8. Let options be the default script fetch options.

  9. Let base URL be the API base URL of environment settings.

  10. Let script be the result of create a classic script with source, environment settings, base URL, and options.

  11. If command parameters["userActivation"] is true, run activation notification steps.

  12. Prepare to run script with environment settings.

  13. Set evaluation status to ScriptEvaluation(script’s record).

  14. If evaluation status.[[Type]] is normal, await promise is true, and IsPromise(evaluation status.[[Value]]):

    1. Set evaluation status to Await(evaluation status.[[Value]]).

  15. Clean up after running script with environment settings.

  16. If evaluation status.[[Type]] is throw:

    1. Let exception details be the result of get exception details with realm, evaluation status, result ownership and session.

    2. Return a new map matching the script.EvaluateResultException production, with the realm field set to realm id, and the exceptionDetails field set to exception details.

  17. Assert: evaluation status.[[Type]] is normal.

  18. Let result be the result of serialize as a remote value with evaluation status.[[Value]], serialization options, result ownership, a new map as serialization internal map, realm and session.

  19. Return a new map matching the script.EvaluateResultSuccess production, with the with the realm field set to realm id, and the result field set to result.

7.5.4.5. The script.getRealms Command

The script.getRealms command returns a list of all realms, optionally filtered to realms of a specific type, or to the realm associated with a navigable's active document.

Command Type
script.GetRealms = (
  method: "script.getRealms",
  params: script.GetRealmsParameters
)

script.GetRealmsParameters = {
  ? context: browsingContext.BrowsingContext,
  ? type: script.RealmType,
}
Result Type
script.GetRealmsResult = {
  realms: [*script.RealmInfo]
}
The remote end steps with session and command parameters are:
  1. Let environment settings be a list of all the environment settings objects that have their execution ready flag set.

  2. If command parameters contains context:

    1. Let navigable be the result of trying to get a navigable with command parameters["context"].

    2. Let document be navigable’s active document.

    3. Let navigable environment settings be a list.

    4. For each settings of environment settings:

      1. If any of the following conditions hold:

        Append settings to navigable environment settings.

    5. Set environment settings to navigable environment settings.

  3. Let realms be a list.

  4. For each settings of environment settings:

    1. Let realm info be the result of get the realm info given settings.

    2. If command parameters contains type and realm info["type"] is not equal to command parameters["type"] then continue.

    3. If realm info is not null, append realm info to realms.

  5. Let body be a map matching the script.GetRealmsResult production, with the realms field set to realms.

  6. Return success with data body.

Extend this to also allow realm parents e.g. for nested workers? Or get all ancestor workers.

We might want to have a more sophisticated filter system than just a literal match.

7.5.4.6. The script.removePreloadScript Command

The script.removePreloadScript command removes a preload script.

Command Type
script.RemovePreloadScript = (
  method: "script.removePreloadScript",
  params: script.RemovePreloadScriptParameters
)

script.RemovePreloadScriptParameters = {
  script: script.PreloadScript
}
Return Type
EmptyResult
The remote end steps given session and command parameters are:
  1. Let script be the value of the "script" field in command parameters.

  2. Let preload script map be session’s preload script map.

  3. If preload script map does not contain script, return error with error code no such script.

  4. Remove script from preload script map.

  5. Return null

7.5.5. Events

7.5.5.1. The script.message Event
Event Type
 script.Message = (
  method: "script.message",
  params: script.MessageParameters
 )

script.MessageParameters = {
  channel: script.Channel,
  data: script.RemoteValue,
  source: script.Source,
}
The remote end event trigger is the emit a script message steps, given session, realm, channel properties, and message:
  1. Let environment settings be the environment settings object whose realm execution context's Realm component is realm.

  2. Let related navigables be the result of get related navigables given environment settings.

  3. If event is enabled given session, "script.message" and related navigables:

    1. If channel properties contains "serializationOptions", let serialization options be the value of the serializationOptions field of channel properties. Otherwise let serialization options be a map matching the script.SerializationOptions production with the fields set to their default values.

    2. Let if channel properties contains "ownership", let ownership type be channel properties["ownership"]. Otherwise let ownership type be "none".

    3. Let data be the result of serialize as a remote value given message, serialization options, ownership type, a new map as serialization internal map and realm.

    4. Let source be the get the source with realm.

    5. Let params be a map matching the script.MessageParameters production, with the channel field set to channel properties["channel"], the data field set to data, and the source field set to source.

    6. Let body be a map matching the script.Message production, with the params field set to params.

    7. Emit an event with session and body.

7.5.5.2. The script.realmCreated Event
Event Type
script.RealmCreated = (
 method: "script.realmCreated",
 params: script.RealmInfo
)
The remote end event trigger is:

When any of the set up a window environment settings object, set up a worker environment settings object or set up a worklet environment settings object algorithms are invoked, immediately prior to returning the settings object:

  1. Let environment settings be the newly created environment settings object.

  2. Let realm info be the result of get the realm info given environment settings.

  3. If realm info is null, return.

  4. Let related navigables be the result of get related navigables given environment settings.

  5. Let body be a map matching the script.RealmCreated production, with the params field set to realm info.

  6. For each session in the set of sessions for which an event is enabled given "script.realmCreated" and related navigables:

    1. Emit an event with session and body.

The remote end subscribe steps with subscribe priority 2, given session, navigables and include global are:

  1. Let environment settings be a list of all the environment settings objects that have their execution ready flag set.

  2. For each settings of environment settings:

    1. Let related navigables be a new set.

    2. If the associated Document of settingsrelevant global object is a Document:

      1. Let navigable be settings’s relevant global object's associated Document’s node navigable.

      2. If navigable is null, continue.

      3. Let top-level traversible be navigable’s top-level traversable.

      4. If top-level traversible is not in navigables, continue.

      5. Append top-level traversible to related navigables.

      Otherwise, if include global is false, continue.

    3. Let realm info be the result of get the realm info given settings.

    4. If realm info is null, continue.

    5. Let body be a map matching the script.RealmCreated production, with the params field set to realm info.

    6. If event is enabled given session, "script.realmCreated" and related navigables:

      1. Emit an event with session and body.

Should the order here be better defined?

7.5.5.3. The script.realmDestroyed Event
Event Type
script.RealmDestroyed = (
  method: "script.realmDestroyed",
  params: script.RealmDestroyedParameters
)

script.RealmDestroyedParameters = {
  realm: script.Realm
}

The remote end event trigger is:
Define the following unloading document cleanup steps with document:
  1. Let related navigables be an empty set.

  2. Append document’s navigable to related navigables.

  3. For each worklet global scope in document’s worklet global scopes:

    1. Let realm be worklet global scope’s relevant Realm.

    2. Let realm id be the realm id for realm.

    3. Let params be a map matching the script.RealmDestroyedParameters production, with the realm field set of realm id.

    4. Let body be a map matching the script.RealmDestroyed production, with the params field set to params.

    5. For each session in the set of sessions for which an event is enabled given "script.realmDestroyed" and related navigables:

      1. Emit an event with session and body.

  4. Let environment settings be the environment settings object whose relevant global object's associated Document is document.

  5. Let realm be environment settingsrealm execution context's Realm component.

  6. Let realm id be the realm id for realm.

  7. Let params be a map matching the script.RealmDestroyedParameters production, with the realm field set to realm id.

  8. Let body be a map matching the script.RealmDestroyed production, with the params field set to params.

  9. For each session in the set of sessions for which an event is enabled given "script.realmDestroyed" and related navigables:

    1. Emit an event with session and body.

Whenever a worker event loop event loop is destroyed, either because the worker comes to the end of its lifecycle, or prematurely via the terminate a worker algorithm:

  1. Let environment settings be the environment settings object for which event loop is the responsible event loop.

  2. Let related navigables be the result of get related navigables given environment settings.

  3. Let realm be environment settings’s environment settings object’s Realm.

  4. Let realm id be the realm id for realm.

  5. Let params be a map matching the script.RealmDestroyedParameters production, with the realm field set of realm id.

  6. Let body be a map matching the script.RealmDestroyed production, with the params field set to params.

7.6. The storage Module

The storage module contains functionality and events related to storage.

A storage partition is a namespace within which the user agent may organize persistent data such as cookies and local storage.

A storage partition key is a map which uniquely identifies a storage partition.

7.6.1. Definition

Remote end definition

StorageCommand = (
  storage.DeleteCookies //
  storage.GetCookies //
  storage.SetCookie
)

Local end definition

StorageResult = (
  storage.DeleteCookiesResult /
  storage.GetCookiesResult /
  storage.SetCookieResult
)

7.6.2. Types

7.6.2.1. The storage.PartitionKey Type

Local end definition

storage.PartitionKey = {
  ? userContext: text,
  ? sourceOrigin: text,
  Extensible,
}

The storage.PartitionKey type represents a storage partition key.

The following table of standard storage partition key attributes enumerates attributes with well-known meanings which a remote end may choose to support. An implementation may define additional extension storage partition key attributes.

Attribute Definition
"userContext" A user context id
"sourceOrigin" The serialization of the origin of resources that can access the storage partition

Remote ends may support any number of extension storage partition key attributes. In order to avoid conflicts with other implementations, these attributes must begin with a unique identifier for the vendor and user-agent followed by U+003A (:).

A remote end has a map of default values for storage partition key attributes which contains zero or more entries. Each key must be a member of the table of standard storage partition key attributes where the storage partition key corresponds to a standard storage partition, or an extension storage partition key attribute where it does not, and the values represent the default value of that partition key that will be used when the user doesn’t provide an explicit value. The precise entries are implementation-defined and are determined by the storage partitioning adopted by the implementation.

A remote end has a list of required partition key attributes which contains zero or more entries. Each key must be a member of the table of standard storage partition key attributes where the storage partition key corresponds to a standard storage partition, or an extension storage partition key attribute where it does not. The precise entries are implementation-defined and are determined by the storage partitioning adopted by the implementation. This list includes only partition keys for which no default is available. As such the list must not share any entries with the keys of default values for storage partition key attributes.

To deserialize filter given filter:
  1. Let deserialized filter to be an empty map.

  2. For each namevalue in filter:

    1. Let deserialized name be the field name corresponding to the JSON key name in the table for cookie conversion.

    2. If name is "value", set deserialized value to deserialize protocol bytes with value, otherwise let deserialized value be value.

    3. Set deserialized filter[deserialized name] to deserialized value.

  3. Return deserialized filter.

To expand a storage partition spec given partition spec:
  1. If partition spec is null:

    1. Set partition spec to an empty map.

  2. Otherwise, if partition spec["type"] is "context":

    1. Let navigable be the result of trying to get a navigable given partition spec["context"].

    2. Let partition key be the key of navigable’s associated storage partition.

    3. Return success with data partition key.

  3. Let partition key be an empty map.

  4. For each namedefault value in the default values for storage partition key attributes:

    1. Let value be partition spec[name] if it exists or default value otherwise.

    2. Set partition key[name] to value.

  5. For each name in the remote end’s required partition key attributes:

    1. If partition spec[name] exists:

      1. Set partition key][name] to partition spec[name].

    2. Otherwise:

      1. Return error with error code underspecified storage partition.

  6. Return success with data partition key.

To get the cookie store given storage partition key:
  1. If storage partition key uniquely identifies an extant storage partition:

    1. Let store be the cookie store of that storage partition.

    2. Return success with data store.

  2. Return error with error code no such storage partition.

To match cookie given stored cookie and filter:
  1. For each namevalue in filter:

    1. If stored cookie[name] does not equal value:

      1. Return false.

  2. Return true.

To get matching cookies given cookie store and filter:
  1. Let cookies be a new list.

  2. Set deserialized filter to deserialize filter with filter.

  3. For each stored cookie in cookie store:

    1. If match cookie with stored cookie and deserialized filter is true:

      1. Append stored cookie to cookies.

  4. Return cookies.

7.6.3. Commands

7.6.3.1. The storage.getCookies Command

The storage.getCookies command retrieves zero or more cookies which match a set of provided parameters.

Command Type
storage.GetCookies = (
  method: "storage.getCookies",
  params: storage.GetCookiesParameters
)

storage.CookieFilter = {
  ? name: text,
  ? value: network.BytesValue,
  ? domain: text,
  ? path: text,
  ? size: js-uint,
  ? httpOnly: bool,
  ? secure: bool,
  ? sameSite: network.SameSite,
  ? expiry: js-uint,
  Extensible,
}

storage.BrowsingContextPartitionDescriptor = {
  type: "context",
  context: browsingContext.BrowsingContext
}

storage.StorageKeyPartitionDescriptor = {
  type: "storageKey",
  ? userContext: text,
  ? sourceOrigin: text,
  Extensible,
}

storage.PartitionDescriptor = (
  storage.BrowsingContextPartitionDescriptor /
  storage.StorageKeyPartitionDescriptor
)

storage.GetCookiesParameters = {
  ? filter: storage.CookieFilter,
  ? partition: storage.PartitionDescriptor,
}
Result Type
storage.GetCookiesResult = {
  cookies: [*network.Cookie],
  partitionKey: storage.PartitionKey,
}
The remote end steps with session and command parameters are:
  1. Let filter be the value of the filter field of command parameters if it is present or an empty map if it isn’t.

  2. Let partition spec be the value of the partition field of command parameters if it is present or null if it isn’t.

  3. Let partition key be the result of trying to expand a storage partition spec with partition spec.

  4. Let store be the result of trying to get the cookie store with partition key.

  5. Let cookies be the result of get matching cookies with store and filter.

  6. Let serialized cookies be a new list.

  7. For each cookie in cookies:

    1. Let serialized cookie be the result of serialize cookie given cookie.

    2. Append serialized cookie to serialized cookies.

  8. Let body be a map matching the storage.GetCookiesResult production, with the cookies field set to serialized cookies and the partitionKey field set to partition key.

  9. Return success with data body.

7.6.3.2. The storage.setCookie Command

The storage.setCookie command creates a new cookie in a cookie store, replacing any cookie in that store which matches according to [COOKIES].

Command Type
storage.SetCookie = (
  method: "storage.setCookie",
  params: storage.SetCookieParameters,
)

storage.PartialCookie = {
  name: text,
  value: network.BytesValue,
  domain: text,
  ? path: text,
  ? httpOnly: bool,
  ? secure: bool,
  ? sameSite: network.SameSite,
  ? expiry: js-uint,
  Extensible,
}

storage.SetCookieParameters = {
  cookie: storage.PartialCookie,
  ? partition: storage.PartitionDescriptor,
}
Result Type
storage.SetCookieResult = {
  partitionKey: storage.PartitionKey
}
The remote end steps with session and command parameters are:
  1. Let cookie spec be the value of the cookie field of command parameters.

  2. Let partition spec be the value of the partition field of command parameters if it is present or null if it isn’t.

  3. Let partition key be the result of trying to expand a storage partition spec with partition spec.

  4. Let store be the result of trying to get the cookie store with partition key.

  5. Let deserialized value be deserialize protocol bytes with cookie spec["value"].

  6. Create a cookie in store using cookie name cookie spec["name"], cookie value deserialized value, cookie domain cookie spec["domain"], and an attribute-value list of the following cookie concepts listed in the table for cookie conversion:

    Cookie path

    cookie spec["path"] if it exists, otherwise "/".

    Cookie secure only

    cookie spec["secure"] if it exists, otherwise false.

    Cookie HTTP only

    cookie spec["httpOnly"] if it exists, otherwise false.

    Cookie expiry time

    cookie spec["expiry"] if it exists, otherwise leave unset to indicate that this is a session cookie.

    Cookie same site

    cookie spec["sameSite"] if it exists, otherwise leave unset to indicate that no same site policy is defined.

    If this step is aborted without inserting a cookie into the cookie store, return error with error code unable to set cookie.

  7. Let body be a map matching the storage.SetCookieResult production, with the partitionKey field set to partition key.

  8. Return success with data body.

7.6.3.3. The storage.deleteCookies Command

The storage.deleteCookies command removes zero or more cookies which match a set of provided parameters.

Command Type
storage.DeleteCookies = (
  method: "storage.deleteCookies",
  params: storage.DeleteCookiesParameters,
)

storage.DeleteCookiesParameters = {
  ? filter: storage.CookieFilter,
  ? partition: storage.PartitionDescriptor,
}
Result Type
storage.DeleteCookiesResult = {
  partitionKey: storage.PartitionKey
}
The remote end steps with session and command parameters are:
  1. Let filter be the value of the filter field of command parameters if it is present or an empty map if it isn’t.

  2. Let partition spec be the value of the partition field of command parameters if it is present or null if it isn’t.

  3. Let partition key be the result of trying to expand a storage partition spec with partition spec.

  4. Let store be the result of trying to get the cookie store with partition key.

  5. Let cookies be the result of get matching cookies with store and filter.

  6. For each cookie in cookies:

    1. Remove cookie from store.

  7. Let body be a map matching the storage.DeleteCookiesResult production, with the partitionKey field set to partition key.

  8. Return success with data body.

7.7. The log Module

The log module contains functionality and events related to logging.

A BiDi Session has a log event buffer which is a map from navigable id to a list of log events for that context that have not been emitted. User agents may impose a maximum size on this buffer, subject to the condition that if events A and B happen in the same context with A occurring before B, and both are added to the buffer, the entry for B must not be removed before the entry for A.

To buffer a log event given session, navigables and event:

  1. Let buffer be session’s log event buffer.

  2. Let navigable ids be a new list.

  3. For each navigable of navigables:

    1. Append the navigable id for navigable to navigable ids.

  4. For each navigable id in navigable ids:

    1. Let other navigables be an empty list

    2. For each other id in navigable ids:

    3. If other id is not equal to navigable id, append other id to other navigables.

    4. If buffer does not contain navigable id, let buffer[navigable id] be a new list.

    5. Append (event, other navigables) to buffer[navigable id].

Note: we store the other navigables here so that each event is only emitted once. In practice this is only relevant for workers that can be associated with multiple navigables.

Do we want to key this on browsing context or top-level traversable? The difference is in what happens if an event occurs in a frame and that frame is then navigated before the local end subscribes to log events for the top level navigable.

7.7.1. Definition

Local end definition

LogEvent = (
  log.EntryAdded
)

7.7.2. Types

7.7.2.1. log.LogEntry

Local end definition

log.Level = "debug" / "info" / "warn" / "error"

log.Entry = (
  log.GenericLogEntry /
  log.ConsoleLogEntry /
  log.JavascriptLogEntry
)

log.BaseLogEntry = (
  level: log.Level,
  source: script.Source,
  text: text / null,
  timestamp: js-uint,
  ? stackTrace: script.StackTrace,
)

log.GenericLogEntry = {
  log.BaseLogEntry,
  type: text,
}

log.ConsoleLogEntry = {
  log.BaseLogEntry,
  type: "console",
  method: text,
  args: [*script.RemoteValue],
}

log.JavascriptLogEntry = {
  log.BaseLogEntry,
  type: "javascript",
}

Each log event is represented by a log.Entry object. This has a type property which represents the type of log entry added, a level property representing severity, a source property representing the origin of the log entry, a text property with the log message string itself, and a timestamp property corresponding to the time the log entry was generated. Specific variants of the log.Entry are used to represent logs from different sources, and provide additional fields specific to the entry type.

7.7.3. Events

7.7.3.1. The log.entryAdded Event
Event Type
log.EntryAdded = (
 method: "log.entryAdded",
 params: log.Entry,
)

The remote end event trigger is:

Define the following console steps with method, args, and options:

  1. For each session in active BiDI sessions:

    1. If method is "error" or "assert", let level be "error". If method is "debug" or "trace" let level be "debug". If method is "warn", let level be "warn". Otherwise let level be "info".

    2. Let timestamp be a time value representing the current date and time in UTC.

    3. Let text be an empty string.

    4. If Type(args[0]) is String, and args[0] contains a formatting specifier, let formatted args be Formatter(args). Otherwise let formatted args be args.

      Note: The formatter operation is underdefined in the console specification, formatting can be inconsistent between different implementations.

    5. For each arg in formatted args:

      1. If arg is not the first entry in args, append a U+0020 SPACE to text.

      2. If arg is a primitive ECMAScript value, append ToString(arg) to text. Otherwise append an implementation-defined string to text.

    6. Let realm be the realm id of the current Realm Record.

    7. Let serialized args be a new list.

    8. Let serialization options be a map matching the script.SerializationOptions production with the fields set to their default values.

    9. For each arg of args:

      1. Let serialized arg be the result of serialize as a remote value with arg as value, serialization options, none as ownership type, a new map as serialization internal map, realm and session.

      2. Add serialized arg to serialized args.

    10. Let source be the result of get the source given current Realm Record.

    11. If method is "assert", "error", "trace", or "warn", let stack be the current stack trace. Otherwise let stack be null.

    12. Let entry be a map matching the log.ConsoleLogEntry production, with the the level field set to level, the text field set to text, the timestamp field set to timestamp, the stackTrace field set to stack if stack is not null, or omitted otherwise, the method field set to method, the source field set to source and the args field set to serialized args.

    13. Let body be a map matching the log.EntryAdded production, with the params field set to entry.

    14. Let settings be the current settings object

    15. Let related navigables be the result of get related navigables given settings.

    16. If event is enabled with session, "log.entryAdded" and related navigables, emit an event with session and body.

      Otherwise, buffer a log event with session, related browsing contexts, and body.

Define the following error reporting steps with arguments script, line number, column number, message and handled:

  1. If handled is true return.

  2. Let settings be script’s settings object.

  3. Let stack be the stack trace for an exception with the exception corresponding to the error being reported.

  4. Let source be the result of get the source given current Realm Record.

  5. Let entry be a map matching the log.JavascriptLogEntry production, with level set to "error", text set to message, source set to source, and the timestamp field set to timestamp.

  6. Let related navigables be the result of get related navigables given settings.

  7. For each session in active BiDi sessions:

    1. If event is enabled with session, "log.entryAdded" and related navigables, emit an event with session and body.

      Otherwise, buffer a log event with session, related browsing contexts, and body.

Lots more things require logging. CDP has LogEntryAdded types xml, javascript, network, storage, appcache, rendering, security, deprecation, worker, violation, intervention, recommendation, other. These are in addition to the js exception and console API types that are represented by different methods.

Allow implementation-defined log types

The remote end subscribe steps, with subscribe priority 10, given session, navigables and include global are:

  1. For each navigable idevents in session’s log event buffer:

    1. Let maybe context be the result of getting a navigable given navigable id.

    2. If maybe context is an error, remove navigable id from log event buffer and continue.

    3. Let navigable be maybe context’s data

    4. Let top level navigable be navigable’s top-level traversable.

    5. If include global is true and top level navigable is not in navigables, or if include global is false and top level navigable is in navigables:

      1. For each (event, other navigables) in events:

        1. Emit an event with session and event.

        2. For each other context id in other navigables:

          1. If log event buffer contains other context id, remove event from log event buffer[other context id].

7.8. The input Module

The input module contains functionality for simulated user input.

7.8.1. Definition

remote end definition

InputCommand = (
  input.PerformActions //
  input.ReleaseActions //
  input.SetFiles
)

7.8.2. Types

7.8.2.1. input.ElementOrigin

The input.ElementOrigin type represents an Element that will be used as a coordinate origin.

input.ElementOrigin = {
  type: "element",
  element: script.SharedReference
}
The is input.ElementOrigin steps given object are:
  1. If object is a map matching the input.ElementOrigin production, return true.

  2. Return false.

To get Element from input.ElementOrigin steps given session:
  1. Return the following steps, given origin and navigable:

    1. Assert: origin matches input.ElementOrigin.

    2. Let document be navigable’s active document.

    3. Let reference be origin["element"]

    4. Let environment settings be the environment settings object whose relevant global object's associated Document is document.

    5. Let realm be environment settingsrealm execution context's Realm component.

    6. Let element be the result of trying to deserialize remote reference with reference, realm, and session.

    7. If element doesn’t implement Element return error with error code no such element.

    8. Return success with data element.

7.8.3. Commands

7.8.3.1. The input.performActions Command

The input.performActions command performs a specified sequence of user input actions.

Note: for a detailed description of the behavior of this command, see the actions section of [WEBDRIVER].

Command Type
input.PerformActions = (
  method: "input.performActions",
  params: input.PerformActionsParameters
)

input.PerformActionsParameters = {
  context: browsingContext.BrowsingContext,
  actions: [*input.SourceActions]
}

input.SourceActions = (
  input.NoneSourceActions /
  input.KeySourceActions /
  input.PointerSourceActions /
  input.WheelSourceActions
)

input.NoneSourceActions = {
  type: "none",
  id: text,
  actions: [*input.NoneSourceAction]
}

input.NoneSourceAction = input.PauseAction

input.KeySourceActions = {
  type: "key",
  id: text,
  actions: [*input.KeySourceAction]
}

input.KeySourceAction = (
  input.PauseAction /
  input.KeyDownAction /
  input.KeyUpAction
)

input.PointerSourceActions = {
  type: "pointer",
  id: text,
  ? parameters: input.PointerParameters,
  actions: [*input.PointerSourceAction]
}

input.PointerType = "mouse" / "pen" / "touch"

input.PointerParameters = {
  ? pointerType: input.PointerType .default "mouse"
}

input.PointerSourceAction = (
  input.PauseAction /
  input.PointerDownAction /
  input.PointerUpAction /
  input.PointerMoveAction
)

input.WheelSourceActions = {
  type: "wheel",
  id: text,
  actions: [*input.WheelSourceAction]
}

input.WheelSourceAction = (
  input.PauseAction /
  input.WheelScrollAction
)

input.PauseAction = {
  type: "pause",
  ? duration: js-uint
}

input.KeyDownAction = {
  type: "keyDown",
  value: text
}

input.KeyUpAction = {
  type: "keyUp",
  value: text
}

input.PointerUpAction = {
  type: "pointerUp",
  button: js-uint,
}

input.PointerDownAction = {
  type: "pointerDown",
  button: js-uint,
  input.PointerCommonProperties
}

input.PointerMoveAction = {
  type: "pointerMove",
  x: js-int,
  y: js-int,
  ? duration: js-uint,
  ? origin: input.Origin,
  input.PointerCommonProperties
}

input.WheelScrollAction = {
  type: "scroll",
  x: js-int,
  y: js-int,
  deltaX: js-int,
  deltaY: js-int,
  ? duration: js-uint,
  ? origin: input.Origin .default "viewport",
}

input.PointerCommonProperties = (
  ? width: js-uint .default 1,
  ? height: js-uint .default 1,
  ? pressure: float .default 0.0,
  ? tangentialPressure: float .default 0.0,
  ? twist: (0..359) .default 0,
  ; 0 .. Math.PI / 2
  ? altitudeAngle: (0.0..1.5707963267948966) .default 0.0,
  ; 0 .. 2 * Math.PI
  ? azimuthAngle: (0.0..6.283185307179586) .default 0.0,
)

input.Origin = "viewport" / "pointer" / input.ElementOrigin
Return Type
EmptyResult

The remote end steps with session and command parameters are:

  1. Let navigable id be the value of the context field of command parameters.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. Let input state be get the input state with session and navigable’s top-level traversable.

  4. Let actions options be a new actions options with the is element origin steps set to is input.ElementOrigin, and the get element origin steps set to the result of get Element from input.ElementOrigin steps given session.

  5. Let actions by tick be the result of trying to extract an action sequence with input state, command parameters, and actions options.

  6. Try to dispatch actions with input state, actions by tick, navigable, and actions options.

  7. Return success with data null.

7.8.3.2. The input.releaseActions Command

The input.releaseActions command resets the input state associated with the current session.

Command Type
input.ReleaseActions = (
  method: "input.releaseActions",
  params: input.ReleaseActionsParameters
)

input.ReleaseActionsParameters = {
  context: browsingContext.BrowsingContext,
}
Return Type
EmptyResult

The remote end steps given session, and command parameters are:

  1. Let navigable id be the value of the context field of command parameters.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. Let top-level traversable be navigable’s top-level traversable.

  4. Let input state be get the input state with session and top-level traversable.

  5. Let actions options be a new actions options with the is element origin steps set to is input.ElementOrigin, and the get element origin steps set to get Element from input.ElementOrigin steps given session.

  6. Let undo actions be input state’s input cancel list in reverse order.

  7. Try to dispatch tick actions with undo actions, 0, navigable, and actions options.

  8. Reset the input state with session and top-level traversable.

  9. Return success with data null.

7.8.3.3. The input.setFiles Command

The input.setFiles command sets the files property of a given input element with type file to a set of file paths.

Command Type
input.SetFiles = (
  method: "input.setFiles",
  params: input.SetFilesParameters
)

input.SetFilesParameters = {
  context: browsingContext.BrowsingContext,
  element: script.SharedReference,
  files: [*text]
}
Return Type
EmptyResult

The remote end steps given session and command parameters are:

  1. Let navigable id be the value of the command parameters["context"] field.

  2. Let navigable be the result of trying to get a navigable with navigable id.

  3. Let document be navigable’s active document.

  4. Let environment settings be the environment settings object whose relevant global object's associated Document is document.

  5. Let realm be environment settings’s realm execution context's Realm component.

  6. Let element be the result of trying to deserialize remote reference with command parameters["element"], realm, and session.

  7. If element doesn’t implement Element, return error with error code no such element.

  8. If element doesn’t implement HTMLInputElement, element’s type is not in the File Upload state, or element is disabled, return error with error code unable to set file input.

  9. If the size of files is greater than 1 and element’s multiple attribute is not set, return error with error code unable to set file input.

  10. Let files be the value of the command parameters["files"] field.

  11. Let selected files be element’s selected files.

  12. If the size of the intersection of files and selected files is equal to the size of selected files and equal to the size of files, queue an element task on the user interaction task source given element to fire an event named cancel at element, with the bubbles attribute initialized to true.

    Note: Cancellation in a browser is typically determined by changes in file selection. In other words, if there is no change, a "cancel" event is sent.

  13. Otherwise, update the file selection for element with files as the user’s selection.

  14. If, for any reason, the remote end is unable to set the selected files of element to the files with paths given in files, return error with error code unsupported operation.

    Note: For example remote ends might be unable to set selected files to files that do not currently exist on the filesystem.

  15. Return success with data null.

7.9. The webExtension Module

The webExtension module contains functionality for managing and interacting with web extensions.

7.9.1. Definition

remote end definition

WebExtensionCommand = (
  webExtension.Install //
  webExtension.Uninstall
)

local end definition

WebExtensionResult = (
  webExtension.InstallResult
)

7.9.2. Types

7.9.2.1. The webExtension.Extension Type
webExtension.Extension = text

The webExtension.Extension type represents a web extension id within a remote end.

7.9.3. Commands

7.9.3.1. The webExtension.install Command

The webExtension.install command installs a web extension in the remote end.

Command Type
webExtension.Install = (
   method: "webExtension.install",
   params: webExtension.InstallParameters
)

webExtension.InstallParameters = {
   extensionData: webExtension.ExtensionData,
}

webExtension.ExtensionData = (
   webExtension.ExtensionArchivePath /
   webExtension.ExtensionBase64Encoded /
   webExtension.ExtensionPath
)

webExtension.ExtensionPath = {
   type: "path",
   path: text,
}

webExtension.ExtensionArchivePath = {
   type: "archivePath",
   path: text,
}

webExtension.ExtensionBase64Encoded = {
   type: "base64",
   value: text,
}
Result Type
webExtension.InstallResult = {
  extension: webExtension.Extension
}
To extract a zip archive given bytes:
  1. Perform implementation defined steps to decode bytes using the zip compression algorithm. TODO: Find a better reference for zip decoding.

  2. If the previous step failed (e.g. because bytes did not represent valid zip-compressed data) then return error with error code invalid web extension. Otherwise let entry be a directory entry containing the extracted filesystem entries.

  3. Return entry.

To expand a web extension data spec given extension data spec:
  1. Let type be extension data spec["type"].

  2. If installing a web extension using type isn’t supported return error with error code unsupported operation.

  3. In the following list of conditions and associated steps, run the first set of steps for which the associated condition is true:

    type is the string "path"
    1. Let path be extension data spec["path"].

    2. Let locator be a directory locator with path path and root corresponding to the root of the file system.

    3. Let entry be locate an entry given locator.

    type is the string "archivePath"
    1. Let archive path be extension data spec["path"].

    2. Let locator be a file locator with path archive path and root corresponding to the root of the file system.

    3. Let archive entry be locate an entry given locator.

    4. If archive entry is null, return null.

    5. Let bytes be archive entry’s binary data.

    6. Let entry be the result of trying to extract a zip archive given bytes.

    type is the string "base64"
    1. Let bytes be forgiving-base64 decode extension data spec["path"].

    2. If bytes is failure, return null.

    3. Let entry be the result of trying to extract a zip archive given bytes.

  4. Return entry.

The remote end steps with command parameters are:
  1. If installing web extensions isn’t supported return error with error code unsupported operation.

  2. Let extension data spec be command parameters["extensionData"].

  3. Let extension directory entry be the result of trying to expand a web extension data spec with extension data spec.

  4. If extension directory entry is null, return error with error code invalid web extension.

  5. Perform implementation defined steps to install a web extension from extension directory entry. If this fails, return error with error code invalid web extension. Otherwise let extension id be the unique identifier of the newly installed web extension.

  6. Let result be a map matching the webExtension.InstallResult production with the extension field set to extension id.

  7. Return success with data result.

7.9.3.2. The webExtension.uninstall Command

The webExtension.uninstall command uninstalls a web extension for the remote end.

Command Type
webExtension.Uninstall = (
   method: "webExtension.uninstall",
   params: webExtension.UninstallParameters
)

webExtension.UninstallParameters = {
   extension: webExtension.Extension,
}
The remote end steps with command parameters are:
  1. Let extension be command parameters["extension"].

  2. If the remote end has no web extension with id equal to extension, return error with error code no such web extension.

  3. Perform any implementation-defined steps to remove the web extension from the remote end. If this fails, return error with error code unknown error.

  4. Return success with data null.

8. Patches to Other Specifications

This specification requires some changes to external specifications to provide the necessary integration points. It is assumed that these patches will be committed to the other specifications as part of the standards process.

8.1. HTML

The report an error algorithm is modified with an additional step at the end:

  1. Call any error reporting steps defined in external specifications with script, line, col, message, and true if the error is handled, or false otherwise.

8.2. Console

Other specifications can define console steps.

  1. At the point when the Printer operation is called with arguments name, printerArgs and options (which is undefined if the argument is not provided), call any console steps defined in external specification with arguments name, printerArgs, and options.

8.3. CSS

8.3.1. Determine the device pixel ratio

Insert the following steps at the start of the determine the device pixel ratio algorithm:

  1. If device pixel ratio overrides contains window’s navigable, return device pixel ratio overrides[window’s navigable].

9. Appendices

This section is non-normative.

9.1. External specifications

Note: the list is not exhaustive and might not be up to date.

The following external specifications define additional WebDriver BiDi modules:

  1. Permissions

  2. Web Bluetooth

Index

Terms defined by this specification

Terms defined by reference

References

Normative References

[CONSOLE]
Dominic Farolino; Robert Kowalski; Terin Stock. Console Standard. Living Standard. URL: https://console.spec.whatwg.org/
[COOKIES]
A. Barth. HTTP State Management Mechanism. April 2011. Proposed Standard. URL: https://httpwg.org/specs/rfc6265.html
[CSS-PAINT-API-1]
Ian Kilpatrick; Dean Jackson. CSS Painting API Level 1. URL: https://drafts.css-houdini.org/css-paint-api-1/
[CSS21]
Bert Bos; et al. Cascading Style Sheets Level 2 Revision 1 (CSS 2.1) Specification. URL: https://drafts.csswg.org/css2/
[CSS22]
Bert Bos. Cascading Style Sheets Level 2 Revision 2 (CSS 2.2) Specification. URL: https://drafts.csswg.org/css2/
[CSSOM-VIEW-1]
Simon Pieters. CSSOM View Module. URL: https://drafts.csswg.org/cssom-view/
[DOM]
Anne van Kesteren. DOM Standard. Living Standard. URL: https://dom.spec.whatwg.org/
[ECMAScript]
ECMAScript Language Specification. URL: https://tc39.es/ecma262/multipage/
[ENCODING]
Anne van Kesteren. Encoding Standard. Living Standard. URL: https://encoding.spec.whatwg.org/
[FETCH]
Anne van Kesteren. Fetch Standard. Living Standard. URL: https://fetch.spec.whatwg.org/
[FS]
Austin Sullivan. File System Standard. Living Standard. URL: https://fs.spec.whatwg.org/
[FULLSCREEN]
Philip Jägenstedt. Fullscreen API Standard. Living Standard. URL: https://fullscreen.spec.whatwg.org/
[GEOMETRY-1]
Simon Pieters; Chris Harrelson. Geometry Interfaces Module Level 1. URL: https://drafts.fxtf.org/geometry/
[HR-TIME-3]
Yoav Weiss. High Resolution Time. URL: https://w3c.github.io/hr-time/
[HTML]
Anne van Kesteren; et al. HTML Standard. Living Standard. URL: https://html.spec.whatwg.org/multipage/
[INFRA]
Anne van Kesteren; Domenic Denicola. Infra Standard. Living Standard. URL: https://infra.spec.whatwg.org/
[MEDIAQUERIES-5]
Dean Jackson; et al. Media Queries Level 5. URL: https://drafts.csswg.org/mediaqueries-5/
[MIMESNIFF]
Gordon P. Hemsley. MIME Sniffing Standard. Living Standard. URL: https://mimesniff.spec.whatwg.org/
[RESOURCE-TIMING]
Yoav Weiss; Noam Rosenthal. Resource Timing. URL: https://w3c.github.io/resource-timing/
[RFC4648]
S. Josefsson. The Base16, Base32, and Base64 Data Encodings. October 2006. Proposed Standard. URL: https://www.rfc-editor.org/rfc/rfc4648
[RFC6455]
I. Fette; A. Melnikov. The WebSocket Protocol. December 2011. Proposed Standard. URL: https://www.rfc-editor.org/rfc/rfc6455
[RFC8610]
H. Birkholz; C. Vigano; C. Bormann. Concise Data Definition Language (CDDL): A Notational Convention to Express Concise Binary Object Representation (CBOR) and JSON Data Structures. June 2019. Proposed Standard. URL: https://www.rfc-editor.org/rfc/rfc8610
[RFC9110]
R. Fielding, Ed.; M. Nottingham, Ed.; J. Reschke, Ed.. HTTP Semantics. June 2022. Internet Standard. URL: https://httpwg.org/specs/rfc9110.html
[RFC9562]
K. Davis; B. Peabody; P. Leach. Universally Unique IDentifiers (UUIDs). May 2024. Proposed Standard. URL: https://www.rfc-editor.org/rfc/rfc9562
[SELECTORS-4]
Elika Etemad; Tab Atkins Jr.. Selectors Level 4. URL: https://drafts.csswg.org/selectors/
[SERVICE-WORKERS]
Jake Archibald; Marijn Kruisselbrink. Service Workers. URL: https://w3c.github.io/ServiceWorker/
[UNICODE]
The Unicode Standard. URL: https://www.unicode.org/versions/latest/
[URL]
Anne van Kesteren. URL Standard. Living Standard. URL: https://url.spec.whatwg.org/
[WEBAUDIO-1.0]
Paul Adenot; Hongchan Choi. Web Audio API. URL: https://webaudio.github.io/web-audio-api/
[WEBDRIVER]
Simon Stewart; David Burns. WebDriver. URL: https://w3c.github.io/webdriver/
[WEBDRIVER2]
Simon Stewart; David Burns. WebDriver. URL: https://w3c.github.io/webdriver/
[WEBIDL]
Edgar Chen; Timothy Gu. Web IDL Standard. Living Standard. URL: https://webidl.spec.whatwg.org/

Informative References

[CSS-VALUES-3]
Tab Atkins Jr.; Elika Etemad. CSS Values and Units Module Level 3. URL: https://drafts.csswg.org/css-values-3/
[HTTP11]
R. Fielding, Ed.; M. Nottingham, Ed.; J. Reschke, Ed.. HTTP/1.1. June 2022. Internet Standard. URL: https://httpwg.org/specs/rfc9112.html
[JSON-RPC]
JSON-RPC Working Group. JSON-RPC 2.0 Specification. 4 January 2013. URL: https://www.jsonrpc.org/specification
[SAME-SITE-COOKIES]
Mike West; Mark Goodwin. Same-Site Cookies. URL: https://datatracker.ietf.org/doc/html/draft-ietf-httpbis-cookie-same-site
[URLPattern]
Ben Kelly; Jeremy Roman; 宍戸俊哉 (Shunya Shishido). URL Pattern Standard. Living Standard. URL: https://urlpattern.spec.whatwg.org/

Issues Index

Surely there’s a better mechanism for doing this "wait for an event" thing.
Should we have something like microtasks to ensure this runs before any other tasks on the event loop?
Should this be an appendix?
Do we support > 1 connection for a single session?
Nothing seems to define what status code is used for UTF-8 errors.
Need to hook in to the session ending to allow the UA to close the listener if it wants.
Define creation of sandbox realm. This is going to return a SandboxWindowProxy wrapping window.
Define in detail how SandboxProxy works
Define how this works.
Unclear that this is the best way to formally define the concept of a user context or the interaction with storage.
Should we specify that top-level traversables with a non-null opener have the same associated user context as their opener? Need to check if this is something existing implementations enforce.
this is rather imprecise language, but hopefully it’s clear that the intent is that we send the response to the command before starting shutdown of the connections.
This needs to be generalized to work with realms too.
This needs to be generalised to work with realms too.
property specify how the ignore cache flag works. This needs to consider whether only the first load of a resource bypasses the cache (i.e. whether this is like initially clearing the cache and proceeding like normal), or whether resources not directly loaded by the HTML parser (e.g. loads initiated by scripts or stylesheets) also bypass the cache.
Are we surfacing enough information about what failed and why with an error here? What error code do we want? Is there going to be a problem where local ends parse the implementation-defined strings to figure out what actually went wrong?
This ought to be integrated into the update rendering algorithm in some more explicit way.
There is an open discussion about the behavior when closing the last top-level traversable. We could expect to close the browser, close the session or leave this up to the implementation. [Issue #w3c/webdriver-bidi#170]
This ought to be integrated into the update rendering algorithm in some more explicit way.
There is a race condition in the algorithm as written because by the time we try to navigate the target session history entry might not exist. Once we support waiting for history to navigate we can handle this more robustly.
Do we want to expose a `browsingContext.pageShow event? In that case we’d need to call this whenever `pageshow` is going to be emitted, not just on bfcache restore, and also add the persisted status to the data.
It’s unclear if we ought to only fire this event for browsing contexts that have active documents; navigation can also cause contexts to become inaccessible but not yet get discarded because bfcache.
This doesn’t have a way to cancel the auth.
Should we include parameters other than realm?
as in Fetch it’s unclear if this is the right way to handle multiple headers, parsing issues, etc.
Consider including the `sharedId` of the document node that initiated the request in addition to the context.
consider a "request already sent" error.
Allow setting the precise kind of error [Issue #508]
This has the wrong error code
This doesn’t handle lone surrogates
handle "stale object reference" case.
Add WASM types?
Should WindowProxy get attributes in a similar style to Node?
handle String / Number / etc. wrapper objects specially?
reconsider mirror objects' lifecycle.
children and child nodes are different things. Either childNodeCount should reference to childNodes, or it should be renamed to childrenCount.
This has the wrong error code
This has the wrong error code
TODO: Add timeout argument as described in the script.evaluate.
Extend this to also allow realm parents e.g. for nested workers? Or get all ancestor workers.
We might want to have a more sophisticated filter system than just a literal match.
Should the order here be better defined?
Do we want to key this on browsing context or top-level traversable? The difference is in what happens if an event occurs in a frame and that frame is then navigated before the local end subscribes to log events for the top level navigable.
Lots more things require logging. CDP has LogEntryAdded types xml, javascript, network, storage, appcache, rendering, security, deprecation, worker, violation, intervention, recommendation, other. These are in addition to the js exception and console API types that are represented by different methods.
Allow implementation-defined log types