Server Timing

This specification enables a server to communicate performance metrics about the request-response cycle to the user agent. It also standardizes a JavaScript interface to enable applications to collect, process, and act on these metrics to optimize application delivery.

### The `Server-Timing` Header Field The Server-Timing header field is used to communicate one or more metrics and descriptions for the given request-response cycle. The ABNF (Augmented Backus-Naur Form) [[RFC5234]] syntax for the [=Server-Timing header field=] is as follows: ```ABNF Server-Timing = #server-timing-metric server-timing-metric = metric-name *( OWS ";" OWS server-timing-param ) metric-name = token server-timing-param = server-timing-param-name OWS "=" OWS server-timing-param-value server-timing-param-name = token server-timing-param-value = token / quoted-string ``` See [[RFC7230]] for definitions of `#`, `*`, `OWS`, `token`, and `quoted-string`. A response MAY have multiple server-timing-metric entries with the same metric-name, and the user agent MUST process and expose all such entries. The user agent MAY surface provided metrics in any order - i.e. the order of metrics in the HTTP header field is not significant. This header field is defined with an extensible syntax to allow for future parameters. A user agent that does not recognize particular server-timing-param-name in the Server-Timing header field of a response MUST ignore those tokens and continue processing instead of signaling an error. To avoid any possible ambiguity, individual `server-timing-param-name`s SHOULD NOT appear multiple times within a `server-timing-metric`. If any `server-timing-param-name` is specified more than once, only the first instance is to be considered, even if the `server-timing-param` is incomplete or invalid. All subsequent occurrences MUST be ignored without signaling an error or otherwise altering the processing of the `server-timing-metric`. This is the only case in which the ordering of parameters within a `server-timing-metric` is considered to be significant. User agents MUST ignore extraneous characters found after a `server-timing-param-value` but before the next `server-timing-param` and before the end of the current `server-timing-metric`. User agents MUST ignore extraneous characters found after a `metric-name` but before the first `server-timing-param` and before the next `server-timing-metric`.

This specification establishes the server-timing-params for server-timing-param-names "dur" for {{duration}} and "desc" for {{description}}, both optional.

- To minimize the HTTP overhead the provided names and descriptions should be kept as short as possible - e.g. use abbreviations and omit optional values where possible. - Because there can be no guarantee of clock synchronization between client, server, and intermediaries, it is impossible to map a meaningful `startTime` onto the clients timeline. For that reason, any `startTime` attribution is purposely omitted from this specification. If the developers want to establish a relationship between multiple entries, they can do so by communicating custom data via metric names and/or descriptions. - The server and/or any relevant intermediaries are in full control of which metrics are communicated to the user agent and when. For example, access to some metrics may be restricted due to privacy or security reasons - see section.

To parse a server-timing header field given a string |field|: 1. Let |position| be a [=string/position variable=], initially pointing at the start of |field|. 1. Let |name| be the result of [=collecting a sequence of code points=] from |field| that are not equal to U+003B (;), given |position|. 1. [=Strip leading and trailing ASCII whitespace=] from |name|. 1. If |name| is an empty string, return null. 1. Let |metric| be a new {{PerformanceServerTiming}} whose metric name is |name|. 1. Let |params| be an empty ordered map. 1. While |position| is not at the end of |field|: 1. Advance |position| by 1. 1. Let |paramName| be the result of [=collecting a sequence of code points=] from |field| that are not equal to U+003D (=), given |position|. 1. [=Strip leading and trailing ASCII whitespace=] from |paramName|. 1. If |paramName| is an empty string or |params|[|paramName|] [=map/exists=], [=iteration/continue=]. 1. Advance |position| by 1. 1. Let |paramValue| be an empty string. 1. [=Skip ASCII whitespace=] within |field| given |position|. 1. If the [=code point=] at |position| within |field| is U+0022 ("), then: 1. Set |paramValue| to the result of [=collecting an HTTP quoted string=] from |field| given |position| with the extract-value flag set. 1. [=Collect a sequence of code points=] from |field| that are not equal to U+003B (;), given |position|. The result is not used. 1. Otherwise: 1. Let |rawParamValue| be the result of [=collecting a sequence of code points=] from |field| that are not equal to U+003B (;), given |position|. 1. Let |paramValue| be the result of [=strip leading and trailing ASCII whitespace|stripping=] |rawParamValue|. 1. [=map/set|Set=] |metric|'s params to |params|. 1. Return |metric|.

## The PerformanceServerTiming Interface ``` webidl [Exposed=(Window,Worker)] interface PerformanceServerTiming { readonly attribute DOMString name; readonly attribute DOMHighResTimeStamp duration; readonly attribute DOMString description; [Default] object toJSON(); }; ``` When toJSON is called, run [[WEBIDL]]'s [=default toJSON steps=]. ### name attribute The name getter steps are to return this's metric name. ### duration attribute The duration getter steps are to do the following: 1. If this's params["dur"] does not [=map/exists|exist=], return 0. 1. Let |dur| be the result of parsing this's params["dur"] using the [=rules for parsing floating-point number values=]. 1. If |dur| is an error, return 0; Otherwise return |dur|.

Since duration is a {{DOMHighResTimeStamp}}, it usually represents a [=duration=] in milliseconds. Since this is not enforcable in practice, duration can represent any unit of time, and having it represent a [=duration=] in milliseconds is a recommendation.

### description attribute The description getter steps are to return this's params["desc"] if it [=map/exists=], otherwise the empty string. A {{PerformanceServerTiming}} has an associated string metric name, initially set to the empty string. A {{PerformanceServerTiming}} has an associated ordered map params, initially empty.

## Examples

    > GET /resource HTTP/1.1
    > Host: example.com
    

    < HTTP/1.1 200 OK
    < Server-Timing: miss, db;dur=53, app;dur=47.2
    < Server-Timing: customView, dc;desc=atl
    < Server-Timing: cache;desc="Cache Read";dur=23.2
    < Trailer: Server-Timing
    < (... snip response body ...)
    < Server-Timing: total;dur=123.4

Name	Duration	Description
miss
db	53
app	47.2
customView
dc		atl
cache	23.2	Cache Read
total	123.4

The above header fields communicate six distinct metrics that illustrate all the possible ways for the server to communicate data to the user agent: metric name only, metric with value, metric with value and description, and metric with description. For example, the above metrics may indicate that for `example.com/resource.jpg` fetch:

There was a cache miss.
The request was routed through the "atl" datacenter ("dc").
The database ("db") time was 53 ms.
A cache read took 23.2 ms.
The application server ("app") took 47.2ms to process "customView" template or function.
The total time for the request-response cycle on the server was 123.4ms, which is recorded at the end of the response and delivered via a trailer field.

The application can collect, process, and act on the provided metrics via the provided JavaScript interface:

    // serverTiming entries can live on 'navigation' and 'resource' entries
    for (const entryType of ['navigation', 'resource']) {
      for (const {name: url, serverTiming} of performance.getEntriesByType(entryType)) {
        // iterate over the serverTiming array
        for (const {name, duration, description} of serverTiming) {
          // we only care about "slow" ones
          if (duration > 200) {
            console.info('Slow server-timing entry =',
              JSON.stringify({url, entryType, name, duration, description}, null, 2))
          }
        }
      }
    }

## Use cases ### Server timing in developer tools Server processing time can be a significant fraction of the total request time. For example, a dynamic response may require one or more database queries, cache lookups, API calls, time to process relevant data and render the response, and so on. Similarly, even a static response can be delayed due to overloaded servers, slow caches, or other reasons. Today, the user agent developer tools are able to show when the request was initiated, and when the first and last bytes of the response were received. However, there is no visibility into where or how the time was spent on the server, which means that the developer is unable to quickly diagnose if there is a performance bottleneck on the server, and if so, in which component. Today, to answer this question, the developer is required to use different techniques: check the server logs, embed performance data within the response (if possible), use external tools, and so on. This makes identifying and diagnosing performance bottlenecks hard, and in many cases impractical. Server Timing defines a standard mechanism that enables the server to communicate relevant performance metrics to the client and allows the client to surface them directly in the developer tools - e.g. the requests can be annotated with server sent metrics to provide insight into where or how the time was spent while generating the response. ### Server timing for automated analytics In addition to surfacing server sent performance metrics in the developer tools, a standard JavaScript interface enables analytics tools to automatically collect, process, beacon, and aggregate these metrics for operational and performance analysis. ### Measuring request routing performance Server Timing enables origin servers to communicate performance metrics about where or how time is spent while processing the request. However, the same request and response may also be routed through one or more multiple proxies (e.g. cache servers, load balancers, and so on), each of which may introduce own delays and may want to provide performance metrics into where or how the time is spent. For example, a CDN edge node may want to report which data center was being used, if the resource was available in cache, and how long it took to retrieve the response from cache or from the origin server. Further, the same process may be repeated by other proxies, thus allowing full end-to-end visibility into how the request was routed and where the time was spent. Similarly, when a Service Worker is active, some or all of the navigation and resource requests may be routed through it. Effectively, an active Service Worker is a local proxy that is able to reroute requests, serve cached responses, synthesize responses, and more. As a result, Server Timing enables Service Worker to report custom performance metrics about how the request was processed: whether it was fetched from a server or served from local cache, duration of relevant the processing steps, and so on.