Skip to content

Latest commit

 

History

History
199 lines (151 loc) · 11.1 KB

README.md

File metadata and controls

199 lines (151 loc) · 11.1 KB

This proposal is an early design sketch by ChromeOS team to describe the problem below and solicit feedback on the proposed solution. It has not been approved to ship in Chrome.

Explainer: navigator.clipboard.contentsID()

Authors:

Participate

Table of Contents

Introduction

Why a new thing, aren’t other clipboard APIs enough?

In short, without this there's no efficient way to detect clipboard changes. To elaborate, let's consider a common use case: Virtual Desktop Infrastructure (VDI). While connecting to a remote desktop using a web browser, users expect the experience between computers to be seamless. Some native applications on the remote side react to clipboard changes before the user explicitly pastes the data, so users are surprised when they copy something locally and the remote side doesn't react (for example, native paste button on the remote side is greyed out or yields stale content). Many Clipboard API use cases within VDI environments center around synchronizing the local clipboard with a remote machine, so that:

  1. When a user copies something locally outside the VDI app and then switches to it, the new clipboard contents are seamlessly available in the remote session.
  2. When a user copies something on the remote machine and switches away from the VDI app, they can paste the copied content locally.

Without contentsID(), there are two primary ways to achieve the first scenario:

  • Upon refocusing the VDI app, automatically send the content from the local clipboard to the remote machine.
  • Upon refocusing the VDI app, read the clipboard contents, compare them with the last known state, and send to the remote machine only if they have changed.

Neither of these approaches is optimal (especially with large clipboard contents), and additional challenges related to sanitization and encoding make it difficult to directly compare the clipboard contents byte-by-byte with previously received data.

What is the optimal solution then?

Several platforms (ex. MacOS, Windows, X11 and Wayland, ChromeOS, Android and iOS) offer efficient ways to track clipboard content changes without directly reading the data. This is often achieved through clipboard sequence numbers or change notifications. The navigator.clipboard.contentsID() API aims to leverage these capabilities. It allows websites to request a numeric token (a 128-bit integer) representing the current clipboard state. If this token differs from a previously retrieved one, it indicates that the clipboard contents have changed between the two calls. Importantly, this operation has a constant time complexity (O(1)), independent of the clipboard's size. Therefore, even frequent checks (e.g., on window refocus) remain efficient, even when dealing with large amounts of copied data.

Goals

  • Provide a way to check if the clipboard changed between two points in time that is:
    • Easy to use
    • Efficient, no matter how big the clipboard contents are
    • Usable across multiple windows/tabs under one browser process
  • Improve potential current heuristics for clipboard synchronization…

Non-goals

  • …without providing a new fingerprinting surface.

Token stability across tabs or app windows

One of the goals of this API is to enable cross-app synchronization of clipboard - so this should be as close to the stability of the clipboard itself as possible. So, every site under the same browser process should get the same token from calling contentsID().

How to use it?

Frankly, quite straightforwardly. Signature of the method will look somewhat like this:

Promise < BigInt > contentsID();

So in the mentioned VDI case, the code could look somewhat like this:

var lastToken = null;

// Handler called on every window refocus.
// It checks if it's necessary to sync clipboard contents to remote.
window.addEventListener("focus", () => {
  navigator.clipboard.contentsID().then((token) => {
    if (token !== lastToken) {
      // Clipboard contents have changed!
      // Send to remote machine
    }
    lastToken = token;
  });
});

// Function that is called by the client app when user copies something on remote.
async function onRemoteClipboardChanged(remoteClipboardItems) {
  await navigator.clipboard.write(remoteClipboardItems);
  lastToken = await navigator.clipboard.contentsID();
}

Then, all that remains is to call onRemoteClipboardChanged every time the clipboard changes remotely - and provided that no changes occur locally while the window is in focus (which is usually the case, as clipboard changes mostly occur due to user actions - especially in case of local clipboard and VDI), clipboard synchronization will look seamless. In the unfortunate case of anticipated local changes to the clipboard done in the background, this can be improved in two ways:

  • Regular polling of the token and invoking a similar handler to the focus handler in the snippet above: this is generally not the best solution, but this API should be lightweight enough that it doesn’t create much overhead.
  • Integrating this with clipboardchange event in addition (or instead) or the focus event: this depends on whether clipboardchange event becomes a part of the web standard. This API's design - or the particular implementation - will need to be integrated with the clipboardchange design to ensure it isn't delivered between writing to the clipboard and updating the last-known token value.

Both however would require some synchronization of the handler and onRemoteClipboardChanged to prevent handlers getting between write and contentsID.

Note: In any case, this will be in some degree prone to inherent race conditions due to lack of clipboard atomic operations - which will show themselves mostly in case of user switching apps very rapidly. This API exists in order to enable heuristics to make this invisible in most cases, but will not fix it completely.

Security & Privacy considerations

This should be under the same restrictions as the navigator.clipboard.read():

  • It should require clipboard-read permissions and request them on call.
  • It should be available only while the tab has focus.

Thus, it doesn’t expose any new not-available-before security-sensitive information. The only potential attack vector would be correlating different sessions with the same user based on the token, which provides a more precise way of ensuring across sessions that those to clipboards are in fact the same user. In practice however, this could be done by just re-reading the clipboard contents and comparing them, especially across changes - which is possible already. Correlating users across sites by the origins that have clipboard permissions is already trivially easy and existence of this API does not change this state significantly.

Alternatives

Functionality itself

There is another proposed API for tracking clipboard changes - a clipboardchange event. However, even if implemented and standardized, it operates differently. Instead of determining if a change has occurred between two points in time, it provides real-time notifications for every change, without detailed information about the cause. Therefore, if your app also writes to the clipboard, it can be challenging to determine whether you or another source caused the change (especially with multiple windows/tabs of the same app open), potentially leading to unnecessary data transfers or having to implement comparison anyway. In case of contentsID(), you can save the new token just after writing - and it will be irrelevant for all active tabs/windows irrelevant what caused the change, only that this change is already in sync with the remote and no action is needed.

Format of the token

There are several ways in which the token could look like, including:

  1. Sequence number that would increase with each change (or with each call that detected a change)
  2. Timestamp of the last change (or call that detected it)
  3. Hash of the clipboard contents
  4. Random 128-bit number without any specified scheme or significance - other than “after something is written to the clipboard, contentsID() should yield a different value than it did before the write”

Preferred approach is 4, for the following reasons:

  • It doesn’t provide any information about the user’s action other than already available
  • Randomness of this degree is enough to ensure the lack of false positives, conforming with UUID standards
  • It’s implementationally and computationally the simplest
  • It’s the simplest solution that is sufficient for the provided use case
  • It’s trivial to compare and store

References & acknowledgements

Many thanks for valuable feedback and advice from: