Independent Submission W. Han Internet-Draft Individual Intended status: Informational 2 June 2026 Expires: 4 December 2026 AI Manifest: Site-Published Friction-Recovery Descriptors for AI Agents draft-han-ai-manifest-02 Abstract This document specifies the AI Manifest, a JSON-based format with which a website operator publishes an AI Friction-Recovery Manifest (AFRM): a declarative, advisory description of the known user- interface (UI) traps on a site, the framework hints that contextualize them, and the interaction shortcuts that bypass them. An autonomous AI agent that uses browser-automation tools discovers the manifest out-of-band and consults it as reference data when it encounters friction, instead of re-inferring site-specific behavior from the full Document Object Model (DOM) by trial and error. The manifest is advisory data published by the origin; it is not part of, and does not modify, any browser-automation tool schema. The specification defines five interoperable discovery methods, the AFRM three-part schema (frameworkHints, knownTraps, and shortcuts), and an OPTIONAL SHA-256 canonical-hash verification procedure via a trust registry, together with security mitigations against prompt- injection attacks. Empirical evaluation with a contemporary LLM browser agent on a proprietary-knowledge enterprise task -- an SAP S/4HANA-style table- maintenance screen whose valid values depend on a company-code registration scope that is not derivable from public or training knowledge -- shows that a site-published manifest, consumed directly by the agent with no external runtime, reduces interaction actions by approximately 54% and input tokens by approximately 45% relative to the no-manifest baseline, with both conditions completing the task. The manifest's value concentrates where the agent cannot succeed from training knowledge alone: site-specific dependencies, data, and hard mechanical blocks. This -02 revision supersedes the deterministic-runtime framing of draft-han-ai-manifest-01. The earlier revision reported a single- trial preliminary measurement and concluded that an embedded manifest is of negative value unless paired with a deterministic execution runtime (an "SDK"). Subsequent multi-condition measurement with a real LLM agent shows that a manifest is consumed and is useful on its own; an out-of-band discovery path (the Method A well-known URI, or Han Expires 4 December 2026 [Page 1] Internet-Draft AI Manifest June 2026 the user-curated Method E) avoids the prompt-injection-defense friction that the earlier embedded-only trial encountered. A deterministic runtime remains OPTIONAL -- useful for latency- or service-level-sensitive deployments -- but is no longer presented as REQUIRED. Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at https://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on 4 December 2026. Copyright Notice Copyright (c) 2026 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/ license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 3 2. Conventions and Definitions . . . . . . . . . . . . . . . . . 4 3. Protocol Overview . . . . . . . . . . . . . . . . . . . . . . 4 3.1. Embedding Methods . . . . . . . . . . . . . . . . . . . . 4 3.1.1. Method A: Well-Known URI . . . . . . . . . . . . . . 4 3.1.2. Method B: Hidden DOM Element . . . . . . . . . . . . 5 3.1.3. Method C: HTTP Response Header . . . . . . . . . . . 5 3.1.4. Method D: HTML Link Element . . . . . . . . . . . . . 5 3.1.5. Method E: AI-Side Curation . . . . . . . . . . . . . 5 3.2. Manifest Schema (AFRM Three-Part Model) . . . . . . . . . 6 3.3. Agent Detection Algorithm . . . . . . . . . . . . . . . . 6 3.4. Canonical Hash and Trust Verification . . . . . . . . . . 7 Han Expires 4 December 2026 [Page 2] Internet-Draft AI Manifest June 2026 3.5. Consulting the Manifest . . . . . . . . . . . . . . . . . 8 4. Central Trust Registry . . . . . . . . . . . . . . . . . . . 8 5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 9 5.1. Well-Known URI Registration . . . . . . . . . . . . . . . 9 5.2. AI Manifest Actions Registry (initial) . . . . . . . . . 9 5.3. AI Manifest Trap Categories Registry (initial) . . . . . 9 5.4. Link Relation Type Registration . . . . . . . . . . . . . 10 6. Security Considerations . . . . . . . . . . . . . . . . . . . 10 6.1. Prompt Injection Risk . . . . . . . . . . . . . . . . . . 10 6.2. Interaction with LLM Agent Prompt Injection Defenses . . 11 6.3. Integrity of the Manifest . . . . . . . . . . . . . . . . 12 6.4. Transport Security . . . . . . . . . . . . . . . . . . . 12 7. Privacy Considerations . . . . . . . . . . . . . . . . . . . 12 8. Implementation Status . . . . . . . . . . . . . . . . . . . . 12 9. Intellectual Property Rights Disclosure . . . . . . . . . . . 14 10. Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . 14 11. Normative References . . . . . . . . . . . . . . . . . . . . 14 Appendix A. Document History . . . . . . . . . . . . . . . . . . 15 Author's Address . . . . . . . . . . . . . . . . . . . . . . . . 15 1. Introduction Large Language Model (LLM)-based AI agents increasingly interact with web services via browser-automation protocols such as the Model Context Protocol (MCP), Playwright, Puppeteer, and Selenium WebDriver. Current agents typically parse entire DOM trees or screenshots on every page to infer UI structure, producing three well-known problems: 1. Substantial token consumption due to repeated analysis of large DOMs on every session. 2. High failure rates on complex multi-step transactional UIs such as enterprise resource planning (ERP) systems, academic manuscript submission portals, and government e-services. 3. Absence of a standardized mechanism for a website operator to declare an AI-agent-friendly ("AI-Ready") operational surface. Related prior work includes robots.txt, llms.txt, agents.txt, and ai- plugin.json. These address crawling permissions, LLM-friendly documentation, agent capability declarations, and API-level integration respectively. None provides step-by-step UI workflow instructions for multi-page transactional flows. AI Manifest fills this gap by specifying a JSON format that describes a site's known UI traps, framework hints, and interaction shortcuts. An AI agent discovers and, optionally, verifies the manifest, then Han Expires 4 December 2026 [Page 3] Internet-Draft AI Manifest June 2026 consults it as advisory reference data when it encounters friction, instead of rediscovering the same site-specific behavior by repeated DOM-based inference. 2. Conventions and Definitions The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here. AI Manifest (AI Friction-Recovery Manifest, AFRM) A structured data object, expressed in JSON, that a website operator serves at a well-known URI or otherwise publishes (see Section 3.1), containing at minimum a version, a publisher, and a knownTraps array; each trap entry carries a category, a CSS selector, and an escapeAction. The two terms are used interchangeably in this document. AI Agent A software process, typically driven by an LLM, that accesses a web page via a browser-automation tool and executes actions on behalf of a human user. Central Trust Registry A network service that stores SHA-256 hash values of manifests pre-registered by publishers, and responds to real-time trust lookups from AI agents with a status of white- listed, black-listed, or unknown. Canonical Form The representation of an AI Manifest obtained by lexicographically sorting all JSON object keys at every nesting level and serializing the result using the JSON encoding defined in [RFC8259] with UTF-8. 3. Protocol Overview 3.1. Embedding Methods A website MAY provide an AI Manifest via one or more of the following methods: 3.1.1. Method A: Well-Known URI The server SHOULD make the manifest retrievable at the following well-known URI [RFC8615]: /.well-known/ai-manifest.json Han Expires 4 December 2026 [Page 4] Internet-Draft AI Manifest June 2026 In addition, the HTML document SHOULD declare the entry point via an HTML meta element: 3.1.2. Method B: Hidden DOM Element The server MAY embed the manifest into the HTML response as a hidden element with the display:none style and aria-hidden="true" attribute: 3.1.3. Method C: HTTP Response Header The server MAY declare the manifest location and hash in a response header: X-AI-Manifest: url=/.well-known/ai-manifest.json; hash=sha256: Method C is RECOMMENDED in conjunction with Method A, so that an AI agent can discover the manifest URL and validate the hash in a single request-response round trip before fetching the body. 3.1.4. Method D: HTML Link Element The server MAY declare the manifest via an HTML link element in the document head: Method D allows AI agents that parse the DOM but do not inspect HTTP headers to discover the manifest location. The rel value ai-manifest is registered in Section 5.4. 3.1.5. Method E: AI-Side Curation A user MAY place a manifest at a vendor-conventional path in their own file system, for example ~/.claude/afrm/{domain}.json or ~/.codex/afrm/{domain}.json. The AI runtime loads the file when it visits a matching domain. Because the manifest originates from the user's authored context, it enters the agent's trust context without the prompt-injection screening applied to externally fetched data. Han Expires 4 December 2026 [Page 5] Internet-Draft AI Manifest June 2026 When both an AI-side curated manifest and a site-side published manifest exist for the same origin, the curated manifest SHOULD take precedence as the more explicit trust signal. Method E is a transitional mechanism intended for domains that have not yet published a site-side AFRM. Once the site publishes its own manifest, the AI-side copy SHOULD be retired or used only to override individual fields. 3.2. Manifest Schema (AFRM Three-Part Model) An AI Friction-Recovery Manifest (AFRM) is a JSON object [RFC8259] with the following top-level fields: version (string, REQUIRED) Specification version. This document defines version "1.0". publisher (string, REQUIRED) Canonical domain name of the publishing website. manifestId (string, REQUIRED) Stable identifier scoped to the publisher, used for registry lookup. registry_url (string, REQUIRED) HTTPS URL of the trust registry that the publisher has pre-registered this manifest with. frameworkHints (object, OPTIONAL) Metadata about the UI framework in use (e.g., framework name, version, rendering model). Allows the AI agent to apply framework-specific heuristics before encountering friction. knownTraps (array, REQUIRED) Primary content of the AFRM. An ordered array of trap descriptors, each describing a UI pattern that predictably blocks autonomous AI agents. Each element MUST include trapId (string), category (string from the IANA trap registry, see Section 5), selector (string), and escapeAction (string). Optional fields include description and condition. shortcuts (array, OPTIONAL) An array of direct navigation or action shortcuts that allow the AI agent to bypass multi-step UI flows when a direct API or deep-link equivalent exists. 3.3. Agent Detection Algorithm Upon loading a page, an AI agent implementing this specification SHOULD perform the following detection sequence before any full-DOM inference pass: Han Expires 4 December 2026 [Page 6] Internet-Draft AI Manifest June 2026 1. Check the agent's local file system for a user-curated manifest at the vendor-conventional path (e.g., ~/.claude/ afrm/{domain}.json) matching the current origin (Method E). If found, load it into the trust context and continue; the site-side steps below may still be consulted for supplementary data. 2. Inspect the HTTP response headers for X-AI-Manifest (Method C). 3. If absent, retrieve /.well-known/ai-manifest.json (Method A) or resolve the URI declared by the meta element. 4. If still absent, check the document head for a element and fetch the referenced URL (Method D). 5. If still absent, search the DOM for an element with id="ai- manifest" and read its data-manifest attribute (Method B). 6. If none is found, fall back to conventional DOM-based inference. 3.4. Canonical Hash and Trust Verification A trust registry is an OPTIONAL integrity layer. When a site- published manifest declares a registry_url, the AI agent SHOULD compute a SHA-256 hash over the canonical form (see Section 2) of the manifest and send a trust lookup request to that URI before relying on the manifest. Such a request MUST use HTTPS [RFC2818] and MUST carry the tuple {publisher, manifestId, hash} as a JSON body. A user-curated manifest obtained via Method E is trusted as part of the user's own authored context and does not require a registry lookup. Because the manifest is advisory and the agent retains control of every interaction (see Section 3.5), a deployment MAY operate without a registry, in which case the agent relies on its own validation of each suggested action. The registry response is a JSON object containing a status field with one of the following values: * "white" — the manifest is trusted; the agent MAY proceed to execution. * "black" — the manifest is explicitly distrusted; the agent MUST abort and SHOULD alert the human user. * "unknown" — the manifest is not registered; the agent SHOULD warn the user and MAY fall back to DOM-based inference. Han Expires 4 December 2026 [Page 7] Internet-Draft AI Manifest June 2026 Implementations MAY cache a non-expired registry response keyed by the manifest hash, to avoid repeated network round trips for an identical manifest. 3.5. Consulting the Manifest The AFRM is advisory reference data, not a mandatory execution script. An AI agent operates with its own capabilities and consults the manifest when it would otherwise expend effort rediscovering site-specific behavior. Two consultation patterns are typical: * Reactive (friction recovery): when the agent encounters a UI condition matching a knownTraps entry's category and selector, it applies the associated escapeAction instead of recovering by trial and error. This is the primary use. * Proactive (shortcut): when a shortcuts entry offers a direct navigation or action equivalent to a multi-step UI flow, the agent MAY take it directly. The agent retains full control of the interaction and MAY continue to use its own DOM- or vision-based inference at any time; the manifest supplements that inference and does not replace it. 4. Central Trust Registry A Central Trust Registry accepts manifest registrations from publishers and answers real-time hash lookups from AI agents. A conforming registry SHOULD: * Store the SHA-256 hash of the canonical form of each registered manifest, together with the publisher and manifestId fields. * Perform static analysis on the submitted manifest (its knownTraps and shortcuts entries) and reject or black-list manifests whose selectors or actions match a published pattern of prompt-injection risk (for example, selectors targeting iframe elements for cross- origin form submission, or actions outside the registered action set). * Expose a community-driven mechanism for reporting and black- listing malicious manifests. This document does not mandate a specific registry operator. Multiple interoperable registries MAY exist, and each manifest declares which registry is authoritative for it via registry_url. Han Expires 4 December 2026 [Page 8] Internet-Draft AI Manifest June 2026 5. IANA Considerations 5.1. Well-Known URI Registration This document requests IANA to register the following entry in the "Well-Known URIs" registry established by [RFC8615]: URI Suffix: ai-manifest.json Change Controller: Independent Submission Stream editor Reference: This document Status: provisional Related Information: None 5.2. AI Manifest Actions Registry (initial) This document requests IANA to create a new registry named "AI Manifest Actions", with the following initial registrations. Registration policy: Specification Required [RFC8126]. click Invoke a click event on the selected element. fill Type a value into a text input element. select Choose an option from a drop-down list element. upload Attach a file to a file input element. wait Pause for a condition or duration. navigate Change the current URL. assert Verify that a condition holds before proceeding. 5.3. AI Manifest Trap Categories Registry (initial) This document requests IANA to create a new registry named "AI Manifest Trap Categories", with the following initial registrations. Registration policy: Specification Required [RFC8126]. shadow-dom-trap A UI component rendered inside a Shadow DOM tree whose selectors are not accessible via standard document.querySelector calls. virtual-scroll-trap A list or grid that renders only a visible Han Expires 4 December 2026 [Page 9] Internet-Draft AI Manifest June 2026 viewport window, requiring programmatic scrolling before off- screen items can be targeted. iframe-context-trap An action target located inside a cross-origin or same-origin