YAML-in-HTML embeds structured, trust-scored knowledge into web pages that AI systems can retrieve and cite independently, turning any CMS into an AI memory engine without special plugins. It works with platforms like WordPress, providing fragment-level memory that AI can evaluate for trustworthiness.
The web was made for people, not machines. Humans interpret context. AI struggles to know what's true, who said it, and where it came from.
Schema.org and JSON-LD improved SEO, not AI understanding. They operate at the page level, lack provenance, and don’t support claim-level retrieval. AI systems are left guessing.
"Since 2011, websites have used Schema.org and JSON-LD to feed SEO bots. But these formats were never designed for AI memory." — David Bynon
YAML-in-HTML changes that. It embeds machine-readable, trust-scored memory inside standard HTML using inert tags. No JavaScript. No plugins.
<template
data-visibility-fragment
data-type="text/yaml"
data-sdt-class="DataFragment"
data-entity="plan:H5521-290-0"
data-digest="2025-cms-ma-mapd-plan"
data-glossary-scope="cms_landscape"
data-fragment-scope="semantic-digest">
Inside is YAML content—easy to read, easy to export, and fully retrievable by agents.
Defines what the fragment is, what it belongs to, and how to classify it.
data-sdt-class: DataFragment
entity: plan:H5521-290-0
digest: 2025-cms-ma-mapd-plan
glossary_scope: cms_landscape
fragment_scope: semantic-digest
Documents the source, license, and retrieval details.
ProvenanceMeta:
ID: 2025-cms-ma-landscape
Title: CMS MA Landscape File, 2025
Creator: Centers for Medicare & Medicaid Services (CMS)
License: Public Domain
Published: 2025-06-01
Retrieved: 2025-06-28
Digest: 2025-cms-ma-mapd-plan
Entity: plan:H5521-290-0
The smallest unit of AI-retrievable knowledge.
Fields:
•id: in_primary
defined_term: Primary Care Visit
value: "$0"
unit: usd
confidence: high
derived: false
glossary: term-in_primary
source: 2025-cms-pbp
provenance_ref: "#provenance-meta"
Different fragment types serve different AI needs.
Stores raw, structured facts (costs, stats, values).
Machine-readable glossary entries with provenance.
entity: term:zero_premium
Term:
term_id: zero_premium
name: Zero Premium Plan
definition: A Medicare Advantage plan that has no monthly premium beyond Part B.
Question-and-answer pairs for retrieval without hallucination.
FAQ:
question: Are zero-premium Medicare Advantage plans available in all counties?
answer: No. Availability varies by county.
Directories of entities (e.g., list of Medicare plans).
High-level metadata about entire datasets.
Just add the block and YAML into a post or page.
Embed YAML-in-HTML fragments as partials or template components.
Define custom fields or server-side generators that output valid YAML-in-HTML. Key requirement: preserve the and YAML formatting.
"If MCP is the USB-C socket, YAML-in-HTML is the micro thumb drive. It’s small, lightweight, and universally pluggable." — David Bynon
Traditional Web:
Memory-First Web:
YAML-in-HTML brings this future within reach—no new frameworks, no vendor lock-in. It runs on the web we already have.
By adding these machine-readable fragments alongside human-readable content, publishers can serve both audiences effectively—ensuring their expertise is accurately represented in both human research and AI-assisted information retrieval. Learn more in this USA Today story.
For an in-depth overview of this methodology, see the original announcement on Medium.com (https://medium.com/@trust_publishing/the-web-just-got-a-memory-introducing-yaml-in-html-1491e5d2c8fb).
To learn more about implementing YAML-in-HTML and the Semantic Digest Protocol in your own projects, check out David Bynon's documentation at SemanticDigest.org.