Sign in with Google
← back

The Open Knowledge Format (OKF)

The Open Knowledge Format, or OKF, is a new open standard from Google Cloud for packaging the knowledge an AI system needs so that any model or agent can read it. It takes the informal habit of keeping an "AI wiki" next to your work and turns it into a portable, vendor-neutral format. The announcement and the full specification are linked at the end.

The problem it solves

Foundation models are only as good as the context they are given, and in most organisations that context is scattered. Table schemas, metric definitions, runbooks, join paths, and the reasoning behind past decisions live in catalogs, wikis, shared drives, code comments, and the heads of a few senior people. Each tool stores this knowledge in its own shape behind its own API, so it does not travel. Every team that builds an agent re-solves the same job of gathering context, and every catalog vendor reinvents the same data model.

OKF answers this with a format that anyone can produce without an SDK, anyone can consume without an integration, and that survives being moved between systems. It lives in version control beside the code it describes, and the same file is readable by a person and parseable by an agent.

How it works

An OKF bundle is simply a directory of markdown files. The rules are deliberately small enough to fit on a single page.

  1. One concept per file. A concept is anything worth capturing: a table, a dataset, a metric, a playbook. The file's path is its identity, so tables/orders.md is the concept tables/orders.
  2. Markdown with YAML frontmatter. Each file opens with a small block of structured fields. The only required field is type. Recommended fields are title, description, resource (a link to the underlying asset), tags, and timestamp. You may add any other fields you like.
  3. Links make a graph. Concepts reference each other with ordinary markdown links, which turns the folder into a graph of relationships rather than a plain tree.
  4. Two reserved files. An index.md gives a directory listing so an agent can see what is available before opening files, and a log.md records changes over time, newest first.

Because it is just markdown and files, a bundle renders on GitHub, opens in any editor, ships as a tarball, and is indexed by any search tool. If you can read a file with cat, you can read OKF.

Three principles

  1. Minimally opinionated. Exactly one field is mandatory. The spec defines how systems interoperate, not what you must write.
  2. Producer and consumer are independent. A human can author a bundle that an agent consumes, or a pipeline can generate one that a person browses. The format is the contract, and the tools at each end are swappable.
  3. A format, not a platform. It is tied to no cloud, database, model, or agent framework, and never requires an account to read or write.

What comes with it

Google shipped working proofs alongside the spec: a reference agent that drafts a bundle from a BigQuery dataset and then enriches it by crawling authoritative documentation, a self-contained HTML visualiser that renders any bundle as an interactive graph, and three sample bundles built from public datasets.

Why it matters for AI visibility

This is the same idea that sits under AI visibility, seen from the supply side. If you want models to represent your work accurately, the knowledge they rely on has to be legible to them. OKF is a clean way to make that knowledge portable and machine-readable, so the systems that answer questions about your domain can ground themselves in what you actually said rather than guessing.

Read more

How the Open Knowledge Format can improve data sharing is the announcement. The specification and reference code live in GoogleCloudPlatform/knowledge-catalog.

Related concepts

Dan Petrovic · Jun 21, 08:28