Sign in with Google
← back

Primary Bias

Primary bias is what an AI model already believes about your brand before it searches for anything. It is the model's ungrounded confidence in an entity, formed during training and baked into its weights, and it fires the moment a question is asked, before a single source is retrieved. In AI search this turns out to be the single largest factor in whether your content gets selected.

The clearest illustration came from a model reasoning about us. Reading our name, “Dejan”, it pattern-matched to the Balkans and began generating test queries for Serbian and Slovenian cities, when we are in fact Australian. The judgement landed before any other context was processed: as we described in our account at the time, the model had already formed an opinion before the conversation began.

Primary versus secondary bias

It helps to split the influences on selection into two layers. Primary bias is the model's inherent relevance perception of an entity, its pre-retrieval instinct. Secondary bias is everything about how your content is formatted, structured, and presented once it has been retrieved. The distinction matters because they move on very different timescales: secondary bias is addressable now, at the margin, while primary bias is slow to shift because it depends on training data.

Why it is hard to move

Primary bias lives in the model's weights, so changing it means changing what future models learn. Pre-training data is now heavily curated and very unlikely to respond to ordinary SEO. The realistic lever is fine-tuning, which authority-building can influence over roughly three to six months, with major model releases arriving about once a year. Our research on Selection Rate found that a brand strong in the training corpus can earn a high selection rate even with mediocre content, while a weak or confused brand struggles even when its page is retrieved.

How it connects to Selection Rate

Primary bias is measured through its effect on the Selection Rate: how often the model picks your source out of the grounding candidates. We frame primary bias as the dominant input to that rate, and use a probability-path method (our “Tree Walker”) to surface a brand's weakest associations, the high-uncertainty spots where reinforcement helps most.

What you can do about it

  • Build a consistent, authoritative presence in the kinds of sources that feed training data.
  • Disambiguate your entity explicitly, so a name does not collapse into the wrong association.
  • Earn citations in authoritative contexts, across academic, industry, and media.
  • Persist over time, so the signal survives across training cycles.

The uncomfortable part

Primary bias is pre-judgement in the literal sense, and it inherits the patterns in its training data, human ones included. Names, locations, and demographic signals can override the actual context in front of the model. The question is not whether these systems carry bias; they do. For anyone trying to be visible in AI answers, it is a force to understand and work with, not one that can be edited away on the day.

Related concepts

Evidence and sources

Dan Petrovic · Jun 21, 11:41