Technical

How ChatGPT Selects Its Sources, and How to Become One

February 14, 2026 8 min read

The question every marketing leader should be asking in 2026 is no longer "Am I on the first page of Google?" but rather "Am I cited when someone asks an AI about my industry?"

The 4 selection criteria of LLMs

1. Technical accessibility: The crawler must be able to read your site. A robots.txt that blocks GPTBot, ClaudeBot, or MistralBot excludes your site from those models' indexes entirely. This is the baseline: if AI bots cannot crawl your content, you do not exist for them.

2. Information structure: AI models prefer content that directly answers a question. A factual paragraph will always be favored over a promotional one. H2 and H3 headings phrased as questions, bullet lists, and direct concise answers: all of these significantly increase your chances of being cited.

3. Structured data: Schema.org JSON-LD is the language AI reads before your actual content. Organization, FAQPage, Service, Person: these markup types tell models exactly what you do, who you are, and why you are a trustworthy source.

4. Authority and consistency: A site whose content is coherent, non-contradictory, and regularly updated inspires greater confidence in models. Content freshness and consistency are positive signals for all LLMs.

The key role of the llms.txt file

An emerging standard first proposed in 2024, the llms.txt file contains a structured Markdown summary of your business. It is your letter of introduction to LLMs. Placed at the root of your domain (e.g., https://your-website.com/llms.txt), it summarizes who you are, what you do, and how to reach you.

Twenty minutes of work for potential visibility to hundreds of millions of ChatGPT, Claude, Perplexity, and Mistral Le Chat users. Models that have adopted this standard prioritize sites that use it — notably Perplexity, which was an early adopter.

What you can do right now

Start with an audit of your site's current state: does your robots.txt allow AI crawlers? Do you have an llms.txt file? Is your Schema.org markup complete and up to date? These three questions cover the essentials.

Once you have this baseline assessment, the fixes are quick to implement, and their impact on your AI visibility can be measurable within weeks.


Discover your GEO score in 60 seconds

Free analysis: robots.txt, llms.txt, Schema.org, AI-ready content. Score out of 100 with recommendations.

Analyze my website for free →