Frequently Asked Questions

Question 1

What is AI-readiness and why does it matter?

Accepted Answer

AI-readiness measures how well your website's content can be understood, extracted, and used by AI agents like ChatGPT, Claude, and Perplexity. As AI-powered tools become a major source of web traffic, sites that are AI-ready get cited more accurately, appear more often in AI-generated responses, and cost less tokens to process.

Question 2

How do AI agents consume content differently from browsers?

Accepted Answer

Unlike web browsers that render HTML visually, AI agents need to extract text content from your pages. They prefer clean, well-structured content over complex HTML with heavy styling. A well-structured page converted to Markdown uses 70-80% fewer tokens than raw HTML, making it cheaper and more efficient for AI providers.

Question 3

Which AI bots and agents are currently crawling the web?

Accepted Answer

The major AI crawlers include GPTBot (OpenAI/ChatGPT), ClaudeBot (Anthropic/Claude), PerplexityBot (Perplexity), Google-Extended (Google Gemini), Bytespider (ByteDance), CCBot (Common Crawl), and many more. New AI agents appear regularly as the ecosystem grows.

Question 4

What is llms.txt?

Accepted Answer

llms.txt is an emerging standard (defined at llmstxt.org) that helps AI agents understand your website's structure. Similar to how robots.txt guides search engine crawlers, llms.txt provides a Markdown-formatted overview of your site with links to key pages, making it easy for AI agents to navigate your content.

Question 5

What's the difference between llms.txt and llms-full.txt?

Accepted Answer

llms.txt is a concise index with a description and links to your site's main pages. llms-full.txt is an extended version that includes the actual content of those pages inline, giving AI agents everything in a single file without needing to follow links. Use llms.txt as a minimum, and llms-full.txt for comprehensive coverage.

Question 6

How do I create an llms.txt for my site?

Accepted Answer

Create a text file at your domain root (e.g., example.com/llms.txt) following the llmstxt.org spec. Start with a # heading (your site name), add a blockquote description, then list links organized in sections like ## Documentation and ## Main. AgentReady can generate a recommended llms.txt based on your page analysis.

Question 7

Why is Markdown important for AI agents?

Accepted Answer

Markdown is the preferred format for AI agents because it preserves content structure (headings, lists, links, emphasis) while eliminating visual markup noise (CSS, JavaScript, layout divs). A Markdown version of your content uses significantly fewer tokens, making it faster and cheaper for AI systems to process.

Question 8

What is Markdown content negotiation?

Accepted Answer

Content negotiation allows your server to serve different formats of the same page based on the client's Accept header. When an AI agent sends Accept: text/markdown, your server can respond with a Markdown version instead of HTML. This is the most efficient way to serve AI-friendly content without creating separate URLs.

Question 9

How do I serve Markdown versions of my pages?

Accepted Answer

There are two main approaches: (1) Add server logic to detect Accept: text/markdown headers and return Markdown content; (2) Create .md files alongside your pages (e.g., /about.md for /about) and link to them from your llms.txt. AgentReady uses both approaches for its own pages.

Question 10

What is JSON-LD and how does it help AI agents?

Accepted Answer

JSON-LD (JavaScript Object Notation for Linked Data) is a way to embed structured data in your pages using Schema.org vocabulary. AI agents use this data to extract factual, machine-readable information like product details, article metadata, organization info, and more — without needing to parse your HTML.

Question 11

Which Schema.org types should I use?

Accepted Answer

Use the most specific type that matches your content: Article or BlogPosting for articles, Product for product pages, Organization for company pages, FAQPage for FAQ pages, LocalBusiness for local businesses, and WebApplication for web tools. Always include name, description, and relevant properties for your chosen type.

Question 12

How do Open Graph tags help AI agents?

Accepted Answer

Open Graph tags (og:title, og:description, og:image) provide standardized metadata that both social platforms and AI agents use to understand your page's title, description, and primary image. They're easy to implement and serve as a reliable fallback when other structured data is missing.

Question 13

How does robots.txt affect AI crawlers?

Accepted Answer

robots.txt controls which bots can access your site and which pages they can crawl. AI crawlers like GPTBot and ClaudeBot respect robots.txt directives. If your robots.txt blocks these bots, they won't be able to index your content, which means your site won't appear in AI-generated responses.

Question 14

Which AI bots should I allow in robots.txt?

Accepted Answer

To maximize visibility in AI-generated responses, allow at least: GPTBot (OpenAI), ClaudeBot and Claude-Web (Anthropic), PerplexityBot (Perplexity), and Google-Extended (Google Gemini). You can add specific Allow rules for these user agents while maintaining your existing rules for other bots.

Question 15

What are Content-Signal headers?

Accepted Answer

Content-Signal is an HTTP header that tells AI agents how they may use your content. For example: Content-Signal: ai-train=yes, search=yes, ai-input=yes signals that your content can be used for AI training, search indexing, and as input for AI responses. This is a newer standard that gives publishers explicit control over AI usage.

Question 16

How does the AgentReady scoring system work?

Accepted Answer

AgentReady fetches your page, extracts the content, and runs 21 individual checks across 5 weighted dimensions. Each check scores 0-100, and the dimensions are combined into an overall score from 0 to 100. You get a letter grade (A-F), detailed breakdown, and prioritized recommendations to improve your score.

Question 17

What are the 5 scoring dimensions?

Accepted Answer

The 5 dimensions are: Semantic HTML (20%) — proper use of article, main, headings, and semantic elements; Content Efficiency (25%) — token reduction ratio and content-to-noise ratio; AI Discoverability (25%) — llms.txt, robots.txt, sitemap, and markdown negotiation; Structured Data (15%) — Schema.org, Open Graph, and meta tags; Accessibility (15%) — content without JavaScript, page size, and content position.

Question 18

Is the analysis free?

Accepted Answer

Yes! Single-page analysis is completely free with no signup required. You get the full score, recommendations, Markdown conversion, and llms.txt preview. We're currently in beta with a limit of 5 analyses per hour. Full domain crawl and monitoring features are coming soon.

Frequently Asked Questions

AI-Readiness Basics

llms.txt

Markdown for AI

Structured Data & JSON-LD

robots.txt & AI Bots

AgentReady Scoring

Useful Resources