In March 2025, Google and Microsoft made something official: they actively use schema markup during AI response generation. At SMX Munich 2025, Microsoft's Principal Product Manager stated directly that "schema markup helps Microsoft's LLMs understand your content."
This wasn't a surprise to anyone paying attention. But the public confirmation changed the conversation. Schema markup isn't just a nice-to-have for rich snippets anymore. It's a direct input into whether AI engines understand your content well enough to cite it.
If you haven't updated your schema strategy since the pre-AI era, this post is for you.
In This Article
What Is Schema Markup, and Why Does It Matter for AI?
Schema markup is structured data you add to your HTML — typically as JSON-LD in the <head> — that provides machine-readable metadata about your content.
For traditional search, schema enabled rich results: star ratings, FAQ dropdowns, how-to steps, and recipe cards in the SERP. For AI search, the stakes are higher.
Here's the key difference: When a user asks ChatGPT or Perplexity a question, the AI doesn't just read your content the way a human does. It processes the page, extracts meaning, and decides whether it trusts the source enough to cite it. Schema markup compresses that process. It tells the engine:
- What type of content this is (article, FAQ, product, organization)
- Who wrote it and when
- What entity or business is behind it
- What the page is definitively about
Without schema, AI engines have to infer all of this from unstructured text — and they often infer wrong, or not at all.
Why JSON-LD Specifically
There are three formats for structured data: JSON-LD, Microdata, and RDFa. Google recommends JSON-LD. The AI ecosystem has aligned on it.
The reason is practical: JSON-LD lives in a separate <script> block in the <head>, completely independent of your HTML structure. You can add, update, or remove it without touching your visual content. It's maintainable, easy to validate, and doesn't interfere with CSS or layout.
Microdata requires embedding attributes directly in your HTML elements — messy, fragile, and hard to maintain at scale. For the purposes of AI optimization, JSON-LD is the format. Use it.
The Schema Types That Matter Most for AI Citations
1. Organization
What it tells AI engines: Who you are as a business — your name, what you do, where to find you, and how to verify your identity across the web.
Why it matters: AI models need to establish that your brand is a real, credible entity. Organization schema with sameAs links (pointing to your LinkedIn, Twitter/X, Crunchbase, Wikipedia, and other authoritative profiles) gives AI engines a web of cross-references to verify against.
Minimum viable implementation:
{
"@context": "https://schema.org",
"@type": "Organization",
"name": "Your Company Name",
"url": "https://yoursite.com",
"description": "One-sentence description of what you do.",
"logo": "https://yoursite.com/logo.png",
"sameAs": [
"https://linkedin.com/company/yourcompany",
"https://twitter.com/yourcompany"
],
"contactPoint": {
"@type": "ContactPoint",
"contactType": "customer service",
"email": "hello@yoursite.com"
}
}
Add this to every page of your site, particularly the homepage and About page.
2. FAQPage
What it tells AI engines: That this page contains question-answer pairs with direct, structured answers — exactly the format AI engines extract and cite.
Why it matters: FAQPage schema pages are 3.2× more likely to appear in Google AI Overviews — the highest citation multiplier of any schema type tested. This is the single highest-ROI schema type for GEO because AI engines are built to extract and present Q&A content. When you provide it in structured form, you eliminate ambiguity.
Best practice: Keep answers between 50–150 words. Write them to answer the question fully on their own — not as hooks to read the rest of the article. AI engines extract the answer text verbatim.
{
"@context": "https://schema.org",
"@type": "FAQPage",
"mainEntity": [
{
"@type": "Question",
"name": "What is Generative Engine Optimization?",
"acceptedAnswer": {
"@type": "Answer",
"text": "GEO (Generative Engine Optimization) is the practice of
optimizing content so it gets cited inside AI-generated answers
from tools like ChatGPT, Perplexity, Google AI Overviews, and
Gemini. Unlike SEO, which targets rankings, GEO targets citations
— appearing inside the answer itself rather than below it."
}
}
]
}
Add FAQ schema to your key landing pages, service pages, and any blog posts that answer a well-defined question.
3. Article
What it tells AI engines: Who wrote this, when it was published, when it was last updated, and what it's about.
Why it matters: Freshness and authorship are primary trust signals for AI engines. Without Article schema, an AI crawler has to infer the publish date from visible text (unreliable) and the author from byline text (often inconsistent). Schema makes both machine-readable and authoritative.
Key fields:
headline: The article titleauthor: Link to a Person schemadatePublished: ISO 8601 format (e.g.,2026-03-18)dateModified: Update this whenever you update the contentpublisher: Link to your Organization schema
Pro tip: When you update old content, updating dateModified in your schema signals freshness to AI crawlers even if the URL doesn't change. This is one of the lowest-effort ways to maintain citation eligibility for evergreen content.
4. Person (Author Schema)
What it tells AI engines: That the content was written by a real, credentialed human being — not anonymously generated.
Why it matters: Perplexity rarely cites anonymous content. Google's E-E-A-T framework (which feeds directly into AI Overviews) treats author identity and demonstrated expertise as core trust signals. 96% of Google AI Overview citations come from sources with strong E-E-A-T.
Minimum viable implementation:
{
"@context": "https://schema.org",
"@type": "Person",
"name": "Author Name",
"jobTitle": "Their Role",
"url": "https://yoursite.com/authors/their-name",
"sameAs": [
"https://linkedin.com/in/theirprofile"
]
}
Create an author profile page for each contributor. Link their Article schema to their Person schema. AI engines follow these links to verify credibility.
5. HowTo
What it tells AI engines: This page contains a step-by-step process for accomplishing a specific goal.
Why it matters: How-to queries are among the highest-volume AI search interactions. "How to get cited by ChatGPT," "how to implement llms.txt," "how to fix Core Web Vitals" — all of these trigger AI engines to look for step-by-step content. HowTo schema makes your page the unambiguous answer.
Use it on any page that walks through a defined process. Include step objects with clear name and text for each step.
The Critical Warning: Generic Schema Hurts More Than It Helps
Research from a 2026 empirical study of 730 AI citations found something counterintuitive: generic, partially-filled schema produces an 18-percentage-point citation penalty compared to having no schema at all.
The reasoning: AI engines interpret incomplete schema as a mismatch between what you claim and what you deliver. A Product schema with only name filled in is a false signal. It looks like a bot scraped the markup, not that a real business is properly describing itself.
The rule: If you implement a schema type, implement it fully. Every required and recommended field should have a real value. If you don't have the data to fill it in properly, leave that schema type out until you do.
The Implementation Priority Order
If you're starting from scratch, tackle schema in this order:
- Organization — on every page. This is the foundation that establishes your entity identity.
- FAQPage — on your top 5 pages by traffic. Highest citation lift of any schema type.
- Article — on every blog post. Always include datePublished, dateModified, and author.
- Person — create author pages. Link them from Article schema for E-E-A-T signals.
- HowTo — on instructional content. High-value for tutorial-style queries.
Validate Before You Publish
Don't guess whether your schema is correct. Two tools:
- Google Rich Results Test (
search.google.com/test/rich-results) — validates your JSON-LD and shows how Google interprets it. - Schema.org Validator (
validator.schema.org) — checks for structural errors and required fields.
A schema error that returns a 500 for Google is worse than no schema at all. Validate every implementation.
Schema Is the Floor, Not the Ceiling
Schema markup tells AI engines what your content is. But AI engines also care about what your content says and whether anyone else confirms it.
A perfectly schema'd page with weak original content will still lose to a well-written, authoritative piece with basic schema. Schema raises your floor — it eliminates the technical barriers to being understood. The ceiling is set by content quality, brand authority, and the trust signals you build over time.
That's why the best GEO strategy combines technical excellence (schema, robots.txt, page speed, llms.txt) with content strategy (original research, answer-first structure, genuine expertise).
For a complete technical checklist covering schema and nine other AI visibility factors, see our GEO Audit Checklist. For the business strategy behind getting cited, see How to Get Your Business Mentioned by ChatGPT.
Find Out Where Your Schema Gaps Are
Our free GEO audit checks your schema implementation, identifies missing markup types, and scores your overall AI citation readiness — alongside the other technical and content factors that determine whether AI engines trust your site.
Run Free GEO Audit →GEORaiser researches AI search engine behavior and GEO optimization. Sources: Google/Microsoft schema announcements (March 2025), Princeton/Georgia Tech GEO research (KDD 2024), Google E-E-A-T documentation, Schema.org specification.