Our audit scores your site across six dimensions derived from the Princeton/Georgia Tech KDD 2024 framework, verified against first-party guidance from Google, Anthropic, and Microsoft.
1. AI Crawler Access (Weight: 25%) — We verify that GPTBot, OAI-SearchBot, PerplexityBot, ClaudeBot, and Google-Extended are explicitly permitted in robots.txt. Blocking any one of these eliminates all citations from that platform. This dimension has the highest weight because it is a binary gate — if crawlers cannot access your content, no other optimization matters. Otterly.AI’s 2026 study confirmed that 73% of websites inadvertently block at least one major AI crawler.
2. llms.txt Implementation (Weight: 10%) — We check for a well-formed llms.txt at your domain root, endorsed by Anthropic in November 2024, that guides AI systems to your most important content. We evaluate file presence, Markdown formatting, content completeness, and freshness of listed pages. Sites with llms.txt show faster AI indexing and more consistent citation patterns.
3. Schema Markup Depth (Weight: 20%) — We audit JSON-LD coverage across six schema types: Organization, FAQPage, Article, Product, BreadcrumbList, and LocalBusiness. FAQPage schema alone delivers a measured 3.2× citation lift for Google AI Overviews (CXL, 2024). Incomplete schema — such as FAQPage with missing Answer fields — creates an 18-point citation penalty versus having no schema at all.
4. Content Citability (Weight: 20%) — We score every content block by length (75–150 words is the AI citation sweet spot identified in KDD 2024 research), answer-first structure, and presence of proprietary data with named sources. The most common failing: 88% of sites we audit have no content blocks in the optimal word-count range. Blocks with specific statistics from named sources score highest.
5. E-E-A-T Signals (Weight: 15%) — We assess byline authority, author schema markup (Person JSON-LD with jobTitle and affiliation), expert quotations, and external source citations. Google’s documentation confirms that 96% of AI Overview citations come from sources with strong E-E-A-T signals. Sites without visible author attribution are systematically deprioritized by AI engines.
6. Technical SEO Alignment (Weight: 10%) — We check canonical tags, sitemap accuracy, Core Web Vitals, and mobile usability — since 71.7% of ChatGPT citations still come from pages with established organic search presence (Surfer SEO, 2025). We evaluate sitemap submission status, page speed (FCP under 0.4 seconds correlates with 3× more citations), mobile responsiveness, and HTTPS implementation.