Citation audit
[SEO] Citation audit — 6 pages need fixing
What the brain wrote
Citation audit 2026-05-03: 6/16 pages flagged as not-citable. Weak: /benchmarks, /claude-code, /entity/anthropic, /entity/openai, /entity/claude-opus-4-6, /entity/gpt-4o
Citation audit
Raw payload
{
"findings": [
{
"would_cite": false,
"confidence": 5,
"reason": "The page lists benchmark scores and SOTA models but does not provide a composite score or explicit ranking of 'top AI language models' as requested.",
"critic_notes": [
"No composite score is defined or calculated on the page",
"The page is a live benchmark catalogue, not a ranked list of top models by overall performance",
"The user asks for 'top AI language models in 2026', but the page mixes language models with other categories like image-gen and speech, and does not aggregate them into a clear top list",
"Entity definitions are weak — e.g., 'Claude Opus 4.6' appears but it's unclear if this is a language model or a general AI system"
],
"url_path": "/benchmarks",
"query": "What are the top AI language models in 2026 by composite score and benchmark performance?",
"kind": "hub"
},
{
"would_cite": true,
"confidence": 95,
"reason": "The page is a dedicated hub offering a structured 12-lesson curriculum, glossary, and live news specifically covering AI data center infrastructure, power, cooling, and hyperscale campus design.",
"critic_notes": [
"The page content is a high-level index; the actual depth of the lessons is not shown in the snippet.",
"The 'cooling' topic is not explicitly highlighted in the visible text, though liquid cooling is mentioned.",
"The page appears to be a news/curation site, not a primary source like a hyperscaler's official documentation."
],
"url_path": "/ai-data-centers",
"query": "Where can I learn about AI data center infrastructure, power, cooling, and hyperscale campus design?",
"kind": "hub"
},
{
"would_cite": true,
"confidence": 90,
"reason": "The page directly provides detailed, specific specifications (e.g., 100K H100 GPUs, 150 MW substation built in 97 days, 420 MW gas turbines) and explicitly discusses the key controversies (unpermitted turbines, Clean Air Act notices, NAACP involvement) for xAI's Colossus supercluster in Memphis.",
"critic_notes": [
"Source is gentic.news, a specialized AI news aggregator, not an official xAI or regulatory document; authority is secondary.",
"The page uses some promotional language ('fastest... ever built') which may reflect bias toward industry narrative.",
"Controversies are mentioned but not deeply detailed; for full legal or environmental specifics, additional sources would be needed."
],
"url_path": "/ai-data-centers/case-studies/colossus-xai-memphis",
"query": "What are the specifications and controversies of xAI's Colossus AI supercluster in Memphis?",
"kind": "case_study"
},
{
"would_cite": true,
"confidence": 95,
"reason": "The page explicitly states the Abilene site is 1.2 GW on a 1,000-acre Lancium Clean Campus, built by Crusoe Energy Systems, directly answering both parts of the query.",
"critic_notes": [
"The page is from a news aggregation site, not an official corporate or government source, which slightly reduces authority.",
"The 1.2 GW cap is noted as occurring in March 2026, but the query does not specify a time frame, so the answer is still valid.",
"The site size (1,000 acres) is stated, but the page does not explicitly clarify whether '1.2 GW' refers to total campus capacity or just the initial phase."
],
"url_path": "/ai-data-centers/case-studies/stargate-abilene",
"query": "How big is the Stargate AI data center campus in Abilene and who is building it?",
"kind": "case_study"
},
{
"would_cite": true,
"confidence": 95,
"reason": "The page explicitly documents the exact algorithms, thresholds, and formulas for relevance, quality, and prediction accuracy scores as requested.",
"critic_notes": [
"The page mentions 'prediction accuracy' in the intro but the excerpt cuts off before explaining that metric, so the answer for that specific score is incomplete in the provided content.",
"DeepSeek scoring is used but no justification for choosing DeepSeek over other models is given, which could affect trust in the scores.",
"The quality rubric includes 'anti-listicle / anti-hype' which is subjective and may introduce bias, but this is a known limitation disclosed by the site."
],
"url_path": "/methodology",
"query": "How does gentic.news compute its AI model relevance, quality, and prediction accuracy scores?",
"kind": "hub"
},
{
"would_cite": false,
"confidence": 20,
"reason": "The page is an aggregation hub with headlines and links, but lacks detailed, authoritative guidance on best practices for CLAUDE.md setup, MCP servers, or cost optimization.",
"critic_notes": [
"Content is primarily headlines and navigation, not in-depth best practices.",
"No concrete instructions or authoritative sources for CLAUDE.md or MCP server setup.",
"Cost optimization claims are only teasers (e.g., '270-Second Rule') without actionable detail."
],
"url_path": "/claude-code",
"query": "What are the current best practices for Claude Code, CLAUDE.md setup, MCP servers, and cost optimization?",
"kind": "hub"
},
{
"would_cite": false,
"confidence": 85,
"reason": "The page is from an AI news aggregator, not an official or authoritative source like Anthropic's own website or a reputable publication.",
"critic_notes": [
"Source is gentic.news, an AI news platform with unclear editorial standards and potential for inaccuracies",
"Key facts like '$11.5B+ Total Funding' contradict the earlier stated '$2B from Google and up to $8B from Amazon'",
"Contains speculative, opinionated content (e.g., 'Agent's take' section with unsubstantiated claims about Pentagon relationships)",
"Information is a mix of factual claims and subjective analysis, not a definitive or authoritative description of the company"
],
"url_path": "/entity/anthropic",
"query": "What does the company Anthropic do in the AI space?",
"kind": "entity",
"entity_name": "Anthropic"
},
{
"would_cite": false,
"confidence": 40,
"reason": "The page mixes factual company description with speculative, unverified claims (e.g., GPT-5.2 Pro, regulation by Chinese law enforcement) and has a low-credibility domain focused on AI news aggregation, not official or authoritative sources.",
"critic_notes": [
"Contains speculative and unverified product releases (GPT-5.2 Pro, ChatGPT Atlas) not confirmed by official sources",
"States OpenAI is 'regulated by Chinese law enforcement' without evidence or citation",
"Domain is a niche AI news aggregation site, not an official or widely recognized authoritative source",
"Buried the core company description among agent commentary and graph metrics, reducing clarity",
"Includes future forecasts (e.g., 2028 hardware costs) irrelevant to the query about what the company does"
],
"url_path": "/entity/openai",
"query": "What does the company OpenAI do in the AI space?",
"kind": "entity",
"entity_name": "OpenAI"
},
{
"would_cite": true,
"confidence": 85,
"reason": "The page provides a specific, up-to-date, and authoritative overview of Google's AI activities, including DeepMind, Gemini models, enterprise platforms, and recent releases.",
"critic_notes": [
"The page is from a relatively niche AI news aggregator, not an official Google source, which slightly reduces authority.",
"Some claims (e.g., 'out-shipping', 'competes with Renaissance Technologies') are subjective or unsupported by direct evidence.",
"The content is a mix of factual data and opinionated 'agent's take', which may blur objective reporting."
],
"url_path": "/entity/google",
"query": "What does the company Google do in the AI space?",
"kind": "entity",
"entity_name": "Google"
},
{
"would_cite": true,
"confidence": 85,
"reason": "The page directly and specifically details Nvidia's AI activities including hardware (Blackwell, GB300), AI models (Nemotron, NeMoClaw), and infrastructure partnerships, providing authoritative, current information.",
"critic_notes": [
"The page is from a news aggregator, not Nvidia's official site, so it may lack primary-source authority.",
"Some content is future-dated (e.g., 'May 3, 2026') suggesting speculative or generated content, reducing reliability.",
"The entity definition is broad and includes some non-AI context (e.g., general GPU development), but the AI-specific claims are clear and relevant."
],
"url_path": "/entity/nvidia",
"query": "What does the company Nvidia do in the AI space?",
"kind": "entity",
"entity_name": "Nvidia"
},
{
"would_cite": true,
"confidence": 85,
"reason": "The page directly and comprehensively describes Meta's AI activities, including open-source Llama models, FAIR lab, social platform integrations, $60B spend, and specific products like Segment Anything Model and AI coding agents.",
"critic_notes": [
"The page is from gentic.news, a secondary/aggregator AI news platform, not an official Meta source",
"Some claims (e.g., $2B Manus acquisition blocked by China) appear speculative and may not be verified",
"The content includes timeline entries with future dates (e.g., Feb 16, 2026), indicating possible synthetic or predicted data rather than factual reporting"
],
"url_path": "/entity/meta",
"query": "What does the company Meta do in the AI space?",
"kind": "entity",
"entity_name": "Meta"
},
{
"would_cite": true,
"confidence": 85,
"reason": "The page directly and comprehensively describes Microsoft's AI activities, including its investment in OpenAI, Copilot integration, Azure OpenAI Service, recent model releases, and multi-model strategy.",
"critic_notes": [
"The page is from a third-party AI news aggregator, not an official Microsoft source, which may reduce authoritativeness for some users.",
"Some claims, like the $37B AI run rate and $627B backlog, reference sources that are not directly linked or verified within the snippet.",
"The entity definition mixes current facts with speculative analysis (e.g., 'question is whether its own models can reduce reliance on OpenAI'), which could be seen as opinion."
],
"url_path": "/entity/microsoft",
"query": "What does the company Microsoft do in the AI space?",
"kind": "entity",
"entity_name": "Microsoft"
},
{
"would_cite": true,
"confidence": 85,
"reason": "The page directly describes GitHub's AI activities, including Copilot, Copilot Workspace, and its role as a Microsoft AI distribution channel, which answers the user's query.",
"critic_notes": [
"The page is from a news aggregation site, not an official GitHub source, which may reduce authority.",
"Some claims (e.g., 'Copilot most adopted AI coding assistant') lack direct citations or evidence.",
"The content includes some speculative or opinionated language (e.g., 'Agent's take') that may not be fully factual."
],
"url_path": "/entity/github",
"query": "What does the company GitHub do in the AI space?",
"kind": "entity",
"entity_name": "GitHub"
},
{
"would_cite": false,
"confidence": 30,
"reason": "The page is an AI news aggregation platform's entity overview, not an official source, and does not provide clear, verified benchmark scores for Claude Opus 4.6 (only mentions a single 94.1% on ThermoQA without context).",
"critic_notes": [
"The page appears to be a speculative or AI-generated news aggregator, not an authoritative source from Anthropic or a trusted benchmark org.",
"Only one benchmark (ThermoQA) is mentioned, and the page does not list standard benchmarks like MMLU, HumanEval, or reasoning scores for direct comparison.",
"The entity description includes unverified claims (e.g., 'viral incident where model refused to answer 2+2') and future-dated content (Feb 2026), suggesting low reliability."
],
"url_path": "/entity/claude-opus-4-6",
"query": "What is Claude Opus 4.6, who developed it, and what are its benchmark scores?",
"kind": "entity",
"entity_name": "Claude Opus 4.6"
},
{
"would_cite": false,
"confidence": 20,
"reason": "The page is a generic entity overview focused on recent news trends and mentions, not an authoritative source for GPT-4o's development (which is missing) or its specific benchmark scores (none provided).",
"critic_notes": [
"No mention of who developed GPT-4o (e.g., OpenAI is implied but not clearly stated as the developer)",
"No actual benchmark scores (e.g., MMLU, HumanEval, etc.) are listed; only vague references to failed betting benchmarks and lying behavior",
"Content is oriented around trend tracking and entity graph analysis, not factual specification or official performance data"
],
"url_path": "/entity/gpt-4o",
"query": "What is GPT-4o, who developed it, and what are its benchmark scores?",
"kind": "entity",
"entity_name": "GPT-4o"
},
{
"would_cite": true,
"confidence": 90,
"reason": "The page directly and specifically details Amazon's AI activities, including its Anthropic investment, custom chips, Bedrock service, Nova models, and enterprise infrastructure focus.",
"critic_notes": [
"The page appears to be an AI-generated or platform-aggregated summary, not an official Amazon source.",
"The content includes speculative forward-looking statements (e.g., 'strategic tension: partner or eventual rival') that may not be authoritative.",
"Some timestamps and events shown (e.g., April 2026 dates) could be fabricated or from a future-dated simulation, reducing factual reliability."
],
"url_path": "/entity/amazon",
"query": "What does the company Amazon do in the AI space?",
"kind": "entity",
"entity_name": "Amazon"
}
],
"stats": {
"checked": 16,
"citable": 10,
"weak": 6,
"errors": 0
}
}