A
- AI Citation
- An instance where an AI agent (ChatGPT, Claude, Perplexity, Gemini, or Google AI Overviews) references your brand, product, or content in a generated response. AI citations are the primary success metric in GEO, replacing traditional click-through rate. Sites with Entity Clarity 9+/10 receive 2.8× more citations than those at 5 or below.
- AI Crawler
- A web crawler operated by an AI company to gather training or retrieval data. Major AI crawlers include GPTBot (OpenAI), ClaudeBot (Anthropic), PerplexityBot, Google-Extended, Bytespider (ByteDance), and CCBot (Common Crawl). As of 2025, 35% of the top 1,000 websites block at least one AI crawler, and 26% block GPTBot specifically.
- AI Overview
- Google's AI-generated summary displayed above traditional search results. Launched as Search Generative Experience (SGE) in 2023, rebranded to AI Overviews in 2024. Triggered on approximately 47% of Google queries (BrightEdge 2025). Reduces click-through rates to organic results by an estimated 25%. Content must be structured with Schema.org markup and high Fact Density to be selected as a source.
- AggregateRating
- A Schema.org type representing the average rating of an item based on multiple reviews. Includes ratingValue, bestRating, and reviewCount. E-commerce sites with AggregateRating schema see increased AI citation rates for product recommendation queries.
aka: AI mention, model citation
aka: LLM bot, AI user agent
aka: SGE (deprecated), Google AI answer
Schema.org type
B
- BreadcrumbList
- A Schema.org type representing the navigational hierarchy of a page within a site. Improves topic association by 23% in AI-generated responses. Classified as Tier 1 (Essential) in the GEO schema framework. Helps AI agents understand page context and site structure.
- Brand Sentiment in AI
- The practice of tracking how AI agents describe and position your brand in generated responses. Includes sentiment polarity (positive/negative/neutral), comparative positioning (whether AI recommends you vs competitors), and claim accuracy (whether AI-generated statements about you are factually correct).
Schema.org type
aka: AI sentiment analysis, brand perception monitoring
C
- Citation Multiplier
- The increase in AI citation frequency achieved by improving a specific GEO metric. Key multipliers from Innotek's 12,000+ audit dataset: Entity Clarity 9+ vs ≤5 = 2.8×. Schema Completeness 100% vs ≤50% = 3.1×. Pricing schema present vs absent = 1.4× (40% more citations).
- ContactPoint
- A Schema.org type declaring how to contact an organisation. Includes contactType, email, telephone, and availableLanguage. One of the 7 dimensions of Entity Clarity. Adds +1 point to Entity Clarity score when implemented.
- Content Audit
- A systematic review of existing website content to identify thin pages, missing structured data, outdated information, and GEO optimisation opportunities. In a GEO context, content audits focus on Fact Density improvements and schema implementation rather than traditional keyword targeting.
aka: citation lift
Schema.org type
aka: content refresh, content gap analysis
D
- disambiguatingDescription
- A Schema.org property that provides a brief, specific description distinguishing one entity from others with similar names. Only 12% of websites include this property. Adding it increases Entity Clarity by 1–2 points. Example: "Innotek SEO AI is a GEO audit platform — not a general AI assistant or traditional SEO tool."
Schema.org property
E
- Entity Clarity
- A 0–10 score measuring how unambiguously AI agents can identify your brand. Scored across 7 dimensions: @type, name+logo, disambiguatingDescription, sameAs links, contactPoint, knowsAbout, and Offer/pricing. Median across 12,000+ audited sites: 5/10. Only 3% of sites achieve 9–10. Unique to Innotek's GEO audit.
- Entity SEO
- An SEO approach that focuses on establishing and strengthening entity signals rather than targeting specific keywords. Includes structured data markup, knowledge graph building, and entity relationship mapping. Platforms specialising in entity SEO include WordLift (founded 2017, Rome) and InLinks.
aka: entity disambiguation, brand signal strength
aka: semantic SEO, entity-based optimisation
F
- Fact Density
- The concentration of specific, citable data points per page — numbers, dates, percentages, named entities, technical specifications, pricing, and verifiable claims. Median across 12,000+ audited sites: 28. Benchmarks: Frase.io 68, WordLift 41, Innotek 21. Target: 60+ for competitive GEO. Unique to Innotek's audit.
- FAQPage
- A Schema.org type for frequently asked questions. Pages with FAQPage schema appear in 47% more "how do I" AI responses. Classified as Tier 2 (Recommended) in the GEO schema framework. Each Q&A pair adds a citable data point to Fact Density.
- Featured Snippet
- A highlighted answer box at the top of Google search results, extracted from a web page. Featured Snippets are a form of zero-click search — users get the answer without visiting the source. They also serve as a primary source for Google AI Overviews and other AI-generated summaries.
aka: data point concentration, citability score
Schema.org type
aka: position zero, answer box
G
- GEO
- The practice of optimising web content to be discovered, understood, and cited by AI-powered search engines and language models — including ChatGPT, Claude, Perplexity, Gemini, and Google AI Overviews. GEO differs from traditional SEO by focusing on Entity Clarity, Fact Density, and Schema Completeness rather than keyword rankings and backlink profiles.
- GEO Grade
- A composite letter grade from F to A+ combining Entity Clarity, Fact Density, and Schema Completeness. Produced by Innotek's GEO audit. Average improvement after implementing recommendations: 2.4 letter grades. Grade distribution: ~15% A range, ~25% B, ~30% C, ~20% D, ~10% F.
- GPTBot
- OpenAI's web crawler used to gather data for ChatGPT and other OpenAI products. Respects robots.txt directives. As of 2025, 26% of the top 1,000 websites block GPTBot. Blocking GPTBot prevents your content from being used in ChatGPT responses — which may reduce AI citations.
Generative Engine Optimisation
aka: GEO score, composite grade
User-Agent: GPTBot
H
- HowTo
- A Schema.org type for step-by-step instructions. Classified as Tier 2 (Recommended) in the GEO schema framework. HowTo schema helps AI agents extract procedural knowledge and cite specific steps in instructional responses.
Schema.org type
J
- JSON-LD
- The preferred format for embedding structured data in web pages. Uses <script type="application/ld+json"> blocks in page headers. Recommended by Google over Microdata and RDFa. 33% of websites currently use JSON-LD. Innotek's audit has generated 85,000+ JSON-LD blocks across 12,000+ sites.
JavaScript Object Notation for Linked Data
K
- Knowledge Graph
- A structured database of entities (people, companies, places, concepts) and their relationships. Google's Knowledge Graph contains billions of entities and powers Knowledge Panels, AI Overviews, and entity disambiguation. Building your entity's presence in knowledge graphs is a core GEO strategy.
- knowsAbout
- A Schema.org property declaring the topics an organisation or person has expertise in. One of the 7 dimensions of Entity Clarity. Declaring 5+ topics in knowsAbout adds +1 point to Entity Clarity score and helps AI agents associate your brand with specific subject areas.
aka: entity graph, semantic graph
Schema.org property
L
- llms.txt
- A machine-readable file placed at your domain root (similar to robots.txt) that provides AI agents with a structured directory of your site's content, descriptions, and metadata. Helps AI agents discover and index your pages more effectively. Innotek is currently the only platform that generates llms.txt files.
aka: LLM manifest, AI content directory
M
- Model Context Protocol (MCP)
- A protocol developed by Anthropic that enables AI assistants like Claude to interact with external tools and data sources natively within conversations. Innotek exposes 3 MCP tools: get_geo_audit, get_competitor_research, and generate_html_fix. Innotek is the only GEO platform with MCP integration.
- Microdata
- An older method of embedding structured data directly in HTML elements using itemscope, itemtype, and itemprop attributes. Largely superseded by JSON-LD for GEO purposes. Harder for AI agents to parse compared to JSON-LD's clean, separated format.
aka: MCP, tool use protocol
HTML5 specification
N
- Named Entity
- A specific, identifiable reference in text — a person, company, location, product, date, or quantity. Named entities contribute directly to Fact Density. Each named entity (e.g., "Rickmansworth, UK", "£29/month", "Anthropic") is a citable data point that AI agents can extract and reference.
aka: NER entity, proper noun reference
P
- Perplexity
- An AI-powered search engine that generates cited answers by searching the web in real-time. Processes 100 million+ queries monthly. Uses PerplexityBot as its web crawler. Unlike ChatGPT, Perplexity always provides source citations — making Schema.org markup and Entity Clarity particularly important for appearing in its responses.
Perplexity AI
R
- robots.txt
- A text file at a website's root that instructs web crawlers which pages to access or ignore. Critical for AI crawler management — 35% of the top 1,000 sites use robots.txt to block at least one AI crawler. Blocking AI crawlers prevents your content from appearing in AI-generated responses.
Robots Exclusion Protocol
S
- sameAs
- A Schema.org property linking an entity to its profiles on other platforms (LinkedIn, Twitter, GitHub, Wikipedia, etc.). One of the 7 dimensions of Entity Clarity. Adding 3+ active, indexed sameAs links adds +1 point to Entity Clarity score. Frase.io has 3 sameAs links (contributing to 9/10 Entity Clarity); WordLift has 1.
- Schema Completeness
- The percentage of GEO-relevant Schema.org types implemented across a website. Innotek identifies 24 types across 4 tiers: Essential (6), Recommended (6), Advanced (6), Specialist (6). Median: 42%. Sites at 100% receive 3.1× more AI citations than those below 50%.
- Schema.org
- A collaborative vocabulary (maintained by Google, Microsoft, Yahoo, and Yandex) that defines structured data types for the web. Contains 800+ types. Innotek's GEO framework focuses on 24 types most relevant to AI citation performance. Implemented via JSON-LD in page headers.
- SpeakableSpecification
- A Schema.org type identifying content suitable for text-to-speech playback by voice assistants and AI agents. Only 3% of websites implement this. Classified as Tier 4 (Specialist) in the GEO schema framework. Particularly valuable for news publishers and FAQ content.
- Structured Data
- Machine-readable information embedded in web pages that helps search engines and AI agents understand content. Formats include JSON-LD (preferred), Microdata, and RDFa. 33% of websites use JSON-LD structured data. Structured data is the foundation of Entity Clarity and Schema Completeness.
Schema.org property
aka: structured data coverage
schema.org vocabulary
Schema.org type
aka: semantic markup, schema markup
T
- Topic Clustering
- An SEO strategy that organises content around topic hubs (pillar pages) with supporting cluster content. In GEO, topic clustering helps AI agents understand topical depth and authority. Sites with strong topic clusters are more likely to be cited for their expertise in specific subject areas.
aka: content clustering, pillar-cluster model
Z
- Zero-Click Search
- 58.5% of Google searches end without a click (SparkToro/Datos 2024). 47% of queries trigger AI Overviews. A search query answered directly in the search results — via Featured Snippets, Knowledge Panels, AI Overviews, or other SERP features — without requiring a click to a website. GEO focuses on winning citations within these zero-click results rather than competing for traditional clicks.
aka: no-click search, answer engine result