Technical Methodology

How GenSight.AI
Measures AI Visibility.

A complete guide to the deterministic scoring engine that maps how AI models like ChatGPT, Gemini, and Perplexity perceive, rank, and recommend your brand.

1. Why Traditional SEO Isn't Enough Anymore

Search engines crawl documents for keywords and backlinks. Large Language Models (LLMs) work differently - they plot tokens in a multi-dimensional latent space and retrieve answers based on semantic proximity, not page rank.

When someone asks ChatGPT or Perplexity to recommend a product in your category, the AI searches for your brand's Semantic Anchor - the mathematical position your entity occupies in the model's internal representation of the world. If your brand relies on generic industry terms, or if your intellectual property is locked inside video and social media platforms (what we call the Walled Garden Effect), the AI can't find you. It recommends your competitors instead.

Generative Engine Optimization (GEO) is the practice of structuring your brand's digital presence so that AI models can discover, understand, and cite you accurately. GenSight.AI is the diagnostic tool that measures how well your brand is currently optimized for this new reality. Have questions about GEO? See our FAQ →

The Zero-Trust Baseline

GenSight assumes your brand is completely invisible to AI until mathematical proof of your semantic presence is established.

  • Unverified Entity (0 – 30%)
  • Verified Foundational Marker (50%)
  • Dominant Semantic Vector (85%+)

2. The 31-Node Deterministic Scoring Matrix

Most AI audit tools simply ask an LLM to subjectively "score" a website, which produces unreliable, hallucinated results. GenSight takes a fundamentally different approach. We constrain an LLM to act strictly as a Boolean Data Extractor - it infers 31 specific technical signals about your brand (true or false), while our backend forcefully overwrites its assumptions with live, mathematical network verifications. The final score is calculated entirely by our deterministic algorithms, not the AI.

Entity Identity & Verification

Measures how explicitly your brand binds its URL to a machine-readable identity across the open web. This includes structured data (JSON-LD Organization or Person schema), Wikidata entity presence, canonical URL configuration, and whether your brand name is semantically unambiguous.

hasOrgSchema / hasPersonSchemaT/F
hasWikidataT/F
isUnambiguousEntityT/F
hasCanonicalUrlT/F

RAG Readiness & Scrapability

Evaluates whether AI crawlers can actually access and parse your content. This covers robots.txt AI-bot permissions, the presence of an llms.txt file, server-side rendering, semantic HTML structure, structured data tables, and whether your content uses formats that Retrieval-Augmented Generation systems can chunk effectively.

robotsTxtAllowsAiT/F
hasLlmsTxtT/F
isServerSideRenderedT/F
hasSemanticHtmlArchitectureT/F

Competitive Authority & Citations

Calculates your relative citation strength against competitors. This uses live Google Search grounding to verify whether your brand currently appears in the top 5 results for your niche, and whether you displace your nearest competitor in AI-generated recommendations.

top5InGemini (Live Search)T/F
displacesNearestCompetitorT/F
hasHighDrPrCitationsT/F
isIncludedInComparisonListersT/F

How verification works

Of the 31 signals, up to 17 are verified deterministically in Enterprise mode (14 in Influencer mode) - we crawl your actual HTML, parse your robots.txt, query the Wikidata API, check for JSON-LD schema types (including Organization, Person, Product/Service, FAQ, and C-Suite schemas), and detect social media link presence and breadcrumb structures. These verified signals forcefully overwrite whatever the LLM infers. The remaining signals are extracted by the LLM acting as a constrained data extractor, not a grader. The final score is a weighted sum calculated by our math-first scoring algorithm, with a confidence zone of approximately ±3 points to account for the probabilistic signals.

Mode 1 Enterprise Engine
Mode 2 Creator Engine
// Separate boolean sets per mode
if (mode === 'enterprise') {
  eval(hasCrunchbase);
  eval(hasCSuiteSchema);
  eval(hasProprietaryFrameworks);
} else { // influencer
  eval(hasPersonSchema);
  eval(hasCreatorModeLinkedIn);
  eval(hasPublishedBookOrNewsletter);
}

3. Dual-Pipeline Processing

A corporate SaaS product and a personal brand influencer have fundamentally different AI footprints. GenSight runs two entirely separate 31-boolean evaluation matrices depending on entity type.

The Enterprise Pipeline audits technical infrastructure such as llms.txt files, Crunchbase presence, C-suite schema markup, and server-side rendering. The Influencer Pipeline evaluates personal schema, creator-mode LinkedIn profiles, published books or newsletters, podcast interview transcripts, and social handle uniformity.

For influencers who only submit a social media handle, the engine deploys a Pre-Flight Canonical Resolver - it live-searches the web to discover their actual owned domain. If they truly only exist on walled platforms (Instagram, TikTok, YouTube), the engine applies a Walled Garden Penalty that mathematically reduces their technical scores until they syndicate their content into open-web, machine-readable text formats.

Competitive Intelligence

4. Competitive Displacement Analysis

Your AI visibility score only matters relative to your competitors. GenSight identifies and ranks the top 5 entities that currently dominate your niche in LLM-generated citations.

For each competitor, the engine calculates an estimated gap score - how many points ahead or behind they are in the semantic space. It identifies the specific content artifacts and authority markers each competitor uses to win citation share, providing you with a precise displacement strategy rather than vague advice. In Semantic Bridge mode, this analysis is contextualised to the intersection of your brand and target niche, so competitors are relevant to your specific expansion goal.

The Market Visibility Ranking shows your position among these competitors. When your brand ranks below all five analyzed competitors, the dashboard clearly indicates this is a sample-based ranking - there may be additional competitors not shown.

What the displacement analysis reveals

Market Visibility Ranking - where your brand sits among the top 5 competitors in LLM citation space.

Gap Scores - the exact percentage distance between you and each competitor.

Displacement Triggers - the specific information gain or content artifact each rival uses to outrank you.

Latent Space Neighbors - the entities AI mathematically clusters with your brand based on shared semantic vectors.

Net Perception Score
+82
Positive Bias
Dynamic Confidence Vectors
"Technical Accuracy"92%
"Integration Ease"65%
"Pricing Complaints"15%
Perception Engine

5. LLM Bias & Semantic Perception

AI models don't just return links - they synthesize opinions. When a user asks ChatGPT about your product, the response carries a positive, negative, or neutral framing drawn from the model's training data.

GenSight extracts this bias mathematically. Rather than asking the AI "what do you think?", the engine applies Dynamic Topic Modeling to identify the specific subjects the AI associates with your brand (such as "Platform Scalability" or "Pricing Complaints"), assigns confidence scores to each, and computes a net perception score across positive, neutral, and negative distributions.

For entities with insufficient training data, the engine returns "Insufficient Semantic Data" rather than fabricating sentiment - a deliberate anti-hallucination safeguard.

Content Gap Matrix
API Scalability Heavy
Dominated by: Stripe Syndicate to: techcrunch.com
Workflow Automation Moderate
Dominated by: Zapier Syndicate to: g2.com
Information Gain

6. Content Gap Mapping with Syndication Targeting

LLMs don't reward generic marketing copy - they prioritize Information Gain, meaning content that provides data, analysis, or structured knowledge the model can't easily find elsewhere.

The engine scans the semantic footprint of your top competitors to identify specific topics where they dominate AI citations but you don't. For each gap, it identifies the dominant competitor who currently owns that topic, the specific content artifact you need to create (whitepaper, FAQ page, comparison guide), and a content brief describing exactly what to produce.

Most uniquely, each gap is cross-referenced with the AI Influence Graph to recommend a syndication target - the specific high-authority platform where publishing your content will have the most impact on AI citations. This connection between "what to create" and "where to publish it" is what transforms a diagnostic report into an actionable displacement strategy.

Citation Engineering

7. AI Influence Graph

When ChatGPT or Perplexity recommends products in your category, those recommendations aren't pulled from random web pages. They're anchored to specific high-authority domains that the model learned to trust during training and through real-time retrieval.

The AI Influence Graph uses live Google Search grounding to identify the exact platforms, review aggregators, publications, and forums that AI models currently rely on for your specific niche. Each citation node includes the domain type (Review Aggregator, Major Publication, Forum), the competitors already present there, a specific action plan for how to establish your presence, and an effort level estimate.

Think of it as a verified hit-list for your PR and content team - the five places where establishing a footprint will most directly influence what AI says about your brand.

Target Niche: "Enterprise SaaS"
g2.com
G2 Software Reviews
Review Aggregator
reddit.com
r/SaaS Community
Forum
techcrunch.com
Startup News & Funding
Publication
BRAND
TARGET
VECTOR GAP: 47 POINTS
Premium Capability

8. Semantic Bridging

Your baseline score measures visibility in your current niche. But what if you want to expand into a new market? A brand might score 85% for "CRM software" but only 12% for "AI-powered sales automation."

Semantic Bridging recalculates the entire 31-node matrix through the lens of a target concept. It measures the vector distance between your current footprint and the desired cluster, then generates the exact architectural steps needed to make the AI associate your brand with the new niche. The competitive displacement and influence graph analyses within a bridge are contextualised to the intersection of your brand and the target niche - not just the pure target market leaders.

Each bridge search is timestamped and tied to the entity, so you can track whether your bridging efforts are working over time. Bridges older than 30 days are flagged as potentially stale, prompting a re-evaluation.

Premium Capability

9. Score Timeline & Multi-Entity Portfolio

GEO is not a one-time fix - it's an ongoing process. The GenSight dashboard provides a visual timeline showing how your deterministic score has changed over each audit period, with delta indicators showing the exact point change between runs.

Premium subscribers can click on any historical audit to view a complete snapshot of that moment in time - including which semantic bridges were active at that point. This makes it easy to correlate specific actions (such as publishing a whitepaper or securing a G2 listing) with measurable score improvements.

The Multi-Entity Portfolio lets you manage multiple brands or domains from a single dashboard. Each entity maintains its own audit history, bridge campaigns, and timeline - useful for agencies managing several clients or companies with multiple product lines.

Premium Dashboard Features

Score Timeline - last 10 audit periods with delta tracking and confidence zones.

Score Breakdown Panel - transparent view of all 31 signals grouped by pillar, with pass/fail indicators and verified vs. AI-inferred labels.

Enhanced Roadmap - each recommendation tagged with an owner (Technical, Content, or PR), linked scoring signals, estimated score impact, and expandable implementation steps.

Historical Snapshots - click any past audit to view that exact moment's dashboard, including time-scoped bridges.

Entity Portfolio - add and switch between brands, each with independent audit histories.

Bridge Freshness - age indicators on semantic bridge campaigns so you know when to re-evaluate.

Run your baseline evaluation.

See exactly how the scoring matrix evaluates your brand today.

Initialize Audit