LIVE|GOOGLE-EXTENDED|v.1

AI Training Posture

Control Whether Gemini Trains On Your Site

Always-on monitoring of Google's separately controllable AI-training token. Google-Extended governs Gemini Apps and Vertex AI training and grounding. Per Google: it does not impact Search inclusion or ranking. Sentry catches directive drift between intent and configuration. Cortex handles the fix.

sentry.google-extended.live● 3 min ago
03:30:00GET https://capconvert.com/robots.txt
03:30:00200 OK · text/plain · 1.2 KB
03:30:01Parsing 6 Google-Extended rules...
03:30:02 PASS policy_decision_made (explicit Allow)
03:30:03 PASS googlebot_unaffected
03:30:04 PASS meta_robots_consistent
03:30:05 PASS cloudflare_rule_consistent
03:30:06 WARN documented_intent (policy page missing)
03:30:07 PASS gemini_visibility_check
03:30:08 Score 5/6 · Grade A · optimized
Gemini Training Governance

Continuous AI-Training Policy Monitoring

Continuous audits of your Google-Extended posture against the 6 things that matter for Gemini training and grounding inclusion. Per Google's docs: 'Google-Extended does not impact a site's inclusion in Google Search nor is it used as a ranking signal.' This is a pure policy decision. Sentry catches drift between intent and configuration. Cortex fixes it.

RULE · 1

policy_decision_made

Explicit Allow or Disallow

robots.txt contains an explicit Allow or Disallow for User-agent: Google-Extended, not implicit. Per Google's docs: Google-Extended has no separate HTTP user-agent string; it is a robots.txt token only.

RULE · 2

googlebot_unaffected

Googlebot directive unchanged

User-agent: Googlebot still allows crawl. Verbatim from Google's docs: 'Google-Extended does not impact a site's inclusion in Google Search nor is it used as a ranking signal.' Blocking it is purely a Gemini-training decision.

RULE · 3

meta_robots_consistent

Meta-robots aligned

Page-level `nosnippet` and `max-snippet` directives align with the Google-Extended choice. AI Overviews are suppressed when nosnippet is set, regardless of Google-Extended posture; mixed signals confuse the grounding pipeline.

RULE · 4

cloudflare_rule_consistent

Cloudflare rule consistent

If a Cloudflare 'Block AI Training' managed rule is active, it does not contradict the robots.txt Google-Extended directive. A WAF Allow alongside a robots.txt Disallow (or vice versa) creates a confusing posture for both Google and downstream auditors.

RULE · 5

documented_intent

Public AI-content policy

A public page on the site documents the AI-training stance. Increases transparency for users and downstream platforms parsing the policy. Not required by Google, but recommended for any GEO-conscious brand.

RULE · 6

gemini_visibility_check

Live Gemini test confirms intent

A scripted Gemini test query reflects the chosen posture: cited if Allow, absent if Disallow. Catches infrastructure misconfiguration that contradicts the robots.txt declaration.

AI Training Posture

Free Google-Extended Checker

Paste your homepage URL. Sentry verifies the Google-Extended directive, confirms Googlebot remains unaffected, cross-checks Cloudflare and meta-robots posture, and pings Gemini for live behavior, then ships a per-rule report. No signup, instant results, always free.

Comprehensive auditInstant resultsCompletely free
Instant

Audit in under a minute

Sentry fetches your site, runs every GOOGLE-EXTENDED rule, and renders the full result page before your next sip of coffee.

Actionable

Every failure gets a fix

Each failed rule ships with a prescription paragraph. Hand it to engineering and the gap is closed before lunch.

Ongoing

Locked in for the long haul

Add your site to the daily Sentry sweep with one click. New regressions get caught the next morning.

6 rules in the GOOGLE-EXTENDED Sentry. Daily 3:30 AM ET sweep.

Govern AI Training Access

Stop Guessing. Start Seeing. Get Cortex.

One brain. Thirty-six pairs of eyes. Sentries monitor every visibility signal that decides whether search engines, AI engines, and ad platforms show you. Cortex reads what they see, weighs it against a unified corpus of platform documentation, and acts. Every move follows a defined decision protocol: action stated, reason given, impact named.

250
Ranking signals
30
Sentries
60
Platforms
Daily
Always-on
llms.txtai-citationsai-crawlersbrand-pulsetitle-tagsmeta-descstructuredsitemapcore-vitalspage-speedaccessibilitydomain-agehttpsaboutauthorsbacklinksmentionsreviewsinternalgbpnapyelptrusthelpfultopicalfirst-handcronavboostfreshnesshreflangimage-seopage-qualitycanniballlm-outputtrackingssl-tlsCORTEXdecision engine