top of page

Metehan: Google's AI Search architecture

  • Autorenbild: th3s3rp4nt
    th3s3rp4nt
  • 8. Dez. 2025
  • 2 Min. Lesezeit

Key Takeaways:

  • Metehan discovered in Google's Discovery Engine 7 ranking signals that likely power AI Mode (Vertex AI search)

    1. Base Ranking → core algorithm output

    2. Gecko Score → embedding similarity

    3. Jetstream → cross-attention model (handles negation better than embeddings)

    4. BM25 → yes, keyword matching still matters

    5. PCTR → 3-tier engagement system

    6. Freshness → time-sensitive query adjustment

    7. Boost/Bury → manual business rules

  • His key takeaways:

    • Ancestor headings travel WITH each chunk  

    • Prediction is here. PCTR unlocks in tiers: Popularity → PCTR → Personalized PCTR

    • Personalized ranking only activates after 100K+ queries

  • Query Suggestions Model Options:

    • Automatic: System decides the best source

    • Document: Suggestions derived from indexed document content

    • Search History: Suggestions based on past searches

    • User Events: Suggestions driven by user interaction data

    • Completable Fields: Suggestions from schema-defined completable attributes

  • Actionable insights:

    1. Optimize for grounded synthesis. The LLM is constrained to ground answers in retrieved content. Your content needs clear, extractable statements.

    2. Avoid adversarial patterns. Content that triggers adversarial detection gets filtered. Write straightforwardly without manipulation patterns.

    3. Maintain high relevance density. Low relevance content gets filtered. Every section should contribute to query relevance.

    4. Structure for multi-turn. AI Mode supports follow-ups. Content that answers related questions in the same topic cluster may have advantages.


    1. Seven explicit signals, not a black box

    The Signal Viewer exposes seven distinct ranking signals. This isn’t a single neural network making opaque decisions. It’s a multi-signal fusion system with interpretable components.

    2. Gecko + Jetstream = Semantic layer

    Google uses both embedding similarity (Gecko) and cross-attention (Jetstream) for semantic understanding. Jetstream’s negation handling suggests nuanced query understanding beyond simple similarity.

    3. PCTR is a three-tier system, not a single signal

    Engagement signals operate in tiers: Popularity → PCTR → Personalized PCTR. Each tier has quality thresholds and unlocks with more user interaction data. Personalized ranking only activates after 100,000+ queries. (And of course, this is about the Discovery Engine)

    4. 500-token chunk limit with optional heading context

    The retrieval unit has a ~500 token maximum, with an option to include ancestor headings. Structure your content accordingly.

    5. Three product tiers from one pipeline

    Traditional search, AI Overviews, and AI Mode are all served from the same pipeline with different configurations. The architecture is unified.

    6. Adversarial and relevance gates

    Final safety filters can block content even after it ranks well. Write naturally and maintain high relevance throughout.




Sources:

© 2026 David Epding.            Erstellt mit Wix.com.

david epding logo

David Epding ist GEO & SEO, Data Analytics und Automation Manager mit über 10 Jahren Erfahrung in Technischem SEO mit breiter Expertise für LLMs und langjähriger Erfahrung in der Daten-Analyse.

bottom of page