May 22, 2026 · 2 min read
New paper: GEO-SFE — How content structure shapes AI citation behavior
Our latest research introduces GEO-SFE, a structural feature engineering framework for generative engine optimization. Across six mainstream AI engines, structural rewrites lift citation rate by 17.3% and subjective quality by 18.5%.
We’re excited to share our latest paper, “Structural Feature Engineering for Generative Engine Optimization: How Content Structure Shapes Citation Behavior”, now available on arXiv. The paper formalizes a question we keep running into with customers: when an AI answer engine decides which pages to cite, how much of that decision is driven by structure rather than meaning?
→ Read the full paper (PDF) · arXiv abstract
The gap we’re closing
Most published GEO work focuses on semantic rewrites — what the content says. Our team kept seeing the same pattern in production: two pages with near-identical claims, one cited reliably by ChatGPT / Perplexity / Gemini and one ignored. The difference wasn’t the message — it was the scaffolding around it. GEO-SFE is our attempt to make that scaffolding measurable and controllable.
What GEO-SFE proposes
The framework decomposes page structure into three levels and models how each shifts citation probability across different generative engine architectures:
- Macro-structure — overall document architecture (sectioning, heading hierarchy, narrative order).
- Meso-structure — information chunking (paragraph density, list use, fact placement, callouts).
- Micro-structure — visual emphasis (bolding, inline highlighting, typographic cues retained in the rendered DOM).
On top of those levels we build architecture-aware optimization strategies and predictive models that preserve semantic integrity — the goal is to make structural rewrites that the engine notices, without changing what the page actually means.
Results
We evaluated GEO-SFE across six mainstream generative engines and saw consistent gains:
- +17.3% citation rate
- +18.5% subjective answer quality
The improvements held across engine families, which is the result we cared about most — it suggests structural effects generalize rather than overfitting to one model’s quirks.
Authors
- Junwei Yu
- Mufeng Yang
- Yepeng Ding
- Hiroyuki Sato
The work was carried out as a research collaboration tied to the VeReach GEO program. If you want to discuss applying GEO-SFE to your own content surface, get in touch with the team.
What this means for the product
GEO-SFE isn’t a standalone tool — it slots into the rewrite-and-retest loop already inside VeReach GEO. Expect structural diagnostics and rewrite hints to start surfacing as first-class signals alongside the semantic ones, with the same PIRR / PRP / CCD scoring driving the loop.
For the full methodology, the architecture-aware optimization strategies, and the per-engine breakdown, please read the paper: arxiv.org/abs/2603.29979.