Internal Linking — Complete Guide
Last updated: June 1, 2026
What this guide is and is not. This is the framework reference for internal linking — the crawl graph, hub-and-spoke and pillar-cluster patterns, anchor-text discipline, orphan detection. The Phase 1A cluster itself is used as a worked case study throughout. If you want a task-focused workflow for improving internal links on an existing page, see How to Improve Internal Linking.
1. The crawl graph
Every site is, structurally, a graph: pages are nodes; internal links are edges. Crawlers (search-engine and AI alike) discover pages by following edges from a known starting point (the homepage, the sitemap, or an externally-linked URL). A page that is not reachable through the edge graph is invisible to crawl discovery.
Two practical consequences. First, depth from root matters: a page four or more clicks from the homepage is harder for crawlers to prioritize. Second, the sitemap supplements but does not replace the crawl graph — Google's documented behavior is to use the sitemap for discovery hints while still relying on the link graph for understanding relationships.
A practical rule: any page worth indexing should be reachable through the link graph within three clicks from the homepage or a major hub.
2. Authority flow
Search engines distribute ranking signal across a site partly through the link graph. The original conceptual model is PageRank — pages with more incoming links from authoritative pages tend to rank higher. Google has documented that PageRank-style link analysis continues to inform ranking, though the precise production formula is not public.
For practical site planning, the framing that holds up regardless of algorithmic detail:
- Authority flows through links. A page that receives links from your homepage, your hub pages, and your high-traffic articles is more likely to be considered important than an isolated page.
- The homepage is your authority root. Links from the homepage carry more weight than links from a deep article (Google has discussed this in general terms in Search Central content; precise weights are not documented).
- Hub pages distribute authority horizontally. A hub linking to ten related sub-pages distributes the hub's authority across them.
Avoid claims about specific authority percentages or PageRank values — Google does not publish them.
3. Hub-and-spoke architecture
Hub-and-spoke is the foundational internal-linking pattern. A hub page covers a topic at high level and links to spoke pages that cover sub-topics in depth. Spokes link back to the hub.
The pattern's benefits:
- Crawl efficiency: a crawler reaching the hub discovers all spokes immediately.
- Editorial clarity: the hub answers "what is this topic at a glance"; spokes answer specific questions.
- Reader navigation: a reader who lands on a spoke can find related spokes by clicking back to the hub.
- Authority distribution: hub authority is shared across spokes via the back-links.
Common implementation: the hub is a directory or summary page (e.g., /technical-seo/) with a card grid linking to each spoke. Each spoke includes a "Hub" link in its related-resources block.
4. Pillar-cluster pattern
The pillar-cluster pattern extends hub-and-spoke with horizontal links between spokes. Each spoke links not only to its hub but to other spokes covering related sub-topics. The result is a more densely interconnected sub-graph.
When the pillar-cluster pattern helps:
- The topic has natural cross-references between sub-pages (the structured-data guide naturally references the canonical guide; the AI-crawlers reference naturally cross-references the per-platform visibility guides).
- Readers who land on one spoke would benefit from a clear path to a related spoke.
- The hub alone is not the natural connection between sub-topics.
The Phase 1A cluster on helperg.com is a pillar-cluster implementation. See §7 for the worked example.
5. Anchor text discipline
Anchor text is the visible text of a link. Google documents that descriptive anchor text helps users and search engines understand what a linked page is about.
What to do
- Use descriptive anchors. "The robots.txt complete guide" beats "click here." The descriptive anchor tells readers and parsers what the linked page is about.
- Vary anchor text naturally. The same page can be linked with several different descriptive anchors across the site. Variation is more natural than exact-match repetition.
- Match the anchor to the linked content. The anchor should accurately describe what is on the other end. Misleading anchors are confusing.
What not to do
- Avoid "click here," "read more," and bare URLs as anchors. They provide no semantic value.
- Avoid exact-match keyword anchors on every internal link. Over-optimization is detectable. Use exact-match where natural, descriptive variants elsewhere.
- Avoid generic descriptive phrases that do not name the topic. "Our guide" without naming the topic is barely better than "click here."
6. Orphan pages — detection and fix
An orphan is a page that no other page on the site links to. Orphans typically come from one of three sources: a page published without adding it to navigation; a page whose hub or referrer was deleted; a sitemap entry for a page that exists but is no longer surfaced.
Detection
Compare your sitemap URL list to the URLs reachable from the homepage by following all internal links. URLs in the sitemap but not in the reachable set are orphans. For sites of any size this is a scripted task, but the underlying check is straightforward.
Fix
For each orphan, decide:
- Does it deserve to be indexed? If yes, link to it from a relevant hub or article.
- Should it not be indexed? Add a
noindexmeta tag, and remove from sitemap. - Is it obsolete? 301-redirect to the most relevant alternative, and remove from sitemap.
Do not leave orphans in the sitemap unaddressed. They confuse Search Console coverage reports and represent unindexed-but-public surface area.
7. Case study — the Phase 1A cluster on helperg.com
The Phase 1A AI Search and Technical SEO content layer on this site implements pillar-cluster internal linking. The structure:
- Two hubs: /ai-search/ (AI Search hub) and /technical-seo/ (Technical SEO hub).
- Twelve spokes: six guides under each hub.
- Existing primer + how-to siblings: the older 800-word concept and how-to pages remain; each one carries a "Complete guide" callout linking to its Phase 1A counterpart, and each Phase 1A page links back to its siblings in the related-resources block.
- Cross-spoke links: each Phase 1A page links to at least three other Phase 1A pages where the content naturally cross-references (the structured-data guide links to the canonical guide and the technical SEO checklist; the AI-search-visibility synthesis links to every per-crawler page).
The pattern is documented in the Phase 1A spec at §5A.3 (Cross-link contract) and verified at implementation time by counting inbound and outbound links per page against minimum thresholds. The result is a densely interconnected sub-graph where a reader who lands on any spoke can reach any other related spoke in two clicks or fewer.
8. AI-visibility considerations
AI providers (OpenAI, Anthropic, Perplexity, Google's generative products) have not published documented internal-linking ranking signals. The expectation that clean link structure helps AI parsers is reasonable but not provider-stated.
What is reasonable to assume: descriptive anchor text, clear hub-to-spoke structure, and absence of orphan pages help any parser — classic crawler or AI — understand site structure. What is not reasonable: claims that specific link patterns "rank in Claude" or "boost ChatGPT visibility." Neither provider publishes such signals. See the AI Search Visibility — Complete Guide for the broader honest framing.
9. Common mistakes
- Orphan pages left in the sitemap. URLs in sitemap without internal links from the rest of the site.
- Pages buried too deep. Important content four or more clicks from the homepage is hard for crawlers to prioritize and hard for readers to discover.
- Generic anchor text everywhere. "Click here," "read more," "this article" — provide no semantic value.
- Exact-match keyword anchor on every internal link to a page. Over-optimization signal; vary naturally instead.
- Hub pages with no back-links from spokes. Spokes that do not link back to their hub fail to share authority and confuse navigation.
- No cross-spoke links. When content on Spoke A naturally references Spoke B, not linking is missed opportunity for both readers and crawlers.
- Linking to noindex'd or robots-blocked pages from prominent positions. Wastes link equity.
- Treating internal linking as a one-time setup. Site structure evolves; cross-link maintenance is ongoing.
10. Checklist
- Every page worth indexing is reachable from the homepage within three clicks.
- Every hub has a card grid or equivalent linking to all its spokes.
- Every spoke links back to its hub in the related-resources block.
- Cross-spoke links exist where content naturally cross-references.
- Anchor text is descriptive — no "click here" or "read more" on any internal link.
- Anchor text varies naturally across multiple links to the same page.
- Orphan pages are identified by comparing sitemap to reachable-from-homepage set.
- Orphans are either linked into the graph, noindex'd, or 301-redirected.
- Internal links target canonical URLs (not parameter variants).
- Internal links use HTTPS where applicable.
- The site's hub structure is documented for editorial review.
- New pages are linked into the graph at publish time, not later.
- Search Console coverage reports are reviewed for unindexed-but-public URLs.
- An owner is identified for ongoing internal-link health.
- The hub structure is reviewed when major content sections are added or removed.
11. FAQ
Why does internal linking matter?
Internal links determine which pages are discoverable from the site root, how readily crawlers reach them, and how authority is distributed across a site. They also shape the navigation experience for human readers and for AI systems parsing the content.
Is PageRank still a thing?
Google has documented that PageRank-style link-graph signals continue to inform ranking, though the precise production formula is not public. Practical guidance: treat internal links as a real signal that influences discoverability and authority distribution, without claiming a specific algorithmic weight.
What is anchor text and does it matter?
Anchor text is the visible text of a link. Google documents that descriptive anchor text helps users and search engines understand what a linked page is about. Generic anchors like "click here" provide no semantic value; descriptive anchors do.
Should I worry about exact-match anchor text?
For internal links, exact-match anchor text on every link is unnatural and over-optimization. Use descriptive anchors that vary naturally; the page being linked is identified by URL plus surrounding context, not anchor text alone.
What is an orphan page?
An orphan page is one that has no internal links pointing to it from the rest of the site. Orphans are typically discoverable only through sitemap or direct URL, not through crawl from the homepage. Most orphans are accidents — pages someone published without adding them to navigation or content hubs.
How many internal links should a page have?
There is no documented universal number. Practical guidance: enough that the page contributes to discoverability for related content, not so many that the page reads as a link directory. For long-form reference pages on this site we typically place 5–10 substantive internal links.
Does the hub-and-spoke pattern still work?
Yes. Hub-and-spoke (a hub page linking to detailed sub-pages, sub-pages linking back to the hub) is a long-established pattern that benefits both crawl efficiency and reader navigation. The pillar-cluster variant adds horizontal links between sub-pages.
Do internal links affect AI visibility?
AI providers have not published documented internal-linking ranking signals. The general expectation is that AI parsers benefit from clean, descriptive link structure for the same reasons classic crawlers do. Treat internal linking as classic-SEO hygiene that may help AI systems, without claiming a specific AI ranking benefit.
12. Sources
- Google Search Central — SEO starter guide (internal links section) — captured 2026-06
- Google Search Central — Get started with Search — captured 2026-06
- Google Search Central — Title link (anchor text) — captured 2026-06
- Google Search Central — Sitemap overview (for orphan-detection workflow) — captured 2026-06
- Google Search Central — Large site crawl budget — captured 2026-06
- sitemaps.org — Sitemap protocol — captured 2026-06