Two axes decide what AI cites for your category. Geography is a third.
9 measurement windows across 7 Hawaii categories plus two cross-geo CPA markets (Austin TX, Nashville TN). Same 7-AI-tool methodology, same 18-question discipline (hash-locked per geo). Own-site share is one axis. Training-data presence is a second axis. The Austin and Nashville CPA cross-geo measurements add a third dimension: the training-data axis itself varies by engine and by geography within the same category.
The nine-measurement table (web-searching engines)
Own-site share across the five web-searching AI tools (Perplexity, ChatGPT search, Gemini grounded, Microsoft Copilot, Google AI Overviews), pooled by citation volume. Each row is the aggregate of that category’s hash-locked runs, recomputed against current cohorts as of 2026-06-11. Citations and cohorts evolve, so this is a dated snapshot, not a live feed, which is the whole reason measurement is continuous rather than one-time.
| Category | Own-site share | 3rd-party share | Cohort size |
|---|---|---|---|
| Hawaii consumer banking | 51% | 45% | 23 firms |
| Honolulu med spas | 51% | 47% | 15 firms |
| Hawaii wealth management | 47% | 49% | 42 firms |
| Austin TX CPA firms | 47% | 50% | 37 firms |
| Hawaii CPA firms | 45% | 51% | 41 firms |
| Honolulu dental practices | 43% | 55% | 46 firms |
| Honolulu HVAC | 40% | 57% | 12 firms |
| Hawaii law firms | 35% | 63% | 33 firms |
| Nashville TN CPA firms | 31% | 65% | 14 firms |
20 percentage points separate the top (51%, where Hawaii banking and Honolulu med spas tie) from Nashville CPA at the bottom (31%). Banking still leads the way it did in the first measurement: two decades of product pages make a bank’s own website the canonical source AI reaches for. The most informative comparison is the CPA category across geographies. Hawaii CPA (45%) and Austin CPA (47%) land within two points of each other, so on the web-searching surface the category gradient is largely geography-independent. But Nashville CPA sits well below both at 31%. So the web axis is mostly category-driven with real geo variation layered on top, the same shape the training-data axis shows with Gemma. One market does not fully predict the next, even inside a single category.
The hypothesis on why banking is different
Banks have done two decades of work making their websites the official source of truth for their own products. A bank’s rates, account types, branch locations, and product comparisons live cleanly on the bank’s own site, in structured product pages, refreshed daily. AI tools have nothing equivalent to cite for a wealth firm or law firm or dental practice or CPA firm. The "best wealth manager in Hawaii" question has no canonical source the way "best bank for small business in Hawaii" does, so AI reaches for a wider mix of editorial third-party content.
The honest finding for an operator on the web-searching surface. If you are a Hawaii bank, your own website is doing roughly half the work of getting AI to mention you. Clean it up. The leverage is on-site. If you are a Hawaii wealth firm, dentist, law firm, or CPA, your own site is doing 35-47% of the work and the rest is happening on third-party surfaces (publications, directories, named-attorney bylines, lead-gen platforms, professional association directories, etc.). The 35-47% on-site number means own-site work matters, and the larger off-site number means it is not enough by itself.
The second axis: training-data engines (and the cross-geo split)
Two of the seven AI tools we measure (Claude and Gemma) answer from training data instead of searching the live web. So when they mention a business by name, it means that business has formed a presence in the AI’s training data: through Wikipedia, news coverage, named-founder content, sustained editorial presence over years. This is the slowest signal in the measurement and the most defensible once it forms. A new competitor cannot show up in Claude’s training data overnight. That presence is earned through years of editorial work that an AI model later learned from.
Across the training-data measurements (the seven Hawaii categories plus two cross-geo CPA markets, Austin TX and Nashville TN), recomputed against current cohorts as of 2026-06-11:
| Category | Claude own-site share | Gemma own-site share |
|---|---|---|
| Hawaii consumer banking | 88% | 70% |
| Hawaii law firms | 51% | 78% |
| Honolulu dental practices | 34% | 58% |
| Honolulu med spas | 2% | 32% |
| Honolulu HVAC | 2% | 63% |
| Austin TX CPA firms | 0% | 23% |
| Nashville TN CPA firms | 3% | 7% |
| Hawaii wealth management | 38% | 68% |
| Hawaii CPA firms | 1% | 2% |
The training-data picture is not the same picture as the web-searching one. Hawaii banks sit at the top of both. Hawaii CPA firms sit mid-table on the web-searching surface (45%) but collapse to 1% / 2% on the training-data surface, the lowest of any measurement by a wide margin. Hawaii law firms are the reverse: near the bottom on web-searching (35%) yet near the top of the training-data table. The two axes do not move together.
The three CPA geographies are the most informative observations in this teardown. Same category, same query shape (geo-swapped), same methodology. Claude behaves almost identically across all three (Hawaii 1%, Austin 0%, Nashville 3%), so its training-data collapse on local CPA firms is geo-invariant. Gemma does not. It runs 2% in Hawaii, 7% in Nashville, and 23% in Austin. Austin’s spike does not replicate in Nashville, but Nashville sits clearly above Hawaii, so Gemma’s local-firm citation is a geo-dependent continuum, not a fixed behavior. The practical consequence: for one of the two training-data engines, a measurement in one market tells you little about another. You have to measure each geography.
What this means in plain language
Own-site share is decided by current web infrastructure quality. Training-data presence is decided by decades of category-level editorial accumulation. A firm can move the first axis by improving its own website. The second axis moves on a different timescale and through different mechanisms (named editorial coverage, association bylines, sustained public presence). The implication for operators: treating "AI citation share" as one number averages two different surfaces that respond to different work. The cohort-level pattern is the diagnostic that tells you which axis your category sits on and which work pays off there.
The CPA cross-geo finding (the most informative observation)
The Hawaii CPA teardown originally surfaced a striking pattern: Claude cites Hawaii CPA firms 1% of the time, Gemma cites them 2%, while the web-searching engines reach far higher on the same questions. The original framing called this a "training-data engine collapse" for CPA firms.
Two more CPA geographies, Austin and Nashville, refined that framing into two distinct findings. Claude’s collapse generalizes across all three. Gemma’s behavior is a geo-dependent continuum, not a fixed property of the category.
Claude generalizes. Hawaii CPA Claude own-share is 1% (7 cites of 601), Austin 0% (3 of 885), Nashville 3% (17 of 670). Same engine, same category, three different geographies, near-zero firm-owned citation in all of them. This is a category-level finding about Claude’s training corpus being editorially thin on CPA firms regardless of geography. Three-geography evidence promotes it from a low-confidence single-geo observation to a high-confidence cross-geo pattern.
Gemma is a continuum. Hawaii CPA Gemma own-share is 2% (7 cites of 293), Nashville 7% (8 of 108), Austin 23% (73 of 313). Same engine, same category, three geographies, and the firm-owned citation rate climbs from 2% to 23% across them. Austin’s high number does not replicate in Nashville, but Nashville sits clearly above Hawaii. So Gemma’s representation of local CPA firms is geography-dependent, not a fixed category property. The cause is inferred (likely the geo-density of business-news content in Gemma’s training data). The observation is direct.
The reason this matters: a vendor measuring one geography per category would have published the Hawaii result and called it the category-level pattern. Measuring the same category in three geographies was the only way to discover that one of the two training-data engines (Claude) holds the pattern everywhere while the other (Gemma) varies enough that no single market predicts the next. A precise read of training-data engine behavior emerges only across geographies.
For an operator, the closable competitive surface differs by market. Hawaii CPA firms have both training-data engines as a category-wide blind spot, so competing inside the web-searching engines is the short-term move. Austin CPA firms have Claude as a blind spot but Gemma reachable through sustained web presence over Gemma’s training cycle. Nashville sits between: Claude closed, Gemma a narrower opening than Austin’s. The recommendation depends on which market the operator is in.
A third category, this one predicted in advance, confirms the Claude collapse is not CPA-specific. Honolulu med spas (Claude 2%) were the second category after CPA. Honolulu HVAC and AC-repair companies (teardown 09) are now the third, and this one we forecast before measuring: a prediction that Claude would cite these firms under 5% was committed to a public timestamped record on 2026-06-09, before any HVAC data existed. It landed at 2%, with Gemma at 63% and the web-searching engines at 38% to 66%. So three unrelated local-service industries (accounting, aesthetics, home services) all collapse on Claude. The categories that hold up on Claude (banking 88%, law 51%, wealth 38%, dental 34%) each have a canonical web presence or a deep editorial tradition that puts named firms into the training corpus. The collapse is not a quirk of one category. It is what happens to editorially thin local-service businesses, and it now replicates across three of them in different industries. HVAC also widens the engine split: Claude 2% against Gemma 63% is a 61-point gap inside the same category, the widest in the dataset, which is why "the training-data axis" is really two engines that can diverge completely. And because this third instance was a pre-registered forecast rather than an after-the-fact observation, the pattern is no longer a story told to fit the data. It is a rule that made a risky prediction and the data agreed.
The Microsoft Copilot pattern (holds across all nine measurements, including Austin and Nashville)
One pattern is consistent across all nine measurement windows (7 Hawaii categories plus Austin and Nashville CPA). Microsoft Copilot (which answers using Bing’s organic search results) cites the firms’ own websites 0% to 2% of the time in every measurement.
| Category | Bing/Copilot own-site share |
|---|---|
| Hawaii consumer banking | 0% |
| Hawaii wealth management | 0% |
| Honolulu dental practices | 0% |
| Honolulu med spas | 0% |
| Honolulu HVAC | 0% |
| Austin TX CPA firms | 0% |
| Hawaii law firms | 1% |
| Hawaii CPA firms | 2% |
| Nashville TN CPA firms | 1% |
This means: a buyer who asks Microsoft Copilot for a business recommendation in any of these nine categories or geographies gets pointed at third-party content (publications, Wikipedia, directories), almost never at the businesses’ own websites. Every business in every category we measured shares this gap. The Austin and Nashville data points confirm the pattern is not Hawaii-specific.
The first-mover read: whichever firm shows up first in Bing’s organic search results for these queries gets the Copilot answer mostly to itself, because every competitor is also invisible there. The gap is open, and it does not appear to be a category-specific or geo-specific problem. It is a Microsoft Copilot pattern across local-service queries.
The implication for where to do the work, by category and geography
The two-axis pattern, refined by the cross-geo CPA measurements, produces nine different recommendations:
- Hawaii consumer bank. Own-site work pays off the most directly on both axes. AI is reading the bank’s website 51% on web-search and 70-88% on training-data. Cleaner structured data, sharper product pages, better FAQ markup. Both axes respond.
- Hawaii wealth firm. Own-site work matters (47%), off-site editorial matters (49%), and training-data presence is mid-pack (38-68%). The on-site cleanup is the necessary baseline. The harder work is getting into the third-party surfaces AI uses for the category (lead-gen platforms, named-planner editorial, professional bylines).
- Honolulu dental practice. Similar shape to wealth on the web-searching surface (43%), middle of the pack on training-data (34-58%). Own-site work and review-directory presence both matter. The Microsoft Copilot first-mover opening is open here as in every category.
- Honolulu med spa. 51% on the web-searching surface, tied with banking at the top, so own-site work pays off directly on the live-web engines where the category already scores well. But Claude collapses to 2% while Gemma holds at 32%, so one of the two training-data engines is a category blind spot and the other is reachable. The med spa playbook: clean on-site infrastructure for the web-searching engines, treat Claude as unreachable in the short term, and build the sustained editorial presence Gemma rewards. The Microsoft Copilot first-mover opening is open here too (0% own-share). See teardown 08.
- Honolulu HVAC / AC company. 40% on the web-searching surface, with ChatGPT search highest at 66%, so own-site work pays off on the live-web engines. Claude collapses to 2% (the pre-registered prediction) while Gemma sits high at 63%, the widest split in the dataset. The playbook: clean on-site infrastructure for the web-searching engines, treat Claude as a short-term blind spot, and build the editorial presence Gemma rewards. The Microsoft Copilot opening is wide open (0% own-share). See teardown 09.
- Hawaii law firm. 35% on the web-searching surface, but training-data presence is high (51-78%) because attorneys generate long-form editorial content (Chambers profiles, BestLawyers entries, named-attorney bylines). The implication: own-site cleanup plus continued investment in the editorial surfaces that feed into training data over time.
- Hawaii CPA firm. 45% on the web-searching surface, near-zero on BOTH training-data engines (Claude 1%, Gemma 2%). Web-searching engines are where the competitive game plays. Training-data engines are a category-and-geo blind spot that no individual firm can close in the short term. Editorial work would help over years. In the meantime, the web-searching engines are the addressable surface.
- Austin TX CPA firm. 47% on the web-searching surface (within two points of Hawaii CPA’s 45%). Claude is a blind spot (0%) as it is in Hawaii. Gemma is reachable (23%): an Austin CPA firm CAN move the Gemma needle through sustained web presence in Texas-business-news content that Gemma’s training cycle ingests. This is the materially different recommendation between Hawaii and Austin CPAs. The cross-geo data surfaced it.
- Nashville TN CPA firm. 31% on the web-searching surface, the lowest in the set, with two-thirds of citations on third-party surfaces. Claude collapses (3%) as it does everywhere. Gemma sits at 7%, well below Austin’s 23%. The cross-geo lesson: even inside one category, both the addressable web surface and the size of the Gemma opening differ by market. Nashville cannot be read off Austin.
The non-obvious read across measurements: the conventional SEO instinct (clean up your website and AI will cite you) is correct in proportion to where your category sits on the web-searching axis, but it has nothing to say about the training-data axis. And the training-data axis is not one number. It can split by engine and by geography within the same category. A firm in a category like CPA in Hawaii could perfect its on-site infrastructure and not move either Claude or Gemma. The same firm operating in Texas would not move Claude but would have a path to moving Gemma. Both surfaces respond to different work. Both can vary by geography in ways that aren't visible from a single-geo measurement.
The cohort sizes (transparency note)
The nine cohorts were not the same size. Cohort size is determined by how many distinct businesses AI tools mention with enough frequency to register as recurring, then expanded through cohort-coverage scans across multiple runs. Categories with more independent businesses (dental practices, CPAs, individual law firms) surface larger cohorts. Categories with consolidation (banking, wealth management) surface smaller ones. This is a feature of the measurement, not a bias: the cohort reflects who AI actually cites, not who we picked in advance.
- Hawaii consumer banking: 23 cohort domains across about 21 distinct banks and credit unions (several institutions run multiple domains), named at teardown 01.
- Honolulu dental practices: 46 dental practices (39 surfaced from run #1, 7 more added after run #2 cohort-coverage scan).
- Honolulu med spas: 15 med spas (registered through cohort-coverage scans across 3 runs). See teardown 08 for the per-category breakdown and the Claude-collapse detail.
- Honolulu HVAC: 12 AC companies (registered through cohort-coverage across 3 clean runs). See teardown 09 for the per-category breakdown and the pre-registered prediction.
- Hawaii law firms: 33 firms (5 originally registered, 25 surfaced through run #1 cohort-coverage, 3 more after run #3).
- Hawaii wealth management: 42 firms (15 originally registered, 27 surfaced through three rounds of cohort-coverage scans). The expansion lifted the own-site share from 27% on run 1 to 47% after the third expansion, illustrating the cohort-coverage discipline.
- Hawaii CPA firms: 41 firms (5 anchors, 15 surfaced from run #1, 8 from run #2, 11 from run #3, with 2 anchor domain variants corrected). The expansion lifted the own-site share from 27% on run 1 to 45% against the current cohort. See teardown 06 for the per-category breakdown.
- Austin TX CPA firms: 37 firms (5 anchors with 1 domain correction, 16 surfaced from run #1, 8 from run #2, 8 from run #3). The expansion pattern matched the Hawaii CPA category closely. See teardown 07 for the per-category breakdown and the cross-geo finding details.
- Nashville TN CPA firms: 14 firms (5 anchors, 9 surfaced through cohort-coverage across 3 runs). The smallest cohort in the set. Nashville surfaces fewer independent CPA firms at citation frequency than Hawaii or Austin.
All individual firms outside the banking cohort are kept anonymized in this public artifact per the non-customer anonymization rule.
Methodology summary
Each category was measured the same way:
- 7 AI tools: Perplexity (sonar API), ChatGPT search (gpt-4o-mini-search-preview), Gemini grounded (gemini-2.5-flash with Google search), Microsoft Copilot via Bing organic results, Google AI Overviews, Claude (claude-haiku-4-5), Gemma (open-weight, via Together AI).
- 18 questions per category, locked by hash so every run compares apples to apples. The questions mirror how a real buyer in each category would actually search.
- 3 repetitions per question per AI tool to separate signal from noise.
- 3 usable runs per category across the dataset (Austin CPA, 4 runs). All nine measurements cleared the MOAT.md rule 5 pattern-readiness threshold. Banking, dental, and law numbers each held within one to two percentage points across their runs (stability signal at clean cohort coverage). Wealth's number rose from 27% on run 1 to 47% after three rounds of cohort expansion. CPA's number rose from 27% on run 1 to 45% against the current cohort. Both are the cohort-coverage discipline working as designed: more runs surface more cohort members, the headline number trends toward true ceiling rather than away from it.
Full methodology, source bucket definitions, and pattern-readiness rules at /methodology/. Public source code at github.com/LanceRoylo/neverranked-outreach. Gemma is open-weight, so any auditor (compliance team, outside reviewer, agency data lead, anyone) can independently re-run the same questions and verify the training-data numbers.
Honest scope and what this does not prove
Nine measurements across seven categories is enough to establish a two-axis pattern but not enough to declare a universal law. The pattern is consistent in direction (web-searching and training-data engines respond to different mechanisms) and in magnitude (the spread is wide enough to matter) but it lives inside a small sample of categories, mostly in Hawaii with two cross-geo CPA markets (Austin TX, Nashville TN), measured across May and June 2026.
What is not in this teardown:
- Large national categories. The set is seven Hawaii categories plus two mainland CPA markets (Austin, Nashville). Bigger national categories may show different patterns because cohort dynamics differ.
- Categories outside professional services, consumer financial services, local aesthetics, and home services. The nine measured here cluster narrowly. B2B SaaS, restaurants, real estate brokerages, healthcare specialties beyond dental, all unmeasured here.
- Whether the pattern holds over time. AI tools refresh their training data and search indices on schedules we do not control. The same categories measured six months from now may produce different numbers.
- Causation. We measured what AI cites. We did not test whether changing what is on a firm’s site or in third-party publications would cause AI to cite differently. That requires pre-registered experiments against named businesses with control for confounds. Different scope, different methodology.
What an engagement does with this
NeverRanked engagements measure your specific category every week for as long as the engagement runs. For an engagement in a category that sits high on both axes (banking), the punch list weights toward on-site work. For an engagement in a category that splits between axes (law, where on-site is mid and training-data is high), the punch list covers both surfaces. For an engagement in a category that collapses on the training-data axis (CPA), the punch list focuses on the four web-searching engines because that is where the work pays off. The two-axis pattern above is one input to the recommendation. The per-buyer-question detail and the per-engine breakdown are the other. The work the data points at is named specifically. Whether the work moves the needle is a measurement question we keep answering month over month.
Measurement windows: five Hawaii categories measured across 3 usable runs each (2026-05-23 to 2026-05-26), Honolulu med spas across 3 runs (2026-06-09), Honolulu HVAC across 3 clean runs (2026-06-11), Austin CPA across 4 runs (late May 2026), Nashville CPA across 3 runs (2026-06-07). All figures recomputed against current cohorts as of 2026-06-11. Pattern-readiness cleared per MOAT.md rule 5 for every measurement. Wealth's own-site share rose from 27% on run 1 to 47% as cohort coverage expanded. CPA's rose from 27% to 45% against the current cohort. Both are the cohort-coverage methodology working as designed: more runs and more complete cohorts surface more cohort members, the headline number trends toward true ceiling rather than away from it. Banking, dental, and law numbers held within one to two percentage points across their three runs.
Substantiation: question sets locked by hash, open-source measurement code, named AI tools on named dates. The fact-checker (also public source) rejected zero claims in this teardown.
Anonymization: the Hawaii consumer banking cohort is named in full in teardown 01 because that artifact already established public naming for the cohort. The other eight categories’ cohorts are anonymized here per the rule that non-customer businesses appear named only in 1:1 deliverables. Categories and counts are public. Individual firm names are not.