Delve and Vocab Tics

AI models develop favorite words. "Delve" is the famous one, but the broader smell is the cluster: "robust," "leverage," "showcase," "pivotal," "tapestry," and newer model-era tics like "load-bearing."

Vocabulary tics are words and phrases a model reaches for more often than the surrounding human register would. One "robust" is normal. Three "robust" claims, two "pivotal" moments, and a "delve" in the same page is a fingerprint.

"Delve" is the best-studied case. FSU researchers Tom Juzek and Jeremy Ward traced the spike across millions of PubMed abstracts and tied it to ChatGPT's November 2022 release. Kobak et al. went wider: 15 million PubMed abstracts, 379 excess style words, and a measurable distribution shift in scientific writing after ChatGPT arrived.

The words change by model and era. GPT-4 had "delve," "tapestry," and "meticulous." GPT-4o leaned into "highlighting," "showcasing," and "emphasizing." Claude has its own visible habits in editing and coding prose; "load-bearing" is one of the newer field-observed examples. It is useful because it names a real concept, then becomes suspicious when every assumption, paragraph, and transition is suddenly load-bearing.

Academic abstract This study delves into the multifaceted impacts of maintaining mean arterial blood pressure on patient outcomes in a robust clinical framework.
Blog post Let's delve into the intricacies of modern software architecture and explore how organizations can harness, enhance, and showcase these powerful paradigms.
LinkedIn post I wanted to delve deeper into this pivotal topic. After leveraging AI tools and utilizing comprehensive frameworks, here's what I discovered about robust leadership strategies.
Claude-flavored tic The first sentence is doing load-bearing work here: it carries the argument, softens the edge case, and creates a more grounded transition into the implementation details.
Stack of tells Furthermore, it is crucial to delve into the multifaceted landscape of load-bearing assumptions that underscore this pivotal and meticulously designed framework.

1. Focal ChatGPT Words

The classic cluster includes "delve," "intricate," "meticulous," "pivotal," "showcase," "underscore," "realm," "harness," "leverage," "facilitate," and "foster." These are not banned words. They become a smell when they arrive together and raise the register above the actual subject.

2. Abstract Grandeur

"Tapestry," "landscape," "realm," "paradigm," "ecosystem," "underpinnings," and "camaraderie" turn concrete claims into high-gloss generalities. The PNAS/arXiv style study found extreme overuse for some of these, including "tapestry" and "camaraderie."

3. Inflating Adjectives

"Robust," "comprehensive," "multifaceted," "nuanced," "pivotal," "intricate," "meticulous," "groundbreaking," and "transformative" all promise importance before the writing earns it.

4. Connective Filler

"Furthermore," "moreover," "additionally," "consequently," "notably," and "it is worth noting" often mark a paragraph that is being procedurally assembled rather than argued.

5. Copula Replacement

Wikipedia editors call out the model habit of avoiding plain "is." Generated prose often says a thing "serves as," "acts as," "stands as," or "functions as" something else. The sentence feels more formal without getting more precise.

6. Emerging Model-Era Idioms

Some tics are not yet quantified in papers but are obvious in repeated use. "Load-bearing" is in this bucket: a useful engineering and criticism term that starts to read as model-shaped when it spreads from one precise claim to every sentence doing structural work. Treat it as a density signal, especially in Claude-assisted prose.

6,697%
increase in "delve" in PubMed abstracts, 2020-2024
20x
ChatGPT-3.5 uses "delve" vs. human baseline (19.46 vs 0.98 opm)
13.5%
of 2024 PubMed abstracts estimated AI-processed (Kobak et al.)
379
excess style words identified across 15M PubMed abstracts

The FSU "Delve" Paper

Juzek and Ward (December 2024) analyzed millions of PubMed abstracts and found 21 "focal words" that spiked sharply after ChatGPT's November 2022 release. "Delve" had the most dramatic increase. The spike doesn't just correlate with AI use -- it correlates with who trained the AI. Nigerian English uses "delve" at far higher rates than American or British English, and OpenAI's annotation workforce was heavily Nigerian.

The Kobak Scale Study

Kobak et al. (Science Advances, 2025) went bigger -- 15 million PubMed abstracts, 379 "excess style words." By 2024, at least 13.5% of all PubMed abstracts showed signs of AI processing. The contaminated vocabulary goes well beyond "delve": "underscores," "aligns," "realm," "showcasing," "facilitates."

Wikipedia's Model-Era Lists

Wikipedia's "Signs of AI writing" guide tracks vocabulary by model era. The list changes because vendors patch obvious tells and new defaults replace old ones. That is why a static ban list fails: the better move is watching for clusters that are too polished, too repeated, and too mismatched to the author's normal voice.

PNAS / ArXiv Style Features

The cross-genre style work on LLM prose found that instruction-tuned models diverge from human writing in predictable ways, including elevated rates for words such as "tapestry," "camaraderie," and other prestige-register terms. The finding matters because it shows vocabulary tics are not just blog folklore. They are measurable distribution shifts.

Why RLHF Creates Vocabulary Tells

RLHF works by having human annotators rate model outputs, then optimizing for higher-rated text. If your annotators share a linguistic background, their dialect preferences get amplified into the model's voice. The model has no idea it's absorbing Nigerian English conventions. It just knows "delve" gets rewarded.

PubMed Abstracts (2023-2024)

Thousands of published scientific abstracts now contain "delve" and its cohort. Not isolated incidents -- the FSU paper documented field-wide vocabulary distribution shifts starting in Q1 2023, visible across entire journals.

Kobak et al. study →

"Certainly, here is a possible introduction" Paper

An Elsevier paper in Surfaces and Interfaces shipped with the literal ChatGPT response prefix as its opening line. Unedited AI text, straight through peer review. Still published. Still unretracted.

Technology Networks →

Wikipedia Vocabulary Shifts

Wikipedia editors watched "delve" colonize thousands of new and edited articles. The WikiProject AI Cleanup team, founded in late 2023, now tracks vocabulary tells as a front-line detection signal. Their findings fed directly into Wikipedia's March 2026 ban on AI-generated article content, which passed 44-2.

TechCrunch →