Brand voice for AI content: your writer's style guide
AI blogs sound generic without a real guide. What belongs in a brand voice AI content style guide, and how to keep it current as your product evolves.
AI blogs sound generic without a real guide. What belongs in a brand voice AI content style guide, and how to keep it current as your product evolves.

Most founders who resist AI-written blogs do not actually object to the writing. They object to the sameness: the hedged sentences, the rule-of-three lists, the register that could belong to any company writing about any product. That fear is now backed by data, not just a vibe. In a survey of 132 marketers already using AI in their workflow, generic-sounding content was the top content-quality complaint: 87 named it, ahead of outdated information at 51 and content that failed to reflect real expertise at 43 (Brafton, State of AI Adoption in Marketing Teams). The fix is not writing less with AI. It is giving the model an explicit style guide instead of a mood board.
Ask a model to "write in a friendly, professional tone" and it will comply, technically. The result reads like every other blog that got the same instruction, because "friendly" and "professional" describe a feeling, not a rule a model can check its output against. Vague direction produces vague, converged output. This is not a one-off failure of a bad prompt. It is the default behavior of the underlying models, and it is measurable.
A NeurIPS 2025 Best Paper study from the University of Washington, Stanford, Microsoft, Carnegie Mellon, and the Allen Institute for AI tested more than 70 large language models against a 26,000-query open-ended benchmark called INFINITY-CHAT. It found what the researchers call an "artificial hivemind" effect: individual models repeat themselves across separate runs on the same prompt, and different model families converge on strikingly similar answers to each other (UW Allen School, "Artificial Hivemind"). A separate February 2026 study in iScience, which analyzed Portuguese-language news writing, measured this directly in output shape: GPT-4o and Mistral both produce text that clusters tightly around a statistical average length and structure, while human-written text spreads across a much wider, flatter distribution (iScience, PMC). Two different research teams, two different methods, the same finding: left to their own defaults, models do not just sound similar to each other within one company's product. They sound similar across the entire industry.
That's the mechanism behind the complaint. If every blog prompting a model with the same three adjectives gets output from the same narrow band of the distribution, every one of those blogs will read like a slightly different font on the same paragraph.
"Professional, friendly, bold" is not a style guide. It is a wish. A model has no way to check whether a sentence is "bold" enough, so it defaults to whatever bold usually looks like in its training data, which is the same thing every other model defaults to. Compare that with an instruction like "spell out negatives; only contract them for occasional rhythm" or "always use the Oxford comma" or "close paragraphs with one short sentence, 3 to 6 words." Those are testable. A model, or a human editor, can look at a sentence and say yes or no. Adjectives cannot be checked. Rules can.
The Brafton survey has a quiet second data point worth noting here: off-brand messaging ranked lower as a concern than in past years, which the report ties to marketers getting more deliberate about prompting and workflow rather than winging it with a one-line tone instruction. Teams that got specific stopped getting burned on voice. The ones still fighting generic output are, on average, the ones still describing a mood instead of a rule.
The cost is not aesthetic. It is trust, and trust is the whole point of a blog. Klaviyo's AI consumer trends data puts a number on it: only 13% of consumers say they completely trust AI generally, and 32% say noticing AI-generated marketing content specifically makes them trust the brand less (Klaviyo, consumer trust in AI). That second figure is the one that matters here: readers are not rejecting AI writing on principle. They are penalizing the flat, interchangeable version of it, the version a vague-adjective prompt reliably produces.
The same flatness hurts you with AI answer engines, for a related reason: a model summarizing search results has no way to attribute a distinctive, well-sourced voice to a page that reads like every other page. We cover the fact-checking half of that trust problem in how AI content fact-checking actually works; voice is the other half. A post can be fully accurate and still get skipped because it has nothing distinct for a reader, or a model, to hold onto.
A usable style guide is a checklist and a set of examples, not a paragraph of adjectives. It should let a writer, human or model, look at a draft sentence and know immediately whether it fits.
The rules that actually constrain a model's output are mechanical, not emotional. Point of view: do you write "you," "we," or the third person for the product itself? Contraction policy: do negatives get spelled out ("does not," "cannot") most of the time, with the occasional natural contraction for rhythm, or is the default the other way around? Oxford comma, yes or no. Typical sentence and paragraph length, and where you allow a short, punchy sentence to land for emphasis. Heading case and whether headings are phrased as real questions. A list of 10 to 20 words and phrases you actually use, so the model reaches for your vocabulary instead of the generic one. None of that is a feeling. All of it is checkable.
Ann Handley, writing about brand voice, quotes Kevin Lynch, Creative Director at Oatly, on a version of this same idea from the human side of the desk: "We like to be 'consistently inconsistent.' And we try to make sure the brand acts 'like a human, not a company'" (Ann Handley, on brand voice). Oatly gets there with taste and a small team who has internalized the voice. A model has no equivalent instinct to fall back on, and that's the gap a written guide has to close.
Rules describe the shape. Examples show what the shape looks like in practice, and they are the fastest way to close the gap between a rule and a model's output. Pull 3 to 5 verbatim paragraphs from posts you have already published and put them in the guide as the reference a new draft should match. Where you can, pair one with a rewritten "off-brand" version, so the contrast is explicit rather than implied. A model matching against your own shipped sentences drifts far less than one matching against a description of them.
A style guide that lives in a Google Doc nobody opens might as well not exist. It has to sit where the writing actually happens, and it has to be something the writer consults on every single post, not something referenced once at onboarding and then forgotten. This is the same point automated content creation without the slop makes about the parts of a pipeline that should stay under human judgment: voice is one of them, and judgment only holds if the rule is actually in front of the writer when the draft gets made.
This is a real, current feature of how Lyra works, not a hypothetical: when you connect your repo, she reads your existing posts and generates a CLAUDE.md style guide directly in that repo, including a measured voice profile like the one above, point of view, contraction ratios, Oxford comma usage, sentence rhythm, signature vocabulary, pulled from your actual published posts. It is a plain file you can open, edit, and correct, sitting next to the content it governs, and it is exactly the file that produced the paragraph you are reading now. That's the difference between a voice guide and a voice guideline: one is an editable artifact a writer consults every time, the other is a memory of a conversation from three months ago.
A style guide is not a one-time deliverable. It has to change at roughly the same pace as the blog it governs, or it quietly stops matching the thing it describes.
The guide you write at onboarding reflects your voice at that moment: your product then, your positioning then, the ten posts you had published then. Six months in, you have shipped a new feature, repositioned against a competitor, hired a writer whose habits are quietly shifting the house style, or simply gotten better at saying what you mean. None of that updates the original document unless someone deliberately goes back and edits it. A style guide that nobody revisits is not a living reference. It's a snapshot slowly drifting away from the blog it is supposed to describe.
The fix is structural: the guide has to be as editable as the content it governs, checked into the same repo, reviewed the same way a code change is reviewed. When it lives as a plain file next to your posts instead of a document in a separate tool, updating it is a one-line edit, not a project. Lyra supports re-scoping on demand for exactly this reason: when your voice or product shifts enough that the original guide stops matching, you trigger a fresh scan and she rebuilds the profile from your latest posts, not the ones you had when you first connected the repo.
Two triggers are worth watching for, one scheduled and one event-based. On a schedule, skim the guide every quarter, even if nothing dramatic changed, because small drifts compound the same way a slowly changing average does. On an event basis, re-check it after anything that plausibly moves your voice: a repositioning, a new hire whose writing sets the house style, a product pivot, or simply a stretch where the last several posts read noticeably different from the guide's own examples. If a reviewer or a reader starts flagging that recent posts feel "off," that is the guide telling you it is out of date before you have consciously noticed.
Generic AI content is a solved problem, not an unavoidable one. The research says the model's default is sameness; the fix is giving it something more specific than an adjective to work from, and keeping that thing current as your voice actually changes. Why we built Lyra covers the part of that story that led here: we wrote one post a week, then two, by hand for about a year before automating any of it, and the automation only worked once it could learn to sound like we did.
A style guide with nothing but adjectives cannot stop a model from sounding like everyone else. Lyra generates a real one from your own posts at scoping, keeps it editable in your repo, and re-scopes it as your voice evolves.
FAQ
It is a document a model can execute against: point of view, contraction policy, sentence rhythm, heading conventions, signature vocabulary, and worked on-brand and off-brand examples pulled from your real posts. It is not a list of adjectives like professional, friendly, or bold. Those describe a feeling; a style guide gives instructions a model can follow line by line.
Because a prompt like 'write in a friendly, professional tone' gives a model nothing concrete to act on, so it falls back to the same safe, hedged register every model converges toward. Research on 70-plus LLMs found this convergence happens both within one model across runs and across different model families, which is why unguided AI writing from different tools reads so similarly.
Specific enough to settle an argument. State whether you use the Oxford comma, whether negatives get spelled out or contracted, your typical sentence and paragraph length, your heading case, and 10 to 20 words or phrases you actually use. Then show 3 to 5 verbatim paragraphs from posts you have already published as the reference the model matches against.
Re-check it whenever your product, market, or team changes materially, and skim it every quarter even if nothing obvious changed. A guide written once at onboarding drifts out of date the moment you ship a new feature, rename something, or a new writer's habits start showing up in the posts it was supposedly modeled on.
Built by the tool you're reading about
Lyra finds the topics worth ranking for, writes them in your repo's voice, fact-checks every claim, and opens a pull request scored and ready to merge. You review and hit merge. Want to see what she'd write for you? Tell us about your blog and the founder will walk through it with you.
Keep reading

Pricing page SEO: how to structure prices so AI Overviews and ChatGPT quote them right, plus the verification habit that stops a cited price going stale.

A topic cluster strategy for 2026: pick 3 to 7 deep clusters, map each post to a real buyer question, and sequence 30+ posts so authority compounds.

The exact content structure that gets pages cited by AI: the answer block, self-contained chunks, definition callouts, and the byline, run as a checklist.