Risk 12 of 13 · AI Risk Series

When AI sounds right but is wrong

The risky moment can look like a polished answer.

AI output can be confident and wrong at the same time. Polished drafts may carry invented citations, stale data, or details from another client.

Where it comes from Staff treat fluent, useful-looking AI output as verified and stop checking the load-bearing facts.
What the business loses Trust with clients, customers, or regulators when wrong AI material appears in deliverables and the firm cannot say what was checked.
What ends it Verification proportional to consequence, plus a reviewer who can name the claim, the source that proves it, and what happens if it is wrong.
← Series introduction Article 12 of 13

The risky moment can look like a polished answer.

The draft reads cleanly, uses the right tone, and includes numbers and citations that look reasonable enough for a capable person to have written.

That polish is why the mistake survives.

AI output can be wrong without looking wrong.

It can invent sources, misstate dates, mix one client's details into another client's draft, summarize away the clause that matters, or present stale information as current. If staff treat the output as an answer instead of a draft, the business can carry that error into client work, sales material, legal review, financial planning, customer support, or management decisions.

What the risk is

This risk is business reliance on AI output that appears correct and is later found wrong.

The output may include:

  • Invented facts, numbers, sources, or citations.
  • Real-looking report titles, authors, dates, or quotations that do not exist.
  • Stale information presented as current.
  • A summary that misses the one detail that changes the answer.
  • One client's wording, context, or confidential detail appearing in another client's draft.
  • Customer-facing AI answers that are confident but wrong.
  • AI-generated content used commercially without clear review of provenance, rights, or originality.

The defining issue is that AI can sound confident without being correct. Large language models generate plausible text, and the facts still need to be proven elsewhere. They may produce a citation-shaped sentence, a source-shaped title, or a number-shaped answer because that is the pattern the prompt asks for.

The safer business habit is to treat AI output as a draft that still needs verification.

This is separate from Prompt injection. Prompt injection is an attack that manipulates the AI. This article is about ordinary output failure where no attacker is involved.

It is also separate from Insider misuse. If an employee intentionally uses AI to fabricate, impersonate, or cover wrongdoing, that is insider misuse. Here, the staff member is trying to do legitimate work but trusts the output too far.

It is separate from confidential data entering AI, which covers the data entered into AI. This article covers what comes back out and how the business uses it.

For customer-facing AI, Vendor AI features covers the deployment decision and vendor posture. This article covers whether the AI's answer is true, current, complete, and appropriate to rely on.

How it happens in a normal SMB

A small Canadian marketing agency is preparing a creative brief for a regional retail client. The client is launching a new product line and wants the agency to summarize the market, customer attitudes, competitor positioning, and recommended messaging.

The account manager uses the firm's sanctioned AI assistant to speed up the first draft. She asks for a concise market overview, recent Canadian consumer trends, competitor share estimates, and source citations that the client can review.

The AI produces a polished brief with three market-size figures, two competitor share percentages, a paragraph about changing consumer attitudes, and citations that look credible: industry research reports, a trade association publication, and a recent Statistics Canada survey.

The account manager reads the brief for tone and usefulness. It looks professional, so she edits the tone, adds client-specific context, and sends it to the creative team and the client.

Nobody checks the sources.

The client uses the brief in an internal strategy presentation. During review, one of the client's finance leaders asks for the underlying report behind a market-size figure because it does not match the client's own planning assumptions.

The account manager goes back to the AI-generated citations. Two of the industry reports do not exist. One real trade association exists, but the report title is invented. There is no Statistics Canada survey with the title the AI provided; the AI created a plausible title and attached it to a trusted institution.

The brief was wrong in exactly the way that is easy to miss: polished and useful-looking.

The client asks for a written explanation. They also ask the agency to review other AI-assisted work delivered over the past year and to identify which deliverables included AI-generated research, statistics, or citations. The agency now has to investigate old work that was never labeled, source-checked, or stored with verification notes.

The failure path

The failure path looks like this:

Case file Sequence 12 · Wrong output
  1. A staff member uses AI to draft, summarize, research, analyze, or prepare a deliverable.

  2. The AI produces output that is fluent, specific, and confident.

  3. The output includes facts, dates, names, numbers, citations, recommendations, or summaries that appear credible.

  4. The staff member reviews for tone and usefulness while the load-bearing claims remain unchecked.

  5. The output is sent externally, used in a decision, added to a customer-facing answer, or passed downstream to another team.

  6. A client, customer, regulator, business owner, or later workflow relies on the output.

  7. The error is discovered after it has already shaped work, decisions, commitments, or customer expectations.

  8. The business has to explain how the output was created, what was checked, who relied on it, and whether similar errors exist elsewhere.

The dangerous step is treating AI output as verified because it sounds competent.

A competent-sounding answer still needs source checking before it leaves the business.

Business consequence

The first consequence is trust loss.

In the marketing-agency example, the client can understand the failure without understanding language models. The agency put invented research into a client deliverable. The client used that material in internal planning. Now the client has to ask whether the agency's other work was checked.

Other consequences depend on the output:

  • Client embarrassment when wrong AI-generated material appears in a board deck, sales document, proposal, report, or customer communication.
  • Operational mistakes when staff act on stale policy, pricing, tax, regulatory, or market information.
  • Contract or legal risk when a clause summary misses the caveat that changes the conclusion.
  • Professional liability where regulated work relies on AI output without appropriate review.
  • Cross-client trust damage when one client's wording, detail, or strategy appears in another client's draft.
  • Customer support harm when a chatbot or automated response gives a confident answer that the business cannot honour.
  • Management decisions based on numbers, comparisons, or trends that were never source-checked.

There is also an intellectual-property and copyright exposure. AI-generated text, images, code, slide content, campaign concepts, and product copy can create commercial risk if the business cannot explain provenance, rights, or review. Treat externally used AI output like other commercial material: someone owns the decision to use it, someone checks it, and the business keeps enough record to defend the choice if challenged.

The evidence gap matters. If the business did not label AI-assisted work, preserve source links, or keep verification notes, it may not be able to answer a client's basic questions later: what was AI-generated, what was checked, which sources were real, and which deliverables might contain the same failure.

Controls that interrupt the failure path

The first control is verification proportional to consequence.

Review should match consequence. A rough internal brainstorm needs a lighter process than a client report, legal interpretation, financial recommendation, customer-facing answer, or regulated deliverable.

Start here

  • Treat AI output as a draft that still needs verification.
  • Verify any specific claim before it leaves the business or informs a material decision.
  • Check citations, source names, links, quotations, numbers, dates, dollar figures, laws, policies, product claims, and named references at the source.
  • Use a current authoritative source for current questions. AI can point toward sources, but the authoritative source itself has to carry the claim.
  • Require domain review for client-facing, regulated, financial, legal, technical, or high-consequence work.
  • Keep separate AI sessions, projects, or workspaces for separate clients to reduce cross-client confusion.
  • Disable memory or persistent context for AI tools used across multiple clients unless there is a reviewed reason to keep it on.
  • Label AI-assisted drafts where it helps reviewers understand that source-checking is required.

Add where needed

  • Keep source links or verification notes with client deliverables that include AI-assisted research, statistics, citations, or claims.
  • Build a simple checklist for external work: claims checked, citations opened, dates current, client context verified, no cross-client content, reviewer named.
  • For customer-facing AI, use approved knowledge sources, escalation to a person, answer limits, and clear labeling that the user is interacting with AI.
  • Require human checking before AI-generated citations, legal references, market statistics, case studies, customer claims, or technical instructions pass review.
  • Require stronger review before using AI-generated content commercially where IP, copyright, brand, or customer reliance matters.

The standard should scale with the consequence of being wrong. A low-stakes internal summary may need only light review, while client-facing recommendations need subject-matter review and regulated or contractual deliverables need verification against the actual source material.

The reviewer should be able to answer three questions:

  • What claims in this output matter?
  • Which source proves each claim?
  • What happens if this claim is wrong?

If nobody can answer those questions, hold the output inside the business until the claims are checked.

Policy rule this creates

Rule 12 of 13

AI output must be verified by a person qualified to assess it before it is used in client work, sent externally, published, or relied on for material business decisions. The staff member using AI remains accountable for the output. Citations, sources, numbers, dates, dollar figures, legal or regulatory references, named facts, product claims, and customer-facing answers must be checked against authoritative sources before use. AI-assisted work for different clients must be kept in separate sessions, projects, or workspaces where practical.

One of 13 rules for your AI usage policy

The rule above is one of 13 that make up a working AI Usage Policy. The SMB AI Policy Builder walks you through the full set of decisions and produces the policy, working documents, and a 90-day implementation plan.

Launching soon. Join the waitlist to be notified.

Get practical insights like this in your inbox

Occasional articles and updates on technology, risk, operations, and support.