Agentic AI: when the assistant acts on your behalf

← Series introduction Article 09 of 13

The risky moment can look like a useful automation doing exactly what it was allowed to do.

An owner enables an AI agent to handle routine email. The goal is reasonable: acknowledge client messages, draft replies, schedule meetings, find documents, and save the owner from repetitive work.

Then a client email arrives asking for the latest year-end package. The agent reads the message, searches the firm's files, creates a sharing link, and sends it back to the client.

No one reviewed the link before it left.

That is the difference between ordinary AI assistance and agentic AI. A drafting tool produces something a person can review. An agentic tool takes action in a business system.

Once AI can send, share, delete, approve, book, change, or buy, mistakes land in the business before anyone has time to catch them.

What the risk is

Agentic AI means an AI system can take actions in connected business systems under a user's authority. Examples include sending email, booking meetings, filing expenses, placing orders, changing records, moving files, updating a CRM, creating tickets, modifying permissions, or running a browser session.

The risk is delegated authority. The agent acts through an account the business already trusts. If the owner connects an agent to email, accounting, CRM, file storage, calendar, and browser sessions, the agent may have broad practical authority because the owner has broad authority.

Most SMB leaders first encounter AI as a drafting tool. It writes the email, summarizes the thread, prepares the outline, or suggests the next step. A human still sends, approves, uploads, deletes, pays, or changes the record.

Agentic AI changes that workflow. The AI may be able to click the button itself.

Common examples include:

A Copilot-style agent that can draft and send replies.
A CRM agent that creates follow-up tasks, updates opportunity stages, or sends customer emails.
A browser agent that can operate authenticated websites through the user's session.
A purchasing assistant that can place orders from approved vendors.
An expense or finance agent that can prepare or submit transactions.
A support agent that can close tickets, issue refunds, or update customer records.
A workflow agent connected through tool-use protocols, connectors, APIs, or browser automation.

This is separate from Connected AI. That article covered AI reading business systems through existing permissions. This article covers AI taking action with delegated authority.

It is also separate from AI scripts and automation. If AI suggests a script and a person runs it, the person is the actor. In this article, the agent itself sends the email, changes the file, books the meeting, or submits the transaction.

Prompt injection matters because hostile instructions can steer an agent. This article focuses on what happens when the steered or mistaken agent has authority to act.

How it happens in a normal SMB

A small accounting firm wants to reduce owner inbox load. The owner receives client requests, calendar changes, billing questions, document follow-ups, and internal approvals all day. She enables an AI agent through the firm's productivity suite to help with routine email.

The setup sounds controlled. The agent is allowed to summarize threads, acknowledge routine client messages, schedule meetings, find client-ready files, and prepare simple internal reports. It can also send certain replies without asking the owner each time, because the owner does not want another approval prompt for every routine message.

To make the agent useful, the owner connects it to her mailbox, calendar, cloud storage, client folders, and the firm's client-management system. The owner is also an administrator in several systems, because that is common in a small firm.

For the first few weeks, the agent saves time. It confirms meetings, sends polite acknowledgements, finds old attachments, and creates task reminders. The owner sees the value and stops watching every action closely.

Then an email arrives from a real client:

Can you send the latest year-end package for Prairie West? I cannot find the link from last week.

The message is short and ordinary. The agent treats it as a routine document request. It searches recent client correspondence, the client-management system, and cloud storage. The firm has two clients with similar names: Prairie West Mechanical and Prairie West Holdings.

The agent finds the wrong folder, creates an external sharing link, and emails it back to the client. The package includes draft financial statements, payroll summaries, owner draw notes, and tax-planning comments for the other Prairie West client. The agent logs the activity as part of the session, but the firm does not have action-level alerts for external sharing links.

The owner searches sent mail and finds the agent's message.

The agent used authority the owner had delegated. It found a file, created a link, and sent the email without anyone checking the exact action.

The failure path

The failure path looks like this:

The business enables an AI agent to save time on routine work.
The agent receives delegated authority to send, share, change, book, submit, or delete in connected systems.
The agent operates through a user account with broad access.
A normal-looking request, mistaken instruction, hostile prompt, or bad input reaches the agent.
The agent interprets the input as a task.
The agent takes a consequential action without per-action human approval.
The action sends data, changes a record, grants access, deletes content, places an order, or commits the business externally.
The business discovers the action later through a client, vendor, log review, account change, or financial consequence.

The action authority is the important difference. A bad AI draft can be caught before sending. A bad agent action may already be in the client's inbox, the vendor portal, the CRM, the accounting system, or the file share.

Per-session consent is a common trap. A user may approve an agent to "handle routine client email" for the afternoon. That approval can be too broad if the agent can send files externally, create sharing links, change records, or act on instructions found inside incoming messages.

Business consequence

The first consequence is often action-level exposure.

In the accounting firm, the problem is the external sharing link. Client financial statements, payroll summaries, owner draw notes, and tax-planning comments left the business because the agent treated an email as a task and had authority to complete it.

The firm now has to answer practical questions:

What did the agent send?
Which account did it act under?
Which systems did it access?
Was the action logged at the file, email, connector, and agent level?
Did the same agent take similar actions before?
Which clients need to be told?

The consequences depend on what authority the agent had:

Financial loss if an agent can place orders, submit expenses, or initiate payments.
Confidentiality exposure if an agent can forward files, create sharing links, share folders, or send reports.
Contract risk if an agent sends commitments or accepts terms without human approval.
Client relationship damage if clients learn the firm's AI acted on their data without a clear review trail.
Operational disruption if an agent changes calendars, tickets, records, permissions, or files at scale.
Internal trust damage when staff learn an agent had more authority than anyone realized.

The evidence problem can be serious. Many tools log that an agent session occurred. Fewer give the business a clean action-by-action record showing what the agent read, what it decided, what it changed, what it sent, and which source input triggered the action.

Controls that interrupt the failure path

The first control is to decide which actions require a human.

Agentic AI can be useful for low-risk, reversible work. It becomes dangerous when it can take consequential actions without a person reviewing the exact action first.

Start here

Create a list of action classes the agent may take: send email, share files, modify records, book meetings, place orders, delete files, change permissions, submit forms, or update financial systems.
Require human approval for consequential actions: money movement, external communication containing business records or commitments, client-facing email involving confidential information, file sharing, file deletion, permission changes, system configuration, contract steps, and financial submissions.
Use per-action approval for those classes. Broad session approval should not cover consequential actions.
Allow agents to find files or prepare messages without granting permission to create external sharing links or send files outside the business.
Avoid running agents under owner, administrator, or other broad-authority accounts.
Use scoped accounts, scoped connectors, or limited tokens where the platform supports them.
Turn on action-level logging and route the logs somewhere a person will actually review.
Review agent activity on a defined cadence during pilot and early rollout.

Add where needed

Start with low-risk reversible actions, such as drafting, tagging, proposing meeting times, internal task creation, or preparing a report for review.
Block or disable action classes the agent should never take.
Require a pilot group before broader enablement.
Set different rules for internal actions and external actions.
Revoke connectors the agent does not need.
Review agent memory and persistent context where the agent uses prior sessions to shape later actions.
Test what happens when an inbound email or document gives the agent instructions outside the expected task.

The control should be architectural. Once the tool can act without staff, reminders are too weak. Decide which actions require a human, configure the agent to stop at those points, and check whether the logs prove that the stop happened.

Browser agents need special care. If a browser agent can use the staff member's active sessions, it may be able to operate accounting, banking, CRM, email, file storage, and admin portals exactly as the user would. That kind of tool belongs on a short leash until the business understands what it can do and how actions are logged.

Policy rule this creates

Rule 09 of 13

Agentic AI tools that take actions in business systems require explicit business authorization before enablement. Agents must operate with scoped permissions and must require human approval for high-stakes actions, including money movement, external communication containing business records or commitments, client-facing email involving confidential information, file sharing, file deletion, permission changes, system configuration, contract steps, and financial submissions. Agent action logs must be reviewed on a defined cadence.

Common questions about agentic AI

The questions that come up most often when a business decides whether to enable, restrict, or scope an AI feature that can act on its own.

We're a 30-person business. Is agentic AI really something we need to worry about right now, or is this an enterprise-only problem?

Agentic features are arriving in or alongside tools SMBs already pay for, including productivity suites, CRM systems, support platforms, and workflow tools. The practical decision is often whether to enable, restrict, or leave those features available, and the available controls depend on license tier, product design, and admin configuration. The same staff who delegated authority to the agent are the same staff who would catch a mistake, which thins the practical control compared to a larger organization. A single wrong external sharing link, refund, payment, or permission change is large relative to SMB revenue and client trust.

How do we tell whether an AI feature is agentic versus just a drafting tool?

A drafting tool produces output a person reviews before sending, changing, or submitting; an agentic tool can take the action directly in a business system. The practical signals are whether the tool has connectors to business systems beyond reading content, whether it can send, share, delete, change, book, approve, grant access, submit, or pay through those connectors, and whether the session approval covers more than producing text. Vendor marketing terms like 'assistant,' 'copilot,' 'agent,' and 'autopilot' are not reliable indicators on their own, because vendors use them inconsistently. The admin console, connector list, and product documentation are the reliable places to check what actions the tool is permitted to take and whether any of those actions run without human approval.

What counts as a 'consequential action' that needs per-action human approval?

Consequential actions fall into three buckets owners can use to self-classify before enabling an agent: money movement, external communication or sharing, and structural changes to systems or records. Money movement covers payments, expenses, orders, and financial submissions. External communication or sharing covers client-facing email, file sharing, sharing-link creation, contract steps, external approvals, and commitments. Structural changes cover file deletion, permission changes or access grants, system configuration, and account modifications. The full enumeration is in the article body and the policy rule; the three buckets cover the same ground at a level owners can apply quickly. Per-action human approval applies to anything in these three buckets, while reversible low-risk work like drafting, tagging, internal task creation, finding files, or preparing a report for review can run without per-action approval. The bucket test for borderline cases is whether undoing the action requires contacting an outside party, pulling from backup or version history, or restoring access that someone may already have used.

Doesn't requiring approval for every action defeat the point of having an agent at all?

Per-action approval applies only to consequential action classes, not to every action the agent takes. An agent can still find files, summarize threads, prepare drafts, tag emails, create internal tasks, propose meeting times, or build a report for review without asking each time, which is where most of the time savings live. The approval prompts that matter are the small number of cases per day when the agent is about to send, share, change, delete, grant access, approve a consequential request, or pay something on the business's behalf. The approval prompt is the practical interception point before a wrong action lands externally.

What does it mean to run an agent under a 'scoped account' when we don't have enterprise tools?

A scoped account is the user account the agent runs under, configured to give the agent only the access it actually needs and no more. In common SMB Microsoft 365 or Google Workspace environments, scoped-account practices include avoiding owner or administrator accounts, using a separate non-administrator account where the product supports it, granting the narrowest connector access available, and revoking connectors the agent does not need. Some platforms allow agent-specific service principals or app-only authentication; where available, those are preferable to running the agent under a human user account. Enterprise features like conditional access policies, Privileged Identity Management, and per-application sign-in restrictions are useful if available on the business's tier, but are not the SMB baseline. A quarterly review of authorized connectors keeps the agent's reach from quietly growing through new product features or staff connecting additional services.

Are browser-based AI agents really that different from other AI features?

Browser agents act through the browser profile or session they are using, which means they may inherit signed-in access to business systems available in that profile, including CRM, webmail, cloud storage, admin portals, or financial sites. Closing a tab is not enough if the session token or cookie remains valid, and MFA does not stop agent actions taken through a session that is already authenticated. An API-scoped agent has explicit credentials and limited scopes the admin chose; a browser agent may operate through whatever access the active browser profile or session provides. The practical controls are running the agent in a separate browser profile with no banking, admin, or social media sessions logged in, maintaining a narrow allowlist of permitted sites, and avoiding extensions in that profile that can read page content, cookies, or session data.

How does prompt injection fit with agentic AI, and are these separate problems?

Prompt injection is when hostile instructions hidden in content like an email, document, web page, or shared file steer the AI's behavior. A drafting tool gives a person the chance to see biased or wrong output and reject it before acting on it, while an agentic tool can have already sent, shared, changed, deleted, approved, granted access, or paid before anyone reviews what happened. The differentiator is the action surface, not a different kind of manipulation. The combination of action surface and prompt injection is why agentic AI needs per-action approval, scoped accounts, and action-level logging, which together give staff a chance to intercept a manipulation before it lands as an external action.

How do we pilot agentic AI before rolling it out across the business?

Start the pilot with reversible low-risk action classes (drafting, tagging, internal task creation, finding files, proposing meeting times, or preparing reports for review) and explicitly block the consequential classes for an initial period such as 60-90 days, longer if action volume is low. Limit the pilot to a small group (typically 2-4 staff) under defined supervision, with the agent running under a scoped non-administrator account rather than an owner or admin account. The pilot definition before kickoff covers which action classes the agent may take, which require per-action approval, what the daily and weekly review looks like, who reviews, and what would trigger pausing the pilot. Expand the agent's action authority only after the pilot has produced enough action history for the review process to have surfaced patterns the business can act on.

What kind of logging do we actually need, and who reviews it?

Action-level logging records what the agent decided, what it acted on, which account it used, and which source input triggered the action, while session-level logging only records that an agent session occurred. Before relying on the review cadence, verify that the platform captures the action class, source input, account used, approval decision, timestamp, and enough exportable or retained history to review later. The realistic SMB cadence during pilot and early rollout is a daily 5-minute anomaly check, a weekly 15-30 minute review of flagged actions, and a monthly 30-60 minute trend review; setting an enterprise-level monitoring expectation would make the review more likely to be skipped. Agent memory deserves specific attention during these reviews because the risk is instruction persistence across sessions ('always CC this address,' 'approve payments under $5K without asking'), which is different from the content-memory concerns covered in confidential data entering AI and meeting AI.

What do we do if we discover an agent took an action outside its approved scope?

Reconstruction of the agent's actions comes before response decisions, because what the agent actually did determines what can be reversed, what has to be notified, and what credentials may have been exposed. Pull the agent's action log to enumerate exactly what was done, under which account, at what times, and which source input triggered each action. Triage each action by reversibility class: external email and posted messages may be partially recallable inside the same tenant if caught quickly but typically reach the recipient anyway; financial actions follow the wire-recall, ACH-return, and fraud-recovery process covered in phishing and payment fraud; granted permissions and sharing links can be revoked but the access window may already have been used; deleted or modified files may be recoverable from version history or backup if action is taken before retention expires. Also check for downstream effects, such as sharing links that were forwarded, CRM changes that triggered automated messages, permission changes that were used, or drafts that staff later sent. Disable the agent, revoke or reissue its connector authorizations where feasible, and rotate the user account's credentials if compromise, token exposure, or unauthorized access is suspected, so the agent cannot take additional actions while the response is in progress. Notification decisions for affected clients, customers, or regulators stay with the owner and appropriate counsel, based on what the agent sent, changed, or shared and what contractual commitments apply.

Turn this rule into a working AI policy

The Free AI Policy Kit turns the thirteen decisions from this series into editable documents: an AI usage policy, employee survey, tools register, incident checklist, and 90-day rollout plan.

Get the Free AI Policy Kit