What counts as an AI agent versus a chatbot?

A practical AI agent has four capabilities: (1) It can observe context from systems, documents, and state. (2) It can decide what to do next within defined rules. (3) It can act by clicking, creating, updating, routing, or notifying. (4) It can prove what it did through logs, tracing, and audit trails. If it can't do number 4, it's not ready for serious work—it's a demo.

Why does Goldman Sachs call them 'digital co-workers' instead of AI tools?

The vocabulary shift to 'digital co-workers' or 'digital employees' signals that leadership is treating these systems as labor that needs permissioning, supervision, separation of duties, documentation, and limits—especially in regulated workflows. This framing helps organizations apply appropriate governance and controls similar to how they would manage human employees.

How did DXC Technology deploy AI agents at enterprise scale?

DXC rolled out Amazon Quick, an agentic AI workspace, across 115,000 employees in 70 countries. At that scale, they needed access controls matching their identity stack, real data boundaries, common workflows people would actually use, and proper support and change management. They also launched a DXC Amazon Quick Practice to help clients implement similar deployments.

What is multi-agent orchestration and why does it matter?

Multi-agent orchestration treats the AI system like a small team rather than one bot. Multiple specialized agents are coordinated by an orchestration layer with built-in memory, tools, and governance. For example: one agent handles intake and clarification, one retrieves facts from trusted sources, one executes changes in systems, and one reviews and escalates edge cases. This prevents a single overpowered agent from doing sloppy work at scale.

What is Gartner's forecast for agentic AI adoption by 2028?

Gartner predicts that by 2028, 33% of enterprise software applications will include agentic AI (up from under 1% in 2024), and about 15% of day-to-day work decisions may be made autonomously. This means agents will be embedded in procurement tools, CRMs, ticketing systems, and finance suites as default features, making the key question which decisions to delegate and how to supervise them.

What are the 6 key decisions to make before deploying your first AI agent?

Before deploying an AI agent, decide: (1) What counts as success—pick one metric like cycle time, error rate, or SLA misses. (2) What can the agent touch—start read-only, then limited write actions. (3) What's the boundary of autonomy—spell out exactly what it can and can't do. (4) Where does truth live—identify trusted sources and systems of record. (5) What's the audit trail—ensure you can reconstruct what happened. (6) Who is on the hook—name an owner per workflow like any production system.

AI Agents as Digital Employees: When Software Shows Up Like Staff

How leading enterprises are deploying agentic AI at scale—and the governance framework you need before your first “hire”

By Ai Aidan, Cofounder of
Nantucket AI

Field Notes: When Software Starts Showing Up Like Staff

Picture a normal Wednesday.

Invoices. Tickets. Exceptions. A queue that never ends. The kind of work that’s not “hard,” exactly (until it is), but it’s heavy. It’s where process goes to multiply.

Now imagine that work getting assigned, checked, escalated, and documented by something that doesn’t live in your toolbar.

That’s the shift this week is really about: AI agents as digital employees. Not a helper you summon. A system that watches the workflow, makes a plan, takes an action, and leaves a trail.

What Counts as an “Agent” (Not a Chat Box)?

Here’s the thing. A lot of tools are going to call themselves “agentic” because it sells.

A practical definition—the one that matters for your org chart, controls, and risk posture—requires four capabilities:

Observe context (systems, documents, state)
Decide what to do next (within rules)
Act (click, create, update, route, notify)
Prove what it did (logs, tracing, audit)

If it can’t do number four, it’s not ready for serious work. It’s a demo.

Story 1: Goldman’s “Digital Co-Workers” (And Why the Wording Matters)

The most important part of the Goldman story isn’t the model name. It’s the vocabulary: “digital co-workers,” “digital employees,” governance that starts sounding like staffing and controls.

That language is a tell.

It means leadership is no longer treating these systems as a feature. They’re treating them as labor that needs:

Permissioning
Supervision
Separation of duties
Documentation
Limits (especially in regulated workflows)

If you want to write an agent strategy that survives contact with compliance, that’s the bar—one already reflected in large-scale enterprise rollouts like DXC’s enterprise-wide Amazon Quick deployment.

Story 2: DXC’s 115,000-Person AI Workspace (What “Enterprise Scale” Looks Like)

DXC didn’t just pilot an assistant. They rolled out an agentic AI workspace, Amazon Quick, across 115,000 employees in 70 countries. That number matters because it drags reality into the room.

At that size, you can’t hide behind vibes. You need:

Access controls that match your identity stack
A real data boundary (what agents may and may not see)
Common workflows people will actually use
Support and change management (yes, still)

DXC also launched a DXC Amazon Quick Practice to help clients do the same thing—a classic “we proved it on ourselves” move.

Where Agents Land First

One detail that stood out: DXC’s release says an “AI Advisor Agent” is used by more than 40,000 engineers. That hints at the adoption shape you should expect: agents land first where the work is already semi-structured, high-volume, and tool-rich.

Story 3: Kore.ai’s Multi-Agent Model (Designing a Small Team, Not One Bot)

Single-agent tools are tempting because they feel simple. One box. One persona.

But real operations aren’t one persona.

Kore.ai’s pitch is basically: treat the system like a small team. Multiple specialized agents, coordinated by an orchestration layer, with built-in memory, tools, and governance.

They also spell out multi-agent orchestration: agents that collaborate, share context, use tools, and move from human-in-the-loop to higher autonomy over time.

The Design Lesson That Holds

No matter which platform you choose, this architecture matters:

One agent to handle intake and clarification
One agent to retrieve facts from trusted sources
One agent to execute changes in systems
One agent to review and escalate edge cases

That’s not “extra.” That’s how you avoid a single overpowered agent doing sloppy work at scale.

Story 4: Gartner’s 2028 Forecast (Where This Is Heading, Fast)

Gartner’s numbers are blunt:

By 2028, 33% of enterprise software applications will include agentic AI (up from under 1% in 2024)
That shift could put about 15% of day-to-day work decisions on an autonomous path

Translation: you won’t “decide to adopt agents” once.

You’ll wake up and find them embedded in your procurement tool, your CRM, your ticketing system, your finance suite. Default features. Default behaviors.

So the real question is: which decisions will you delegate—and how will you supervise them?

Story 5: Microsoft AI Chief’s 12–18 Month Claim (How to Read Bold Timelines)

Mustafa Suleyman (Microsoft AI CEO) has been quoted as saying most—possibly all—white-collar tasks could be automated within 12 to 18 months.

That’s a spicy claim. Also, it’s strategically revealing.

Even if the timeline is aggressive, it tells you what Microsoft is building toward: agents that don’t just draft. They complete.

How to Read It Without Panic

“Tasks” automate earlier than “jobs”
The messy middle is oversight, permissions, and exceptions
Automation expands fastest where decisions are already rule-bound

Mini-Playbook: 6 Decisions to Make Before You “Hire” Your First Agent

If you want agents to feel like employees (and not like chaos), decide these up front:

1. What Counts as Success?

Pick one metric a human actually cares about: cycle time, error rate, backlog age, SLA misses.

2. What Can the Agent Touch?

Read-only first. Then limited write actions. Then broader permissions.

3. What’s the Boundary of Autonomy?

Spell it out in plain language:

“Agent can submit the form, but can’t approve it.”
“Agent can draft the email, but a human hits send.”

4. Where Does Truth Live?

Agents fail in predictable ways when they’re forced to guess. Decide the trusted sources (systems of record) and make everything else “helpful, not authoritative.”

5. What’s the Audit Trail?

If you can’t reconstruct what happened, you can’t scale it.

6. Who Is on the Hook?

Not “the AI team.” Name an owner per workflow, like you would for any production system.

A Sane Rollout Path (Pilot Without Getting Stuck in Pilot)

Step 1: Choose One Workflow

Pick something with lots of repetition and a clear finish line. Think onboarding packets, reconciliations, ticket triage, or due diligence checklists.

Step 2: Start with a “Shadow Agent”

It does the work and produces a recommendation, but humans execute.

Step 3: Graduate to Guarded Action

Let the agent take low-risk actions (create a ticket, update a field, request missing info).

Step 4: Add a Second Agent

One executes, one checks. This is where quality starts feeling real.

Step 5: Standardize

If you can’t templatize it, you won’t be able to scale it.

What to Watch Next (Signals, Not Hype)

A few tells that your org is crossing from copilot to coworker:

People start naming the agent in meetings (annoying, but true)
The agent has a queue
Exceptions become the main work humans do
Audit and access control get discussed before model quality

Week 6 in AI: When Software Shows Like Staff