AI Agent vs Workflow: When You Need an Agent
You need an AI agent only when the sequence of steps cannot be known until runtime, because each step's result decides the next. If you can list the steps before the trigger fires, build a workflow and keep the model inside individual steps. A workflow is cheaper, testable, and replayable; an agent is none of those by default, which is why most agent projects stall.
You need an AI agent only when the sequence of steps cannot be known until the moment the task runs, because each step's result decides what the next step should be. If you can sit down and list the steps in order before the trigger fires, you need a workflow, even if the task is complex and leans on AI inside several of those steps. The popular framing, agents for complex work and workflows for simple work, is the wrong axis. The axis that matters is whether the path is knowable in advance. Most jobs that get pitched as agents are workflows with a model living inside a few of the steps, and building them as agents is how you end up in the pile of canceled projects.
This matters in 2026 because nearly everything is being relabeled as an agent. Gartner calls it "agent washing," the rebranding of assistants, chatbots, and RPA scripts as agentic without the underlying capability, and estimates only about 130 of the thousands of self-described agentic vendors are real. The word has been stretched until it means nothing. So before you buy or build one, it helps to have a test that tells you which kind of system the job actually needs.
An agent and a workflow are different machines
Start with the distinction the field's own builders use. Anthropic, in its guidance on building effective agents, defines a workflow as a system where models and tools are "orchestrated through predefined code paths," and an agent as a system where models "dynamically direct their own processes and tool usage, maintaining control over how they accomplish tasks."
Read that twice, because the difference is precise. In a workflow, you wrote the path. The model might read a document, classify a ticket, or draft a paragraph, but the order of operations is fixed by your code or your no-code canvas. In an agent, the model decides the path. It looks at the current state, picks the next action, runs it, looks again, and keeps going until it judges the task complete. Nobody wrote that sequence down ahead of time, because it depends on what the model finds along the way.
That single property, who decides the next step, is the whole decision. Everything else follows from it.
The test: can you draw the flowchart before the event fires?
Here is the test we run before building anything. Take the task and try to draw its flowchart on paper, in full, before any real event happens. Every box, every branch, every decision point.
If you can draw it, it is a workflow. The branches can be many, the logic can be intricate, some boxes can be model calls. None of that matters. If the shape is fixed at design time, code should own the control flow and the model should only do work inside the boxes you drew.
If you genuinely cannot draw it, because the second step depends on what the first step returns in a way you cannot anticipate, and the third depends on the second, and the number of loops is open, then you have an agent. The flowchart can only be drawn after the fact, by reading the log of what it actually did.
Notice what this test ignores. It does not ask whether the task is hard, or whether the data is messy, or whether it "feels like AI." A task can be very complex and still have a knowable path. Reconciling a month of invoices against a bank feed is complex and full of edge cases, but the steps are the same every time: pull the records, match by amount and date, flag the mismatches, route them. That is a workflow. Researching a prospect by deciding what to look up next based on what each search turns up is a simpler-sounding task with an unknowable path. That is an agent.
A model inside a step does not make it an agent
This is the point operators miss most often, and it is the one that quietly inflates budgets. People assume that the moment AI enters the picture, they are building an agent. They are not.
A workflow can call a model to do real cognitive work and remain a workflow. It can read an inbound email and classify the intent. It can extract the line items from a PDF invoice. It can draft a reply in your voice. It can score a lead from free-text notes. Each of those is a model doing something a rule cannot, sitting inside a step whose place in the sequence you decided in advance. The model is an employee following a process, not a manager redesigning it.
Deciding which individual steps even need a model, versus a plain rule, is its own call, and we wrote the per-step version of this decision in when an automation should use AI instead of a rule. The agent-versus-workflow question sits one level up: not "should this step be a model," but "should the model own the order of the steps at all." For most business automations the answer to that higher question is no, and you get most of the value of AI by putting it inside a workflow you control. That is the bulk of what we build in workflow automation: fixed pipelines with a model doing the judgment-heavy work in specific, named spots.
What an agent actually costs you
The reason to default to a workflow is not philosophical. An agent gives up four things a workflow gives you for free, and you pay for the difference every day it runs.
It is not replayable. In a workflow, a given input produces the same path every time, so you can re-run a failed case and watch exactly what happened. An agent chooses its own path, and the path can differ between runs of the same input, so reproducing a bug is its own investigation.
It is hard to test. You cannot write a clean test for a system whose steps you do not know in advance. A workflow you can grade case by case against a frozen set of examples, the way we describe in how to test an AI automation before production. An open-ended agent has no fixed surface to pin a test to.
It costs more and runs slower. Anthropic is blunt that "agentic systems often trade latency and cost for better task performance." Every decision the agent makes is another model call, and a task that a workflow finishes in one or two calls can cost an agent ten or twenty as it reasons its way forward. For a high-volume operator that gap is the whole monthly bill.
It fails more than the demos suggest. Carnegie Mellon's TheAgentCompany benchmark put leading models against real office tasks inside a simulated company. The best performer, Gemini 2.5 Pro, completed 30.3 percent of tasks autonomously. The rest were lower. Autonomous, open-ended execution on consequential work is still a coin flip at best, which is why Gartner expects most agentic projects to be scrapped by 2027 on cost and unclear value. None of that means agents are useless. It means the bar for needing one should be high, because the failure rate is real.
Which of these is an agent?
Here is how common operator tasks sort out under the test. The column that matters is whether the path is knowable before the run.
| Task | Path known in advance? | Build it as |
|---|---|---|
| Lead comes in: enrich, score, route to a rep, post to Slack | Yes, same steps every time | Workflow |
| Read an inbound email, classify intent, draft a reply, send for approval | Yes, model works inside fixed steps | Workflow |
| Generate a tailored proposal from deal data and a template | Yes, even though the output is "creative" | Workflow |
| Support bot that answers from docs and books a meeting via one known tool | Mostly yes, bounded tool use per turn | Workflow with tool use |
| Reconcile invoices against the bank feed, flag mismatches | Yes, complex but fixed | Workflow |
| Research a prospect, deciding each next search from the last result | No, path depends on findings | Agent |
| Diagnose a failing system: pull logs, decide next probe from contents, repeat | No, loop length and order unknown | Agent |
The rows that come out as agents share a signature: the model has to look at what it just learned and choose what to do next, and you cannot pre-write that chain. The rows that come out as workflows share the opposite one: you could hand the step list to a new hire and they would follow it the same way every time.
If a task lands in the agent rows and you do build one, the next question is what tools it can reach and how it picks among them at runtime, which is the MCP server decision. Tool plumbing only becomes a real question once you have established that the model genuinely owns the control flow. If it does not, you have skipped a layer of cost and risk you never needed.
How to decide this week
Before you commit to building or buying an agent, do one thing: draw the flowchart for the task, in full, by hand. If you can complete it, you have a workflow, and you should build it as one with the model placed only in the steps that need judgment. You will ship faster, your runs will be testable and replayable, and your monthly cost will be a fraction of the agentic version. If you genuinely cannot complete the flowchart because each step depends on the last in a way you cannot predict, then an agent is the honest answer, and you should go in knowing the failure rate and budgeting for the extra calls.
Most of the "we need an AI agent" requests we get turn into workflows once we run that test together, and the client gets the result they wanted for less money and less risk. Building the agent only when the path is truly unknowable is most of what separates a system that ships from one that gets canceled. If you are weighing whether your project needs an agent or a workflow, tell us what the task is and we will draw the flowchart with you and recommend the simpler build that actually works.
Frequently Asked Questions
SOURCES & CITATIONS
- Building Effective AI Agents — Anthropichttps://www.anthropic.com/research/building-effective-agents
- Gartner Predicts Over 40% of Agentic AI Projects Will Be Canceled by End of 2027 — Gartnerhttps://www.gartner.com/en/newsroom/press-releases/2025-06-25-gartner-predicts-over-40-percent-of-agentic-ai-projects-will-be-canceled-by-end-of-2027
- TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks — Carnegie Mellon Universityhttps://arxiv.org/abs/2412.14161
About Alexey Yushkin
Alexey is the founder of GENERAL INFORMATICS LLC. He designs and ships AI and automation systems for businesses and operators across the US.
Related reading
Want this kind of system in your business?
We build practical AI and automation systems for operators. Send us your current workflow and we will show you what to automate first.
Request a Workflow Review