how to

How to Train an AI Employee

A week-one playbook for training an AI employee and moving toward an agi employee feel: pick one channel, write the rules, review replies, then auto-send.

By Kelvin Tang5 min read

How to train an ai employee is mostly a week-one operations problem, not a model problem. Give it one job, one channel, one set of rules, then keep a human in the loop until the replies stop drifting. That is how a WhatsApp responder, a Slack digest, or a Gmail triage system becomes useful without turning into a liability.

If you want the plain definition first, read What is an AI Employee?. If you want the buying decision, AI Employee vs. AI Assistant is the cleaner companion. And if you want a live support example, Perla for Customer Support shows the same pattern in practice.

1. Pick one job and one channel

Do not train the whole business at once. Pick one job that repeats, and one place where that job already happens. For many teams that means WhatsApp for customer replies, Slack for internal updates, or Gmail for inbox triage. One lane gives you cleaner rules and faster feedback.

Write the job in one sentence. Example: "Reply to shipping questions, basic product questions, and order status in WhatsApp, then hand off refunds and complaints to a human." That sentence becomes the boundary. It also keeps the first version honest.

If you need a reference for the channel itself, Meta's WhatsApp Business Platform docs and Slack's Workflow Builder guide are useful starting points.

2. Write the instructions people actually use

A good system does not need a giant manual. It needs the facts people ask for every day. Start with pricing, opening hours, shipping cutoffs, refund rules, brand voice examples, and a short list of approved phrases. Put that in Notion, Google Docs, or wherever your team already keeps living notes.

Use real examples, not theory. Copy five replies that sounded right. Copy five replies that sounded off. That contrast teaches faster than a polished style guide full of adjectives. The goal is not to sound clever. The goal is to sound like your team.

This is also where the difference between an ai employee and a chatbot becomes obvious. A chatbot answers a question. A trained system can answer, file, route, and remember. That is the real jump.

3. Set the handoff rules before anything goes live

This is the part people skip. Then they regret it.

Write the cases that must go to a human. Start with refunds, complaints, broken items, angry tone, legal claims, payment disputes, and anything the system is unsure about. Then decide who owns each handoff and how fast they need to respond.

Make the transfer obvious. "I'm sending this to a person now" is enough. Customers do not mind automation if they can see the handoff. They do mind getting trapped in loops.

The safest setup is usually shared-channel first, account-access later. That means WhatsApp, Slack, or a shared inbox before deeper access to systems of record. If you want the privacy and access framing, Capabilities and the product page explain how we think about it.

4. Run review-only for the first week

Do not turn on auto-send on day one. Let the system draft replies, then have a human review them. That one week will show you where the rules are thin, where the voice is off, and where the system is guessing instead of reasoning.

Track three things:

  • Was the answer correct?
  • Was the tone on brand?
  • Did it need a human at all?

Use those notes to tighten the setup. Most teams find the same small mistakes over and over. Fix those first. Do not rewrite everything because of one bad reply.

A good review loop is simple: check yesterday's edge cases in the morning, add new examples from real chats at midday, and mark safe categories for later automation at the end of the day.

5. Turn on auto-send in narrow lanes

Only automate the easy cases first. Shipping status. Opening hours. Basic product questions. Order lookup if the data is clean. Keep the messy stuff in review-only until it is boring.

A good first live setup looks like this:

  • WhatsApp, Slack, or Gmail as the active lane
  • 20 to 30 short examples in the knowledge base
  • 5 to 10 escalation rules
  • One human owner per shift
  • A daily summary to the founder

That is enough to do real work. It is also enough to learn fast. The system gets better because the work is narrow, not because the model is magical.

6. Measure the right signs

Ignore vanity metrics. Watch the ones that tell you whether the system is helping.

Look at:

  • First response time
  • Percent of messages handled without a human
  • Number of edits a human made
  • Number of escalations
  • Repeated questions that should be added to the rules

If response time drops but edits rise, the rules are too loose. If edits drop but every other message escalates, the rules are too tight. The middle is what you want: fast on the easy stuff, careful on the hard stuff.

This is also where the category starts to feel broad enough to be an agi employee. The same memory, tone, and escalation rules carry across channels. WhatsApp, Slack, Gmail, and Google Workspace stop feeling like separate tools and start feeling like one operating surface.

What to do if it breaks

It will break somewhere. Plan for that.

If the system sounds confident but gets the answer wrong, tighten the knowledge base and push more cases back into review-only. If the handoff fails, assign one owner and one visible channel for exceptions. If the voice drifts, feed it better examples from real chats and remove the bland ones.

The goal is not perfect automation. The goal is a system that stays useful when volume spikes and still feels human when the hard cases show up.

If you want the strategic framing, start with What is an AI Employee?. If you want the buying lens, AI Employee vs. AI Assistant is the sharper read. And if you want to see the support pattern in practice, Perla for Customer Support is the closest match.

If this is the shape of your problem, hire Perla and read more on Capabilities.

Frequently asked questions

What should I train first?
Start with one job that repeats every day. Customer replies, inbox triage, meeting prep, and daily reporting are strong first bets because the rules are visible and the outcome is easy to check. Pick the one that costs the team the most time, not the one that sounds most impressive.
How much should I teach on day one?
Less than you think. A short voice guide, a few policy notes, and five to ten real examples are enough to begin. If you try to teach everything at once, the system will look smart in the demo and drift in real work.
When should I switch from review-only to auto-send?
Only after the same kind of reply has worked several times with no edits. If a category still needs human cleanup, keep it in review-only. Safe speed matters more than early confidence.
What tools usually show up in the first setup?
WhatsApp, Slack, Gmail, Google Workspace, Notion, and Zendesk are common starting points. The exact stack matters less than whether the job has a clear trigger, a clear output, and a clear human handoff.
Is this the same thing as an agi employee?
Not exactly, but the line can blur once memory and rules carry across channels. A setup starts to feel like an agi employee when the same instructions work in WhatsApp, Slack, and Gmail without fresh prompting each time.

Hire your first AI employee

Perla handles your Google Workspace, WhatsApp, Slack, email, and more — so you don't have to.

See what Perla does