Help Center
AI & knowledgeAdmin only12 min read

Teach the AI with feedback

Your AI agent only gets sharper when you tell it where it went wrong — and Exabloom turns that into a real loop. Flag a reply in the Inbox, triage it in the AI Feedback queue, then bundle the lessons into a rewrite of the agent’s own instructions. No fine-tuning, no data science. Here’s the whole thing, end to end.
Two surfaces, one loop
Flagging lives in the Inbox and is open to anyone working conversations. Reviewing and training live in the admin AI Feedback area. Together they form one teaching loop.
Start here

The 60-second version

  • Flag an AI reply in the chat. For a thumbs-down, describe what went wrong — AI assist drafts a clean correction you confirm.
  • Triage it in the AI Feedback queue with the full conversation and model context: edit, reject, or set it aside.
  • Train the agent on a batch of flagged items — the AI rewrites its system prompt, you review the diff and accept.
  • Lock wins in as test cases so future changes never undo what already works.
Tip
If you only do one thing: flag generously. The queue can’t teach the agent anything you never marked.
The mental model

How the loop actually works

Three stages, two surfaces. Frontline staff capture signal in the Inbox. An admin triages it in one place. And when enough has built up, training turns that pile of corrections into a sharper set of instructions for the agent.

Capture
Inbox · anyone
Flag a reply
Triage
AI Feedback · admin
Sort & decide
Train
Refinement run
Rewrite the prompt
The teaching loop: capture in the Inbox, triage in the AI Feedback queue, train the agent from a batch.

The crucial thing to understand up front: training doesn’t fine-tune a model. It rewrites the agent’s system prompt — the plain-English instructions that govern how it behaves. That’s why it’s fast, transparent, and completely reversible: you’re always reviewing a diff of text before anything changes.

Step 1

Capture in the Inbox

On any message sent by an AI agent, a verdict control lets anyone working the Inbox react: Mark as good when it nailed the answer, or Flag when it didn’t.

Sales Triage · AI agent

Our P1 Maths trial is on Saturday at 10am — shall I book you in?

4:12 PM Mark as good Flag
The verdict control sits under an AI agent’s reply. A thumbs-up saves a positive example; a thumbs-down opens the guided flag drawer.

A thumbs-up is one tap — optionally jot why it’s a good example. A thumbs-down opens a guided drawer. You don’t fill in a rigid form — you just describe what went wrong in plain language. AI assist reads it (and may ask a single follow-up if it’s unsure), then drafts a structured correction:

You write
We don’t run a Saturday P1 class — trials are Tuesday 5pm. It invented a slot.
Plain language. AI may ask one follow-up.
AI assist drafted
Issue
Hallucination
What it should have said
P1 Maths trials run Tuesdays at 5pm — would that suit you?
Edit anything, then submit.
You describe the problem; AI assist shapes it into an issue category and a concrete “what it should have said”. Edit anything before you submit.

The category is one of Hallucination, Wrong tone, Missed context, or Other. The most valuable field is what it should have said — that’s the line the rewrite leans on hardest, so make it concrete.

Good to know
Each person gets one feedback row per message. You can reopen and edit yours anytime; teammates’ feedback on the same reply shows in an “existing feedback” panel so you’re not flagging blind.
Step 2

Triage the queue

Everything captured collects in the admin AI Feedback queue, grouped into three tabs by status: Pending, Trained, and Rejected. New items land in Pending. You can filter by category and search the corrections and rationales.

AI Feedback
PendingTrainedRejected
3 ready-to-train items for Sales Triage
Train Sales Triage on 3 items
HallucinationInvented a Saturday trial slot
Wei Ling · flagged by Ravi · 2h ago
Wrong toneToo pushy on the follow-up
Marcus Tan · flagged by Ravi · 5h ago
Handled the refund question perfectly
Priya · marked by Sara · yesterday
The AI Feedback queue. Pending items are grouped per agent with a “Train” button; each row shows verdict, category, the correction, and who flagged it.

Open any item to judge it on its merits. The detail view has three tabs:

  • Feedback — the flagged message, the issue category, the suggested correction, and the reviewer’s rationale.
  • Conversation context — the messages on either side of the flagged turn, so you can see what the customer actually asked.
  • LLM call — the model used, token usage, and the exact prompt and raw response behind the reply. (Shown when it was captured.)

From the Feedback tab, you have four moves on a single item:

Edit
Sharpen the category, the “what it should have said”, or the rationale before it’s used — a precise correction teaches far more than a vague one.
Reject
Set it aside with a reason — Duplicate, Not actionable, Not the AI’s fault, Off-topic, or Other — plus an optional note. It moves to the Rejected tab, kept on record but out of training.
Promote to test case
Turn this exact situation into a regression test on the agent, so future changes are automatically checked against it. Works for good replies too.
Move back to Pending
Reopen a rejected item if you change your mind — it returns to the queue and its rejection reason is cleared.
Good to know
Triage is about getting the Pending tab to reflect only what you actually want to teach. Reject the noise, edit the keepers — then training works on a clean batch.
Step 3

Train the agent

This is where feedback becomes change. On the Pending tab, items are grouped by agent with a Train [agent] on N items button. Click it and the AI reads the agent’s current system prompt plus every flagged correction in the batch, then proposes a single rewritten prompt that addresses them all — surgical edits over wholesale rewrites.

You don’t accept blind. You get a side-by-side diff of the current prompt versus the refined one, with the changes highlighted:

Refine Sales Triage system prompt
Bundling 3 feedback items
+4 −1
Current
You are a tuition centre assistant.
Be friendly and helpful.
Offer trial slots when asked.
Refined draft
You are a tuition centre assistant.
Be friendly and helpful.
Only offer trial slots from the booking
tool — never invent days or times.
Ask once; don’t push if they decline.
Create regression tests· 2 of 3
RevertAccept & update draft
The refinement diff. Removed lines in red, added instructions in green — and a checklist of which flagged items to also lock in as regression tests.

Read it, tick which items are worth keeping as regression tests, and hit Accept & update draft. Three things then happen:

  • The refined prompt is written to the agent’s draft — not the live agent. You publish it from the agent itself once you’ve had a final look.
  • Every item in the batch moves from Pending to Trained, tagged with the refinement run it belonged to.
  • The items you ticked become test cases on the agent (see Step 4).
Watch out
If the agent already has unsaved draft edits, accepting will ask whether to Overwrite draft — it shows you the start of the existing draft first so you don’t clobber work in progress by accident.
Step 4

Lock wins in as test cases

A great answer is fragile — the next prompt edit could quietly undo it. Promoting a piece of feedback to a test case freezes that situation into a check the agent must keep passing.

The test uses the conversation up to the flagged turn as the input and your “what it should have said” as the expected response. From then on, an AI judge checks the agent against it whenever you change the prompt — so a fix for one problem can’t silently break another.

Tip
Promote works on good replies too — that’s the cleanest way to bank the behaviours you never want to lose. Find your test cases on the agent under its Tests tab.
Recipes

Setups to copy

Stamp out a recurring mistake
  1. 1Each time the agent makes the same slip, Flag it and let AI assist draft the fix.
  2. 2When 3–4 have piled up for that agent, hit Train [agent].
  3. 3Read the diff, tick the clearest ones as tests, then Accept & update draft.
  4. 4Open the agent and publish the draft when you’re happy.
Protect what already works
  1. 1When the agent nails a tricky reply, Mark as good.
  2. 2Open it in the queue and Promote to test case.
  3. 3Now every future refinement is checked against it — wins don’t quietly regress.
Clear the queue honestly
  1. 1Flagged but actually fine? Reject as Not the AI’s fault with a one-line note.
  2. 2Same issue twice? Reject the second as Duplicate.
  3. 3Keep Pending to only what you genuinely intend to train on.

Good to know & pitfalls

  • “Train” rewrites the prompt — it doesn’t fine-tune a model. There’s no data-science step and no model weights change. The AI rereads your agent’s instructions and proposes a sharper version. That’s why it’s fast, reversible, and yours to edit.
  • Training updates a draft, not the live agent. Accepting writes the refined prompt to the agent’s draft. Go to the agent to review and publish it. If the agent already has unsaved draft edits, you’ll be asked to Overwrite draft or cancel.
  • Only flagged (bad) items train. A refinement run bundles Pending items with a thumbs-down verdict, for a single agent. Good replies don’t feed the rewrite — their job is to become regression tests.
  • A correction teaches more than a flag. The “what it should have said” line is what the AI leans on most. The more concrete it is, the better the rewrite — and the more likely it’s worth a test.
  • Every item carries its context. Open one and you get three tabs: Feedback, the full Conversation context around the flagged turn, and the LLM call — the model, token usage, and the exact prompt the agent saw. Judge it fairly before you decide.
  • Reject keeps a record; Delete doesn’t. Prefer Reject — it stays searchable with its reason. Delete is permanent and only offered while an item is untouched (never trained or promoted).
  • Only AI-agent replies can be flagged. Messages sent by an AI agent in the Inbox carry the verdict control. AI compute nodes inside a workflow never send a message of their own, so there’s nothing to flag there.
Who can do what
Anyone working the Inbox can flag replies. Reviewing the queue and training agents live in the admin area — keep that in mind when deciding who owns the loop.

Need a hand?

Our Singapore-based team is one message away — happy to help you get set up.