Question 1

How do I run my first voice agent evaluation?

Accepted Answer

Three steps: define your scenarios, configure caller profiles, and select metrics. Evalgent runs the full scenario × profile × runs matrix automatically. The result is a scenario success rate per cell plus evidence for every failure — a baseline you can iterate against from the first evaluation onwards.

Question 2

What is a voice agent evaluation campaign?

Accepted Answer

A voice agent evaluation campaign is a structured run of scenarios against caller profiles, scored on defined metrics. Each campaign produces a scenario success rate, per-cell breakdowns, and evidence for failures. Campaigns gate releases — pass the threshold and ship, fail and iterate. This is what makes AI voice agent QA a release discipline.

Question 3

Can I compare two voice agent versions?

Accepted Answer

Use the same evaluation configuration across versions — same scenarios, same caller profiles, same metrics — and run each version through it. Evalgent produces per-scenario deltas showing where reliability improved or regressed. Common comparison flows: prompt change A vs. B, GPT-4o vs. Claude, ElevenLabs vs. Deepgram.

Question 4

What's the difference between an evaluation and a regression test?

Accepted Answer

A voice agent evaluation runs the full scenario × profile matrix to measure reliability. A regression test re-runs the same matrix after a change to confirm nothing got worse. In Evalgent, both use the same scenarios and metrics — what changes is what triggers them. Evaluations baseline. Regressions catch drift.

Run structured evaluations on your voice agent

Support agent v2.3 — regression test

What is a voice agent evaluation?

How to set-up a campaign for voice agent evaluation?

Define your test matrix

Set success criteria

Review & launch

See exactly where your agent stands

Results matrix

Evidences

Recommendations

The difference structured evaluations make

Manual testing today

Structured & automated

Frequently asked questions

Explore more

Reviews

Scenarios

Profiles

Know if your voice agent is ready for production