Skip to content

Commit d934ec8

Browse files
authored
voice and chat testing (#280)
1 parent f6329b8 commit d934ec8

File tree

3 files changed

+92
-0
lines changed

3 files changed

+92
-0
lines changed

fern/docs.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -349,6 +349,10 @@ navigation:
349349
contents:
350350
- page: Test Suites
351351
path: test/test-suites.mdx
352+
- page: Chat Testing
353+
path: test/chat-testing.mdx
354+
- page: Voice Testing
355+
path: test/voice-testing.mdx
352356

353357
- section: Deploy
354358
collapsed: true

fern/test/chat-testing.mdx

Lines changed: 49 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,49 @@
1+
---
2+
title: Chat Testing
3+
subtitle: Automated text-based testing for AI agents
4+
slug: /test/chat-testing
5+
---
6+
7+
## Overview
8+
9+
Chat Test Suites allow you to evaluate your AI agents through simulated text conversations. This is our recommended solution for testing as it is much faster than voice testing and lets you isolate testing the behavior of your agent.
10+
11+
## How Chat Testing Works
12+
13+
1. **Simulation:** Our AI tester engages with your agent in a text-based conversation.
14+
2. **Scripted Interaction:** The testing agent follows your predefined script to simulate specific customer scenarios.
15+
3. **Transcript Capture:** The conversation is captured as a transcript.
16+
4. **Evaluation:** A language model (LLM) assesses the transcript against your success criteria.
17+
18+
## Designing your tests
19+
20+
Good test design is critical to evaluating your agent. You'll want to consider testing:
21+
22+
1. The tool calls of your agent. Set your script to schedule an appointment or call a transfer tool. At the evaluation step, your rubric will have context of the tool call history to evaluate success.
23+
2. Knowledge base integrations. Test different Q&A to make sure that your agent responds as expected.
24+
3. Legal / compliance issues. Ask the agent to answer things it's not supposed to, and verify that it refuses to answer.
25+
4. Personality. Simulate an angry, frustrated or manipulative customer, and make sure your assistant handles the situation well.
26+
27+
## Benefits of Chat Testing
28+
29+
- **Speed:** Chat tests execute faster than voice tests, allowing for rapid iteration.
30+
- **Cost-Effective:** No TTS or STT models are used during chat testing.
31+
- **Focused Assessment:** Evaluate pure conversational ability without audio-related variables.
32+
- **Higher Test Volume:** Run more tests in less time to ensure comprehensive coverage.
33+
34+
## Creating Chat Tests
35+
36+
You can create chat tests as part of a Test Suite:
37+
38+
1. Navigate to the **Test** tab and select **Test Suites**.
39+
2. Create a new Test Suite or edit an existing one.
40+
3. When adding tests, select **Chat** as the test type.
41+
4. Define your script and success criteria as detailed in the [Test Suites](./test-suites) documentation.
42+
43+
## Best Practices for Chat Testing
44+
45+
- Use chat tests for rapid iteration during development.
46+
- Create variations of the same scenario to test different user inputs.
47+
- Test edge cases and potential misunderstandings.
48+
49+
For comprehensive instructions on creating and managing test suites that include chat tests, refer to the [Test Suites](./test-suites) documentation.

fern/test/voice-testing.mdx

Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,39 @@
1+
---
2+
title: Voice Testing
3+
subtitle: Automated voice call testing for AI voice agents
4+
slug: /test/voice-testing
5+
---
6+
7+
## Overview
8+
9+
Voice Test Suites enable you to test your AI voice agents through simulated phone conversations. Our platform connects two AI agents - your voice agent and our testing agent - on a real phone call, following your predefined scripts to evaluate performance under various scenarios.
10+
11+
## How Voice Testing Works
12+
13+
1. **Simulation:** Our AI tester calls your voice agent and follows a script that simulates real customer behavior.
14+
2. **Conversation:** Both AIs engage in a natural voice conversation, with the tester following your script guidelines.
15+
3. **Recording:** The entire call is recorded and transcribed for evaluation.
16+
4. **Assessment:** After the call, the transcript is evaluated against your rubric by a language model (LLM).
17+
18+
## Benefits of Voice Testing
19+
20+
- **Natural Interaction:** Test your voice agent in the most realistic scenario - actual phone calls.
21+
- **Audio Quality Assessment:** Evaluate not just responses but also voice clarity, tone, and cadence.
22+
- **End-to-End Verification:** Confirm that your entire voice pipeline works correctly from telephony to response.
23+
24+
## Creating Voice Tests
25+
26+
You can create voice tests as part of a Test Suite:
27+
28+
1. Navigate to the **Test** tab and select **Test Suites**.
29+
2. Create a new Test Suite or edit an existing one.
30+
3. When adding tests, select **Voice** as the test type.
31+
4. Define your script and success criteria as detailed in the [Test Suites](./test-suites) documentation.
32+
33+
## Voice Test Limitations
34+
35+
- Voice tests require more time to execute compared to chat tests.
36+
- Each test consumes calling minutes from your account.
37+
- Maximum call duration is limited to 15 minutes per test.
38+
39+
For detailed instructions on creating and managing test suites that include voice tests, see the [Test Suites](./test-suites) documentation.

0 commit comments

Comments
 (0)