You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+28-5Lines changed: 28 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,20 +25,24 @@ This starter app is compatible with any [custom web/mobile frontend](https://doc
25
25
26
26
This project includes built-in support for:
27
27
28
-
-**Dual-channel audio recording** via LiveKit Egress (agent on one channel, user on the other)
28
+
-**Audio recording** via LiveKit Egress (all participants mixed, or dual-channel with agent on one channel and user on the other)
29
29
-**Real-time transcript capture** from STT output, saved as JSON
30
30
31
+
> **Note:** Audio recording via Egress only works in `dev` or `start` mode (connected to LiveKit Cloud). The `console` mode uses a mock room for local testing and cannot record audio. Transcripts are saved in all modes.
32
+
31
33
### S3 Output Structure
32
34
33
-
Recordings and transcripts are saved to S3:
35
+
Recordings and transcripts are saved to S3 with matching session IDs for easy correlation:
The `{session_id}` is a timestamp (`YYYYMMDD-HHMMSS`) generated when the session starts, making it easy to match audio recordings with their corresponding transcripts.
45
+
42
46
### AWS Configuration
43
47
44
48
Add these environment variables to your `.env.local`:
To enable dual-channel recording (agent audio on left channel, user audio on right channel), edit `src/egress_manager.py` and add the `audio_mixing` parameter:
66
+
67
+
```python
68
+
info =awaitself.livekit_api.egress.start_room_composite_egress(
69
+
egress_proto.RoomCompositeEgressRequest(
70
+
room_name=room_name,
71
+
audio_only=True,
72
+
audio_mixing=egress_proto.AudioMixing.DUAL_CHANNEL_AGENT, # Add this line
73
+
file_outputs=[file_output],
74
+
)
75
+
)
76
+
```
77
+
59
78
## Coding agents and MCP
60
79
61
80
This project is designed to work with coding agents like [Cursor](https://www.cursor.com/) and [Claude Code](https://www.anthropic.com/claude-code).
@@ -123,12 +142,16 @@ Next, run this command to speak to your agent directly in your terminal:
123
142
uv run python src/agent.py console
124
143
```
125
144
145
+
> **Note:** Console mode is for local testing only. Audio recording is disabled (transcripts still work).
146
+
126
147
To run the agent for use with a frontend or telephony, use the `dev` command:
127
148
128
149
```console
129
150
uv run python src/agent.py dev
130
151
```
131
152
153
+
> This mode connects to LiveKit Cloud and enables full audio recording to S3.
0 commit comments