Improving Pause Detection in Voice AI Interviews

This project is a small browser-based prototype that shows how pause detection in voice interviews can be improved.

In many voice AI interviews, the system interrupts users too early because it treats short silences as the end of an answer. This prototype tries to fix that by understanding how humans actually pause while thinking.

What problem does it solve?

Prevents AI from cutting off candidates mid-answer
Distinguishes thinking pauses from completed answers
Makes voice interviews feel more natural and less stressful

How does it work (simple explanation)?

The browser microphone captures audio
Audio energy is calculated in real time
Initial silence is used to estimate background noise
If the user was recently speaking, the system allows a longer pause
A state machine decides when it is safe for the AI to respond
A debug panel shows what the system is thinking

No speech-to-text is used — responses are simulated to keep the focus on pause handling.

Folder Structure

voice-interview-pause-handler/
├── audio/          # microphone + VAD logic
├── conversation/   # pause handling & state machine
├── ui/             # debug visualization
├── config/         # thresholds and constants
├── demo/           # before/after samples
├── index.html
├── app.js
└── docs/summary.pdf

How to run the project

Open the project in a local server (VS Code Live Server or similar)
Open index.html in the browser
Click Start Interview
Allow microphone access
Speak naturally and observe the debug panel

Demo scenarios to try

Pause for 1–2 seconds while thinking mid-answer
Speak continuously for a long answer
Stop speaking completely and wait

You will notice fewer interruptions compared to a fixed silence timeout.

Why this approach?

This prototype focuses on impact and clarity rather than heavy AI models. It shows how small, well-thought-out changes in pause handling can significantly improve user experience in voice-based interviews.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Pause Detection in Voice AI Interviews

What problem does it solve?

How does it work (simple explanation)?

Folder Structure

How to run the project

Demo scenarios to try

Why this approach?

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
audio		audio
config		config
conversation		conversation
demo		demo
ui		ui
LICENSE		LICENSE
README.md		README.md
app.js		app.js
index.html		index.html
styles.css		styles.css
summary.pdf		summary.pdf

Folders and files

Latest commit

History

Repository files navigation

Improving Pause Detection in Voice AI Interviews

What problem does it solve?

How does it work (simple explanation)?

Folder Structure

How to run the project

Demo scenarios to try

Why this approach?

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages