modernize: dont write audio to tmp file by JarbasAl · Pull Request #45 · OpenVoiceOS/ovos-stt-http-server

JarbasAl · 2026-01-09T05:19:07Z

dont write audio to tmp file
drop dependency on speech_recognition package

Summary by CodeRabbit

New Features
- STT endpoint accepts sample_rate and sample_width query parameters for flexible audio handling.
- Transcribe function now accepts optional sample_rate (default 16000) and sample_width (default 2).
Chores
- Server implementation updated to FastAPI.
- Dependency constraints for plugin manager updated.
- Deprecated Python version classifiers removed.
Deprecations
- Gradio interface initialization now emits a deprecation warning.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

…tion package

coderabbitai · 2026-01-09T05:19:16Z

Warning

Rate limit exceeded

@JarbasAl has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 20 minutes and 15 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

📥 Commits

Reviewing files that changed from the base of the PR and between d1adb73 and 2e933b5.

📒 Files selected for processing (2)

ovos_stt_http_server/__init__.py
ovos_stt_http_server/gradio_app.py

📝 Walkthrough

Walkthrough

Refactors audio handling to construct AudioData directly from request payloads and replaces internal bytes2audiodata usage with AudioData imports; updates transcribe signature and Gradio deprecation warning; relaxes ovos-plugin-manager version constraint; and updates package metadata to reference FastAPI and remove old classifiers.

Changes

Cohort / File(s)	Summary
Audio data handling & endpoints `ovos_stt_http_server/__init__.py`, `ovos_stt_http_server/gradio_app.py`	Removed internal `bytes2audiodata`; use `AudioData` from `ovos_plugin_manager.utils.audio` to build audio from raw request bytes. `/stt` now reads `sr`/`sw` (sample rate/width) and constructs `AudioData`; `transcribe` signature extended with `sample_rate` and `sample_width`. Gradio binding emits a deprecation warning.
Dependency constraint `requirements/requirements.txt`	Bumped `ovos-plugin-manager` constraint from `>=2.1.0,<2.2.0` to `>=2.1.1,<3.0.0`.
Package metadata `setup.py`	Updated project description to reference FastAPI; removed obsolete Python version classifiers and the linguistic topic classifier.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐇 I munched some bytes, then hopped with glee,

Wrapped them in AudioData, neat as can be.
No temp-file crumbs or conversion maze,
Faster paths now brighten my days —
A little rabbit's coding praise!

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly matches the primary change: removing the bytes2audiodata function that wrote audio to temporary files and replacing it with direct AudioData construction.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 5

🤖 Fix all issues with AI agents

In @ovos_stt_http_server/__init__.py:
- Around line 125-126: The query params `sample_rate` and `sample_width` are
being read as strings; convert them to integers before creating `AudioData` by
wrapping the `request.query_params.get("sample_rate", 16000)` and
`request.query_params.get("sample_width", 2)` calls with `int(...)` (or parse
and fall back to the defaults if parsing fails) so `sr` and `sw` are ints when
passed to `AudioData`.
- Around line 128-131: The code constructs an AudioData object named audio but
then calls model.detect_language with raw audio_bytes, losing sample rate/width
info; change calls to model.detect_language(audio) (and the second occurrence
later near the other call) so detect_language receives the AudioData instance
consistent with model.process_audio which expects AudioData.

In @requirements/requirements.txt:
- Line 4: Update the pinned Gradio dependency in requirements.txt from
gradio~=3.28 to a patched release (e.g., gradio==5.39.0 or at minimum
gradio>=4.26.0) to remediate multiple CVEs; change the line with "gradio~=3.28"
to the chosen safe version, then regenerate any lockfiles or dependency pins
(pip-compile/Pipfile.lock/poetry.lock) and run tests/build to ensure
compatibility.
- Line 1: Update the package constraint for ovos-plugin-manager in
requirements.txt to require at least v2.1.1 and allow the full 2.x series by
using ">=2.1.1,<3.0.0"; also scan usages of the AudioData class (e.g., type
hints in process_audio methods and any instantiation like AudioData(audio_bytes,
sample_rate, sample_width)) to ensure the call signature and attributes match
the v2.1.1 API and adjust argument order or names if the AudioData constructor
or typing changed.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between ff0ff80 and 29c66b3.

📒 Files selected for processing (4)

ovos_stt_http_server/__init__.py
ovos_stt_http_server/gradio_app.py
requirements/requirements.txt
setup.py

🧰 Additional context used

🧬 Code graph analysis (1)

ovos_stt_http_server/gradio_app.py (1)

ovos_stt_http_server/__init__.py (3)

ModelContainer (30-54)

process_audio (53-54)

process_audio (94-96)

🪛 GitHub Actions: Run Unit Tests

setup.py

[error] 1-1: Command failed or potential issue detected: 'python build_test/setup.py bdist_wheel sdist'. CI emitted an error annotation: Are there relative paths in setup.py?

🪛 OSV Scanner (2.3.1)

requirements/requirements.txt