-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
Description
Description
Enable users of Sunet Drive (Nextcloud-based) to initiate transcriptions via the built-in Whisper integration. This requires the backend to expose an API that mimics the Whisper API structure defined by OpenAI, as supported by e.g. Nextcloud Talk and Text.
Expected Behavior
- Backend exposes
/v1/audio/transcriptionsor equivalent endpoint - Accepts audio file + model spec + language
- Returns JSON with transcript or full txt version
- Adheres to Whisper-compatible request/response schema
- Respects user authentication and storage isolation (e.g. Nextcloud tokens)
Notes
- See: Whisper API schema
- Allows direct transcription from inside Nextcloud (e.g. with Nextcloud Text or OCR plugins)
- Enables single sign-on/flowless use from Drive
Priority
Medium — strongly enhances usability for Drive users, aligns with ecosystem standards. Not needed for MVP but should be prioritised after initial launch
Reactions are currently unavailable