Skip to content

Feature Request: Native Nextcloud integration using Whisper-compatible API #10

@erklu

Description

@erklu

Description

Enable users of Sunet Drive (Nextcloud-based) to initiate transcriptions via the built-in Whisper integration. This requires the backend to expose an API that mimics the Whisper API structure defined by OpenAI, as supported by e.g. Nextcloud Talk and Text.


Expected Behavior

  • Backend exposes /v1/audio/transcriptions or equivalent endpoint
  • Accepts audio file + model spec + language
  • Returns JSON with transcript or full txt version
  • Adheres to Whisper-compatible request/response schema
  • Respects user authentication and storage isolation (e.g. Nextcloud tokens)

Notes

  • See: Whisper API schema
  • Allows direct transcription from inside Nextcloud (e.g. with Nextcloud Text or OCR plugins)
  • Enables single sign-on/flowless use from Drive

Priority

Medium — strongly enhances usability for Drive users, aligns with ecosystem standards. Not needed for MVP but should be prioritised after initial launch

Metadata

Metadata

Assignees

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions