Group Meeting Transcription Platform

This application is developed as part of my research into improved supervision of gruop projects. You can read more about my research by following my ResearchGate profile (link at bottom of page).

This full stack application is developed to help manage groups who are working together on a project. Currently the application allows you to manage:

multiple groups
members within the groups
group meetings with attendance (including guests)
upload associated audio of meetings
upload transcriptions from meetings or transcribe them from the uploaded audio using diarisation tools

The aim is that the generation of transcription files will improve as the AI learns the voices of the group members. Data storage is kept local so that the data remains private.

The next stage of the application development will be to add interaction with an LLM to:

query the generated transcripts
summarise the meetings for a supervisor

Additionally, the application aims to allow a group to reference a repository which will gather development statistics to complement the meeting transcription data.

Tech Stack

Backend: FastAPI, SQLAlchemy, Alembic
Database: PostgreSQL + pgAdmin
Frontend: Next.js (React + TypeScript)
Containerisation: Docker + Docker Compose
Environment Management: python-dotenv

Getting Started

1. Clone the repository

git clone https://github.com/yourusername/group-meeting-transcripts.git
cd group-meeting-transcripts

2. Create a .env.docker file in ./

# .env
POSTGRES_USER=your_user
POSTGRES_PASSWORD=your_password
POSTGRES_DB=diarisation_db
POSTGRES_PORT=5432
POSTGRES_HOST=db

PGADMIN_DEFAULT_EMAIL=admin@example.com
PGADMIN_DEFAULT_PASSWORD=admin

HUGGING_FACE_TOKEN = see step 3

3. Generate your HuggingFace token

Accept the terms and conditions on these models https://huggingface.co/pyannote/segmentation https://huggingface.co/pyannote/speaker-diarization https://huggingface.co/pyannote/embedding

4. Start the project with Docker Compose

docker-compose up --build

This will:

✔️ Build the FastAPI backend (api)

✔️ Run PostgreSQL (db)

✔️ Run pgAdmin (pgadmin)

✔️ Run frontend (frontend) if enabled

✔️ Automatically apply Alembic migrations on startup

API Usage

Once the server is running, access:

FastAPI root: http://localhost:8000

Docs: http://localhost:8000/docs

pgAdmin: http://localhost:5050

Next.js frontend: http://localhost:3000

Folder Structure

Copy
Edit
.
├── backend/                # FastAPI backend
│   ├── db.py
│   ├── models.py
│   ├── main.py
│   ├── routes/
├── alembic/            # Alembic migrations
├── frontend/           # Next.js app
├── docker-compose.yml
├── Dockerfile
└── .env                #<- the .env file you need to create

Troubleshooting

If database connection fails, confirm .env is loaded and Docker volumes are clean.

Use docker-compose down -v to reset volume data.
Use print(os.getenv(...)) to debug env variables in db.py.

Building Custom PyTorch/TorchAudio Wheels for GPU

If you need to build PyTorch or TorchAudio with support for your specific GPU, you can create a custom wheel (.whl) file. This allows you to install PyTorch locally or in Docker without downloading prebuilt binaries.

Overview

Determine your GPU architecture (compute capability)
- Each NVIDIA GPU has a compute capability (e.g., sm_61 for GTX 1070). The newer GPUs will have support included with the latest Torch release, but you may need to build a custom wheel if you have an older card (like me!)
- You can find the list of compute capabilities on NVIDIA's official documentation: CUDA GPUs.
Install prerequisites
- You will need a working Python environment, compiler toolchain, and CUDA toolkit compatible with your GPU.
- For Linux, this usually means installing cmake, ninja-build, gcc, and Python development headers.
- For Windows, Visual Studio Build Tools are required along with the CUDA toolkit.
- See PyTorch’s official build guide: https://github.com/pytorch/pytorch#from-source
Clone the repositories
- Clone PyTorch and TorchAudio (or any other libraries you need) from their official GitHub repositories with --recursive to include submodules.
Set GPU architecture flags
- PyTorch allows you to specify which GPU architectures to build for. Setting the appropriate architecture ensures optimized performance on your hardware.
- For guidance, refer to the official PyTorch documentation on building for CUDA: https://pytorch.org/docs/stable/notes/windows.html#build-from-source
Build the wheel
- Use Python’s standard build system to create a .whl file.
- Once built, you can install it locally or use it in Docker.
Install locally or in Docker
- Install the wheel using pip install <wheel-file> on your local machine.
- In Docker, you can copy the wheel into your image and install it during the build process to avoid downloading from the internet.

Additional Resources

PyTorch From Source: https://github.com/pytorch/pytorch#from-source
TorchAudio From Source: https://github.com/pytorch/audio#building-from-source
NVIDIA CUDA Toolkit: https://developer.nvidia.com/cuda-toolkit

Note: Building PyTorch from source can take a long time depending on your system. Make sure to include only the GPU architectures you need to reduce build time and wheel size.

License

MIT License — feel free to use, modify and distribute.

Contributors

Alun King - ResearchGate Profile

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
alembic		alembic
backend		backend
frontend		frontend
tutorials		tutorials
.gitignore		.gitignore
LICENCE		LICENCE
LICENSE		LICENSE
README.md		README.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
dockerfile		dockerfile
env.example		env.example
requirements.txt		requirements.txt
torch-requirements.txt		torch-requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Group Meeting Transcription Platform

Tech Stack

Getting Started

1. Clone the repository

2. Create a .env.docker file in ./

3. Generate your HuggingFace token

4. Start the project with Docker Compose

API Usage

Folder Structure

Troubleshooting

Building Custom PyTorch/TorchAudio Wheels for GPU

Overview

Additional Resources

License

Contributors

About

Licenses found

Uh oh!

Releases

Packages

Languages

License

Licenses found

alunkingusw/meeting_diarisation

Folders and files

Latest commit

History

Repository files navigation

Group Meeting Transcription Platform

Tech Stack

Getting Started

1. Clone the repository

2. Create a .env.docker file in ./

3. Generate your HuggingFace token

4. Start the project with Docker Compose

API Usage

Folder Structure

Troubleshooting

Building Custom PyTorch/TorchAudio Wheels for GPU

Overview

Additional Resources

License

Contributors

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages