DocProcAI-Service

This service is designed to process and manage uploaded lecture material (video recordings, documents, slides) to facilitate some advanced features in the MEITREX platform.

Features

Splitting of lecture videos into sections based on detected slide changes via computer vision
OCR of lecture video on screen text
Transcript & Closed Captions generation for lecture videos
Generating of text embeddings on a per-section-basis for videos and per-page-basis for documents
Semantic search/fetching of semantically similar sections of lecture material
Automatic generation of section titles for the video sections generated

For a deeper dive into the features and considerations made during development, check out our paper on DocProcAI.

Configuration

The service uses the config.yaml file located in the root directory for configuration. For further information about configuration check out this file, all configuration properties are explained using in-file comments.

Resource Requirements, Additional Information & Design Rationale

For additional information on the design and implementation of this service, check out the accompanying paper.

Training Repository

Scripts used for training live in the training repository.

Name		Name	Last commit message	Last commit date
Latest commit History 287 Commits
.github/workflows		.github/workflows
.vs		.vs
client		client
components		components
config		config
controller		controller
dto		dto
events		events
fileextractlib		fileextractlib
persistence		persistence
pg-init-scripts		pg-init-scripts
schema		schema
service		service
utils		utils
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
app.py		app.py
config.yaml		config.yaml
docker-compose.yml		docker-compose.yml
docker-compose.yml.backup		docker-compose.yml.backup
paper.pdf		paper.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocProcAI-Service

Features

Configuration

Resource Requirements, Additional Information & Design Rationale

Training Repository

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DocProcAI-Service

Features

Configuration

Resource Requirements, Additional Information & Design Rationale

Training Repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages