Brazilian Sign Language Recognition Project

This is a collaborative data science project under the Omdena São Paulo Chapter focused on Brazilian Sign Language (Libras) recognition. The project aims to develop machine learning models that can classify sign language videos into corresponding Portuguese words.

Project Overview

See STRUCTURE.md for detailed project organization.

Setup Instructions

Prerequisites

Python 3.11 or higher
uv package manager (recommended installation via pip: pip install uv)

Environment Setup

Clone the repository:

git clone https://github.com/OmdenaAI/SaoPauloBrazilChapter_BrazilianSignLanguage.git
cd SaoPauloBrazilChapter_BrazilianSignLanguage

Install core dependencies:
```
uv sync
```
For additional dependencies:
```
uv sync --extra <group>  # Example: uv sync --extra data
```
See pyproject.toml for available dependency groups (data, model, app).

Using the environment:

# Activate the environment
uv venv activate

# Run your code
python your_script.py
jupyter notebook

Or run commands directly without activation:

uv run python your_script.py
uv run jupyter notebook

Adding new dependencies:

uv add <package>           # Add to core dependencies
uv add --extra data <pkg>  # Add to data processing tools

Project Structure

SaoPauloBrazilChapter_BrazilianSignLanguage/
├── data/                  # Data files
│   ├── raw/              # Original data
│   │   ├── INES/        # INES dataset
│   │   │   └── videos/  # Video files (stored on Google Drive)
│   │   ├── SignBank/    # SignBank dataset
│   │   │   └── videos/  # Video files (stored on Google Drive)
│   │   ├── UFV/         # UFV dataset
│   │   │   └── videos/  # Video files (stored on Google Drive)
│   │   └── V-Librasil/  # V-Librasil dataset
│   │       └── videos/  # Video files (stored on Google Drive)
│   ├── interim/          # Intermediate processing
│   │   ├── Analysis/    # Analysis info of the data
│   │   ├── Debug/       # Debug files for inspecting preprocessing steps
│   │   ├── RawCleanVideos/ # The raw video files after some initial cleaning before the preprocessing pipeline
│   │   ├── RawMotionMeasurements/ # Raw motion measurements
│   │   └── RawPoseLandmarks/ # Raw pose landmarks
│   ├── preprocessed/        # Final datasets
│   │   ├── metadata_v*.csv # Metadata for each preprocessing version
│   │   ├── videos/      # Preprocessed videos
│   │   │   └── v*/      # Version-specific processed videos (e.g., v1, v2, v3...)
│   │   └── landmarks/   # Processed landmark data
│   │       └── v*/      # Version-specific processed landmarks (e.g., v1, v2, v3...)
├── code/                 # Source code
│   ├── data/            # Data processing
│   ├── models/          # Model implementations
├── modelling/            # Model training and evaluation
│   ├── logs/            # Logs
│   ├── model_files/     # Model files
└── tests/               # Unit tests

See STRUCTURE.md for complete structure details.

Data Management

Video Files

Large video files are stored on Google Drive
Video directories in the repository structure are placeholders
Download videos to your local videos/ directories as needed
Preprocessing pipeline creates versioned outputs (v1, v2, v3...) of processed videos and landmarks
Each version has its own metadata and individual file metadata

Data Files

Small files like CSV files, labels, and metadata are tracked in Git
Store processed data (features, embeddings) in preprocessed/
Document data formats in respective directories
Metadata files track preprocessing steps and configurations

Name		Name	Last commit message	Last commit date
Latest commit History 629 Commits
code		code
data		data
docs		docs
notebooks		notebooks
webapp		webapp
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
PREPROCESSING_STEPS.md		PREPROCESSING_STEPS.md
README.md		README.md
STRUCTURE.md		STRUCTURE.md
__init__.py		__init__.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Brazilian Sign Language Recognition Project

Project Overview

Setup Instructions

Prerequisites

Environment Setup

Project Structure

Data Management

Video Files

Data Files

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 11

Uh oh!

Languages

OmdenaAI/SaoPauloBrazilChapter_BrazilianSignLanguage

Folders and files

Latest commit

History

Repository files navigation

Brazilian Sign Language Recognition Project

Project Overview

Setup Instructions

Prerequisites

Environment Setup

Project Structure

Data Management

Video Files

Data Files

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 11

Uh oh!

Languages

Packages