Feat/nllb support by hspedro · Pull Request #12 · hspedro/babeltron

hspedro · 2025-03-13T16:57:06Z

NLLB Model Support and Architecture Improvements

Overview

This PR introduces support for Facebook's No Language Left Behind (NLLB) translation model alongside the existing M2M100 model. The changes include comprehensive architecture improvements, enhanced documentation, and better model management capabilities.

Key Features

1. NLLB Model Integration

Added support for NLLB-200 model with multiple size options:
- Small (600M parameters, ~1.2GB)
- Large (3.3B parameters, ~6.6GB)
Implemented automatic language code conversion between ISO and FLORES-200 formats
Added model-specific optimizations and configurations

2. Model Management Improvements

Enhanced model download script with support for multiple model types
Added size-specific model configurations
Improved model initialization and loading process
Added backward compatibility for existing model paths

3. API Enhancements

Added model type selection in translation requests
Enhanced response format with model information
Improved error handling for model-specific cases
Added validation for language code formats

4. Documentation Updates

Added comprehensive architecture documentation
Updated API documentation with NLLB-specific details
Added language code conversion guidelines
Enhanced deployment documentation

5. Testing Improvements

Added tests for NLLB model integration
Enhanced model download tests
Added language code conversion tests
Improved test coverage for model-specific features

Technical Details

Model Configuration

MODEL_CONFIGS = {
    "m2m100": {
        "sizes": ["418M", "1.2B", "12B"],
        "default_size": "418M",
        "model_class": "M2M100ForConditionalGeneration",
        "tokenizer_class": "M2M100Tokenizer",
    },
    "nllb": {
        "sizes": ["600M", "3.3B"],
        "default_size": "600M",
        "model_class": "AutoModelForSeq2SeqLM",
        "tokenizer_class": "AutoTokenizer",
    }
}

API Changes

Added optional model_type parameter to translation requests
Enhanced response format with model architecture information
Added support for FLORES-200 language codes

Environment Variables

Added BABELTRON_MODEL_TYPE for default model selection
Enhanced model path configuration options
Added model size configuration

Breaking Changes

None. The changes are backward compatible with existing M2M100 usage.

Migration Guide

Update environment variables if needed:

BABELTRON_MODEL_TYPE=nllb  # Optional, defaults to m2m100

Download NLLB models if desired:

make download-model-nllb-small
# or
make download-model-nllb-large

Update language codes when using NLLB:
- Use FLORES-200 format (e.g., "eng_Latn" instead of "en")
- ISO codes are automatically converted when possible

Testing

All tests passing
Added comprehensive test coverage for new features
Verified backward compatibility

Documentation

Updated architecture documentation with new diagrams
Enhanced API documentation
Added model-specific guidelines
Updated deployment instructions

Future Considerations

Additional model size options
Model fine-tuning capabilities
Enhanced language code handling
Performance optimizations

Related Issues

Closes #XXX (Add NLLB model support)
Addresses #YYY (Improve model management)
Resolves #ZZZ (Enhance documentation)

Checklist

Default is still 8000, but one can send as a parameter

The NLLB-200 model supports 200 languages, transformer-based and has better performance than M2M100 on FLORES benchmark. To use it, one can download the model and specify it using the BABELTRON_MODEL_TYPE=nllb either in docker or running locally directly.

hspedro added 6 commits March 13, 2025 11:57

chore: parameterize server port

ea55dcf

Default is still 8000, but one can send as a parameter

feat(scripts): support to download NLLB

46fbbe2

chore: bump to v0.4.0

ef6d0b5

docs: add architecture overview

528eb80

test: fix model fixtures

5bf5727

hspedro force-pushed the feat/nllb-support branch from 6fc7b13 to bedb3e8 Compare March 13, 2025 19:07

hspedro added 2 commits March 13, 2025 16:07

feat(auth): re-enable auth middleware

d7f5655

test: fix NLLB model fixtures

d679376

hspedro force-pushed the feat/nllb-support branch from bedb3e8 to d679376 Compare March 13, 2025 19:07

hspedro merged commit bca34e5 into main Mar 13, 2025
2 checks passed

hspedro deleted the feat/nllb-support branch March 13, 2025 19:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/nllb support#12

Feat/nllb support#12
hspedro merged 8 commits intomainfrom
feat/nllb-support

hspedro commented Mar 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hspedro commented Mar 13, 2025

NLLB Model Support and Architecture Improvements

Overview

Key Features

1. NLLB Model Integration

2. Model Management Improvements

3. API Enhancements

4. Documentation Updates

5. Testing Improvements

Technical Details

Model Configuration

API Changes

Environment Variables

Breaking Changes

Migration Guide

Testing

Documentation

Future Considerations

Related Issues

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant