Mobile llm integration by RanaZay · Pull Request #95 · EvolvingLMMs-Lab/LLaVA-OneVision-1.5

RanaZay · 2026-01-15T18:04:53Z

No description provided.

…artifacts untracked

…validation

…sues - Updated alignment.sh: Added conda activation, set CUDA paths to 12.1, configured for GPU 7 only - Modified stage_1_alignment_llava_ov_4b.sh: Changed TP from 2 to 1, updated checkpoint path to tp1_pp1 - Fixed Makefile: Added check to skip recompilation if helpers_cpp .so files already exist - Updated utils.py: Added pre-compilation check to avoid unnecessary builds

- Added print statements for debugging in dataloader_provider.py - Added print statements in qwen2vl_task_encoder.py for tracing preprocessing - Added debugging prints in llavaov_1_5_provider.py - Added print statements in train.py, megatron_trainer.py, and other training files - These changes were made during codebase exploration and understanding

- Add FastViT model implementation (mobileclip_l_384) in aiak_training_llm/models/fastvit/ - Update LlavaOnevision1_5 model to use FastViT encoder - Add FastViT preprocessing in qwen2vl_task_encoder.py - Add --use-fastvit and related command-line arguments - Add checkpoint conversion scripts for FastVLM - Update training configs for 2-GPU setup (TP=2) - Add .gitignore entries for checkpoints and training outputs

- Add debug prints in FastViT forward pass (mci.py, mobileclip_encoder.py, fastvit_vision_model.py) - Update GPU configuration to use 2 GPUs (TP=2) in alignment.sh - Add comprehensive code documentation and comments - Update training configuration for FastViT image processing - Add FastViT preprocessing path in qwen2vl_task_encoder.py

- Update megatron_core SOURCES.txt - Update stage 1 alignment script configuration

…izer/processor loading

…zations - Add MobileLLM 140M model architecture and configuration - Integrate FastViT vision encoder with configurable image sizes - Optimize training for low-memory GPUs (1 GPU, reduced batch size, increased recomputation) - Add inference script for FastVLM testing - Configure training for 5 sample test runs to validate setup - Update .gitignore to exclude large checkpoint files

- Add HuggingFace checkpoint (model.safetensors, tokenizer.json) - Add Megatron checkpoint (TP=2 format) - Configure Git LFS for large model files

- Add MobileLLM config and layer specs - Update model provider and layer specs to support MobileLLM backbone - Add stage 1 alignment script for MobileLLM-140M - Add comprehensive integration documentation - Update alignment.sh to support MobileLLM training option

…LaVA-OneVision-1.5 into mobile-llm-integration

- Implement MobileLLM-R1-140M (140M params) with GQA, SwiGLU, RMSNorm - Fix QK LayerNorm configuration (was disabled, now enabled) - Add FastViT/MobileCLIP vision encoder support - Add local _is_te_min_version() implementation to fix import error - Update training scripts for MobileLLM experiments - Verify model architecture matches official config.json

- Add convert_fastvit_hf_to_mcore.sh: Convert FastViT from HF to Megatron format - Add convert_mobilellm_hf_to_mcore.sh: Convert MobileLLM-R1-140M to Megatron format - Add merge_mobilellm_fastvit.sh: Merge language and vision checkpoints - Update stage_1_alignment_mobilellm_140m.sh: Training config with merged checkpoint - Add test_inference_mobilellm.py: Inference script for FastVLM - Fix nvcc path resolution for multiple CUDA installations - Update training_utils.py: FastViT checkpoint handling Successfully tested: - Checkpoint conversion: 629 vision + 152 language tensors - Merged checkpoint: 781 keys, 774MB - Training verified: Loss 11.87, 36.6 tokens/sec/GPU on A100-40GB

- Load FastViT config from mobileclip_l.json to get correct architecture values - Print full config objects for language, vision, and adapter - Add local_files_only parameter for HF tokenizer when loading from filesystem - Fixes vision config showing incorrect language model values

…el printing

RanaZay and others added 23 commits December 24, 2025 16:22

Save: commit all local changes

fc13f35

Add TE-free attention path and training fixes

8588f2f

Track Stage1/alignment.sh in outer repo (de-nest Stage1)

4692f43

Track apex in outer repo (de-nest apex submodule)

f46815e

Prune old stage_1_alignment logs and tensorboard events; keep latest …

a816816

…artifacts untracked

Add AMD/ROCm alignment launcher (alignment_rocm.sh)

b1429d0

Make ROCm launcher path-relative with conda fallback and quick-start …

fbd311d

…validation

Update configuration and metadata files

46cb9ad

- Update megatron_core SOURCES.txt - Update stage 1 alignment script configuration

Fix ROCm compatibility: add PYTHONPATH and local_files_only for token…

2da41ab

…izer/processor loading

Add MobileLLM checkpoints using Git LFS

4a9753d

- Add HuggingFace checkpoint (model.safetensors, tokenizer.json) - Add Megatron checkpoint (TP=2 format) - Configure Git LFS for large model files

Merge branch 'mobile-llm-integration' of https://github.com/RanaZay/L…

aaa1f84

…LaVA-OneVision-1.5 into mobile-llm-integration

Improve local path detection for tokenizer loading

5e16565

Skip fused CUDA kernels compilation on ROCm/AMD platforms

fcadc4c

Skip fused CUDA kernels compilation on ROCm/AMD platforms and add mod…

b16cd6c

…el printing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mobile llm integration#95

Mobile llm integration#95
RanaZay wants to merge 23 commits intoEvolvingLMMs-Lab:mainfrom
RanaZay:mobile-llm-integration

RanaZay commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

RanaZay commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant