🗺️ Roadmap

Vision

Build the most comprehensive open-source library of AI research skills, enabling AI agents to autonomously conduct experiments from hypothesis to deployment.

Target: 86 comprehensive skills — achieved ✅

Progress Overview

Metric	Current	Target
Skills	86 (high-quality, standardized YAML)	86 ✅
Avg Lines/Skill	420 lines (focused + progressive disclosure)	200-500 lines
Documentation	~130,000 lines total (SKILL.md + references)	100,000+ lines
Gold Standard Skills	65 with comprehensive references	50+ ✅
Coverage	Autoresearch, Ideation, Paper Writing, Architecture, Tokenization, Fine-Tuning, Data Processing, Post-Training, Safety, Distributed, Infrastructure, Optimization, Evaluation, Inference, Agents, RAG, Multimodal, MLOps, Observability, Prompt Engineering, Emerging Techniques	Full Lifecycle ✅

Development Phases

✅ Phase 1: Model Architecture (COMPLETE - 5 skills)

Status: Core model architectures covered

Completed Skills:

✅ Megatron-Core - NVIDIA's framework for training 2B-462B param models
✅ LitGPT - Lightning AI's 20+ clean LLM implementations
✅ Mamba - State-space models with O(n) complexity
✅ RWKV - RNN+Transformer hybrid, infinite context
✅ NanoGPT - Educational GPT in ~300 lines by Karpathy

✅ Phase 2: Tokenization (COMPLETE - 2 skills)

Status: Essential tokenization frameworks covered

Completed Skills:

✅ HuggingFace Tokenizers - Rust-based, BPE/WordPiece/Unigram
✅ SentencePiece - Language-independent tokenization

✅ Phase 3: Fine-Tuning (COMPLETE - 4 skills)

Status: Core fine-tuning frameworks covered

Completed Skills:

✅ Axolotl - YAML-based fine-tuning with 100+ models
✅ LLaMA-Factory - WebUI no-code fine-tuning
✅ Unsloth - 2x faster QLoRA fine-tuning
✅ PEFT - Parameter-efficient fine-tuning with LoRA, QLoRA, DoRA, 25+ methods

✅ Phase 4: Data Processing (COMPLETE - 2 skills)

Status: Distributed data processing covered

Completed Skills:

✅ Ray Data - Distributed ML data processing
✅ NeMo Curator - GPU-accelerated data curation

✅ Phase 5: Post-Training (COMPLETE - 4 skills)

Status: RLHF and alignment techniques covered

Completed Skills:

✅ TRL Fine-Tuning - Transformer Reinforcement Learning
✅ GRPO-RL-Training - Group Relative Policy Optimization (gold standard)
✅ OpenRLHF - Full RLHF pipeline with Ray + vLLM
✅ SimPO - Simple Preference Optimization

✅ Phase 6: Safety & Alignment (COMPLETE - 4 skills)

Status: Core safety frameworks covered

Completed Skills:

✅ Constitutional AI - AI-driven self-improvement via principles
✅ LlamaGuard - Safety classifier for LLM inputs/outputs
✅ NeMo Guardrails - Programmable guardrails with Colang
✅ Prompt Guard - Meta's 86M prompt injection & jailbreak detector

✅ Phase 7: Distributed Training (COMPLETE - 5 skills)

Status: Major distributed training frameworks covered

Completed Skills:

✅ DeepSpeed - Microsoft's ZeRO optimization
✅ PyTorch FSDP - Fully Sharded Data Parallel
✅ Accelerate - HuggingFace's distributed training API
✅ PyTorch Lightning - High-level training framework
✅ Ray Train - Multi-node orchestration

✅ Phase 8: Optimization (COMPLETE - 6 skills)

Status: Core optimization techniques covered

Completed Skills:

✅ Flash Attention - 2-4x faster attention with memory efficiency
✅ bitsandbytes - 8-bit/4-bit quantization
✅ GPTQ - 4-bit post-training quantization
✅ AWQ - Activation-aware weight quantization
✅ HQQ - Half-Quadratic Quantization without calibration data
✅ GGUF - llama.cpp quantization format for CPU/Metal inference

✅ Phase 9: Evaluation (COMPLETE - 1 skill)

Status: Standard benchmarking framework available

Completed Skills:

✅ lm-evaluation-harness - EleutherAI's standard for benchmarking LLMs

✅ Phase 10: Inference & Serving (COMPLETE - 4 skills)

Status: Production inference frameworks covered

Completed Skills:

✅ vLLM - High-throughput LLM serving with PagedAttention
✅ TensorRT-LLM - NVIDIA's fastest inference
✅ llama.cpp - CPU/Apple Silicon inference
✅ SGLang - Structured generation with RadixAttention

✅ Phase 10.5: Infrastructure (COMPLETE - 3 skills)

Status: Cloud infrastructure and orchestration covered

Completed Skills:

✅ Modal - Serverless GPU cloud with Python-native API, T4-H200 on-demand
✅ SkyPilot - Multi-cloud orchestration across 20+ providers with spot recovery
✅ Lambda Labs - Reserved/on-demand GPU cloud with H100/A100, persistent filesystems

✅ Phase 11: Agents (COMPLETE - 4 skills)

Status: Major agent frameworks covered

Completed Skills:

✅ LangChain - Most popular agent framework, 500+ integrations
✅ LlamaIndex - Data framework for LLM apps, 300+ connectors
✅ CrewAI - Multi-agent orchestration with role-based collaboration
✅ AutoGPT - Autonomous AI agent platform with visual workflow builder

✅ Phase 12: RAG (COMPLETE - 5 skills)

Status: Core RAG and vector database skills covered

Completed Skills:

✅ Chroma - Open-source embedding database
✅ FAISS - Facebook's similarity search, billion-scale
✅ Sentence Transformers - 5000+ embedding models
✅ Pinecone - Managed vector database
✅ Qdrant - High-performance Rust vector search with hybrid filtering

✅ Phase 13: Multimodal (COMPLETE - 7 skills)

Status: Comprehensive multimodal frameworks covered

Completed Skills:

✅ CLIP - OpenAI's vision-language model
✅ Whisper - Robust speech recognition, 99 languages
✅ LLaVA - Vision-language assistant, GPT-4V level
✅ Stable Diffusion - Text-to-image generation via HuggingFace Diffusers
✅ Segment Anything (SAM) - Meta's zero-shot image segmentation with points/boxes/masks
✅ BLIP-2 - Vision-language pretraining with Q-Former, image captioning, VQA
✅ AudioCraft - Meta's MusicGen/AudioGen for text-to-music and text-to-sound

✅ Phase 14: Advanced Optimization (COMPLETE)

Status: Advanced optimization techniques covered (merged into Phase 8)

Note: HQQ and GGUF skills have been completed and merged into Phase 8: Optimization.

✅ Phase 15: MLOps & Observability (COMPLETE - 5 skills)

Status: Core MLOps and LLM observability covered

Completed Skills:

✅ MLflow - Open-source MLOps platform for tracking experiments
✅ TensorBoard - Visualization and experiment tracking
✅ Weights & Biases - Experiment tracking and collaboration
✅ LangSmith - LLM observability, tracing, evaluation
✅ Phoenix - Open-source AI observability with OpenTelemetry tracing

✅ Phase 16: Prompt Engineering & Advanced Applications (COMPLETE - 6 skills)

Status: Core prompt engineering and multi-agent tools covered

Completed Skills:

✅ DSPy - Declarative prompt optimization and LM programming
✅ Guidance - Constrained generation and structured prompting
✅ Instructor - Structured output with Pydantic models
✅ Outlines - Structured text generation with regex and grammars
✅ CrewAI - Multi-agent orchestration (completed in Phase 11)
✅ AutoGPT - Autonomous agents (completed in Phase 11)

✅ Phase 17: Extended Multimodal (COMPLETE)

Status: All extended multimodal skills complete, merged into Phase 13

Note: BLIP-2, SAM, and AudioCraft have been completed and merged into Phase 13: Multimodal.

✅ Phase 18: Emerging Techniques (COMPLETE - 6 skills)

Status: Core emerging techniques covered

Completed Skills:

✅ MoE Training - Mixture of Experts with DeepSpeed/HuggingFace
✅ Model Merging - mergekit, SLERP, and model composition
✅ Long Context - RoPE extensions, ALiBi, and context scaling
✅ Speculative Decoding - Medusa, Lookahead, and draft models for faster inference
✅ Knowledge Distillation - MiniLLM, reverse KLD, teacher-student training
✅ Model Pruning - Wanda, SparseGPT, and structured pruning

Contributing to the Roadmap

Want to help us achieve these goals?

Pick a skill from the roadmap - Comment on GitHub Discussions to claim it
Follow the contribution guide - Use our template and quality standards
Submit your PR - We review within 48 hours

🎉 Roadmap Complete!

All 70 skills have been completed! The library now covers the full AI research lifecycle:

✅ Phase 1-10: Core ML infrastructure (Architecture, Tokenization, Fine-Tuning, Data Processing, Post-Training, Safety, Distributed Training, Optimization, Evaluation, Inference)
✅ Phase 10.5: Infrastructure (Modal, SkyPilot, Lambda Labs)
✅ Phase 11-12: Applications (Agents, RAG)
✅ Phase 13: Multimodal (CLIP, Whisper, LLaVA, Stable Diffusion, SAM, BLIP-2, AudioCraft)
✅ Phase 14-16: Advanced (Optimization, MLOps & Observability, Prompt Engineering)
✅ Phase 17-18: Extended (Extended Multimodal, Emerging Techniques)

Future Directions

While the 70-skill roadmap is complete, the library will continue to evolve with:

Updates: Keeping existing skills current with latest versions
Community contributions: Additional skills from contributors
Emerging tools: New frameworks and techniques as they mature

Philosophy

Quality over Quantity: Each skill must provide real value with comprehensive guidance, not just links to docs. We aim for 300+ lines of expert-level content per skill, with real code examples, troubleshooting guides, and production-ready workflows.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🗺️ Roadmap

Vision

Progress Overview

Development Phases

✅ Phase 1: Model Architecture (COMPLETE - 5 skills)

✅ Phase 2: Tokenization (COMPLETE - 2 skills)

✅ Phase 3: Fine-Tuning (COMPLETE - 4 skills)

✅ Phase 4: Data Processing (COMPLETE - 2 skills)

✅ Phase 5: Post-Training (COMPLETE - 4 skills)

✅ Phase 6: Safety & Alignment (COMPLETE - 4 skills)

✅ Phase 7: Distributed Training (COMPLETE - 5 skills)

✅ Phase 8: Optimization (COMPLETE - 6 skills)

✅ Phase 9: Evaluation (COMPLETE - 1 skill)

✅ Phase 10: Inference & Serving (COMPLETE - 4 skills)

✅ Phase 10.5: Infrastructure (COMPLETE - 3 skills)

✅ Phase 11: Agents (COMPLETE - 4 skills)

✅ Phase 12: RAG (COMPLETE - 5 skills)

✅ Phase 13: Multimodal (COMPLETE - 7 skills)

✅ Phase 14: Advanced Optimization (COMPLETE)

✅ Phase 15: MLOps & Observability (COMPLETE - 5 skills)

✅ Phase 16: Prompt Engineering & Advanced Applications (COMPLETE - 6 skills)

✅ Phase 17: Extended Multimodal (COMPLETE)

✅ Phase 18: Emerging Techniques (COMPLETE - 6 skills)

Contributing to the Roadmap

🎉 Roadmap Complete!

Future Directions

Philosophy

FilesExpand file tree

ROADMAP.md

Latest commit

History

ROADMAP.md

File metadata and controls

🗺️ Roadmap

Vision

Progress Overview

Development Phases

✅ Phase 1: Model Architecture (COMPLETE - 5 skills)

✅ Phase 2: Tokenization (COMPLETE - 2 skills)

✅ Phase 3: Fine-Tuning (COMPLETE - 4 skills)

✅ Phase 4: Data Processing (COMPLETE - 2 skills)

✅ Phase 5: Post-Training (COMPLETE - 4 skills)

✅ Phase 6: Safety & Alignment (COMPLETE - 4 skills)

✅ Phase 7: Distributed Training (COMPLETE - 5 skills)

✅ Phase 8: Optimization (COMPLETE - 6 skills)

✅ Phase 9: Evaluation (COMPLETE - 1 skill)

✅ Phase 10: Inference & Serving (COMPLETE - 4 skills)

✅ Phase 10.5: Infrastructure (COMPLETE - 3 skills)

✅ Phase 11: Agents (COMPLETE - 4 skills)

✅ Phase 12: RAG (COMPLETE - 5 skills)

✅ Phase 13: Multimodal (COMPLETE - 7 skills)

✅ Phase 14: Advanced Optimization (COMPLETE)

✅ Phase 15: MLOps & Observability (COMPLETE - 5 skills)

✅ Phase 16: Prompt Engineering & Advanced Applications (COMPLETE - 6 skills)

✅ Phase 17: Extended Multimodal (COMPLETE)

✅ Phase 18: Emerging Techniques (COMPLETE - 6 skills)

Contributing to the Roadmap

🎉 Roadmap Complete!

Future Directions

Philosophy