MakazhanAlpamys
diff --git a/‎CLAUDE.md‎
Lines changed: 12 additions & 1 deletion b/‎CLAUDE.md‎
Lines changed: 12 additions & 1 deletion
diff --git a/‎README.md‎
Lines changed: 46 additions & 2 deletions b/‎README.md‎
Lines changed: 46 additions & 2 deletions
diff --git a/‎pyproject.toml‎
Lines changed: 2 additions & 2 deletions b/‎pyproject.toml‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎soup_cli/__init__.py‎
Lines changed: 1 addition & 1 deletion b/‎soup_cli/__init__.py‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎soup_cli/__main__.py‎
Lines changed: 2 additions & 2 deletions b/‎soup_cli/__main__.py‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎soup_cli/cli.py‎
Lines changed: 44 additions & 2 deletions b/‎soup_cli/cli.py‎
Lines changed: 44 additions & 2 deletions
diff --git a/‎soup_cli/commands/doctor.py‎
Lines changed: 174 additions & 0 deletions b/‎soup_cli/commands/doctor.py‎
Lines changed: 174 additions & 0 deletions
@@ -71,6 +71,14 @@ soup train --config soup.yaml
 
 **DeepSpeed:** `utils/deepspeed.py` provides ZeRO Stage 2/3 config templates. `commands/train.py` supports `--deepspeed zero2|zero3|zero2_offload|<path>`. Trainers (SFT/DPO) pass `deepspeed` to HF TrainingArguments. Requires `pip install 'soup-cli[deepspeed]'`.
 
+**Error handling:** `utils/errors.py` maps known exceptions (CUDA OOM, missing deps, connection errors, validation errors) to friendly 2-3 line messages with fix suggestions. `cli.py` wraps all commands in a try/except and uses `--verbose` flag for full tracebacks.
+
+**Doctor:** `commands/doctor.py` checks system info, GPU availability, and all dependency versions. Reports missing/outdated packages with fix suggestions.
+
+**Quickstart:** `commands/quickstart.py` runs a complete demo — creates 20-example alpaca dataset, TinyLlama config, and trains a LoRA adapter. Supports `--dry-run` to create files only.
+
+**Confirmation prompts:** `commands/train.py` and `commands/sweep.py` ask for confirmation before starting. Skip with `--yes` / `-y`.
+
 ## Code Conventions
 
 - **Line length:** 100 chars (ruff enforced)
@@ -99,7 +107,7 @@ soup train --config soup.yaml
 
 ## Tests
 
-Test suite (~281 tests) lives in `tests/`:
+Test suite (~321 tests) lives in `tests/`:
 
 | File | Covers |
 |---|---|
@@ -128,3 +136,6 @@ Test suite (~281 tests) lives in `tests/`:
 | `test_sweep.py` | Sweep params parsing, combinations, nested config |
 | `test_diff.py` | Diff prompts collection, metrics, CLI |
 | `test_deepspeed.py` | DeepSpeed configs, multi-GPU detection, trainer integration |
+| `test_errors.py` | Friendly error messages, --verbose flag, error mapping |
+| `test_doctor.py` | `soup doctor` command, version checking, dependency table |
+| `test_quickstart.py` | `soup quickstart` demo, data/config creation, --dry-run |
@@ -21,7 +21,7 @@
   <a href="https://pypi.org/project/soup-cli/"><img src="https://img.shields.io/pypi/v/soup-cli?color=blue" alt="PyPI"></a>
   <img src="https://img.shields.io/badge/python-3.9%2B-blue" alt="Python 3.9+">
   <img src="https://img.shields.io/badge/license-MIT-green" alt="MIT License">
-  <img src="https://img.shields.io/badge/tests-281%20passed-brightgreen" alt="Tests">
+  <img src="https://img.shields.io/badge/tests-321%20passed-brightgreen" alt="Tests">
   <a href="https://github.com/MakazhanAlpamys/Soup/actions"><img src="https://github.com/MakazhanAlpamys/Soup/actions/workflows/ci.yml/badge.svg" alt="CI"></a>
 </p>
 
@@ -324,6 +324,43 @@ soup train --config soup.yaml --deepspeed zero2_offload
 soup train --config soup.yaml --deepspeed ./my_ds_config.json
 ```
 
+## Quickstart Demo
+
+Run a complete demo in one command — creates sample data, config, and trains a tiny model:
+
+```bash
+# Full demo (creates data + config + trains TinyLlama)
+soup quickstart
+
+# Just create files without training
+soup quickstart --dry-run
+
+# Skip confirmation
+soup quickstart --yes
+```
+
+## Health Check
+
+Check your environment for compatibility issues:
+
+```bash
+soup doctor
+```
+
+Shows: Python version, GPU availability, all dependency versions, and fix suggestions.
+
+## Error Handling
+
+Soup shows friendly error messages by default (2-3 lines with a fix suggestion). For full tracebacks:
+
+```bash
+# Any command with --verbose
+soup train --config soup.yaml --verbose
+
+# Global flag works with all commands
+soup --verbose eval --model ./output --benchmarks mmlu
+```
+
 ## Data Formats
 
 Soup supports these formats (auto-detected):
@@ -429,6 +466,10 @@ soup eval --model ./output --benchmarks mmlu --run-id run_20260223_143052_a1b2
 | Hyperparameter sweep (grid/random) | ✅ |
 | Model comparison (diff) | ✅ |
 | Multi-GPU / DeepSpeed | ✅ |
+| Friendly error messages | ✅ |
+| Health check (soup doctor) | ✅ |
+| Quickstart demo | ✅ |
+| Confirmation prompts | ✅ |
 | Web dashboard | 🔜 |
 | Cloud mode (BYOG) | 🔜 |
 
@@ -459,7 +500,10 @@ soup sweep --config soup.yaml --param lr=...  Hyperparameter search
 soup diff --model-a ./a --model-b ./b         Compare two models
 soup data generate --prompt "..." --count 100 Generate synthetic data
 soup train --deepspeed zero2                  Multi-GPU with DeepSpeed
+soup doctor                                   Check environment & dependencies
+soup quickstart [--dry-run]                   Full demo: create data + config + train
 soup version                                  Show version
+soup --verbose <command>                      Show full traceback on errors
 ```
 
 ## Requirements
@@ -478,7 +522,7 @@ pip install -e ".[dev]"
 # Lint
 ruff check soup_cli/ tests/
 
-# Run unit tests (fast, no GPU needed — 281 tests)
+# Run unit tests (fast, no GPU needed — 321 tests)
 pytest tests/ -v
 
 # Run smoke tests (downloads tiny model, runs real training)
 
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 
 [project]
 name = "soup-cli"
-version = "0.3.0"
+version = "0.3.1"
 description = "Fine-tune LLMs in one command. No SSH, no config hell."
 readme = "README.md"
 license = "MIT"
@@ -48,7 +48,7 @@ generate = ["httpx>=0.24.0"]
 deepspeed = ["deepspeed>=0.12.0"]
 
 [project.scripts]
-soup = "soup_cli.cli:app"
+soup = "soup_cli.cli:run"
 
 [project.urls]
 Homepage = "https://github.com/MakazhanAlpamys/Soup"
 
@@ -1,3 +1,3 @@
 """Soup CLI — Fine-tune LLMs in one command."""
 
-__version__ = "0.3.0"
+__version__ = "0.3.1"
@@ -1,5 +1,5 @@
 """Allow running as `python -m soup_cli`."""
 
-from soup_cli.cli import app
+from soup_cli.cli import run
 
-app()
+run()
@@ -1,5 +1,7 @@
 """Main CLI entry point — all commands registered here."""
 
+import sys
+
 import typer
 from rich.console import Console
 
@@ -19,9 +21,14 @@
     sweep,
     train,
 )
+from soup_cli.commands import doctor as doctor_cmd
+from soup_cli.commands import quickstart as quickstart_cmd
 
 console = Console()
 
+# Global verbose flag — set via callback, read by error handler
+_verbose = False
+
 app = typer.Typer(
     name="soup",
     help="Fine-tune LLMs in one command. No SSH, no config hell.",
@@ -45,6 +52,8 @@
 app.command()(serve.serve)
 app.command()(sweep.sweep)
 app.command(name="diff")(diff.diff)
+app.command()(doctor_cmd.doctor)
+app.command()(quickstart_cmd.quickstart)
 
 # Register data generate as a subcommand of data
 data.app.command(name="generate")(generate.generate)
@@ -57,6 +66,39 @@ def version():
 
 
 @app.callback(invoke_without_command=True)
-def main(ctx: typer.Context):
+def main(
+    ctx: typer.Context,
+    verbose: bool = typer.Option(
+        False,
+        "--verbose",
+        "-V",
+        help="Show full traceback on errors",
+    ),
+):
     """Soup — fine-tune LLMs in one command."""
-    pass
+    global _verbose
+    _verbose = verbose
+
+
+def run():
+    """Entry point with friendly error handling."""
+    try:
+        app()
+    except SystemExit:
+        raise
+    except typer.Exit:
+        raise
+    except KeyboardInterrupt:
+        console.print("\n[yellow]Interrupted.[/]")
+        sys.exit(130)
+    except Exception as exc:
+        from soup_cli.utils.errors import format_friendly_error
+
+        format_friendly_error(exc, verbose=_verbose)
+        sys.exit(1)
+
+
+# When invoked via `soup` entry point, use run() for error handling.
+# When invoked via `python -m soup_cli`, __main__.py calls run() directly.
+if __name__ == "__main__":
+    run()
@@ -0,0 +1,174 @@
+"""soup doctor — check dependency compatibility and system health."""
+
+import platform
+import sys
+
+from rich.console import Console
+from rich.panel import Panel
+from rich.table import Table
+
+console = Console()
+
+# Dependencies to check: (import_name, package_name, min_version, required)
+DEPS = [
+    ("torch", "torch", "2.0.0", True),
+    ("transformers", "transformers", "4.36.0", True),
+    ("peft", "peft", "0.7.0", True),
+    ("trl", "trl", "0.7.0", True),
+    ("datasets", "datasets", "2.14.0", True),
+    ("bitsandbytes", "bitsandbytes", "0.41.0", True),
+    ("accelerate", "accelerate", "0.25.0", True),
+    ("pydantic", "pydantic", "2.0.0", True),
+    ("typer", "typer", "0.9.0", True),
+    ("rich", "rich", "13.0.0", True),
+    ("yaml", "pyyaml", "6.0", True),
+    ("plotext", "plotext", "5.2.0", True),
+    # Optional
+    ("fastapi", "fastapi", "0.104.0", False),
+    ("uvicorn", "uvicorn", "0.24.0", False),
+    ("datasketch", "datasketch", "1.6.0", False),
+    ("lm_eval", "lm-eval", "0.4.0", False),
+    ("wandb", "wandb", "0.15.0", False),
+    ("deepspeed", "deepspeed", "0.12.0", False),
+    ("httpx", "httpx", "0.24.0", False),
+]
+
+
+def doctor():
+    """Check system dependencies, GPU, and compatibility."""
+    console.print("[bold]Soup Doctor[/] — checking your environment...\n")
+
+    # System info
+    console.print(
+        Panel(
+            f"Python:   [bold]{sys.version.split()[0]}[/]\n"
+            f"Platform: [bold]{platform.system()} {platform.release()}[/]\n"
+            f"Arch:     [bold]{platform.machine()}[/]",
+            title="System",
+        )
+    )
+
+    # GPU check
+    _check_gpu()
+
+    # Dependencies table
+    table = Table(title="Dependencies")
+    table.add_column("Package", style="bold")
+    table.add_column("Required", justify="center")
+    table.add_column("Installed", justify="center")
+    table.add_column("Min Version")
+    table.add_column("Status")
+
+    issues = []
+
+    for import_name, pkg_name, min_ver, required in DEPS:
+        try:
+            mod = __import__(import_name)
+            version = getattr(mod, "__version__", getattr(mod, "VERSION", "?"))
+            version_str = str(version)
+
+            if _version_ok(version_str, min_ver):
+                status = "[green]OK[/]"
+            else:
+                status = f"[yellow]outdated (need >={min_ver})[/]"
+                issues.append(f"Upgrade {pkg_name}: pip install '{pkg_name}>={min_ver}'")
+
+            table.add_row(
+                pkg_name,
+                "yes" if required else "optional",
+                version_str,
+                f">={min_ver}",
+                status,
+            )
+        except ImportError:
+            if required:
+                status = "[red]MISSING[/]"
+                issues.append(f"Install {pkg_name}: pip install '{pkg_name}>={min_ver}'")
+            else:
+                status = "[dim]not installed[/]"
+
+            table.add_row(
+                pkg_name,
+                "yes" if required else "optional",
+                "—",
+                f">={min_ver}",
+                status,
+            )
+
+    console.print(table)
+
+    # Summary
+    if issues:
+        console.print(f"\n[yellow]Found {len(issues)} issue(s):[/]")
+        for issue in issues:
+            console.print(f"  [red]>[/] {issue}")
+        console.print("\n[dim]Fix all: pip install -U " + " ".join(
+            f"'{pkg_name}>={min_ver}'"
+            for _, pkg_name, min_ver, required in DEPS
+            if required
+        ) + "[/]")
+    else:
+        console.print("\n[bold green]All checks passed![/] Your environment is ready.")
+
+
+def _check_gpu():
+    """Check GPU availability and display info."""
+    try:
+        import torch
+
+        if torch.cuda.is_available():
+            gpu_count = torch.cuda.device_count()
+            gpus = []
+            for idx in range(gpu_count):
+                name = torch.cuda.get_device_name(idx)
+                mem = torch.cuda.get_device_properties(idx)
+                total_gb = getattr(mem, "total_memory", getattr(mem, "total_mem", 0))
+                total_gb = total_gb / (1024 ** 3)
+                gpus.append(f"  GPU {idx}: [bold]{name}[/] ({total_gb:.1f} GB)")
+            gpu_info = "\n".join(gpus)
+            cuda_ver = torch.version.cuda or "N/A"
+            console.print(
+                Panel(
+                    f"CUDA:     [bold green]available[/] (v{cuda_ver})\n"
+                    f"GPUs:     [bold]{gpu_count}[/]\n{gpu_info}",
+                    title="GPU",
+                )
+            )
+        elif hasattr(torch.backends, "mps") and torch.backends.mps.is_available():
+            console.print(
+                Panel(
+                    "Backend:  [bold green]MPS (Apple Silicon)[/]\n"
+                    "Status:   [bold green]available[/]",
+                    title="GPU",
+                )
+            )
+        else:
+            console.print(
+                Panel(
+                    "Backend:  [bold yellow]CPU only[/]\n"
+                    "Warning:  Training will be slow without GPU.",
+                    title="GPU",
+                )
+            )
+    except ImportError:
+        console.print(
+            Panel(
+                "Backend:  [red]unknown (torch not installed)[/]",
+                title="GPU",
+            )
+        )
+
+
+def _version_ok(installed: str, minimum: str) -> bool:
+    """Check if installed version meets minimum requirement."""
+    try:
+        inst_parts = [int(x) for x in installed.split(".")[:3]]
+        min_parts = [int(x) for x in minimum.split(".")[:3]]
+        # Pad to same length
+        while len(inst_parts) < 3:
+            inst_parts.append(0)
+        while len(min_parts) < 3:
+            min_parts.append(0)
+        return inst_parts >= min_parts
+    except (ValueError, AttributeError):
+        return True  # Can't parse, assume OK
Original file line number	Diff line number	Diff line change
`@@ -1,3 +1,3 @@`
`1`	`1`	`"""Soup CLI — Fine-tune LLMs in one command."""`
`2`	`2`
`3`		`-__version__ = "0.3.0"`
	`3`	`+__version__ = "0.3.1"`