Merge pull request #12 from hebbihebb/claude/explore-flan-t5-integration-011CUdvVQuiX39GMz9qpR4Xu

hebbihebb · web-flow · commit 8dcf1707c8e9 · 2025-10-30T21:41:31.000Z
Integrate FLAN-T5 Grammar Correction (GPU-accelerated) – Confirmed Working, Formatting Guardrails Needed
diff --git a/CUDA_FIX_README.md b/CUDA_FIX_README.md
@@ -0,0 +1,122 @@
+# GPU Not Detected - Quick Fix Guide
+
+## Problem
+
+You're seeing this in the log:
+```
+CUDA not available, using CPU (this will be slow)
+```
+
+But you have a **GTX 2070 (8GB VRAM)** that should work perfectly!
+
+## Why This Happens
+
+When you run:
+```bash
+pip install torch
+```
+
+...it installs the **CPU-only version** by default. PyTorch requires a special installation command to enable GPU support.
+
+## Impact
+
+**Without GPU (current state)**:
+- ❌ 5-30 seconds per sentence
+- ❌ 100% CPU usage
+- ❌ Very slow for large documents
+
+**With GPU (after fix)**:
+- ✅ 0.5-2 seconds per sentence (**10-50x faster!**)
+- ✅ GPU accelerated
+- ✅ Fast enough for production use
+
+Your GTX 2070 has 8GB VRAM which is perfect for this model (needs ~6-8GB).
+
+## Quick Fix (Automated)
+
+```bash
+# 1. Diagnose the issue
+python check_cuda.py
+
+# 2. Fix it (installs GPU-enabled PyTorch)
+./fix_cuda.sh
+
+# 3. Test it works
+python test_t5_integration.py
+```
+
+The `fix_cuda.sh` script will:
+1. Detect your CUDA version from `nvidia-smi`
+2. Uninstall CPU-only PyTorch
+3. Install GPU-enabled PyTorch (~2-3 GB download)
+4. Verify CUDA is working
+
+## Manual Fix
+
+If you prefer to do it manually:
+
+```bash
+# Check your GPU and CUDA version
+nvidia-smi
+
+# Uninstall CPU-only PyTorch
+pip uninstall torch torchvision torchaudio
+
+# Install with CUDA support (choose based on your CUDA version)
+# For CUDA 11.8:
+pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
+
+# For CUDA 12.1+:
+pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
+
+# Verify it works
+python -c "import torch; print('CUDA available:', torch.cuda.is_available())"
+```
+
+## After the Fix
+
+Once CUDA is detected, the log will show:
+```
+Using CUDA for T5 inference
+✓ Model loaded successfully!
+```
+
+And the model will run **10-50x faster** on your GPU!
+
+## Files Added
+
+Three new diagnostic/fix tools:
+- `check_cuda.py` - Diagnose CUDA detection issues
+- `fix_cuda.sh` - Automated fix script
+- `CUDA_FIX_README.md` - This guide
+
+## Note About the Download
+
+You mentioned the model is still downloading:
+```
+model.safetensors:  15%|█▏      | 472M/3.13G
+```
+
+This is the model weights (3.13 GB), separate from PyTorch. Let that finish downloading, then run the CUDA fix, and you'll be all set!
+
+## Expected Final Behavior
+
+After both downloads complete and CUDA is fixed:
+
+```bash
+$ python test_t5_integration.py
+
+1. Loading T5 model...
+Using CUDA for T5 inference  # ✅ GPU detected
+✓ Model loaded successfully!
+
+2. Testing grammar/spelling corrections:
+
+Test 1:
+  Original:  Thiss sentnce have many speling errrors.
+  Corrected: This sentence has many spelling errors.  # ✅ Fast (0.5-2s)
+```
+
+## Questions?
+
+See the full troubleshooting guide in `T5_INTEGRATION_GUIDE.md` (line 247+).
diff --git a/T5_INTEGRATION_GUIDE.md b/T5_INTEGRATION_GUIDE.md
@@ -246,6 +246,51 @@ filter = T5GrammarFilter(model_name="vennify/t5-base-grammar-correction")
 
 ## Troubleshooting
 
+### Issue: "CUDA not available, using CPU" (but you have a GPU)
+
+**Symptoms**:
+- Log shows "CUDA not available, using CPU (this will be slow)"
+- You have an NVIDIA GPU (e.g., GTX 2070)
+- Very slow inference (5-30 seconds per sentence)
+
+**Cause**: You likely installed the **CPU-only version** of PyTorch
+
+**Solution**:
+```bash
+# 1. Check if CUDA is detected
+python check_cuda.py
+
+# 2. If CUDA is not detected, run the fix script
+./fix_cuda.sh
+
+# This will:
+# - Uninstall CPU-only PyTorch
+# - Install GPU-enabled PyTorch
+# - Verify CUDA works
+```
+
+**Manual fix**:
+```bash
+# Check your CUDA version
+nvidia-smi
+
+# Uninstall CPU-only PyTorch
+pip uninstall torch torchvision torchaudio
+
+# Install with CUDA 11.8 support
+pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
+
+# OR with CUDA 12.1 support
+pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
+
+# Verify
+python -c "import torch; print('CUDA available:', torch.cuda.is_available())"
+```
+
+**Performance difference**:
+- CPU: 5-30 seconds per sentence
+- GPU (GTX 2070): 0.5-2 seconds per sentence (**10-50x faster!**)
+
 ### Issue: "CUDA out of memory"
 
 **Solution**: Reduce max_length or use CPU
diff --git a/check_cuda.py b/check_cuda.py
@@ -0,0 +1,89 @@
+#!/usr/bin/env python3
+"""
+CUDA Diagnostic Script - Check PyTorch and GPU availability
+"""
+
+import sys
+
+print("=" * 60)
+print("CUDA Availability Diagnostic")
+print("=" * 60)
+
+# Check Python version
+print(f"\nPython version: {sys.version}")
+
+# Check if torch is installed
+try:
+    import torch
+    print(f"\n✓ PyTorch installed: version {torch.__version__}")
+except ImportError:
+    print("\n✗ PyTorch not installed!")
+    print("  Install with: pip install torch torchvision torchaudio")
+    sys.exit(1)
+
+# Check CUDA availability in PyTorch
+print(f"\nCUDA available in PyTorch: {torch.cuda.is_available()}")
+
+if torch.cuda.is_available():
+    print(f"✓ CUDA detected!")
+    print(f"  CUDA version: {torch.version.cuda}")
+    print(f"  Number of GPUs: {torch.cuda.device_count()}")
+
+    for i in range(torch.cuda.device_count()):
+        print(f"\n  GPU {i}: {torch.cuda.get_device_name(i)}")
+        props = torch.cuda.get_device_properties(i)
+        print(f"    Total memory: {props.total_memory / 1024**3:.2f} GB")
+        print(f"    Compute capability: {props.major}.{props.minor}")
+else:
+    print("✗ CUDA NOT detected by PyTorch")
+    print("\nPossible reasons:")
+    print("  1. CPU-only PyTorch installed (most likely)")
+    print("  2. NVIDIA drivers not installed")
+    print("  3. CUDA toolkit not installed")
+    print("  4. PyTorch CUDA version doesn't match system CUDA")
+
+# Check PyTorch build info
+print(f"\nPyTorch build info:")
+print(f"  Built with CUDA: {torch.version.cuda is not None}")
+if torch.version.cuda:
+    print(f"  CUDA version: {torch.version.cuda}")
+else:
+    print(f"  ⚠ This is a CPU-only build!")
+
+# Check if NVIDIA GPU exists at system level
+print("\n" + "=" * 60)
+print("System GPU Check")
+print("=" * 60)
+
+try:
+    import subprocess
+    result = subprocess.run(['nvidia-smi'], capture_output=True, text=True, timeout=5)
+    if result.returncode == 0:
+        print("\n✓ nvidia-smi output:")
+        print(result.stdout)
+    else:
+        print("\n✗ nvidia-smi failed (NVIDIA drivers may not be installed)")
+except FileNotFoundError:
+    print("\n✗ nvidia-smi not found (NVIDIA drivers not installed)")
+except Exception as e:
+    print(f"\n⚠ Error running nvidia-smi: {e}")
+
+# Recommendations
+print("\n" + "=" * 60)
+print("Recommendations")
+print("=" * 60)
+
+if not torch.cuda.is_available():
+    print("\nTo enable GPU support:")
+    print("\n1. Check NVIDIA drivers are installed:")
+    print("   nvidia-smi")
+    print("\n2. Reinstall PyTorch with CUDA support:")
+    print("   pip uninstall torch torchvision torchaudio")
+    print("   pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118")
+    print("\n   (cu118 = CUDA 11.8, use cu121 for CUDA 12.1, etc.)")
+    print("\n3. Verify CUDA is detected:")
+    print("   python check_cuda.py")
+else:
+    print("\n✓ Your GPU is ready to use!")
+    print("  The T5 model will automatically use your GPU for inference.")
+    print("  Expected speed: 0.5-2 seconds per sentence (vs 5-30s on CPU)")
diff --git a/fix_cuda.sh b/fix_cuda.sh
@@ -0,0 +1,110 @@
+#!/bin/bash
+# Fix CUDA support for PyTorch - Install GPU-enabled version
+
+set -e
+
+echo "=========================================="
+echo "PyTorch CUDA Fix Script"
+echo "=========================================="
+
+# Check if nvidia-smi exists
+if ! command -v nvidia-smi &> /dev/null; then
+    echo ""
+    echo "✗ nvidia-smi not found!"
+    echo "  NVIDIA drivers are not installed or not in PATH."
+    echo "  Please install NVIDIA drivers first."
+    exit 1
+fi
+
+# Get CUDA version from nvidia-smi
+echo ""
+echo "1. Checking system CUDA version..."
+CUDA_VERSION=$(nvidia-smi | grep "CUDA Version" | awk '{print $9}' | cut -d. -f1,2)
+echo "   Detected CUDA: $CUDA_VERSION"
+
+# Determine PyTorch CUDA version
+if [[ $(echo "$CUDA_VERSION >= 12.1" | bc -l) -eq 1 ]]; then
+    TORCH_CUDA="cu121"
+    echo "   Will install PyTorch with CUDA 12.1 support"
+elif [[ $(echo "$CUDA_VERSION >= 11.8" | bc -l) -eq 1 ]]; then
+    TORCH_CUDA="cu118"
+    echo "   Will install PyTorch with CUDA 11.8 support"
+else
+    TORCH_CUDA="cu118"
+    echo "   ⚠ CUDA version is older, will try CUDA 11.8 build"
+fi
+
+# Show GPU info
+echo ""
+echo "2. GPU Information:"
+nvidia-smi --query-gpu=name,memory.total --format=csv,noheader
+
+# Confirm with user
+echo ""
+echo "=========================================="
+echo "This will:"
+echo "  1. Uninstall current PyTorch (CPU-only)"
+echo "  2. Install PyTorch with CUDA support ($TORCH_CUDA)"
+echo "  3. Download ~2-3 GB of packages"
+echo "=========================================="
+echo ""
+read -p "Continue? (y/n) " -n 1 -r
+echo
+if [[ ! $REPLY =~ ^[Yy]$ ]]; then
+    echo "Cancelled."
+    exit 0
+fi
+
+# Activate virtual environment if it exists
+if [ -d "venv" ]; then
+    echo ""
+    echo "3. Activating virtual environment..."
+    source venv/bin/activate
+    echo "   ✓ Using: $VIRTUAL_ENV"
+elif [ -n "$VIRTUAL_ENV" ]; then
+    echo ""
+    echo "3. Using existing virtual environment: $VIRTUAL_ENV"
+else
+    echo ""
+    echo "⚠ No virtual environment detected!"
+    echo "  Installing to system Python (not recommended)"
+    read -p "Continue anyway? (y/n) " -n 1 -r
+    echo
+    if [[ ! $REPLY =~ ^[Yy]$ ]]; then
+        echo "Cancelled. Create a venv first: python -m venv venv"
+        exit 0
+    fi
+fi
+
+# Uninstall existing PyTorch
+echo ""
+echo "4. Removing CPU-only PyTorch..."
+pip uninstall -y torch torchvision torchaudio 2>/dev/null || true
+echo "   ✓ Uninstalled"
+
+# Install CUDA-enabled PyTorch
+echo ""
+echo "5. Installing PyTorch with CUDA support..."
+echo "   This will download ~2-3 GB..."
+pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/$TORCH_CUDA
+
+echo ""
+echo "=========================================="
+echo "6. Verifying CUDA support..."
+echo "=========================================="
+python check_cuda.py
+
+echo ""
+echo "=========================================="
+echo "✓ Setup Complete!"
+echo "=========================================="
+echo ""
+echo "Your GPU is now ready for T5 inference!"
+echo ""
+echo "Test it:"
+echo "  python test_t5_integration.py"
+echo ""
+echo "Expected performance:"
+echo "  - CPU: 5-30 seconds per sentence"
+echo "  - GPU: 0.5-2 seconds per sentence (10-50x faster!)"
+echo ""
diff --git a/run_t5_test.py b/run_t5_test.py
diff --git a/setup_t5_env.sh b/setup_t5_env.sh