tylerbessire
diff --git a/‎README.md‎
Lines changed: 251 additions & 0 deletions b/‎README.md‎
Lines changed: 251 additions & 0 deletions
diff --git a/‎arc_solver/dsl.py‎
Lines changed: 114 additions & 0 deletions b/‎arc_solver/dsl.py‎
Lines changed: 114 additions & 0 deletions
@@ -0,0 +1,251 @@
+# Enhanced ARC-AGI-2 Solver
+
+This repository contains an advanced solver for the **ARC Prize 2025** competition (ARC‑AGI‑2), implementing the complete blueprint from neuroscience-inspired research. It combines symbolic reasoning with neural guidance, episodic retrieval, program sketches, and test-time training to achieve superior performance on abstract reasoning tasks.
+
+## Key Features
+
+### 🧠 Neuroscience-Inspired Architecture
+- **Neural guidance**: Predicts relevant DSL operations using task features
+- **Episodic retrieval**: Maintains database of solved tasks for analogical reasoning  
+- **Program sketches**: Mines common operation sequences as macro-operators
+- **Test-time training**: Adapts scoring functions to each specific task
+- **Multi-demand network analog**: Prioritizes candidate programs using learned heuristics
+
+### 🔧 Enhanced Capabilities
+- **Object-centric parsing** with connected component analysis
+- **Compact DSL** with composable primitives (rotate, flip, translate, recolor, etc.)
+- **Two-attempt diversity** as required by ARC Prize 2025 rules
+- **Fallback resilience** with graceful degradation to baseline methods
+- **Performance monitoring** with detailed statistics and benchmarking
+
+## Directory Structure
+
+```
+arc_solver_project/
+│
+├── arc_solver/                # Core solver package
+│   ├── grid.py                # Grid operations and utilities
+│   ├── objects.py             # Connected component extraction
+│   ├── dsl.py                 # Domain-specific language primitives
+│   ├── heuristics.py          # Heuristic rule inference
+│   ├── search.py              # Basic brute-force search
+│   ├── solver.py              # Main solver interface (enhanced)
+│   ├── enhanced_solver.py     # Enhanced solver with neural components
+│   ├── enhanced_search.py     # Neural-guided program synthesis
+│   ├── io_utils.py            # JSON loading and submission helpers
+│   └── neural/                # Neural guidance components
+│       ├── features.py        # Task feature extraction
+│       ├── guidance.py        # Neural operation prediction
+│       ├── sketches.py        # Program sketch mining
+│       ├── episodic.py        # Episodic retrieval system
+│       └── ttt.py             # Test-time training
+│
+├── arc_submit.py              # Command-line submission script
+├── train_neural_guidance.py   # Training script for neural components
+├── benchmark.py               # Benchmarking and evaluation tools
+└── README.md                  # This file
+```
+
+## Quick Start
+
+### 1. Basic Usage (Kaggle-ready)
+
+```bash
+# Generate submission file (uses enhanced solver by default)
+python arc_submit.py
+
+# Use baseline solver only (if needed)
+ARC_USE_BASELINE=1 python arc_submit.py
+```
+
+### 2. Training Neural Components
+
+```bash
+# Train neural guidance (requires training data)
+python train_neural_guidance.py
+
+# Or setup environment with defaults
+python benchmark.py
+```
+
+### 3. Python API
+
+```python
+from arc_solver.enhanced_solver import solve_task_enhanced
+
+# Solve a single task with full enhancements
+result = solve_task_enhanced(task)
+
+# Configure solver behavior
+from arc_solver.enhanced_solver import ARCSolver
+solver = ARCSolver(use_enhancements=True)
+result = solver.solve_task(task)
+```
+
+## How It Works
+
+### Enhanced Pipeline
+
+1. **Feature Extraction**: Extract task-level features (colors, objects, transformations)
+2. **Neural Guidance**: Predict which DSL operations are likely relevant
+3. **Episodic Retrieval**: Query database for similar previously solved tasks
+4. **Sketch-Based Search**: Use mined program templates with parameter filling
+5. **Test-Time Adaptation**: Fine-tune scoring function using task demonstrations
+6. **Program Selection**: Rank and select top 2 diverse candidate programs
+
+### Fallback Strategy
+
+If enhanced components fail, the solver gracefully falls back to:
+- Heuristic single-step transformations
+- Brute-force enumeration of 2-step programs
+- Identity transformation as last resort
+
+## Configuration
+
+The solver supports extensive configuration through environment variables and config files:
+
+### Environment Variables
+- `ARC_USE_BASELINE=1`: Force baseline solver only
+- `ARC_DISABLE_ENHANCEMENTS=1`: Disable enhanced features
+
+### Configuration File
+```json
+{
+  "use_neural_guidance": true,
+  "use_episodic_retrieval": true,
+  "use_program_sketches": true,
+  "use_test_time_training": true,
+  "max_programs": 256,
+  "timeout_per_task": 30.0
+}
+```
+
+## Neural Components
+
+### Neural Guidance
+- **Purpose**: Predict which DSL operations are relevant for a given task
+- **Architecture**: Simple MLP with task-level features
+- **Training**: Uses extracted features from training demonstrations
+- **Output**: Operation relevance scores to guide search
+
+### Episodic Retrieval
+- **Purpose**: Reuse solutions from similar previously solved tasks
+- **Method**: Task signature matching with feature-based similarity
+- **Storage**: JSON-based database of solved programs with metadata
+- **Retrieval**: Cosine similarity on numerical features + boolean feature matching
+
+### Program Sketches
+- **Purpose**: Capture common operation sequences as reusable templates
+- **Mining**: Extract frequent 1-step and 2-step operation patterns
+- **Usage**: Instantiate sketches with different parameter combinations
+- **Adaptation**: Learn from successful programs during solving
+
+### Test-Time Training
+- **Purpose**: Adapt scoring function to each specific task
+- **Method**: Fine-tune lightweight scorer on task demonstrations
+- **Features**: Program length, operation types, success rate, complexity
+- **Augmentation**: Generate synthetic training examples via transformations
+
+## Performance and Evaluation
+
+### Benchmarking
+```python
+from benchmark import Benchmark, SolverConfig
+
+config = SolverConfig()
+benchmark = Benchmark(config)
+results = benchmark.run_benchmark("test_data.json")
+print(f"Success rate: {results['performance_stats']['success_rate']:.3f}")
+```
+
+### Monitoring
+The solver tracks detailed statistics:
+- Success rates for enhanced vs baseline methods
+- Component usage (episodic hits, neural guidance, TTT adaptation)
+- Timing breakdown per component
+- Failure mode analysis
+
+## Implementation Notes
+
+### Kaggle Compatibility
+- **Offline execution**: No internet access required
+- **Dependency-light**: Uses only NumPy for core operations
+- **Compute budget**: Optimized for ~$0.42 per task limit
+- **Output format**: Exactly 2 attempts per test input as required
+
+### Code Quality
+- **Type hints**: Full typing support for better maintainability
+- **Documentation**: Comprehensive docstrings and comments
+- **Error handling**: Robust fallback mechanisms
+- **Testing**: Validation and benchmarking utilities
+
+## Extending the Solver
+
+### Adding New DSL Operations
+1. Define operation function in `dsl.py`
+2. Add parameter generation in `sketches.py`
+3. Update feature extraction in `features.py`
+4. Retrain neural guidance if needed
+
+### Improving Neural Components
+1. **Better features**: Add domain-specific feature extractors
+2. **Advanced models**: Replace MLP with transformer/GNN
+3. **Meta-learning**: Implement few-shot adaptation algorithms
+4. **Hybrid methods**: Combine symbolic and neural reasoning
+
+### Advanced Techniques
+- **Probabilistic programming**: Sample programs from learned distributions
+- **Curriculum learning**: Train on tasks of increasing difficulty
+- **Multi-agent reasoning**: Ensemble of specialized solvers
+- **Causal reasoning**: Incorporate causal structure learning
+
+## Research Foundation
+
+This implementation is based on the research blueprint "ARC Prize 2025 & Human Fluid Intelligence" which draws from cognitive neuroscience findings about:
+
+- **Multiple-demand (MD) network**: Neural guidance mimics executive control
+- **Basal ganglia gating**: Operation selection and working memory control  
+- **Hippocampal-mPFC loop**: Episodic retrieval and schema integration
+- **Test-time adaptation**: Rapid task-specific learning from few examples
+
+The solver architecture directly maps these biological systems to computational components.
+
+## Competition Strategy
+
+### Short-term (Immediate)
+- ✅ Strong symbolic baseline with neural enhancements
+- ✅ Episodic retrieval for common patterns
+- ✅ Test-time adaptation for task specialization
+- ✅ Kaggle-ready submission format
+
+### Medium-term (During Contest)
+- Train neural guidance on public training data
+- Mine program sketches from successful solutions  
+- Analyze semi-private feedback for failure modes
+- Expand DSL based on discovered patterns
+
+### Long-term (Advanced Research)
+- Probabilistic program synthesis
+- Hybrid symbolic-neural architecture
+- Broader cognitive priors and meta-learning
+- Integration with large language models
+
+## License
+
+This code is designed to be open-sourced under an appropriate license as required by ARC Prize 2025 rules.
+
+## Citation
+
+If you use this solver or build upon its ideas, please cite the research blueprint and this implementation.
+
+## Contributing
+
+Contributions are welcome! Focus areas:
+- Neural architecture improvements
+- New DSL operations based on failure analysis
+- Advanced meta-learning techniques
+- Performance optimizations for Kaggle constraints
+
+---
+
+**Ready to win ARC Prize 2025!** 🏆
@@ -0,0 +1,114 @@
+"""
+Domain-specific language (DSL) primitives for ARC program synthesis.
+
+This module defines a small set of composable operations that act on grids.
+Each operation is represented by an `Op` object with a name, a function, and
+metadata about its parameters. Programs are sequences of these operations.
+"""
+
+from __future__ import annotations
+
+import numpy as np
+from typing import Any, Callable, Dict, List, Tuple
+
+from .grid import Array, rotate90, flip, transpose, translate, color_map, crop, pad_to, bg_color
+
+
+class Op:
+    """Represents a primitive transformation on a grid.
+
+    Attributes
+    ----------
+    name : str
+        Human-readable name of the operation.
+    fn : Callable
+        Function implementing the operation.
+    arity : int
+        Number of input grids (arity=1 for single-grid ops).
+    param_names : List[str]
+        Names of parameters accepted by the operation.
+    """
+
+    def __init__(self, name: str, fn: Callable[..., Array], arity: int, param_names: List[str]):
+        self.name = name
+        self.fn = fn
+        self.arity = arity
+        self.param_names = param_names
+
+    def __call__(self, *args, **kwargs) -> Array:
+        return self.fn(*args, **kwargs)
+
+
+# Primitive operations (single-grid)
+def op_identity(a: Array) -> Array:
+    return a
+
+
+def op_rotate(a: Array, k: int) -> Array:
+    return rotate90(a, k)
+
+
+def op_flip(a: Array, axis: int) -> Array:
+    return flip(a, axis)
+
+
+def op_transpose(a: Array) -> Array:
+    return transpose(a)
+
+
+def op_translate(a: Array, dy: int, dx: int) -> Array:
+    return translate(a, dy, dx, fill=bg_color(a))
+
+
+def op_recolor(a: Array, mapping: Dict[int, int]) -> Array:
+    return color_map(a, mapping)
+
+
+def op_crop_bbox(a: Array, top: int, left: int, height: int, width: int) -> Array:
+    # ensure cropping stays inside bounds
+    h, w = a.shape
+    top = max(0, min(top, h - 1))
+    left = max(0, min(left, w - 1))
+    height = max(1, min(height, h - top))
+    width = max(1, min(width, w - left))
+    return crop(a, top, left, height, width)
+
+
+def op_pad(a: Array, out_h: int, out_w: int) -> Array:
+    return pad_to(a, (out_h, out_w), fill=bg_color(a))
+
+
+# Register operations in a dictionary for easy lookup
+OPS: Dict[str, Op] = {
+    "identity": Op("identity", op_identity, 1, []),
+    "rotate": Op("rotate", op_rotate, 1, ["k"]),
+    "flip": Op("flip", op_flip, 1, ["axis"]),
+    "transpose": Op("transpose", op_transpose, 1, []),
+    "translate": Op("translate", op_translate, 1, ["dy", "dx"]),
+    "recolor": Op("recolor", op_recolor, 1, ["mapping"]),
+    "crop": Op("crop", op_crop_bbox, 1, ["top", "left", "height", "width"]),
+    "pad": Op("pad", op_pad, 1, ["out_h", "out_w"]),
+}
+
+
+def apply_program(a: Array, program: List[Tuple[str, Dict[str, Any]]]) -> Array:
+    """Apply a sequence of operations (program) to the input array.
+
+    Parameters
+    ----------
+    a : Array
+        Input grid.
+    program : List of (op_name, params)
+        Sequence of operations with parameters. The operations are looked up in
+        OPS.
+
+    Returns
+    -------
+    Array
+        Resulting grid after applying the program.
+    """
+    out = a
+    for name, params in program:
+        op = OPS[name]
+        out = op(out, **params)
+    return out