-
Notifications
You must be signed in to change notification settings - Fork 0
Fix array comparison and add solver logging #11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Train heuristic classifier on combined training and evaluation tasks and expose CLI. [S:DESIGN v1] pass [S:ALG v1] train_from_task_pairs pass [S:TRAIN v1] dataset=train+eval pass [S:TEST v1] unit=103 pass
|
Caution Review failedThe pull request is closed. WalkthroughThe PR updates documentation progress markers; adds operation priors to beam search and integrates them into enhanced search; refactors grid equality; adds deterministic initialization and training APIs for neural guidance; introduces structured logging and enhanced-vs-baseline fallback in the solver; adds training scripts and a model config; and expands tests. Changes
Sequence Diagram(s)sequenceDiagram
autonumber
participant Client
participant ARCSolver
participant Enhanced as EnhancedSearch
participant Baseline as BaselineSearch
participant Logger
Client->>ARCSolver: solve_task_two_attempts(task)
ARCSolver->>Logger: info("start")
alt enhancements enabled
ARCSolver->>Enhanced: synthesize_with_enhancements(task)
Enhanced-->>ARCSolver: enhanced_program?
ARCSolver->>Enhanced: predict_two_enhanced(task)
Enhanced-->>ARCSolver: enhanced_attempts?
ARCSolver->>ARCSolver: _validate_solution(enhanced_attempts)
alt valid
ARCSolver->>Baseline: synthesize_baseline + predict_two_baseline(task)
Baseline-->>ARCSolver: baseline_attempts
ARCSolver->>Logger: info("enhanced success")
ARCSolver-->>Client: [attempt1=enhanced, attempt2=baseline]
else invalid or error
ARCSolver->>Logger: exception("enhanced failed")
ARCSolver->>Baseline: synthesize_baseline + predict_two_baseline(task)
Baseline-->>ARCSolver: baseline_attempts
ARCSolver->>Logger: info("fallback to baseline")
ARCSolver-->>Client: baseline_attempts
end
else enhancements disabled
ARCSolver->>Baseline: synthesize_baseline + predict_two_baseline(task)
Baseline-->>ARCSolver: baseline_attempts
ARCSolver-->>Client: baseline_attempts
end
sequenceDiagram
autonumber
participant Enhanced as EnhancedSearch
participant Guidance as NeuralGuidance
participant Beam as beam_search
Enhanced->>Guidance: score_operations(train_pairs)
Guidance-->>Enhanced: op_scores {op: score}
Enhanced->>Beam: beam_search(train_pairs, ..., op_scores=op_scores)
Beam-->>Enhanced: programs/candidates (scores weighted by op_scores)
sequenceDiagram
autonumber
participant CLI as train_guidance_on_arc.py
participant Loader as _load_tasks
participant Guidance as NeuralGuidance
participant FS as Filesystem
CLI->>Loader: load training/eval tasks
Loader-->>CLI: tasks [[(inp, out), ...], ...]
CLI->>Guidance: train_from_task_pairs(tasks, epochs)
Guidance-->>CLI: trained model in memory
CLI->>FS: save_model(models/guidance_arc.json)
FS-->>CLI: OK
Estimated code review effort🎯 4 (Complex) | ⏱️ ~60 minutes Possibly related PRs
Poem
📜 Recent review detailsConfiguration used: CodeRabbit UI Review profile: CHILL Plan: Pro 📒 Files selected for processing (15)
✨ Finishing touches
🧪 Generate unit tests
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Codex Review: Here are some suggestions.
Reply with @codex fix comments to fix any unresolved comments.
About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you open a pull request for review, mark a draft as ready, or comment "@codex review". If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex fix this CI failure" or "@codex address that feedback".
Summary
tools/train_guidance_on_arc.pyCLI and mark multi-modal reasoning step completemodels/guidance_arc.jsonTesting
pip install -r requirements.txtpytestpython tools/train_guidance_on_arc.py --epochs 5https://chatgpt.com/codex/tasks/task_e_68c3fda055488322804ea540d87df916
Summary by CodeRabbit